Movatterモバイル変換

Multiset

From Wikipedia, the free encyclopedia

Mathematical set with repetitions allowed

This article is about the mathematical concept. For the computer science data structure, seeMultiset (abstract data type).

Inmathematics, amultiset (orbag, ormset) is a modification of the concept of aset that, unlike a set,^[1] allows for multiple instances for each of itselements. The number of instances given for each element is called themultiplicity of that element in the multiset. As a consequence, an infinite number of multisets exist that contain only elementsa andb, but vary in the multiplicities of their elements:

The set{a,b} contains only elementsa andb, each having multiplicity 1 when{a,b} is seen as a multiset.
In the multiset{a,a,b}, the elementa has multiplicity 2, andb has multiplicity 1.
In the multiset{a,a,a,b,b,b},a andb both have multiplicity 3.

These objects are all different when viewed as multisets, although they are the same set, since they all consist of the same elements. As with sets, and in contrast totuples, the order in which elements are listed does not matter in discriminating multisets, so{a,a,b} and{a,b,a} denote the same multiset. To distinguish between sets and multisets, a notation that incorporates square brackets is sometimes used: the multiset{a,a,b} can be denoted by[a,a,b].^[2]

Thecardinality or "size" of a multiset is the sum of the multiplicities of all its elements. For example, in the multiset{a,a,b,b,b,c} the multiplicities of the membersa,b, andc are respectively 2, 3, and 1, and therefore the cardinality of this multiset is 6.

Nicolaas Govert de Bruijn coined the wordmultiset in the 1970s, according toDonald Knuth.^[3]^: 694 However, the concept of multisets predates the coinage of the wordmultiset by many centuries. Knuth himself attributes the first study of multisets to the Indian mathematicianBhāskarāchārya, who describedpermutations of multisets around 1150. Other names have been proposed or used for this concept, includinglist,bunch,bag,heap,sample,weighted set,collection, andsuite.^[3]^: 694

History

[edit]

Wayne Blizard traced multisets back to the very origin of numbers, arguing that "in ancient times, the numbern was often represented by a collection ofn strokes,tally marks, or units."^[4] These and similar collections of objects can be regarded as multisets, because strokes, tally marks, or units are considered indistinguishable. This shows that people implicitly used multisets even before mathematics emerged.

Practical needs for this structure have caused multisets to be rediscovered several times, appearing in literature under different names.^[5]^: 323 For instance, they were important in earlyAI languages, such as QA4, where they were referred to asbags, a term attributed toPeter Deutsch.^[6] A multiset has been also called an aggregate, heap, bunch, sample, weighted set, occurrence set, and fireset (finitely repeated element set).^[5]^: 320^[7]

Although multisets were used implicitly from ancient times, their explicit exploration happened much later. The first known study of multisets is attributed to the Indian mathematicianBhāskarāchārya circa 1150, who described permutations of multisets.^[3]^: 694 The work ofMarius Nizolius (1498–1576) contains another early reference to the concept of multisets.^[8]Athanasius Kircher found the number of multiset permutations when one element can be repeated.^[9]Jean Prestet published a general rule for multiset permutations in 1675.^[10]John Wallis explained this rule in more detail in 1685.^[11]

Multisets appeared explicitly in the work ofRichard Dedekind.^[12]^[13]

Other mathematicians formalized multisets and began to study them as precise mathematical structures in the 20th century. For example,Hassler Whitney (1933) describedgeneralized sets ("sets" whosecharacteristic functions may take anyinteger value: positive, negative or zero).^[5]^: 326^[14]^: 405 Monro (1987) investigated thecategoryMul of multisets and theirmorphisms, defining amultiset as a set with anequivalence relation between elements "of the samesort", and amorphism between multisets as afunction that respectssorts. He also introduced amultinumber: a functionf (x) from a multiset to thenatural numbers, giving themultiplicity of elementx in the multiset. Monro argued that the concepts of multiset and multinumber are often mixed indiscriminately, though both are useful.^[5]^{: 327–328}^[15]

Examples

[edit]

One of the simplest and most natural examples is the multiset ofprime factors of a natural numbern. Here the underlying set of elements is the set of prime factors ofn. For example, the number120 has theprime factorization $120=2^{3}3^{1}5^{1},$ which gives the multiset{2, 2, 2, 3, 5}.

A related example is the multiset of solutions of analgebraic equation. Aquadratic equation, for example, has two solutions. However, in some cases they are both the same number. Thus the multiset of solutions of the equation could be{3, 5}, or it could be{4, 4}. In the latter case it has a solution of multiplicity 2. More generally, thefundamental theorem of algebra asserts that thecomplex solutions of apolynomial equation ofdegreed always form a multiset of cardinalityd.

A special case of the above are theeigenvalues of amatrix, whose multiplicity is usually defined as their multiplicity asroots of thecharacteristic polynomial. However two other multiplicities are naturally defined for eigenvalues, their multiplicities as roots of theminimal polynomial, and thegeometric multiplicity, which is defined as thedimension of thekernel ofA −λI (whereλ is an eigenvalue of the matrixA). These three multiplicities define three multisets of eigenvalues, which may be all different: LetA be an × n matrix inJordan normal form that has a single eigenvalue. Its multiplicity isn, its multiplicity as a root of the minimal polynomial is the size of the largest Jordan block, and its geometric multiplicity is the number of Jordan blocks.

Definition

[edit]

Amultiset may be formally defined as anordered pair(U,m) whereU is aset called auniverse or theunderlying set, and $m\colon U\to \mathbb {Z} _{\geq 0}$ is a function fromU to thenonnegative integers. The value⁠ $m(a)$ ⁠ for an element⁠ $a\in U$ ⁠ is called themultiplicity of⁠ $a {\displaystyle a}$ ⁠ in the multiset and intepreted as the number of occurrences of⁠ $a {\displaystyle a}$ ⁠ in the multiset.

Thesupport of a multiset is the subset of⁠ $U {\displaystyle U}$ ⁠ formed by the elements⁠ $a\in U$ ⁠ such that⁠ $m(a)>0$ ⁠. Afinite multiset is a multiset with afinite support. Most authors definemultisets as finite multisets. This is the case in this article, where, unless otherwise stated, all multisets are finite multisets.

Some authors^[16] define multisets with the additional constraint that⁠ $m(a)>0$ ⁠ for every⁠ $a {\displaystyle a}$ ⁠, or, equivalently, the support equals the underlying set.Multisets with infinite multiplicities have also been studied;^[17] they are not considered in this article.Some authors^[who?] define a multiset in terms of a finite index set⁠ $I {\displaystyle I}$ ⁠ and a function⁠ $f\colon I\rightarrow U$ ⁠ where the multiplicity of an element⁠ $a\in U$ ⁠ is⁠ $|f^{-1}(a)|$ ⁠, the number of elements of⁠ $I {\displaystyle I}$ ⁠ that get mapped to⁠ $a {\displaystyle a}$ ⁠ by⁠ $f {\displaystyle f}$ ⁠.

Multisets may be represented as sets, with some elements repeated. For example, the multiset with support⁠ $\{a,b\}$ ⁠ and multiplicity function such that⁠ $m(a)=2,\;m(b)=1$ ⁠ can be represented as{a,a,b}. A more compact notation, in case of high multiplicities is⁠ $\{(a,2),(b,1)\}$ ⁠ for the same multiset.

If $A=\{a_{1},\ldots ,a_{n}\},$ a multiset with support included in⁠ $A {\displaystyle A}$ ⁠ is often represented as $a_{1}^{m(a_{1})}\cdots a_{n}^{m(a_{n})},$ to which the computation rules ofindeterminates can be applied; that is, exponents 1 and factors with exponent 0 can be removed, and the multiset does not depend on the order of the factors. This allows extending the notation to infinite underlying sets as $\prod _{a\in U}a^{m(a)}.$ An advantage of notation is that it allows using the notation without knowing the exact support. For example, theprime factors of anatural number⁠ $n {\displaystyle n}$ ⁠ form a multiset such that $n=\prod _{p\;{\text{prime}}}p^{m(p)}=2^{m(2)}3^{m(3)}5^{m(5)}\cdots .$

The finite subsets of a set⁠ $U {\displaystyle U}$ ⁠ are exactly the multisets with underlying set⁠ $U {\displaystyle U}$ ⁠, such that⁠ $m(a)\leq 1$ ⁠ for every⁠ $a\in U$ ⁠.

Basic properties and operations

[edit]

Elements of a multiset are generally taken in a fixed setU, sometimes called auniverse, which is often the set ofnatural numbers. An element ofU that does not belong to a given multiset is said to have a multiplicity 0 in this multiset. This extends the multiplicity function of the multiset to a function fromU to the set $\mathbb {N}$ of non-negative integers. This defines aone-to-one correspondence between these functions and the multisets that have their elements inU.

This extended multiplicity function is commonly called simply themultiplicity function, and suffices for defining multisets when the universe containing the elements has been fixed. This multiplicity function is a generalization of theindicator function of asubset, and shares some properties with it.

Thesupport of a multiset $A {\displaystyle A}$ in a universeU is the underlying set of the multiset. Using the multiplicity function $m {\displaystyle m}$ , it is characterized as $\operatorname {Supp} (A):=\{x\in U\mid m_{A}(x)>0\}.$

A multiset isfinite if its support is finite, or, equivalently, if its cardinality $|A|=\sum _{x\in \operatorname {Supp} (A)}m_{A}(x)=\sum _{x\in U}m_{A}(x)$ is finite. Theempty multiset is the unique multiset with anempty support (underlying set), and thus a cardinality 0.

The usual operations of sets may be extended to multisets by using the multiplicity function, in a similar way to using the indicator function for subsets. In the following,A andB are multisets in a given universeU, with multiplicity functions $m_{A}$ and $m_{B}.$

Inclusion:A isincluded inB, denotedA ⊆B, if $m_{A}(x)\leq m_{B}(x)\quad \forall x\in U.$
Union: theunion (called, in some contexts, themaximum orlowest common multiple) ofA andB is the multisetC with multiplicity function^[13] $m_{C}(x)=\max(m_{A}(x),m_{B}(x))\quad \forall x\in U.$
Intersection: theintersection (called, in some contexts, theinfimum orgreatest common divisor) ofA andB is the multisetC with multiplicity function $m_{C}(x)=\min(m_{A}(x),m_{B}(x))\quad \forall x\in U.$
Sum: thesum ofA andB is the multisetC with multiplicity function $m_{C}(x)=m_{A}(x)+m_{B}(x)\quad \forall x\in U.$ It may be viewed as a generalization of thedisjoint union of sets. It defines acommutative monoid structure on the finite multisets in a given universe. This monoid is afree commutative monoid, with the universe as a basis.
Difference: thedifference ofA andB is the multisetC with multiplicity function $m_{C}(x)=\max(m_{A}(x)-m_{B}(x),0)\quad \forall x\in U.$

Two multisets aredisjoint if their supports aredisjoint sets. This is equivalent to saying that their intersection is the empty multiset or that their sum equals their union.

There is an inclusion–exclusion principle for finite multisets (similar tothe one for sets), stating that a finite union of finite multisets is the difference of two sums of multisets: in the first sum we consider all possible intersections of anodd number of the given multisets, while in the second sum we consider all possible intersections of aneven number of the given multisets.^{[citation needed]}

Counting multisets

[edit]

Bijection between 3-subsets of a 7-set (left)
and 3-multisets with elements from a 5-set (right)
So this illustrates that ${\textstyle {7 \choose 3}=\left(\!\!{5 \choose 3}\!\!\right).}$

Bijection between 3-subsets of a 7-set (left)
and 3-multisets with elements from a 5-set (right)
So this illustrates that ${\textstyle {7 \choose 3}=\left(\!\!{5 \choose 3}\!\!\right).}$

Recurrence relation

[edit]

Arecurrence relation for multiset coefficients may be given as $\left(\!\!{n \choose k}\!\!\right)=\left(\!\!{n \choose k-1}\!\!\right)+\left(\!\!{n-1 \choose k}\!\!\right)\quad {\mbox{for }}n,k>0$ with $\left(\!\!{n \choose 0}\!\!\right)=1,\quad n\in \mathbb {N} ,\quad {\mbox{and}}\quad \left(\!\!{0 \choose k}\!\!\right)=0,\quad k>0.$

The above recurrence may be interpreted as follows.Let $[n]:=\{1,\dots ,n\}$ be the source set. There is always exactly one (empty) multiset of size 0, and ifn = 0 there are no larger multisets, which gives the initial conditions.

Now, consider the case in whichn,k > 0. A multiset of cardinalityk with elements from[n] might or might not contain any instance of the final elementn. If it does appear, then by removingn once, one is left with a multiset of cardinalityk − 1 of elements from[n], and every such multiset can arise, which gives a total of $\left(\!\!{n \choose k-1}\!\!\right)$ possibilities.

Ifn does not appear, then our original multiset is equal to a multiset of cardinalityk with elements from[n − 1], of which there are $\left(\!\!{n-1 \choose k}\!\!\right).$

Thus, $\left(\!\!{n \choose k}\!\!\right)=\left(\!\!{n \choose k-1}\!\!\right)+\left(\!\!{n-1 \choose k}\!\!\right).$

Generating series

[edit]

Thegenerating function of the multiset coefficients is very simple, being $\sum _{d=0}^{\infty }\left(\!\!{n \choose d}\!\!\right)t^{d}={\frac {1}{(1-t)^{n}}}.$ As multisets are in one-to-one correspondence withmonomials, $\left(\!\!{n \choose d}\!\!\right)$ is also the number of monomials ofdegreed inn indeterminates. Thus, the above series is also theHilbert series of thepolynomial ring $k[x_{1},\ldots ,x_{n}].$

As $\left(\!\!{n \choose d}\!\!\right)$ is a polynomial inn, it and the generating function are well defined for anycomplex value ofn.

Generalization and connection to the negative binomial series

[edit]

The multiplicative formula allows the definition of multiset coefficients to be extended by replacingn by an arbitrary numberα (negative,real, or complex): $\left(\!\!{\alpha \choose k}\!\!\right)={\frac {\alpha ^{\overline {k}}}{k!}}={\frac {\alpha (\alpha +1)(\alpha +2)\cdots (\alpha +k-1)}{k(k-1)(k-2)\cdots 1}}\quad {\text{for }}k\in \mathbb {N} {\text{ and arbitrary }}\alpha .$

With this definition one has a generalization of the negative binomial formula (with one of the variables set to 1), which justifies calling the $\left(\!\!{\alpha \choose k}\!\!\right)$ negative binomial coefficients: $(1-X)^{-\alpha }=\sum _{k=0}^{\infty }\left(\!\!{\alpha \choose k}\!\!\right)X^{k}.$

ThisTaylor series formula is valid for all complex numbersα andX with|X| < 1. It can also be interpreted as anidentity offormal power series inX, where it actually can serve as definition of arbitrary powers of series with constant coefficient equal to 1; the point is that with this definition all identities hold that one expects forexponentiation, notably

$(1-X)^{-\alpha }(1-X)^{-\beta }=(1-X)^{-(\alpha +\beta )}\quad {\text{and}}\quad ((1-X)^{-\alpha })^{-\beta }=(1-X)^{-(-\alpha \beta )},$ and formulas such as these can be used to prove identities for the multiset coefficients.

Ifα is a nonpositive integern, then all terms withk > −n are zero, and the infinite series becomes a finite sum. However, for other values ofα, including positive integers andrational numbers, the series is infinite.

Applications

[edit]

Multisets have various applications.^[7] They are becoming fundamental incombinatorics.^[18]^[19]^[20]^[21] Multisets have become an important tool in the theory ofrelational databases, which often uses the synonymbag.^[22]^[23]^[24] For instance, multisets are often used to implement relations in database systems. In particular, a table (without a primary key) works as a multiset, because it can have multiple identical records. Similarly,SQL operates on multisets and returns identical records. For instance, consider "SELECT name FROM Student". In the case that there are multiple records with name "Sara" in the student table, all of them are shown. That means the result of an SQL query is a multiset; if the result were instead a set, the repetitive records in the result set would have been eliminated. Another application of multisets is in modelingmultigraphs. In multigraphs there can be multiple edges between any two givenvertices. As such, the entity that specifies the edges is a multiset, and not a set.

There are also other applications. For instance,Richard Rado used multisets as a device to investigate the properties of families of sets. He wrote, "The notion of a set takes no account of multiple occurrence of any one of its members, and yet it is just this kind of information that is frequently of importance. We need only think of the set of roots of a polynomialf (x) or thespectrum of alinear operator."^[5]^{: 328–329}

Generalizations

[edit]

Different generalizations of multisets have been introduced, studied and applied to solving problems.

Signed multisets (in which multiplicity of an element can be anyinteger)^[25]
Real-valued multisets (in which multiplicity of an element can be anyreal number)^[26]
Fuzzy multisets^[27]
Rough multisets^[28]
Hybrid sets^[29]
Multisets whose multiplicity is any real-valuedstep function^[30]
Soft multisets^[31]
Soft fuzzy multisets^[32]
Named sets (unification of all generalizations of sets)^[33]^[34]^[35]^[36]

Notes

[edit]

^The formula⁠ ${\tbinom {n+k-1}{k}}$ ⁠ does not work forn = 0 (where necessarily alsok = 0) if viewed as an ordinary binomial coefficient since it evaluates to⁠ ${\tbinom {-1}{0}}$ ⁠, however the formulan(n+1)(n+2)...(n+k−1)/k! does work in this case because the numerator is anempty product giving1/0! = 1. However⁠ ${\tbinom {n+k-1}{k}}$ ⁠ does make sense forn =k = 0 if interpreted as ageneralized binomial coefficient; indeed⁠ ${\tbinom {n+k-1}{k}}$ ⁠ seen as a generalized binomial coefficient equals the extreme right-hand side of the above equation.

References

[edit]

^Cantor, Georg; Jourdain, Philip E.B. (Translator) (1895)."beiträge zur begründung der transfiniten Mengenlehre" [contributions to the founding of the theory of transfinite numbers].Mathematische Annalen (in German). xlvi, xlix. New York Dover Publications (1954 English translation):481–512,207–246. Archived fromthe original on 2011-06-10.By a set (Menge) we are to understand any collection into a whole (Zusammenfassung zu einem Gansen) M of definite andseparate objects m (p.85)
^Hein, James L. (2003).Discrete mathematics. Jones & Bartlett Publishers. pp. 29–30.ISBN 0-7637-2210-3.
^^a ^b ^cKnuth, Donald E. (1998).Seminumerical Algorithms.The Art of Computer Programming. Vol. 2 (3rd ed.).Addison Wesley.ISBN 0-201-89684-2.
^Blizard, Wayne D (1989)."Multiset theory".Notre Dame Journal of Formal Logic.30 (1):36–66.doi:10.1305/ndjfl/1093634995.
^^a ^b ^c ^d ^eBlizard, Wayne D. (1991)."The Development of Multiset Theory".Modern Logic.1 (4):319–352.
^Rulifson, J. F.; Derkson, J. A.; Waldinger, R. J. (November 1972).QA4: A Procedural Calculus for Intuitive Reasoning (Technical report). SRI International. 73.
^^a ^bSingh, D.; Ibrahim, A. M.; Yohanna, T.; Singh, J. N. (2007). "An overview of the applications of multisets".Novi Sad Journal of Mathematics.37 (2):73–92.
^Angelelli, I. (1965). "Leibniz's misunderstanding of Nizolius' notion of 'multitudo'".Notre Dame Journal of Formal Logic (6):319–322.
^Kircher, Athanasius (1650).Musurgia Universalis. Rome: Corbelletti.
^Prestet, Jean (1675).Elemens des Mathematiques. Paris: André Pralard.
^Wallis, John (1685).A treatise of algebra. London: John Playford.
^Dedekind, Richard (1888).Was sind und was sollen die Zahlen?. Braunschweig: Vieweg. p. 114.
^^a ^bSyropoulos, Apostolos (2000). "Mathematics of multisets". In Calude, Cristian; Paun, Gheorghe; Rozenberg, Grzegorz; Salomaa, Arto (eds.).Multiset Processing, Mathematical, Computer Science, and Molecular Computing Points of View [Workshop on Multiset Processing, WMP 2000, Curtea de Arges, Romania, August 21–25, 2000]. Lecture Notes in Computer Science. Vol. 2235. Springer. pp. 347–358.doi:10.1007/3-540-45523-X_17.ISBN 978-3-540-43063-6.
^Whitney, Hassler (1933). "Characteristic Functions and the Algebra of Logic".Annals of Mathematics.34 (3):405–414.doi:10.2307/1968168.JSTOR 1968168.
^Monro, G. P. (1987). "The Concept of Multiset".Zeitschrift für Mathematische Logik und Grundlagen der Mathematik.33 (2):171–178.doi:10.1002/malq.19870330212.
^Aluffi, Paolo (2009).Algebra: Chapter 0. American Mathematical Society.ISBN 978-0821847817.
^Cf., for instance, Richard Brualdi,Introductory Combinatorics, Pearson.
^Aigner, M. (1979).Combinatorial Theory. New York/Berlin: Springer Verlag.
^Anderson, I. (1987).Combinatorics of Finite Sets. Oxford: Clarendon Press.ISBN 978-0-19-853367-2.
^Stanley, Richard P. (1997).Enumerative Combinatorics. Vol. 1. Cambridge University Press.ISBN 0-521-55309-1.
^Stanley, Richard P. (1999).Enumerative Combinatorics. Vol. 2. Cambridge University Press.ISBN 0-521-56069-1.
^Grumbach, S.; Milo, T (1996)."Towards tractable algebras for bags".Journal of Computer and System Sciences.52 (3):570–588.doi:10.1006/jcss.1996.0042.
^Libkin, L.; Wong, L. (1994). "Some properties of query languages for bags".Proceedings of the Workshop on Database Programming Languages. Springer Verlag. pp. 97–114.
^Libkin, L.; Wong, L. (1995). "On representation and querying incomplete information in databases with bags".Information Processing Letters.56 (4):209–214.doi:10.1016/0020-0190(95)00154-5.
^Blizard, Wayne D. (1990)."Negative Membership".Notre Dame Journal of Formal Logic.31 (3):346–368.doi:10.1305/ndjfl/1093635499.S2CID 42766971.
^Blizard, Wayne D. (1989). "Real-valued Multisets and Fuzzy Sets".Fuzzy Sets and Systems.33 (1):77–97.doi:10.1016/0165-0114(89)90218-2.
^Yager, R. R. (1986). "On the Theory of Bags".International Journal of General Systems.13 (1):23–37.doi:10.1080/03081078608934952.
^Grzymala-Busse, J. (1987). "Learning from examples based on rough multisets".Proceedings of the 2nd International Symposium on Methodologies for Intelligent Systems. Charlotte, North Carolina. pp. 325–332.{{cite book}}: CS1 maint: location missing publisher (link)
^Loeb, D. (1992)."Sets with a negative numbers of elements".Advances in Mathematics.91 (1):64–74.doi:10.1016/0001-8708(92)90011-9.
^Miyamoto, S. (2001). "Fuzzy Multisets and Their Generalizations".Multiset Processing. Lecture Notes in Computer Science. Vol. 2235. Berlin, Heidelberg: Springer. pp. 225–235.doi:10.1007/3-540-45523-X_11.ISBN 978-3-540-43063-6.
^Alkhazaleh, S.; Salleh, A. R.; Hassan, N. (2011). "Soft Multisets Theory".Applied Mathematical Sciences.5 (72):3561–3573.
^Alkhazaleh, S.; Salleh, A. R. (2012)."Fuzzy Soft Multiset Theory".Abstract and Applied Analysis.2012 350603:1–20.doi:10.1155/2012/350603.
^Burgin, Mark (1990)."Theory of Named Sets as a Foundational Basis for Mathematics".Structures in Mathematical Theories. San Sebastian. pp. 417–420.
^Burgin, Mark (1992). "On the concept of a multiset in cybernetics".Cybernetics and Systems Analysis.3:165–167.
^Burgin, Mark (2004). "Unified Foundations of Mathematics".arXiv:math/0403186.
^Burgin, Mark (2011).Theory of Named Sets. Mathematics Research Developments. Nova Science Pub Inc.ISBN 978-1-61122-788-8.