Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Shapley–Folkman lemma

This is a good article. Click here for more information.
From Wikipedia, the free encyclopedia
Sums of sets of vectors are nearly convex

The Shapley–Folkman lemma depicted by a diagram with two panes, one on the left and the other on the right. The left-hand pane displays four sets, which are displayed in a two-by-two array. Each of the sets contains exactly two points, which are displayed in red. In each set, the two points are joined by a pink line-segment, which is the convex hull of the original set. Each set has exactly one point that is indicated with a plus-symbol. In the top row of the two-by-two array, the plus-symbol lies in the interior of the line segment; in the bottom row, the plus-symbol coincides with one of the red-points. This completes the description of the left-hand pane of the diagram. The right-hand pane displays the Minkowski sum of the sets, which is the union of the sums having exactly one point from each summand-set; for the displayed sets, the sixteen sums are distinct points, which are displayed in red: The right-hand red sum-points are the sums of the left-hand red summand-points. The convex hull of the sixteen red-points is shaded in pink. In the pink interior of the right-hand sumset lies exactly one plus-symbol, which is the (unique) sum of the plus-symbols from the right-hand side. Comparing the left array and the right pane, one confirms that the right-hand plus-symbol is indeed the sum of the four plus-symbols from the left-hand sets, precisely two points from the original non-convex summand-sets and two points from the convex hulls of the remaining summand-sets.
The Shapley–Folkman lemma is illustrated by theMinkowski addition of four sets. The point (+) in theconvex hull of the Minkowski sum of the fournon-convex sets (right) is the sum of four points (+) from the (left-hand) sets—two points in two non-convex sets plus two points in the convex hulls of two sets. The convex hulls are shaded pink. The original sets each have exactly two points (shown as red dots).[1]

TheShapley–Folkman lemma is a result inconvex geometry that describes theMinkowski addition ofsets in avector space. The lemma may be intuitively understood as saying that, if the number of summed sets exceeds thedimension of the vector space, then their Minkowski sum is approximately convex.[1][2] It is named after mathematiciansLloyd Shapley andJon Folkman, but was first published by the economistRoss M. Starr.

Related results provide more refined statements about how close the approximation is. For example, theShapley–Folkman theorem provides anupper bound on thedistance between any point in the Minkowski sum and itsconvex hull. This upper bound is sharpened by theShapley–Folkman–Starr theorem (alternatively,Starr's corollary).[3]

The Shapley–Folkman lemma has applications ineconomics,optimization andprobability theory.[3] In economics, it can be used to extend results proved forconvex preferences to non-convex preferences. In optimization theory, it can be used to explain the successful solution of minimization problems that are sums of manyfunctions.[4][5] In probability, it can be used to prove alaw of large numbers forrandom sets.[6]

Introductory example

[edit]

Aset is said to beconvex if everyline segment joining two of its points is asubset in the set. The soliddisk{\displaystyle \bullet } is a convex set, but thecircle{\displaystyle \circ } is not, because the line segment joining two distinct points{\displaystyle \oslash } is not a subset of the circle.[7] Theconvex hull of a setQ{\displaystyle Q} is the smallest convex set that containsQ{\displaystyle Q}.[8]

Minkowski addition is an operation on sets that forms the set of sums ofmembers of the sets, with one member from each set.[9] For example, adding the set consisting of theintegers zero and one to itself yields the set consisting of zero, one, and two:{0,1}+{0,1}={0+0,0+1,1+0,1+1}={0,1,2}.{\displaystyle \{0,1\}+\{0,1\}=\{0+0,0+1,1+0,1+1\}=\{0,1,2\}.} This subset of the integers{0,1,2}{\displaystyle \{0,1,2\}} is contained in theinterval ofreal numbers[0,2]{\displaystyle [0,2]}, which is its convex hull. The Shapley–Folkman lemma implies that every point in[0,2]{\displaystyle [0,2]} is the sum of an integer from{0,1}{\displaystyle \{0,1\}} and a real number from[0,1]{\displaystyle [0,1]}: to get the convex hull of the Minkowski sum of{0,1}{\displaystyle \{0,1\}} with itself, only one of the summands needs to be replaced by its convex hull.[10]

The non-convex set{0,1}{\displaystyle \{0,1\}} and its convex hull[0,1]{\displaystyle [0,1]} are atHausdorff distance12{\displaystyle {\tfrac {1}{2}}} from each other,and this distance remains the same for the non-convex set{0,1,2}{\displaystyle \{0,1,2\}} and its convex hull[0,2]{\displaystyle [0,2]}. In both cases the convex hull contains points such as12{\displaystyle {\tfrac {1}{2}}} that are at distance12{\displaystyle {\tfrac {1}{2}}} from the members of the non-convex set. In this example, the Minkowski sum operation does not decrease the distance between the sum and its convex hull. But when summing is replaced byaveraging, by scaling the sum by the number of terms in the sum, the distance between the scaled Minkowski sum and its convex hull does go down. The distance between the average Minkowski sum12({0,1}+{0,1})={0,12,1}{\displaystyle {\frac {1}{2}}\left(\{0,1\}+\{0,1\}\right)=\left\{0,{\tfrac {1}{2}},1\right\}}and its convex hull[0,1]{\displaystyle [0,1]} is only14{\displaystyle {\tfrac {1}{4}}}, which is half the distance12{\displaystyle {\tfrac {1}{2}}} between itssummand{0,1}{\displaystyle \{0,1\}} and its convex hull[0,1]{\displaystyle [0,1]}. As more sets are added together, the average of their sum "fills out" its convex hull: The maximum distance between the average and its convex hull approaches zero as the average includes more summands. This reduction in distance can be stated more formally as theShapley–Folkman theorem andShapley–Folkman–Starr theorem, as consequences of the Shapley–Folkman lemma.[10]

Preliminaries

[edit]

The Shapley–Folkman lemma depends upon the following definitions and results fromconvex geometry.

Real vector spaces

[edit]

Arealvector space of two dimensions can be given aCartesian coordinate system in which every point is identified by anordered pair of real numbers, called "coordinates", which are conventionally denoted byx{\displaystyle x} andy{\displaystyle y}. Two points in the Cartesian plane can beadded coordinate-wise:(x1,y1)+(x2,y2)=(x1+x2,y1+y2);{\displaystyle (x_{1},y_{1})+(x_{2},y_{2})=(x_{1}+x_{2},y_{1}+y_{2});}further, a point can bemultiplied by each real numberλ{\displaystyle \lambda } coordinate-wise:λ(x,y)=(λx,λy).{\displaystyle \lambda (x,y)=(\lambda x,\lambda y).}

More generally, any real vector space of (finite) dimensionD{\displaystyle D} can be viewed as theset of allD{\displaystyle D}-tuples ofD{\displaystyle D} real numbers(x1,x2,,xD){\displaystyle (x_{1},x_{2},\ldots ,x_{D})} on which two operations are defined:vector addition andmultiplication by a real number. For finite-dimensional vector spaces, the operations of vector addition and real-number multiplication can each be defined coordinate-wise, following the example of the Cartesian plane.[11]

Convex sets

[edit]
Illustration of a convex set, which looks somewhat like a disk: A (green) convex set contains the (black) line-segment joining the points x and y. The entire line-segment is a subset of the convex set.
In aconvex set Q, theline segment connecting any two of its points is a subset of Q.
Illustration of a green non-convex set, which looks somewhat like a boomerang or cashew nut. The black line-segment joins the points x and y of the green non-convex set. Part of the line segment is not contained in the green non-convex set.
In anon-convex set Q, a point in someline-segment joining two of its points is not a member of Q.
Line segments test whether a subset isconvex.

In a real vector space, anon-empty setQ{\displaystyle Q} is defined to beconvex if, for each pair of its points, every point on theline segment that joins them is still inQ{\displaystyle Q}. For example, a soliddisk {\displaystyle \bullet } is convex but acircle {\displaystyle \circ } is not, because it does not contain a line segment joining opposite points.[7] A solidcube is convex; however, anything that is hollow or dented, for example, acrescent shape, is non-convex. Theempty set is convex, either by definition[12] orvacuously.[13]

More formally, a setQ{\displaystyle Q} is convex if, for all pointsq1{\displaystyle q_{1}} andq2{\displaystyle q_{2}} inQ{\displaystyle Q} and for every real numberλ{\displaystyle \lambda } in theunit interval[0,1]{\displaystyle [0,1]}, the point(1λ)q1+λq2{\displaystyle (1-\lambda )q_{1}+\lambda q_{2}}is amember ofQ{\displaystyle Q}. Bymathematical induction, a setQ{\displaystyle Q} is convex if and only if everyconvex combination of members ofQ{\displaystyle Q} also belongs toQ{\displaystyle Q}. By definition, aconvex combination of indexed pointsv1,v2,vD{\displaystyle v_{1},v_{2},\ldots v_{D}} of a vector space is any weighted averageλ1v1+λ2v2++λDvD{\displaystyle \lambda _{1}v_{1}+\lambda _{2}v_{2}+\cdots +\lambda _{D}v_{D}} for indexed real numbersλ1,λ2,λD{\displaystyle \lambda _{1},\lambda _{2},\ldots \lambda _{D}} satisfying the equationλ1+λ2++λD=1{\displaystyle \lambda _{1}+\lambda _{2}+\cdots +\lambda _{D}=1}.[14]

The definition of a convex set implies that theintersection of two convex sets is a convex set. More generally, the intersection of a family of convex sets is a convex set. In particular, the intersection of twodisjoint sets is the empty set, which is convex.[12]

Convex hull

[edit]
A picture of a smoothed triangle, like a triangular (Mexican) tortilla-chip or a triangular road-sign. Each of the three rounded corners is drawn with a red curve. The remaining interior points of the triangular shape are shaded with blue.
In theconvex hull of the red set, each blue point is aconvex combination of some red points.

For every subsetQ{\displaystyle Q} of a real vector space, itsconvex hullconvQ{\displaystyle \operatorname {conv} Q} is theminimal convex set that containsQ{\displaystyle Q}. ThusconvQ{\displaystyle \operatorname {conv} Q} is the intersection of all the convex sets thatcoverQ{\displaystyle Q}.[8] The convex hull of a set can be equivalently defined to be the set of all convex combinations of points inQ{\displaystyle Q}.[15] For example, the convex hull of the set ofintegers{0,1}{\displaystyle \{0,1\}} is the closedinterval ofreal numbers[0,1]{\displaystyle [0,1]}, which has the maximum and minimum of the given set as its endpoints.[10] The convex hull of theunit circle is the closedunit disk, which contains the unit circle and its interior.[16]

Minkowski addition

[edit]
Three squares are shown in the non-negative quadrant of the Cartesian plane.
Minkowski addition of sets. The sum of the squaresQ1=[0,1]2{\displaystyle Q_{1}=[0,1]^{2}} andQ2=[1,2]2{\displaystyle Q_{2}=[1,2]^{2}} is the squareQ1+Q2=[1,3]2{\displaystyle Q_{1}+Q_{2}=[1,3]^{2}}.

In any vector space (or algebraic structure with addition),X{\displaystyle X}, theMinkowski sum of two non-empty setsA,BX{\displaystyle A,B\subseteq X} is defined to be the element-wise operation[17]A+B={x+yxA,yB}.{\displaystyle A+B=\{x+y\mid x\in A,y\in B\}.} For example,{0,1}+{0,1}={0+0,0+1,1+0,1+1}={0,1,2}.{\displaystyle {\begin{aligned}\{0,1\}+\{0,1\}&=\{0+0,0+1,1+0,1+1\}\\&=\{0,1,2\}.\end{aligned}}}This operation is clearly commutative and associative on the collection of non-empty sets. All such operations extend in a well-defined manner to recursive formsn=1NQn=Q1+Q2++QN.{\textstyle \sum _{n=1}^{N}Q_{n}=Q_{1}+Q_{2}+\ldots +Q_{N}.} By the principle of induction,[18]n=1NQn={n=1Nqn|qnQn, 1nN}.{\displaystyle \sum _{n=1}^{N}Q_{n}=\left\{\sum _{n=1}^{N}q_{n}\mathrel {\Bigg |} q_{n}\in Q_{n},~1\leq n\leq N\right\}.}

Convex hulls of Minkowski sums

[edit]

Minkowski addition behaves well with respect to taking convex hulls. Specifically, for all subsetsA,BX{\displaystyle A,B\subseteq X} of a real vector space,X{\displaystyle X}, theconvex hull of their Minkowski sum is the Minkowski sum of their convex hulls. That is,conv(A+B)=(convA)+(convB).{\displaystyle \operatorname {conv} (A+B)=(\operatorname {conv} A)+(\operatorname {conv} B).}And by induction it follows thatconvn=1NQn=n=1NconvQn{\displaystyle \operatorname {conv} \sum _{n=1}^{N}Q_{n}=\sum _{n=1}^{N}\operatorname {conv} Q_{n}}for anyNN{\displaystyle N\in \mathbb {N} } and non-empty subsetsQnX, 1nN{\displaystyle Q_{n}\in X,\ 1\leq n\leq N}.[19][20]

Statements of the three main results

[edit]

Notation

[edit]

In the following statements,D{\displaystyle D} andD{\displaystyle D} represent positive integers.D{\displaystyle D} is the dimension of the ambient spaceRD{\displaystyle \mathbb {R} ^{D}}.

Q1,,QN{\displaystyle Q_{1},\dots ,Q_{N}} represent nonempty, bounded subsets ofRD{\displaystyle \mathbb {R} ^{D}}. They are also called "summands".N{\displaystyle N} is the number of summands.

Q=n=1NQn{\displaystyle Q=\sum _{n=1}^{N}Q_{n}} denotes the Minkowski sum of the summands.

The variablex{\displaystyle x} is used to represent an arbitrary vector inconvQ{\displaystyle \operatorname {conv} Q}.

Shapley–Folkman lemma

[edit]

Because the convex hull and Minkowski sum operations can always be interchanged, asconvQ=n=1Nconv(Qn){\displaystyle \operatorname {conv} Q=\sum _{n=1}^{N}\operatorname {conv} (Q_{n})}, it follows that for everyxconvQ{\displaystyle x\in \operatorname {conv} Q}, there exist elementsqnconvQn{\displaystyle q_{n}\in \operatorname {conv} Q_{n}} such thatn=1Nqn=x{\displaystyle \sum _{n=1}^{N}q_{n}=x}. TheShapley–Folkman lemma refines this statement.[21]

Shapley–Folkman lemmaFor everyxconvQ{\displaystyle x\in \operatorname {conv} Q}, there exist elementsqn{\displaystyle q_{n}} such thatn=1Nqn=x{\displaystyle \sum _{n=1}^{N}q_{n}=x}, and such that at mostD{\displaystyle D} of these elementsqn{\displaystyle q_{n}} belong to(convQn)Qn{\displaystyle (\operatorname {conv} Q_{n})\setminus Q_{n}}, while each remaining elementqn{\displaystyle q_{n}} belongs toQn{\displaystyle Q_{n}}.[21]

For example, every point in[0,2]=[0,1]+[0,1]=conv{0,1}+conv{0,1}{\displaystyle [0,2]=[0,1]+[0,1]=\operatorname {conv} \{0,1\}+\operatorname {conv} \{0,1\}} is the sum of an element in{0,1}{\displaystyle \{0,1\}} and an element in[0,1]{\displaystyle [0,1]}.[10]

Shuffling indices if necessary, this means that every point inconvQ{\displaystyle \operatorname {conv} Q} can be decomposed as

x=n=1Dqn+n=D+1Nqn{\displaystyle x=\sum _{n=1}^{D}q_{n}+\sum _{n=D+1}^{N}q_{n}}

whereqnconvQn{\displaystyle q_{n}\in \operatorname {conv} Q_{n}} for1nD{\displaystyle 1\leq n\leq D} andqnQn{\displaystyle q_{n}\in \operatorname {Q} _{n}} forD+1nN{\displaystyle D+1\leq n\leq N}. Note that the reindexing depends on the pointx{\displaystyle x}.[22]

The lemma may be stated succinctly as

conv(n=1NQn)I{1,2,N}: |I|=D(nIconvQn+nIQn).{\displaystyle \operatorname {conv} \left(\sum _{n=1}^{N}Q_{n}\right)\subseteq \bigcup _{I\subseteq \{1,2,\ldots N\}:~|I|=D}\left(\sum _{n\in I}\operatorname {conv} Q_{n}+\sum _{n\notin I}Q_{n}\right).}

The converse of Shapley–Folkman lemma

[edit]

converse of Shapley–Folkman lemmaIf a vector space obeys the Shapley–Folkman lemma for a natural numberD{\displaystyle D}, and for no number less thanD{\displaystyle D}, then its dimension is finite, and exactlyD{\displaystyle D}.[23]

In particular, the Shapley–Folkman lemma requires the vector space to be finite-dimensional.[23]

Shapley–Folkman theorem

[edit]

Shapley and Folkman used their lemma to prove the following theorem, which quantifies the difference betweenQ{\displaystyle Q} andconvQ{\displaystyle \operatorname {conv} Q} usingHausdorff distance. Hausdorff distances measure how close two sets are. For two setsX{\displaystyle X} andY{\displaystyle Y}, the Hausdorff distance is, intuitively, the smallest amount by which each must be expanded to cover the other. More formally,if the distance from a pointx{\displaystyle x} to a setY{\displaystyle Y} is defined as theinfimum of pairwise distances,d(x,Y)=infyYxy,{\displaystyle d(x,Y)=\inf _{y\in Y}\|x-y\|,}then letXε{\displaystyle X_{\varepsilon }} denote the set of all points withindistanceε{\displaystyle \varepsilon } ofX{\displaystyle X}; equivalently this is closure of the Minkowski sum ofX{\displaystyle X} with a ball of radiusε{\displaystyle \varepsilon }. Then forXY{\displaystyle X\subset Y}, the Hausdorff distance is[24]dH(X,Y)=inf{εXεY}.{\displaystyle d_{\mathrm {H} }(X,Y)=\inf\{\varepsilon \mid X_{\varepsilon }\supset Y\}.}

The Shapley–Folkman theorem quantifies how close to convexityQ{\displaystyle Q} is by upper-bounding its Hausdorff distance toconvQ{\displaystyle \operatorname {conv} Q}, using the circumradii of its constituent sets. For anybounded setSRD,{\displaystyle S\subset \mathbb {R} ^{D},} define its circumradiusradS{\displaystyle \operatorname {rad} S} to be thesmallest radius of a ball containing it. More formally, lettingx{\displaystyle x} denote the center of the smallest enclosing ball, it can be defined as[25]radS=inf{εxRN:{x}εS}.{\displaystyle \operatorname {rad} S=\inf\{\varepsilon \mid \exists x\in \mathbb {R} ^{N}:\{x\}_{\varepsilon }\supset S\}.} With this notation in place, the Shapley–Folkman theorem can be stated as:

Shapley–Folkman theoremdH(Q,convQ)2  maxD(radQn)2.{\displaystyle d_{\mathrm {H} }(Q,\operatorname {conv} Q)^{2}~\leq ~\sum _{\max D}(\operatorname {rad} Q_{n})^{2}.}[26][27]

Here the notationmaxD{\textstyle \sum _{\max D}} means "the sum of theD{\displaystyle D} largest terms". This upper bound depends on the dimension of ambient space and the shapes of the summands, but not on the number of summands.[28]

Shapley–Folkman–Starr theorem

[edit]
A blue disk contains red points. A smaller green disk sits in the largest concavity in among these red points.
The circumradius (blue) and inner radius (green) of a point set (dark red, with its convex hull shown as the lighter red dashed lines). The inner radius is smaller than the circumradius except for subsets of a single circle, for which they are equal.

The Shapley–Folkman theorem can be strengthened by replacing the circumradius of the terms in a Minkowski by a smaller value, theinner radius, which intuitively measures the radius of the holes in a set rather than the radius of a set.Define the inner radiusr(S){\displaystyle r(S)} of a bounded subsetSRD{\displaystyle S\subset \mathbb {R} ^{D}} to be the infimum ofr{\displaystyle r} such that, for anyxconvS{\displaystyle x\in \operatorname {conv} S}, there exists a ballB{\displaystyle B} of radiusr{\displaystyle r} such thatxconv(SB){\displaystyle x\in \operatorname {conv} (S\cap B)}.[29]

Shapley–Folkman–Starr theoremdH(Q,convQ)2maxDr(Qn)2{\displaystyle d_{\mathrm {H} }(Q,\operatorname {conv} Q)^{2}\leq \sum _{\max D}r(Q_{n})^{2}}.[29][30]

Proofs

[edit]

There have been many proofs of these results, from the original,[29] to the laterArrow andHahn,[31]Cassels,[32] Schneider,[33] etc. An abstract and elegant proof byEkeland[34] has been extended by Artstein.[35] Different proofs have also appeared in unpublished papers.[2][36] An elementary proof of the Shapley–Folkman lemma can be found in the book byBertsekas, together with applications in estimating the duality gap in separable optimization problems and zero-sum games.[37]

Usual proofs of these results are nonconstructive: they establish only theexistence of the representation, but do not provide analgorithm for computing the representation. In 1981, Starr published aniterative algorithm for a less sharp version of the Shapley–Folkman–Starr theorem.[28]

Via Carathéodory's theorem

[edit]

The following proof of Shapley–Folkman lemma is fromZhou (1993). The proof idea is to lift the representation ofx{\displaystyle x} fromRD{\displaystyle \mathbb {R} ^{D}}toRD+N{\displaystyle \mathbb {R} ^{D+N}}, useCarathéodory's theorem for conic hulls, then drop back toRD{\displaystyle \mathbb {R} ^{D}}.[38]

Proof of Shapley–Folkman lemma

For eachn{\displaystyle n}, representqnconvQn{\displaystyle q_{n}\in \operatorname {conv} Q_{n}} asqn=k=1Kwn,kqn,k{\displaystyle q_{n}=\sum _{k=1}^{K}w_{n,k}q_{n,k}}, whereK{\displaystyle K} is a large finite number,wn,k0{\displaystyle w_{n,k}\geq 0}, andkwn,k=1{\displaystyle \sum _{k}w_{n,k}=1}.

Now "lift" the representationx=nkwn,kqn,k{\displaystyle x=\sum _{n}\sum _{k}w_{n,k}q_{n,k}} fromRD{\displaystyle \mathbb {R} ^{D}} toRD+N{\displaystyle \mathbb {R} ^{D+N}}. Definex¯=(x,1,...,1);q¯n,k=(qn,k,en){\displaystyle {\bar {x}}=(x,1,...,1);\quad {\bar {q}}_{n,k}=(q_{n,k},e_{n})}whereen{\displaystyle e_{n}} is the vector inRN{\displaystyle \mathbb {R} ^{N}} that has 1 at coordinaten{\displaystyle n}, and 0 at all other coordinates.

With this, we have a lifted representation

x¯=nkwn,kq¯n,k.{\displaystyle {\bar {x}}=\sum _{n}\sum _{k}w_{n,k}{\bar {q}}_{n,k}.}That is,x¯{\displaystyle {\bar {x}}} is in the conic hull of{q¯n,k}n1:N,k1:K{\displaystyle \{{\bar {q}}_{n,k}\}_{n\in 1:N,k\in 1:K}}.

By Carathéodory's theorem for conic hulls, we have an alternative representation

x¯=nkwn,kq¯n,k{\displaystyle {\bar {x}}=\sum _{n}\sum _{k}w'_{n,k}{\bar {q}}_{n,k}}such thatwn,k0{\displaystyle w'_{n,k}\geq 0}, and at mostN+D{\displaystyle N+D} of them are nonzero. Since we defined

x¯=(x,1,...,1);q¯n,k=(qn,k,en){\displaystyle {\bar {x}}=(x,1,...,1);\quad {\bar {q}}_{n,k}=(q_{n,k},e_{n})}this alternative representation is also a representation forx{\displaystyle x}.

We argue that for anyn01:N{\displaystyle n_{0}\in 1:N}, there must be at least one value ofk{\displaystyle k} for whichwn0,k{\displaystyle w'_{n_{0},k}} is nonzero. Remember that we defined(x¯)D+n0{\displaystyle ({\bar {x}})_{D+n_{0}}}, the(D+n0){\displaystyle (D+n_{0})} entry ofx¯{\displaystyle {\bar {x}}}, to be1{\displaystyle 1}. At the same time, from the lifted representation ofx¯{\displaystyle {\bar {x}}},(x¯)D+n0=(nkwn,kq¯n,k)D+n0{\displaystyle ({\bar {x}})_{D+n_{0}}=\left(\sum _{n}\sum _{k}w'_{n,k}{\bar {q}}_{n,k}\right)_{D+n_{0}}}=nkwn,k(q¯n,k)D+n0.{\displaystyle =\sum _{n}\sum _{k}w'_{n,k}({\bar {q}}_{n,k})_{D+n_{0}}.}We drop all terms on the r.h.s. for whichnn0{\displaystyle n\neq n_{0}} since they are zero. The remaining terms take the formwn0,k(q¯n0,k)D+n0=wn0,k{\displaystyle w'_{n_{0},k}({\bar {q}}_{n_{0},k})_{D+n_{0}}=w'_{n_{0},k}}, so we find the equation1=kwn0,k.{\displaystyle 1=\sum _{k}w'_{n_{0},k}.}It follows that there is at least one element of the sum on the r.h.s. that is non-zero.

Combining the fact that for each value ofn{\displaystyle n} there is a non-zerown,k{\displaystyle w_{n,k}'}, together with the fact that there are at mostN+D{\displaystyle N+D} ofwn,k{\displaystyle w_{n,k}'} that are nonzero, we conclude that there can only be at mostD{\displaystyle D} elements ofn1:N{\displaystyle n\in 1:N} for which there are at least two ofwn,k{\displaystyle w_{n,k}'} that are nonzero.

Thus we obtain a representation

x¯=n(kwn,kq¯n,k){\displaystyle {\bar {x}}=\sum _{n}\left(\sum _{k}w'_{n,k}{\bar {q}}_{n,k}\right)}where for at mostD{\displaystyle D} ofn{\displaystyle n}, the termkwn,kq¯n,k{\displaystyle \sum _{k}w'_{n,k}{\bar {q}}_{n,k}} is not inQn{\displaystyle Q_{n}}.

Probabilistic

[edit]

The following "probabilistic" proof of Shapley–Folkman–Starr theorem is fromCassels (1975).[32]

We can interpretconvS{\displaystyle \operatorname {conv} S} in probabilistic terms:xconvS{\displaystyle \forall x\in \operatorname {conv} S}, sincex=wnqn{\displaystyle x=\sum w_{n}q_{n}} for someqnS{\displaystyle q_{n}\in S}, we can define a random vectorX{\displaystyle X}, finitely supported inS{\displaystyle S}, such thatPr(X=qn)=wn{\displaystyle Pr(X=q_{n})=w_{n}}, andx=E[X]{\displaystyle x=\mathbb {E} [X]}.

Then, it is natural to consider the "variance" of a setS{\displaystyle S} asVar(S):=supxconvSinfE[X]=x,X is finitely supported in SVar[X]{\displaystyle Var(S):=\sup _{x\in \operatorname {conv} S}\inf _{\mathbb {E} [X]=x,X{\text{ is finitely supported in }}S}Var[X]}With that,d(S,conv(S))2Var(S)r(S)rad(S){\displaystyle d(S,\operatorname {conv} (S))^{2}\leq Var(S)\leq r(S)\leq rad(S)}.

Proof

d(S,conv(S))2Var(S){\displaystyle d(S,\operatorname {conv} (S))^{2}\leq Var(S)}: Expand their definitions.

Var(S)rad(S){\displaystyle Var(S)\leq rad(S)}: ifxconv(S){\displaystyle x\in \operatorname {conv} (S)} then letX{\displaystyle X} be finitely supported inS{\displaystyle S} such thatE[X]=x{\displaystyle E[X]=x}. Now sinceS{\displaystyle S} is bounded in a ball of radiusrad(S)+ϵ{\displaystyle rad(S)+\epsilon } centered at someo{\displaystyle o}, we haveVar[X]=Var[Xo]E[Xo2]rad(S)+ϵ{\displaystyle Var[X]=Var[X-o]\leq E[\|X-o\|^{2}]\leq rad(S)+\epsilon }.

Var(S)r(S){\displaystyle Var(S)\leq r(S)}: use the previous result.

Proof of Shapley–Folkman–Starr theorem

It suffices to showVar(Q)maxDVar(Qn){\displaystyle Var\left(Q\right)\leq \sum _{\max D}Var(Q_{n})}.

xconvQ{\displaystyle \forall x\in \operatorname {conv} Q}, by Shapley–Folkman lemma, there exists a representationx=nIxn+nJqn{\displaystyle x=\sum _{n\in I}x_{n}+\sum _{n\in J}q_{n}}, such thatI,J{\displaystyle I,J} partitions{1,2,...,N},|I|D,xnconvQn,qnQn{\displaystyle \{1,2,...,N\},|I|\leq D,x_{n}\in \operatorname {conv} Q_{n},q_{n}\in Q_{n}}.

Now, for eachnI{\displaystyle n\in I}, construct random vectorsXn{\displaystyle X_{n}} such thatXn{\displaystyle X_{n}} is finitely supported onQn{\displaystyle Q_{n}}, withE[Xn]=xn{\displaystyle \mathbb {E} [X_{n}]=x_{n}} andVar[Xn]<Var[Qn]+ϵ{\displaystyle Var[X_{n}]<Var[Q_{n}]+\epsilon }, whereϵ>0{\displaystyle \epsilon >0} is an arbitrary small number.

Let all suchXn{\displaystyle X_{n}} be independent. Then letX=nIXn+nJqn{\displaystyle X=\sum _{n\in I}X_{n}+\sum _{n\in J}q_{n}}. Since eachqn{\displaystyle q_{n}} is a deterministic vector, we have

Var[X]=Var[nIXn]=nIVar[Xn]nI(Var(Qn)+ϵ)maxDVar(Qn)+Dϵ.{\displaystyle Var[X]=Var\left[\sum _{n\in I}X_{n}\right]=\sum _{n\in I}Var\left[X_{n}\right]\leq \sum _{n\in I}\left(Var(Q_{n})+\epsilon \right)\leq \sum _{\max D}Var(Q_{n})+D\epsilon .}

Since this is true for arbitraryϵ>0{\displaystyle \epsilon >0}, we haveVar[X]maxDVar(Qn){\displaystyle Var[X]\leq \sum _{\max D}Var(Q_{n})}, and we are done.

History

[edit]
Picture of Lloyd Shapley
A Winner of the 2012 Nobel Award in Economics,Lloyd Shapley proved the Shapley–Folkman lemma withJon Folkman.[1]

The lemma ofLloyd Shapley andJon Folkman was first[38] published by the economistRoss M. Starr, who was investigating the existence ofeconomic equilibria while studying withKenneth Arrow. In his paper, Starr studied aconvexified economy, in which non-convex sets were replaced by their convex hulls; Starr proved that the convexified economy has equilibria that are closely approximated by "quasi-equilibria" of the original economy; moreover, he proved that every quasi-equilibrium has many of the optimal properties of true equilibria, which are proved to exist for convex economies.[1]

Following Starr's 1969 paper, the Shapley–Folkman–Starr results have been widely used to show that central results of (convex) economic theory are good approximations to largeeconomies with non-convexities; for example, quasi-equilibria closely approximate equilibria of a convexified economy. "The derivation of these results in general form has been one of the major achievements of postwar economic theory", wroteRoger Guesnerie.[39]

Applications

[edit]

The Shapley–Folkman lemma enables researchers to extend results for Minkowski sums of convex sets to sums of general sets, which need not be convex. Such sums of sets arise ineconomics, inmathematical optimization, and inprobability theory; in each of these three mathematical sciences, non-convexity is an important feature of applications.

Economics

[edit]
The nonnegative quadrant of the Cartesian plane appears. A blue straight-line slopes downward as a secant joining two points, one on each of the axes. This blue line is tangent to a red curve that touches it at a marked point, whose coordinates are labeled Qx and Qy.
The consumerprefers every basket of goods on theindifference curve I3 over each basket on  I2. The basket (QxQy), where the budget line (shown in blue)supports I2, is optimal and also feasible, unlike any basket lying on  I3 which is preferred but unfeasible.
See also:Convexity in economics

Ineconomics, a consumer'spreferences are defined over all "baskets" of goods. Each basket is represented as a non-negative vector, whose coordinates represent the quantities of the goods. On this set of baskets, anindifference curve is defined for each consumer; a consumer's indifference curve contains all the baskets of commodities that the consumer regards as equivalent: That is, for every pair of baskets on the same indifference curve, the consumer does not prefer one basket over another. Through each basket of commodities passes one indifference curve. A consumer'spreference set (relative to an indifference curve) is theunion of the indifference curve and all the commodity baskets that the consumer prefers over the indifference curve. A consumer'spreferences areconvex if all such preference sets are convex.[40]

An optimal basket of goods occurs where the budget-linesupports a consumer's preference set, as shown in the diagram. This means that an optimal basket is on the highest possible indifference curve given the budget-line, which is defined in terms of a price vector and the consumer's income (endowment vector). Thus, the set of optimal baskets is afunction of the prices, and this function is called the consumer'sdemand. If the preference set is convex, then at every price the consumer's demand is a convex set, for example, a unique optimal basket or a line-segment of baskets.[41]

Non-convex preferences

[edit]
Image of a non-convex preference set with a concavity un-supported by the budget line
When the consumer's preferences have concavities, the consumer may jump between two separate optimal baskets.
See also:Non-convexity (economics)

However, if a preference set isnon-convex, then some prices determine a budget-line that supports twoseparate optimal-baskets. For example, we can imagine that, for zoos, a lion costs as much as an eagle, and further that a zoo's budget suffices for one eagle or one lion. We can suppose also that a zoo-keeper views either animal as equally valuable. In this case, the zoo would purchase either one lion or one eagle. Of course, a contemporary zoo-keeper does not want to purchase half of an eagle and half of a lion (or agriffin)! Thus, the zoo-keeper's preferences are non-convex: The zoo-keeper prefers having either animal to having any strictly convex combination of both.[42]

When the consumer's preference set is non-convex, then (for some prices) the consumer's demand is notconnected; a disconnected demand implies some discontinuous behavior by the consumer, as discussed byHarold Hotelling:

If indifference curves for purchases be thought of as possessing a wavy character, convex to the origin in some regions and concave in others, we are forced to the conclusion that it is only the portions convex to the origin that can be regarded as possessing any importance, since the others are essentially unobservable. They can be detected only by the discontinuities that may occur in demand with variation in price-ratios, leading to an abrupt jumping of a point of tangency across a chasm when the straight line is rotated. But, while such discontinuities may reveal the existence of chasms, they can never measure their depth. The concave portions of the indifference curves and their many-dimensional generalizations, if they exist, must forever remain in unmeasurable obscurity.[43]

The difficulties of studying non-convex preferences were emphasized byHerman Wold[44] and again byPaul Samuelson, who wrote that non-convexities are "shrouded in eternal darkness",[45][a] according to Diewert.[46]

Nonetheless, non-convex preferences were illuminated from 1959 to 1961 by a sequence of papers inThe Journal of Political Economy (JPE). The main contributors were Farrell,[47] Bator,[48]Koopmans,[49] and Rothenberg.[50] In particular, Rothenberg's paper discussed the approximate convexity of sums of non-convex sets.[51] TheseJPE-papers stimulated a paper byLloyd Shapley andMartin Shubik, which considered convexified consumer-preferences and introduced the concept of an "approximate equilibrium".[52] TheJPE-papers and the Shapley–Shubik paper influenced another notion of "quasi-equilibria", due toRobert Aumann.[53][54]

Starr's 1969 paper and contemporary economics

[edit]
Picture of Kenneth Arrow
Kenneth Arrow (1972Nobel laureate) helpedRoss M. Starr to studynon-convexeconomies.[55]

Previous publications onnon-convexity and economics were collected in an annotated bibliography byKenneth Arrow. He gave the bibliography toStarr, who was then an undergraduate enrolled in Arrow's (graduate) advanced mathematical-economics course.[55] In his term-paper, Starr studied the general equilibria of an artificial economy in which non-convex preferences were replaced by their convex hulls. In the convexified economy, at each price, theaggregate demand was the sum of convex hulls of the consumers' demands. Starr's ideas interested the mathematiciansLloyd Shapley andJon Folkman, who proved theireponymous lemma and theorem in "private correspondence", which was reported by Starr's published paper of 1969.[1]

In his 1969 publication, Starr applied the Shapley–Folkman–Starr theorem. Starr proved that the "convexified" economy has general equilibria that can be closely approximated by "quasi-equilibria" of the original economy, when the number of agents exceeds the dimension of the goods: Concretely, Starr proved that there exists at least one quasi-equilibrium of pricespopt{\displaystyle p_{\mathrm {opt} }} with the following properties:

  • For each quasi-equilibrium's pricespopt{\displaystyle p_{\mathrm {opt} }}, all consumers can choose optimal baskets (maximally preferred and meeting their budget constraints).
  • At quasi-equilibrium pricespopt{\displaystyle p_{\mathrm {opt} }} in the convexified economy, every good's market is in equilibrium: Its supply equals its demand.
  • For each quasi-equilibrium, the prices "nearly clear" the markets for the original economy: anupper bound on thedistance between the set of equilibria of the "convexified" economy and the set of quasi-equilibria of the original economy followed from Starr's corollary to the Shapley–Folkman theorem.[56]

Starr established that

"in the aggregate, the discrepancy between an allocation in the fictitious economy generated by [taking the convex hulls of all of the consumption and production sets] and some allocation in the real economy is bounded in a way that is independent of the number of economic agents. Therefore, the average agent experiences a deviation from intended actions that vanishes in significance as the number of agents goes to infinity".[57]

Following Starr's 1969 paper, the Shapley–Folkman–Starr results have been widely used in economic theory.Roger Guesnerie summarized their economic implications: "Some key results obtained under the convexity assumption remain (approximately) relevant in circumstances where convexity fails. For example, in economies with a large consumption side, preference nonconvexities do not destroy the standard results".[58] "The derivation of these results in general form has been one of the major achievements of postwar economic theory", wrote Guesnerie.[39] The topic ofnon-convex sets in economics has been studied by manyNobel laureates: Arrow (1972),Robert Aumann (2005),Gérard Debreu (1983),Tjalling Koopmans (1975),Paul Krugman (2008), andPaul Samuelson (1970); the complementary topic ofconvex sets in economics has been emphasized by these laureates, along withLeonid Hurwicz,Leonid Kantorovich (1975), andRobert Solow (1987).[59] The Shapley–Folkman–Starr results have been featured in the economics literature: inmicroeconomics,[60] in general-equilibrium theory,[61] inpublic economics[62] (includingmarket failures),[63] as well as ingame theory,[64] inmathematical economics,[65] and inapplied mathematics (for economists).[66][67] The Shapley–Folkman–Starr results have also influenced economics research usingmeasure andintegration theory.[68]

Mathematical optimization

[edit]
A graph of a convex function, which is drawn in black. Its epigraph, the area above its graph, is solid green.
Afunction isconvex if the region above itsgraph is aconvex set.

The Shapley–Folkman lemma has been used to explain why largeminimization problems withnon-convexities can be nearly solved (withiterative methods whose convergence proofs are stated for onlyconvex problems). The Shapley–Folkman lemma has encouraged the use of methods of convex minimization on other applications with sums of many functions.[69]

Preliminaries of optimization theory

[edit]

Nonlinear optimization relies on the following definitions forfunctions:

  • Thegraph of a function f is the set of the pairs ofarguments x and function evaluations f(x)
Graph(f) ={(xf(x))}
A graph of the sine function, which periodically oscillates up and down between −1 and +1, with the period 2π.
Thesine function isnon-convex.
Epi(f) ={ (xu) : f(x) ≤ u }.

For example, thequadratic function f(x) = x2 is convex, as is theabsolute value function g(x) = |x|. However, thesine function (pictured) is non-convex on theinterval (0, π).

Additive optimization problems

[edit]

In many optimization problems, theobjective function f isseparable: that is,f is the sum ofmany summand-functions, each of which has its own argument:f(x)=f((x1,,xN))=fn(xn).{\displaystyle f(x)=f{\bigl (}(x_{1},\dots ,x_{N}){\bigr )}=\sum f_{n}(x_{n}).}

For example, problems oflinear optimization are separable. Given a separable problem with an optimal solution, fix an optimal solutionxmin=(x1,,xN)min{\displaystyle x_{\min }=(x_{1},\dots ,x_{N})_{\min }}with the minimum valuef(xmin){\displaystyle f(x_{\min })}. For this separable problem, consider an optimal solution(xmin,f(xmin)){\displaystyle {\bigl (}x_{\min },f(x_{\min }){\bigr )}} to theconvexified problem, where convex hulls are taken of the graphs of the summand functions. Such an optimal solution is thelimit of a sequence[b] of points in the convexified problem(xj,f(xj))convGraph(fn).{\displaystyle {\bigl (}x_{j},f(x_{j}){\bigr )}\in \sum \operatorname {conv} \operatorname {Graph} (f_{n}).}The given optimal-point is a sum of points in the graphs of the original summands and of a small number of convexified summands, by the Shapley–Folkman lemma.[4]

This analysis was published byIvar Ekeland in 1974 to explain the apparent convexity of separable problems with many summands, despite the non-convexity of the summand problems. In 1973, the young mathematicianClaude Lemaréchal was surprised by his success withconvex minimizationmethods on problems that were known to be non-convex; forminimizing nonlinear problems, a solution of thedual problem need not provide useful information for solving the primal problem, unless the primal problem be convex and satisfy aconstraint qualification. Lemaréchal's problem was additively separable, and each summand function was non-convex; nonetheless, a solution to the dual problem provided a close approximation to the primal problem's optimal value.[72][4][73] Ekeland's analysis explained the success of methods of convex minimization onlarge andseparable problems, despite the non-convexities of the summand functions. Ekeland and later authors argued that additive separability produced an approximately convex aggregate problem, even though the summand functions were non-convex. The crucial step in these publications is the use of the Shapley–Folkman lemma.[4][73][74][c] The Shapley–Folkman lemma has encouraged the use of methods of convex minimization on other applications with sums of many functions.[4][5][66][69]

Probability and measure theory

[edit]

Convex sets are often studied withprobability theory. Each point in the convex hull of a (non-empty) subset Q of a finite-dimensional space is theexpected value of asimplerandom vector that takes its values in Q, as a consequence ofCarathéodory's lemma. Thus, for a non-empty set Q, the collection of the expected values of the simple,Q-valued random vectors equals Q's convex hull; this equality implies that the Shapley–Folkman–Starr results are useful in probability theory.[77] In the other direction, probability theory provides tools to examine convex sets generally and the Shapley–Folkman–Starr results specifically.[78] The Shapley–Folkman–Starr results have been widely used in theprobabilistic theory of random sets,[79] for example, to prove alaw of large numbers,[6][80] acentral limit theorem,[80][81] and alarge-deviations principle.[82] These proofs ofprobabilistic limit theorems used the Shapley–Folkman–Starr results to avoid the assumption that all the random sets be convex.

Aprobability measure is a finitemeasure, and the Shapley–Folkman lemma has applications in non-probabilistic measure theory, such as the theories ofvolume and ofvector measures. The Shapley–Folkman lemma enables a refinement of theBrunn–Minkowski inequality, which bounds the volume of sums in terms of the volumes of their summand-sets.[83] The volume of a set is defined in terms of theLebesgue measure, which is defined on subsets ofEuclidean space. In advanced measure-theory, the Shapley–Folkman lemma has been used to proveLyapunov's theorem, which states that therange of avector measure is convex.[84] Here, the traditional term "range" (alternatively, "image") is the set of values produced by the function. Avector measure is a vector-valued generalization of a measure; for example, if p1 and p2 areprobability measures defined on the samemeasurable space, then theproduct function p1 p2 is a vector measure, where p1 p2 is defined for everyevent ω by

(p1 p2)(ω)=(p1(ω), p2(ω)).

Lyapunov's theorem has been used ineconomics,[53][85] in ("bang-bang")control theory, and instatistical theory.[86] Lyapunov's theorem has been called acontinuous counterpart of the Shapley–Folkman lemma,[3] which has itself been called adiscrete analogue of Lyapunov's theorem.[87]

Notes

[edit]
  1. ^"Eternal darkness" describes the Hell ofJohn Milton'sParadise Lost, whose concavity is compared to theSerbonian Bog inBook II, lines 592–594:

    A gulf profound as that Serbonian Bog
    Betwixt Damiata and Mount Casius old,
    Where Armies whole have sunk.

    Milton's description of concavity serves as theliterary epigraph prefacing chapter seven ofArrow & Hahn (1980, p. 169), "Markets with non-convex preferences and production", which presents the results ofStarr (1969).
  2. ^Thelimit of a sequence is a member of theclosure of the original set, which is the smallestclosed set that contains the original set. The Minkowski sum of twoclosed sets need not be closed, so the followinginclusion (wherecl{\displaystyle \operatorname {cl} } denotes thetopological closure operator) can be strict:clP+clQcl(clP+clQ).{\displaystyle \operatorname {cl} P+\operatorname {cl} Q\subseteq \operatorname {cl} (\operatorname {cl} P+\operatorname {cl} Q).}The inclusion can be strict even for twoconvex closed summand-sets.[71] Ensuring that the Minkowski sum of sets be closed requires the closure operation, which appends limits of convergent sequences.
  3. ^Aubin and Ekeland also considered theconvex closure of a problem of non-convex minimization—that is, the problem defined as theclosedconvexhull of theepigraph of the original problem.[75] Their study of duality gaps was extended by Di Guglielmo to thequasiconvex closure of a non-convexminimization problem—that is, the problem defined as theclosedconvexhull of thelowerlevel sets.[76]

Footnotes

[edit]
  1. ^abcdeStarr (1969)
  2. ^abHowe (1979, p. 1)
  3. ^abcStarr (2008)
  4. ^abcdeEkeland (1999, pp. 357–359): Published in the first English edition of 1976, Ekeland's appendix proves the Shapley–Folkman lemma, also acknowledgingLemaréchal's experiments on page 373.
  5. ^abBertsekas (1996, pp. 364–381) acknowledgingEkeland (1999) on page 374 andAubin & Ekeland (1976) on page 381:

    Bertsekas (1996, pp. 364–381) describes an application ofLagrangian dual methods to thescheduling ofelectrical power plants ("unit commitment problems"), where non-convexity appears because ofinteger constraints:

    Bertsekas et al. (1983)

  6. ^abArtstein & Vitale (1975, pp. 881–882)
  7. ^abTakayama (1985), p. 20.
  8. ^abRockafellar (1997), p. 12.
  9. ^Schneider (1993), pp. 126–196, Chapter 3: Minkowski addition.
  10. ^abcdCarter (2001), p. 94.
  11. ^Arrow & Hahn (1980, p. 375)
  12. ^abRockafellar (1997), p. 10.
  13. ^Blumson (2019).
  14. ^Arrow & Hahn (1980, p. 376),Rockafellar (1997, pp. 10–11), andGreen & Heller (1981, p. 37)
  15. ^Arrow & Hahn (1980, p. 385) andRockafellar (1997, pp. 11–12)
  16. ^Boules (2021), p. 83.
  17. ^Schneider (1993, p. xi) andRockafellar (1997, p. 16)
  18. ^Rockafellar (1997, p. 17) andStarr (1997, p. 78)
  19. ^Schneider (1993, pp. 2–3)
  20. ^Arrow & Hahn (1980, p. 387)
  21. ^abCarter (2001), p. 93.
  22. ^Starr (1969, pp. 35–36)
  23. ^abSchneider (1993, p. 140) credits this result toBorwein & O'Brien (1978)
  24. ^Papadopoulos (2005, pp. 105–110, 4.1 The Hausdorff distance). The formula for Hausdorff distance is simplified here using the assumption thatXY{\displaystyle X\subset Y} to avoid requiring also thatXYε{\displaystyle X\subset Y_{\varepsilon }}.
  25. ^Balestro, Martini & Teixeira (2024), p. 151.
  26. ^Starr (1969, p. 36)
  27. ^Schneider (1993, p. 129)
  28. ^abStarr (1981).
  29. ^abcStarr (1969, p. 37)
  30. ^Schneider (1993, pp. 129–130)
  31. ^Arrow & Hahn (1980, pp. 392–395)
  32. ^abCassels (1975, pp. 435–436)
  33. ^Schneider (1993, p. 128)
  34. ^Ekeland (1999, pp. 357–359)
  35. ^Artstein (1980, p. 180)
  36. ^Anderson (2005).
  37. ^Bertsekas (2009).
  38. ^abZhou (1993).
  39. ^abGuesnerie (1989, p. 138)
  40. ^Mas-Colell (1985, pp. 58–61) andArrow & Hahn (1980, pp. 76–79)
  41. ^Arrow & Hahn (1980, pp. 79–81)
  42. ^Starr (1969, p. 26): "After all,one may be indifferent between an automobile and a boat, but in most cases one can neither drive nor sail the combination of half boat, half car."
  43. ^Hotelling (1935, p. 74)
  44. ^Wold (1943b, pp. 231 and 239–240) andWold (1953, p. 146)
  45. ^Samuelson (1950, pp. 359–360):

    It will be noted that any point where the indifference curves are convex rather than concave cannot be observed in a competitive market. Such points are shrouded in eternal darkness—unless we make our consumer a monopsonist and let him choose between goods lying on a very convex "budget curve" (along which he is affecting the price of what he buys). In this monopsony case, we could still deduce the slope of the man's indifference curve from the slope of the observed constraint at the equilibrium point.

  46. ^Diewert (1982, pp. 552–553)
  47. ^Farrell (1959,1961a,1961b)
  48. ^Bator (1961a,1961b)
  49. ^Koopmans (1961, p. 478) and others—for example,Farrell (1959, pp. 390–391) andFarrell (1961a, p. 484),Bator (1961a, pp. 482–483),Rothenberg (1960, p. 438), andStarr (1969, p. 26)—commented onKoopmans (1957, pp. 1–126, especially 9–16 [1.3 Summation of opportunity sets], 23–35 [1.6 Convex sets and the price implications of optimality], and 35–37 [1.7 The role of convexity assumptions in the analysis])
  50. ^Rothenberg (1960, p. 447,1961)
  51. ^Arrow & Hahn (1980, p. 182)
  52. ^Shapley & Shubik (1966, p. 806)
  53. ^abAumann (1966, pp. 1–2) uses results from Aumann (1964,1965)
  54. ^Taking the convex hull of non-convex preferences had been discussed earlier byWold (1943b, p. 243) and byWold (1953, p. 146), according toDiewert (1982, p. 552).
  55. ^abStarr & Stinchcombe (1999, pp. 217–218)
  56. ^Arrow & Hahn (1980, pp. 169–182) andStarr (1969, pp. 27–33)
  57. ^Green & Heller (1981, p. 44)
  58. ^Guesnerie (1989, pp. 99)
  59. ^Mas-Colell (1987)
  60. ^Varian (1992, pp. 393–394)

    Mas-Colell, Whinston & Green (1995, pp. 627–630)

  61. ^Arrow & Hahn (1980, pp. 169–182)

    Mas-Colell (1985, pp. 52–55, 145–146, 152–153, and 274–275)

    Hildenbrand (1974, pp. 37, 115–116, 122, and 168)

    Starr (1997, p. 169)

    Ellickson (1994, pp. xviii, 306–310, 312, 328–329, 347, and 352)

  62. ^Laffont (1988).
  63. ^Salanié (2000, pp. 112–113 and 107–115)
  64. ^Ichiishi (1983, pp. 24–25)
  65. ^Cassels (1981, pp. 127 and 33–34)
  66. ^abAubin (2007, pp. 458–476)
  67. ^Carter (2001, pp. 93–94, 143, 318–319, 375–377, and 416)
  68. ^Trockel (1984, p. 30)
  69. ^abBertsekas (1999, p. 496)
  70. ^Rockafellar (1997, p. 23)
  71. ^Rockafellar (1997), pp. 49 and 75.
  72. ^Lemaréchal (1973, p. 38)Lemaréchal's experiments were discussed in later publications:

    Aardal (1995, pp. 2–3)

    Hiriart-Urruty & Lemaréchal (1993, pp. 143–145, 151, 153, and 156)

  73. ^abEkeland (1974)
  74. ^Aubin & Ekeland (1976, pp. 226, 233, 235, 238, and 241)
  75. ^Aubin & Ekeland (1976) andEkeland (1999, pp. 362–364)
  76. ^Di Guglielmo (1977, pp. 287–288)
  77. ^Schneider & Weil (2008, p. 45)
  78. ^Cassels (1975, pp. 433–434)
  79. ^Molchanov (2005, pp. 195–198, 218, 232, 237–238 and 407)
  80. ^abPuri & Ralescu (1985, pp. 154–155)
  81. ^Weil (1982, pp. 203, and 205–206)
  82. ^Cerf (1999, pp. 243–244) uses applications of the Shapley–Folkman lemma fromPuri & Ralescu (1985, pp. 154–155).
  83. ^Ruzsa (1997, p. 345)
  84. ^Tardella (1990, pp. 478–479)
  85. ^Vind (1964, pp. 168 and 175) was noted by the winner of the 1983Nobel Prize in Economics,Gérard Debreu.Debreu (1991, p. 4) wrote:

    The concept of a convex set (i.e., a set containing the segment connecting any two of its points) had repeatedly been placed at the center of economic theory before 1964. It appeared in a new light with the introduction of integration theory in the study of economic competition: If one associates with every agent of an economy an arbitrary set in the commodity space andif one averages those individual sets over a collection of insignificant agents,then the resulting set is necessarily convex. [Debreu appends this footnote: "On this direct consequence of a theorem of A. A. Lyapunov, seeVind (1964)."] But explanations of the ... functions of prices ... can be made to rest on theconvexity of sets derived by that averaging process.Convexity in the commodity spaceobtained by aggregation over a collection of insignificant agents is an insight that economic theory owes ... to integration theory. [Italics added]

  86. ^Artstein (1980, pp. 172–183)
  87. ^Mas-Colell (1978, p. 210)

References

[edit]

External links

[edit]
Euclidean
geometry
Fundamental Concepts (Euclidean)
Non-Euclidean
geometry
Based on Methods or Structures
Topology
Lists
Glossaries
Miscellaneous
Related
Supply and demand
Markets andFirms
Decision theory andbehavioural economics
Welfare economics
Econometrics andmathematical economics
Other subfields
See also
Basic concepts
Topics (list)
Maps
Mainresults (list)
Sets
Series
Duality
Applications and related

Retrieved from "https://en.wikipedia.org/w/index.php?title=Shapley–Folkman_lemma&oldid=1298729813"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp