Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Power law

From Wikipedia, the free encyclopedia
Functional relationship between two quantities
Not to be confused withForce (law).For other uses, seePower (disambiguation).
"Scaling law" redirects here. For statistical laws of scaling deep learning models, seeNeural scaling law.
An example power-law graph that demonstrates ranking of popularity. To the right is thelong tail, and to the left are the few that dominate (also known as the80–20 rule).

Instatistics, apower law is afunctional relationship between two quantities, where arelative change in one quantity results in a relative change in the other quantity proportional to the change raised to a constantexponent: one quantity varies as a power of another. The change is independent of the initial size of those quantities.

For instance, the area of a square has a power law relationship with the length of its side, since if the length is doubled, the area is multiplied by 22, while if the length is tripled, the area is multiplied by 32, and so on.[1]

Empirical examples

[edit]

The distributions of a wide variety of physical, biological, and human-made phenomena approximately follow a power law over a wide range of magnitudes: these include the sizes of craters on themoon and ofsolar flares,[2] cloud sizes,[3] the foraging pattern of various species,[4] the sizes of activity patterns of neuronal populations,[5] the frequencies ofwords in most languages, frequencies offamily names, thespecies richness inclades of organisms,[6] the sizes ofpower outages, volcanic eruptions,[7] human judgments of stimulus intensity[8][9] and many other quantities.[10] Empirical distributions can only fit a power law for a limited range of values, because a pure power law would allow for arbitrarily large or small values.Acoustic attenuation follows frequency power-laws within wide frequency bands for many complex media. Allometric scaling laws for relationships between biological variables are among the best known power-law functions in nature.

Properties

[edit]

Statistical incompleteness

[edit]

The power-law model does not obey the treasured paradigm of statistical completeness. Especially probability bounds, the suspected cause of typical bending and/or flattening phenomena in the high- and low-frequency graphical segments, are parametrically absent in the standard model.[11]

Scale invariance

[edit]

One attribute of power laws is theirscale invariance. Given a relationf(x)=axk{\displaystyle f(x)=ax^{-k}}, scaling the argumentx{\displaystyle x} by a constant factorc{\displaystyle c} causes only a proportionate scaling of the function itself. That is,

f(cx)=a(cx)k=ckf(x)f(x),{\displaystyle f(cx)=a(cx)^{-k}=c^{-k}f(x)\propto f(x),\!}

where{\displaystyle \propto } denotesdirect proportionality. That is, scaling by a constantc{\displaystyle c} simply multiplies the original power-law relation by the constantck{\displaystyle c^{-k}}. Thus, it follows that all power laws with a particular scaling exponent are equivalent up to constant factors, since each is simply a scaled version of the others. This behavior is what produces the linear relationship when logarithms are taken of bothf(x){\displaystyle f(x)} andx{\displaystyle x}, and the straight-line on thelog–log plot is often called thesignature of a power law. With real data, such straightness is a necessary, but not sufficient, condition for the data following a power-law relation. In fact, there are many ways to generate finite amounts of data that mimic this signature behavior, but, in their asymptotic limit, are not true power laws.[citation needed] Thus, accurately fitting andvalidating power-law models is an active area of research in statistics; see below.

Lack of well-defined average value

[edit]

A power-lawxk{\displaystyle x^{-k}} has a well-definedmean overx[1,){\displaystyle x\in [1,\infty )} only ifk>2{\displaystyle k>2}, and it has a finitevariance only ifk>3{\displaystyle k>3}; most identified power laws in nature have exponents such that the mean is well-defined but the variance is not, implying they are capable ofblack swan behavior.[2] This can be seen in the following thought experiment:[12] imagine a room with your friends and estimate the average monthly income in the room. Now imagine theworld's richest person entering the room, with a monthly income of about 1billion US$. What happens to the average income in the room? Income is distributed according to a power-law known as thePareto distribution (for example, the net worth of Americans is distributed according to a power law with an exponent of 2).

On the one hand, this makes it incorrect to apply traditional statistics that are based onvariance andstandard deviation (such asregression analysis).[13] On the other hand, this also allows for cost-efficient interventions.[12] For example, given that car exhaust is distributed according to a power-law among cars (very few cars contribute to most contamination) it would be sufficient to eliminate those very few cars from the road to reduce total exhaust substantially.[14]

The median does exist, however: for a power lawxk, with exponentk>1{\displaystyle k>1}, it takes the value 21/(k – 1)xmin, wherexmin is the minimum value for which the power law holds.[2]

Universality

[edit]

The equivalence of power laws with a particular scaling exponent can have a deeper origin in the dynamical processes that generate the power-law relation. In physics, for example,phase transitions in thermodynamic systems are associated with the emergence of power-law distributions of certain quantities, whose exponents are referred to as thecritical exponents of the system. Diverse systems with the same critical exponents—that is, which display identical scaling behaviour as they approachcriticality—can be shown, viarenormalization group theory, to share the same fundamental dynamics. For instance, the behavior of water and CO2 at their boiling points fall in the same universality class because they have identical critical exponents.[citation needed][clarification needed] In fact, almost all material phase transitions are described by a small set of universality classes. Similar observations have been made, though not as comprehensively, for variousself-organized critical systems, where the critical point of the system is anattractor. Formally, this sharing of dynamics is referred to asuniversality, and systems with precisely the same critical exponents are said to belong to the sameuniversality class.

Power-law functions

[edit]

Scientific interest in power-law relations stems partly from the ease with which certain general classes of mechanisms generate them.[15] The demonstration of a power-law relation in some data can point to specific kinds of mechanisms that might underlie the natural phenomenon in question, and can indicate a deep connection with other, seemingly unrelated systems;[16] see alsouniversality above. The ubiquity of power-law relations in physics is partly due todimensional constraints, while incomplex systems, power laws are often thought to be signatures of hierarchy or of specificstochastic processes. A few notable examples of power laws arePareto's law of income distribution, structural self-similarity offractals,scaling laws in biological systems, andscaling laws in cities. Research on the origins of power-law relations, and efforts to observe and validate them in the real world, is an active topic of research in many fields of science, includingphysics,computer science,linguistics,geophysics,neuroscience,systematics,sociology,economics and more.

However, much of the recent interest in power laws comes from the study ofprobability distributions: The distributions of a wide variety of quantities seem to follow the power-law form, at least in their upper tail (large events). The behavior of these large events connects these quantities to the study oftheory of large deviations (also calledextreme value theory), which considers the frequency of extremely rare events likestock market crashes and largenatural disasters. It is primarily in the study of statistical distributions that the name "power law" is used.

In empirical contexts, an approximation to a power-lawo(xk){\displaystyle o(x^{k})} often includes a deviation termε{\displaystyle \varepsilon }, which can represent uncertainty in the observed values (perhaps measurement or sampling errors) or provide a simple way for observations to deviate from the power-law function (perhaps for stochastic reasons):

y=axk+ε.{\displaystyle y=ax^{k}+\varepsilon .}

Mathematically, a strict power law cannot be a probability distribution, but a distribution that is a truncatedpower function is possible:p(x)=Cxα{\displaystyle p(x)=Cx^{-\alpha }} forx>xmin{\displaystyle x>x_{\text{min}}} where the exponentα{\displaystyle \alpha } (Greek letteralpha, not to be confused with scaling factora{\displaystyle a} used above) is greater than 1 (otherwise the tail has infinite area), the minimum valuexmin{\displaystyle x_{\text{min}}} is needed otherwise the distribution has infinite area asx approaches 0, and the constantC is a scaling factor to ensure that the total area is 1, as required by a probability distribution. More often one uses an asymptotic power law – one that is only true in the limit; seepower-law probability distributions below for details. Typically the exponent falls in the range2<α<3{\displaystyle 2<\alpha <3}, though not always.[10]

Examples

[edit]

More than a hundred power-law distributions have been identified in physics (e.g. sandpile avalanches), biology (e.g. species extinction and body mass), and the social sciences (e.g. city sizes and income).[17] Among them are:

Artificial Intelligence

[edit]

Astronomy

[edit]

Biology

[edit]

Chemistry

[edit]

Climate science

[edit]
  • Sizes of cloud areas and perimeters, as viewed from space[3]
  • The size of rain-shower cells[22]
  • Energy dissipation in cyclones[23]
  • Diameters ofdust devils on Earth and Mars[24]

General science

[edit]

Economics

[edit]

Finance

[edit]

Mathematics

[edit]

Physics

[edit]

Political Science

[edit]

Psychology

[edit]

Variants

[edit]

Broken power law

[edit]
Some models of theinitial mass function use a broken power law; here Kroupa (2001) in red.

A broken power law is apiecewise function, consisting of two or more power laws, combined with a threshold. For example, with two power laws:[49]

f(x)xα1{\displaystyle f(x)\propto x^{\alpha _{1}}} forx<xth{\displaystyle x<x_{\text{th}}},f(x)xthα1α2xα2 for x>xth.{\displaystyle f(x)\propto x_{\text{th}}^{\alpha _{1}-\alpha _{2}}x^{\alpha _{2}}{\text{ for }}x>x_{\text{th}}.}

Smoothly broken power law

[edit]

The pieces of a broken power law can be smoothly spliced together to construct a smoothly broken power law.

There are different possible ways to splice together power laws. One example is the following:[50]ln(yy0+a)=c0ln(xx0)+i=1ncici1filn(1+(xxi)fi){\displaystyle \ln \left({\frac {y}{y_{0}}}+a\right)=c_{0}\ln \left({\frac {x}{x_{0}}}\right)+\sum _{i=1}^{n}{\frac {c_{i}-c_{i-1}}{f_{i}}}\ln \left(1+\left({\frac {x}{x_{i}}}\right)^{f_{i}}\right)}where0<x0<x1<<xn{\displaystyle 0<x_{0}<x_{1}<\cdots <x_{n}}.


When the function is plotted as alog-log plot with horizontal axis beinglnx{\displaystyle \ln x} and vertical axis beingln(y/y0+a){\displaystyle \ln(y/y_{0}+a)}, the plot is composed ofn+1{\displaystyle n+1} linear segments with slopesc0,c1,,cn{\displaystyle c_{0},c_{1},\dots ,c_{n}}, separated atx=x1,,xn{\displaystyle x=x_{1},\dots ,x_{n}}, smoothly spliced together. The size offi{\displaystyle f_{i}} determines the sharpness of splicing between segmentsi1,i{\displaystyle i-1,i}.

Power law with exponential cutoff

[edit]

A power law with an exponential cutoff is simply a power law multiplied by an exponential function:[10]

f(x)xαeβx.{\displaystyle f(x)\propto x^{-\alpha }e^{-\beta x}.}

Curved power law

[edit]

f(x)xα+βx{\displaystyle f(x)\propto x^{\alpha +\beta x}}[51]

Power-law probability distributions

[edit]

In a looser sense, a power-lawprobability distribution is a distribution whose density function (or mass function in the discrete case) has the form, for large values ofx{\displaystyle x},[52]

P(X>x)L(x)x(α1){\displaystyle P(X>x)\sim L(x)x^{-(\alpha -1)}}

whereα>1{\displaystyle \alpha >1}, andL(x){\displaystyle L(x)} is aslowly varying function, which is any function that satisfieslimxL(rx)/L(x)=1{\textstyle \lim _{x\to \infty }L(rx)/L(x)=1} for any positive factorr{\displaystyle r}. This property ofL(x){\displaystyle L(x)} follows directly from the requirement thatp(x){\displaystyle p(x)} be asymptotically scale invariant; thus, the form ofL(x){\displaystyle L(x)} only controls the shape and finite extent of the lower tail. For instance, ifL(x){\displaystyle L(x)} is the constant function, then we have a power law that holds for all values ofx{\displaystyle x}. In many cases, it is convenient to assume a lower boundxmin{\displaystyle x_{\mathrm {min} }} from which the law holds. Combining these two cases, and wherex{\displaystyle x} is a continuous variable, the power law has the form of thePareto distribution

p(x)=α1xmin(xxmin)α,{\displaystyle p(x)={\frac {\alpha -1}{x_{\min }}}\left({\frac {x}{x_{\min }}}\right)^{-\alpha },}

where the pre-factor toα1xmin{\displaystyle {\frac {\alpha -1}{x_{\min }}}} is thenormalizing constant. We can now consider several properties of this distribution. For instance, itsmoments are given by

E(Xm)=xminxmp(x)dx=α1α1mxminm{\displaystyle \mathbb {E} \left(X^{m}\right)=\int _{x_{\min }}^{\infty }x^{m}p(x)\,\mathrm {d} x={\frac {\alpha -1}{\alpha -1-m}}x_{\min }^{m}}

which is only well defined form<α1{\displaystyle m<\alpha -1}. That is, all momentsmα1{\displaystyle m\geq \alpha -1} diverge: whenα2{\displaystyle \alpha \leq 2}, the average and all higher-order moments are infinite; when2<α<3{\displaystyle 2<\alpha <3}, the mean exists, but the variance and higher-order moments are infinite, etc. For finite-size samples drawn from such distribution, this behavior implies that thecentral moment estimators (like the mean and the variance) for diverging moments will never converge – as more data is accumulated, they continue to grow. These power-law probability distributions are also called Pareto-type distributions, distributions with Pareto tails, or distributions with regularly varying tails.

A modification, which does not satisfy the general form above, with an exponential cutoff,[10] is

p(x)L(x)xαeλx.{\displaystyle p(x)\propto L(x)x^{-\alpha }\mathrm {e} ^{-\lambda x}.}

In this distribution, the exponential decay termeλx{\displaystyle \mathrm {e} ^{-\lambda x}} eventually overwhelms the power-law behavior at very large values ofx{\displaystyle x}. This distribution does not scale[further explanation needed] and is thus not asymptotically as a power law; however, it does approximately scale over a finite region before the cutoff. The pure form above is a subset of this family, withλ=0{\displaystyle \lambda =0}. This distribution is a common alternative to the asymptotic power-law distribution because it naturally captures finite-size effects.

TheTweedie distributions are a family of statistical models characterized byclosure under additive and reproductive convolution as well as under scale transformation. Consequently, these models all express a power-law relationship between the variance and the mean. These models have a fundamental role as foci of mathematicalconvergence similar to the role that thenormal distribution has as a focus in thecentral limit theorem. This convergence effect explains why the variance-to-mean power law manifests so widely in natural processes, as withTaylor's law in ecology and with fluctuation scaling[53] in physics. It can also be shown that this variance-to-mean power law, when demonstrated by themethod of expanding bins, implies the presence of 1/f noise and that 1/f noise can arise as a consequence of this Tweedie convergence effect.[54]

Graphical methods for identification

[edit]

Although more sophisticated and robust methods have been proposed, the most frequently used graphical methods of identifying power-law probability distributions using random samples are Pareto quantile-quantile plots (or ParetoQ–Q plots),[citation needed] mean residual life plots[55][56] andlog–log plots. Another, more robust graphical method uses bundles of residual quantile functions.[57] (Please keep in mind thatpower-law distributions are also called Pareto-type distributions.) It is assumed here that a random sample is obtained from a probability distribution, and that we want to know if the tail of the distribution follows a power law (in other words, we want to know if the distribution has a "Pareto tail"). Here, the random sample is called "the data".

Pareto Q–Q plots

[edit]

Pareto Q–Q plots compare thequantiles of the log-transformed data to the corresponding quantiles of an exponential distribution with mean 1 (or to the quantiles of a standard Pareto distribution) by plotting the former versus the latter. If the resultant scatterplot suggests that the plotted pointsasymptotically converge to a straight line, then a power-law distribution should be suspected. A limitation of Pareto Q–Q plots is that they behave poorly when the tail indexα{\displaystyle \alpha } (also called Pareto index) is close to 0, because Pareto Q–Q plots are not designed to identify distributions with slowly varying tails.[57]

Mean residual life plots

[edit]

On the other hand, in its version for identifying power-law probability distributions, the mean residual life plot consists of first log-transforming the data, and then plotting the average of those log-transformed data that are higher than thei-th order statistic versus thei-th order statistic, fori = 1, ..., n, where n is the size of the random sample. If the resultant scatterplot suggests that the plotted points tend to stabilize about a horizontal straight line, then a power-law distribution should be suspected. Since the mean residual life plot is very sensitive to outliers (it is not robust), it usually produces plots that are difficult to interpret; for this reason, such plots are usually called Hill horror plots.[58]

Log-log plots

[edit]
A straight line on a log–log plot is necessary but insufficient evidence for power-laws, the slope of the straight line corresponds to the power law exponent.

Log–log plots are an alternative way of graphically examining the tail of a distribution using a random sample. Taking the logarithm of a power law of the formf(x)=axk{\displaystyle f(x)=ax^{k}} results in:[59]

log(f(x))=log(axk)=log(a)+log(xk)=log(a)+klog(x),{\displaystyle {\begin{aligned}\log(f(x))&=\log(ax^{k})\\&=\log(a)+\log(x^{k})\\&=\log(a)+k\cdot \log(x),\end{aligned}}}

which forms a straight line with slopek{\displaystyle k} on a log-log scale. Caution has to be exercised however as a log–log plot is necessary but insufficient evidence for a power law relationship, as many non power-law distributions will appear as straight lines on a log–log plot.[10][60] This method consists of plotting the logarithm of an estimator of the probability that a particular number of the distribution occurs versus the logarithm of that particular number. Usually, this estimator is the proportion of times that the number occurs in the data set. If the points in the plot tend to converge to a straight line for large numbers in the x axis, then the researcher concludes that the distribution has a power-law tail. Examples of the application of these types of plot have been published.[61] A disadvantage of these plots is that, in order for them to provide reliable results, they require huge amounts of data. In addition, they are appropriate only for discrete (or grouped) data.

Bundle plots

[edit]

Another graphical method for the identification of power-law probability distributions using random samples has been proposed.[57] This methodology consists of plotting abundle for the log-transformed sample. Originally proposed as a tool to explore the existence of moments and the moment generation function using random samples, the bundle methodology is based on residualquantile functions (RQFs), also called residual percentile functions,[62][63][64][65][66][67][68] which provide a full characterization of the tail behavior of many well-known probability distributions, including power-law distributions, distributions with other types of heavy tails, and even non-heavy-tailed distributions. Bundle plots do not have the disadvantages of Pareto Q–Q plots, mean residual life plots and log–log plots mentioned above (they are robust to outliers, allow visually identifying power laws with small values ofα{\displaystyle \alpha }, and do not demand the collection of much data).[citation needed] In addition, other types of tail behavior can be identified using bundle plots.

Plotting power-law distributions

[edit]

In general, power-law distributions are plotted ondoubly logarithmic axes, which emphasizes the upper tail region. The most convenient way to do this is via the (complementary)cumulative distribution (ccdf) that is, thesurvival function,P(x)=Pr(X>x){\displaystyle P(x)=\Pr(X>x)},

P(x)=Pr(X>x)=Cxp(X)dX=α1xminα+1xXαdX=(xxmin)(α1).{\displaystyle P(x)=\Pr(X>x)=C\int _{x}^{\infty }p(X)\,\mathrm {d} X={\frac {\alpha -1}{x_{\min }^{-\alpha +1}}}\int _{x}^{\infty }X^{-\alpha }\,\mathrm {d} X=\left({\frac {x}{x_{\min }}}\right)^{-(\alpha -1)}.}

The cdf is also a power-law function, but with a smaller scaling exponent. For data, an equivalent form of the cdf is the rank-frequency approach, in which we first sort then{\displaystyle n} observed values in ascending order, and plot them against the vector[1,n1n,n2n,,1n]{\displaystyle \left[1,{\frac {n-1}{n}},{\frac {n-2}{n}},\dots ,{\frac {1}{n}}\right]}.

Although it can be convenient to log-bin the data, or otherwise smooth the probability density (mass) function directly, these methods introduce an implicit bias in the representation of the data, and thus should be avoided.[10][69] The survival function, on the other hand, is more robust to (but not without) such biases in the data and preserves the linear signature on doubly logarithmic axes. Though a survival function representation is favored over that of the pdf while fitting a power law to the data with the linear least square method, it is not devoid of mathematical inaccuracy. Thus, while estimating exponents of a power law distribution, maximum likelihood estimator is recommended.

Estimating the exponent from empirical data

[edit]

There are many ways of estimating the value of the scaling exponent for a power-law tail, however not all of them yieldunbiased and consistent answers. Some of the most reliable techniques are often based on the method ofmaximum likelihood. Alternative methods are often based on making a linear regression on either the log–log probability, the log–log cumulative distribution function, or on log-binned data, but these approaches should be avoided as they can all lead to highly biased estimates of the scaling exponent.[10]

Maximum likelihood

[edit]

For real-valued,independent and identically distributed data, we fit a power-law distribution of the form

p(x)=α1xmin(xxmin)α{\displaystyle p(x)={\frac {\alpha -1}{x_{\min }}}\left({\frac {x}{x_{\min }}}\right)^{-\alpha }}

to the dataxxmin{\displaystyle x\geq x_{\min }}, where the coefficientα1xmin{\displaystyle {\frac {\alpha -1}{x_{\min }}}} is included to ensure that the distribution isnormalized. Given a choice forxmin{\displaystyle x_{\min }}, the log likelihood function becomes:

L(α)=logi=1nα1xmin(xixmin)α{\displaystyle {\mathcal {L}}(\alpha )=\log \prod _{i=1}^{n}{\frac {\alpha -1}{x_{\min }}}\left({\frac {x_{i}}{x_{\min }}}\right)^{-\alpha }} The maximum of this likelihood is found by differentiating with respect to parameterα{\displaystyle \alpha }, setting the result equal to zero. Upon rearrangement, this yields the estimator equation:

α^=1+n[i=1nlnxixmin]1{\displaystyle {\hat {\alpha }}=1+n\left[\sum _{i=1}^{n}\ln {\frac {x_{i}}{x_{\min }}}\right]^{-1}}

where{xi}{\displaystyle \{x_{i}\}} are then{\displaystyle n} data pointsxixmin{\displaystyle x_{i}\geq x_{\min }}.[2][70] This estimator exhibits a small finite sample-size bias of orderO(n1){\displaystyle O(n^{-1})}, which is small whenn > 100. Further, the standard error of the estimate isσ=α^1n+O(n1){\displaystyle \sigma ={\frac {{\hat {\alpha }}-1}{\sqrt {n}}}+O(n^{-1})}. This estimator is equivalent to the popular[citation needed]Hill estimator fromquantitative finance andextreme value theory.[citation needed]

For a set ofn integer-valued data points{xi}{\displaystyle \{x_{i}\}}, again where eachxixmin{\displaystyle x_{i}\geq x_{\min }}, the maximum likelihood exponent is the solution to the transcendental equation

ζ(α^,xmin)ζ(α^,xmin)=1ni=1nlnxixmin{\displaystyle {\frac {\zeta '({\hat {\alpha }},x_{\min })}{\zeta ({\hat {\alpha }},x_{\min })}}=-{\frac {1}{n}}\sum _{i=1}^{n}\ln {\frac {x_{i}}{x_{\min }}}}

whereζ(α,xmin){\displaystyle \zeta (\alpha ,x_{\mathrm {min} })} is theincomplete zeta function. The uncertainty in this estimate follows the same formula as for the continuous equation. However, the two equations forα^{\displaystyle {\hat {\alpha }}} are not equivalent, and the continuous version should not be applied to discrete data, nor vice versa.

Further, both of these estimators require the choice ofxmin{\displaystyle x_{\min }}. For functions with a non-trivialL(x){\displaystyle L(x)} function, choosingxmin{\displaystyle x_{\min }} too small produces a significant bias inα^{\displaystyle {\hat {\alpha }}}, while choosing it too large increases the uncertainty inα^{\displaystyle {\hat {\alpha }}}, and reduces thestatistical power of our model. In general, the best choice ofxmin{\displaystyle x_{\min }} depends strongly on the particular form of the lower tail, represented byL(x){\displaystyle L(x)} above.

More about these methods, and the conditions under which they can be used, can be found in .[10] Further, this comprehensive review article providesusable code (Matlab, Python, R and C++) for estimation and testing routines for power-law distributions.

Kolmogorov–Smirnov estimation

[edit]

Another method for the estimation of the power-law exponent, which does not assumeindependent and identically distributed (iid) data, uses the minimization of theKolmogorov–Smirnov statistic,D{\displaystyle D}, between the cumulative distribution functions of the data and the power law:

α^=argminαDα{\displaystyle {\hat {\alpha }}={\underset {\alpha }{\operatorname {arg\,min} }}\,D_{\alpha }}

with

Dα=maxx|Pemp(x)Pα(x)|{\displaystyle D_{\alpha }=\max _{x}\left|P_{\mathrm {emp} }(x)-P_{\alpha }(x)\right|}

wherePemp(x){\displaystyle P_{\mathrm {emp} }(x)} andPα(x){\displaystyle P_{\alpha }(x)} denote the cdfs of the data and the power law with exponentα{\displaystyle \alpha }, respectively. As this method does not assume iid data, it provides an alternative way to determine the power-law exponent for data sets in which the temporal correlation can not be ignored.[5]

Validating power laws

[edit]

Although power-law relations are attractive for many theoretical reasons, demonstrating that data does indeed follow a power-law relation requires more than simply fitting a particular model to the data.[34] This is important for understanding the mechanism that gives rise to the distribution: superficially similar distributions may arise for significantly different reasons, and different models yield different predictions, such as extrapolation.

For example,log-normal distributions are often mistaken for power-law distributions:[71]. When you take the log of its probability density function, the log-normal distribution has terms that are constant, log, and log-squared. When the mean is small and variance is large, the constant in front of the log-squared term is very small. In that case, for most of the distribution, it will be linear on a log-log plot. It is only for extreme values that the log-squared term asserts itself and shows that it is not a power-law.

For example,Gibrat's law about proportional growth processes produce distributions that are lognormal, although their log–log plots look linear over a limited range. An explanation of this is that although the logarithm of thelognormal density function is quadratic inlog(x), yielding a "bowed" shape in a log–log plot, if the quadratic term is small relative to the linear term then the result can appear almost linear, and the lognormal behavior is only visible when the quadratic term dominates, which may require significantly more data. Therefore, a log–log plot that is slightly "bowed" downwards can reflect a log-normal distribution – not a power law.

In general, many alternative functional forms can appear to follow a power-law form for some extent.[72]Stumpf & Porter (2012) proposed plotting the empirical cumulative distribution function in the log-log domain and claimed that a candidate power-law should cover at least two orders of magnitude.[73] Also, researchers usually have to face the problem of deciding whether or not a real-world probability distribution follows a power law. As a solution to this problem, Diaz[57] proposed a graphical methodology based on random samples that allow visually discerning between different types of tail behavior. This methodology uses bundles of residual quantile functions, also called percentile residual life functions, which characterize many different types of distribution tails, including both heavy and non-heavy tails. However,Stumpf & Porter (2012) claimed the need for both a statistical and a theoretical background in order to support a power-law in the underlying mechanism driving the data generating process.[73]

One method to validate a power-law relation tests many orthogonal predictions of a particular generative mechanism against data. Simply fitting a power-law relation to a particular kind of data is not considered a rational approach. As such, the validation of power-law claims remains a very active field of research in many areas of modern science.[10]

See also

[edit]

References

[edit]

Notes

  1. ^Yaneer Bar-Yam."Concepts: Power Law". New England Complex Systems Institute. Retrieved18 August 2015.
  2. ^abcdNewman, M. E. J. (2005). "Power laws, Pareto distributions and Zipf's law".Contemporary Physics.46 (5):323–351.arXiv:cond-mat/0412004.Bibcode:2005ConPh..46..323N.doi:10.1080/00107510500052444.S2CID 202719165.
  3. ^abDeWitt, Thomas D.; Garrett, Timothy J.; Rees, Karlie N.; Bois, Corey; Krueger, Steven K.; Ferlay, Nicolas (2024-01-05)."Climatologically invariant scale invariance seen in distributions of cloud horizontal sizes".Atmospheric Chemistry and Physics.24 (1):109–122.Bibcode:2024ACP....24..109D.doi:10.5194/acp-24-109-2024.ISSN 1680-7316.
  4. ^Humphries NE, Queiroz N, Dyer JR, Pade NG, Musyl MK, Schaefer KM, Fuller DW, Brunnschweiler JM, Doyle TK, Houghton JD, Hays GC, Jones CS, Noble LR, Wearmouth VJ, Southall EJ, Sims DW (2010)."Environmental context explains Lévy and Brownian movement patterns of marine predators"(PDF).Nature.465 (7301):1066–1069.Bibcode:2010Natur.465.1066H.doi:10.1038/nature09116.PMID 20531470.S2CID 4316766.
  5. ^abcKlaus A, Yu S, Plenz D (2011). Zochowski M (ed.)."Statistical Analyses Support Power Law Distributions Found in Neuronal Avalanches".PLOS ONE.6 (5). e19779.Bibcode:2011PLoSO...619779K.doi:10.1371/journal.pone.0019779.PMC 3102672.PMID 21720544.
  6. ^Albert & Reis 2011, p. [page needed].
  7. ^Cannavò, Flavio; Nunnari, Giuseppe (2016-03-01)."On a Possible Unified Scaling Law for Volcanic Eruption Durations".Scientific Reports.6 22289.Bibcode:2016NatSR...622289C.doi:10.1038/srep22289.ISSN 2045-2322.PMC 4772095.PMID 26926425.
  8. ^Stevens, S. S. (1957). "On the psychophysical law".Psychological Review.64 (3):153–181.doi:10.1037/h0046162.PMID 13441853.
  9. ^Staddon, J. E. R. (1978). "Theory of behavioral power functions".Psychological Review.85 (4):305–320.doi:10.1037/0033-295x.85.4.305.hdl:10161/6003.
  10. ^abcdefghiClauset, Shalizi & Newman 2009.
  11. ^Van Droogenbroeck, Frans J.,'Completion of the standard power-law model by including upper and lower probability bounds' (2023).
  12. ^ab"9na CEPAL Charlas Sobre Sistemas Complejos Sociales (CCSSCS): Leyes de potencias".YouTube. 31 December 2013.
  13. ^Taleb, Nassim Nicholas; Bar-Yam, Yaneer; Cirillo, Pasquale (2020-10-20)."On single point forecasts for fat-tailed variables".International Journal of Forecasting.38 (2):413–422.doi:10.1016/j.ijforecast.2020.08.008.ISSN 0169-2070.PMC 7572356.PMID 33100449.S2CID 220919883.
  14. ^Malcolm Gladwell (February 13, 2006)."Million-Dollar Murray". Archived fromthe original on 2015-03-18. Retrieved2015-06-14.
  15. ^Sornette 2006.
  16. ^Simon 1955.
  17. ^Andriani, P.; McKelvey, B. (2007)."Beyond Gaussian averages: redirecting international business and management research toward extreme events and power laws".Journal of International Business Studies.38 (7):1212–1230.doi:10.1057/palgrave.jibs.8400324.S2CID 512642.
  18. ^Lacquaniti, Francesco; Terzuolo, Carlo; Viviani, Paolo (1983). "The law relating the kinematic and figural aspects of drawing movements".Acta Psychologica.54 (1–3):115–130.doi:10.1016/0001-6918(83)90027-6.PMID 6666647.S2CID 5144040.
  19. ^Albert, J. S.; Bart, H. J.; Reis, R. E. "Species richness & cladal diversity". InAlbert & Reis (2011), pp. 89–104.
  20. ^Yu, Frank H.; Willson, Timothy; Frye, Stephen; Edwards, Aled; Bader, Gary D.; Isserlin, Ruth (2011-02-02). "The human genome and drug discovery after a decade. Roads (still) not taken".Nature.470 (7333):163–165.arXiv:1102.0448v2.Bibcode:2011Natur.470..163E.doi:10.1038/470163a.PMID 21307913.S2CID 4429387.
  21. ^Saravia, Leonardo A.; Doyle, Santiago R.; Bond-Lamberty, Ben (2018-12-10)."Power laws and critical fragmentation in global forests".Scientific Reports.8 (1): 17766.Bibcode:2018NatSR...817766S.doi:10.1038/s41598-018-36120-w.ISSN 2045-2322.PMC 6288094.PMID 30532065.
  22. ^Machado L, Rossow, WB (1993)."Structural characteristics and radial properties of tropical cloud clusters".Monthly Weather Review.121 (12):3234–3260.doi:10.1175/1520-0493(1993)121<3234:scarpo>2.0.co;2.
  23. ^Corral, A, Osso, A, Llebot, JE (2010). "Scaling of tropical cyclone dissipation".Nature Physics.6 (9):693–696.arXiv:0910.0054.Bibcode:2010NatPh...6..693C.doi:10.1038/nphys1725.S2CID 67754747.
  24. ^Lorenz RD (2009). "Power Law of Dust Devil Diameters on Earth and Mars".Icarus.203 (2):683–684.Bibcode:2009Icar..203..683L.doi:10.1016/j.icarus.2009.06.029.
  25. ^"Horton's Laws – Example".www.engr.colostate.edu. Retrieved2018-09-30.
  26. ^Sutton, J. (1997), "Gibrat's Legacy",Journal of Economic Literature XXXV, 40–59.
  27. ^Li, W. (November 1999). "Random texts exhibit Zipf's-law-like word frequency distribution".IEEE Transactions on Information Theory.38 (6):1842–1845.doi:10.1109/18.165464.ISSN 0018-9448.
  28. ^Curtis, Vickie (2018-04-20).Online Citizen Science and the Widening of Academia: Distributed Engagement with Research and Knowledge Production. Springer.ISBN 978-3-319-77664-4.
  29. ^Croteau, David; Hoynes, William (2013-11-06).Media/Society: Industries, Images, and Audiences. SAGE Publications.ISBN 978-1-4833-2355-8.
  30. ^Lewis Fry Richardson (1950).The Statistics of Deadly Quarrels.
  31. ^Berreby, David (July 31, 2014)."Cloudy With a Chance of War".Nautilus Magazine. RetrievedOctober 22, 2020.
  32. ^Martin, Charles H.; Mahoney, Michael W. (2018-10-02). "Implicit Self-Regularization in Deep Neural Networks: Evidence from Random Matrix Theory and Implications for Learning".arXiv:1810.01075 [cs.LG].
  33. ^Reed, W. J.; Hughes, B. D. (2002)."From gene families and genera to incomes and internet file sizes: Why power laws are so common in nature"(PDF).Phys Rev E.66 (6) 067103.Bibcode:2002PhRvE..66f7103R.doi:10.1103/physreve.66.067103.PMID 12513446.
  34. ^abHilbert, Martin (2013)."Scale-free power-laws as interaction between progress and diffusion".Complexity (Submitted manuscript).19 (4):56–65.Bibcode:2014Cmplx..19d..56H.doi:10.1002/cplx.21485.
  35. ^Etro, F.; Stepanova, E. (2018). "Power-laws in art".Physica A: Statistical Mechanics and Its Applications.506:217–220.Bibcode:2018PhyA..506..217E.doi:10.1016/j.physa.2018.04.057.hdl:11382/522706.S2CID 126347599.
  36. ^Fricke, Daniel; Lux, Thomas (2015-02-13)."On the distribution of links in the interbank network: evidence from the e-MID overnight money market"(PDF).Empirical Economics.49 (4). Springer Science and Business Media LLC:1463–1495.doi:10.1007/s00181-015-0919-x.ISSN 0377-7332.S2CID 154684126.
  37. ^"The Granular Origins of Aggregate Fluctuations".Econometrica.79 (3):733–772. 2011.doi:10.3982/ecta8769.ISSN 0012-9682.
  38. ^Gomez-Lievano, Andres; Patterson-Lomba, Oscar; Hausmann, Ricardo (2016-12-22)."Explaining the prevalence, scaling and variance of urban phenomena".Nature Human Behaviour.1 (1) 0012.arXiv:1604.07876.doi:10.1038/s41562-016-0012.ISSN 2397-3374.
  39. ^Bettencourt, Luís M. A.; Lobo, José; Helbing, Dirk; Kühnert, Christian; West, Geoffrey B. (2007-04-24)."Growth, innovation, scaling, and the pace of life in cities".Proceedings of the National Academy of Sciences.104 (17):7301–7306.Bibcode:2007PNAS..104.7301B.doi:10.1073/pnas.0610172104.ISSN 0027-8424.PMC 1852329.PMID 17438298.
  40. ^Neumann, Jerry (2015-06-25)."Power Laws in Venture".ReactionWheel. Retrieved2023-10-11.
  41. ^Müller, Ulrich A.; Dacorogna, Michel M.; Olsen, Richard B.; Pictet, Olivier V.; Schwarz, Matthias; Morgenegg, Claude (1990-12-01). "Statistical study of foreign exchange rates, empirical evidence of a price change scaling law, and intraday analysis".Journal of Banking & Finance.14 (6):1189–1208.doi:10.1016/0378-4266(90)90009-Q.ISSN 0378-4266.
  42. ^Lux, Thomas A.; Alfarano, Simone (2016). "Financial power laws: Empirical evidence, models, and mechanisms".Chaos, Solitons & Fractals.88:3–18.Bibcode:2016CSF....88....3L.doi:10.1016/j.chaos.2016.01.020.
  43. ^Glattfelder, J. B.; Dupuis, A.; Olsen, R. B. (2011-04-01). "Patterns in high-frequency FX data: discovery of 12 empirical scaling laws".Quantitative Finance.11 (4):599–614.arXiv:0809.1040.doi:10.1080/14697688.2010.481632.ISSN 1469-7688.S2CID 154979612.
  44. ^Bolmatov, D.; Brazhkin, V. V.; Trachenko, K. (2013). "Thermodynamic behaviour of supercritical matter".Nature Communications.4 2331.arXiv:1303.3153.Bibcode:2013NatCo...4.2331B.doi:10.1038/ncomms3331.PMID 23949085.S2CID 205319155.
  45. ^Moret, M.; Zebende, G. (2007). "Amino acid hydrophobicity and accessible surface area".Physical Review E.75 (1 Pt 1). 011920.Bibcode:2007PhRvE..75a1920M.doi:10.1103/PhysRevE.75.011920.PMID 17358197.
  46. ^Mackay, D. M. (1963). "Psychophysics of perceived intensity:A theoretical basis for Fechner's and Stevens' laws".Science.139 (3560):1213–1216.Bibcode:1963Sci...139.1213M.doi:10.1126/science.139.3560.1213-a.S2CID 122501807.
  47. ^Staddon, J. E. R. (1978)."Theory of behavioral power functions"(PDF).Psychological Review.85 (4):305–320.doi:10.1037/0033-295x.85.4.305.hdl:10161/6003.
  48. ^John T. Wixted; Shana K. Carpenter."The Wickelgren Power Law and the Ebbinghaus Savings Function"(PDF).Psychological Science. Archived fromthe original(PDF) on April 8, 2016. RetrievedAugust 31, 2016.
  49. ^Jóhannesson, Gudlaugur; Björnsson, Gunnlaugur; Gudmundsson, Einar H. (2006). "Afterglow Light Curves and Broken Power Laws: A Statistical Study".The Astrophysical Journal.640 (1): L5.arXiv:astro-ph/0602219.Bibcode:2006ApJ...640L...5J.doi:10.1086/503294.S2CID 16139116.
  50. ^Caballero, Ethan; Gupta, Kshitij; Rish, Irina; Krueger, David (2023-04-24). "Broken Neural Scaling Laws".arXiv:2210.14891 [cs.LG].
  51. ^"Curved-power law". Archived fromthe original on 2016-02-08. Retrieved2013-07-07.
  52. ^N. H. Bingham, C. M. Goldie, and J. L. Teugels, Regular variation. Cambridge University Press, 1989
  53. ^Kendal, WS; Jørgensen, B (2011). "Taylor's power law and fluctuation scaling explained by a central-limit-like convergence".Phys. Rev. E.83 (6) 066115.Bibcode:2011PhRvE..83f6115K.doi:10.1103/physreve.83.066115.PMID 21797449.
  54. ^Kendal, WS; Jørgensen, BR (2011)."Tweedie convergence: a mathematical basis for Taylor's power law, 1/f noise and multifractality"(PDF).Phys. Rev. E.84 (6) 066120.Bibcode:2011PhRvE..84f6120K.doi:10.1103/physreve.84.066120.PMID 22304168.
  55. ^Beirlant, J., Teugels, J. L., Vynckier, P. (1996)Practical Analysis of Extreme Values, Leuven: Leuven University Press
  56. ^Coles, S. (2001)An introduction to statistical modeling of extreme values. Springer-Verlag, London.
  57. ^abcdDiaz, F. J. (1999). "Identifying Tail Behavior by Means of Residual Quantile Functions".Journal of Computational and Graphical Statistics.8 (3):493–509.doi:10.2307/1390871.JSTOR 1390871.
  58. ^Resnick, S. I. (1997)."Heavy Tail Modeling and Teletraffic Data".The Annals of Statistics.25 (5):1805–1869.doi:10.1214/aos/1069362376.
  59. ^"Power-law fitting and log-log graphs"(PDF). Archived fromthe original(PDF) on 2016-07-05.
  60. ^"So You Think You Have a Power Law — Well Isn't That Special?".bactra.org. Retrieved27 March 2018.
  61. ^Jeong, H.; Tombor, B. Albert; Oltvai, Z.N.; Barabasi, A.-L. (2000). "The large-scale organization of metabolic networks".Nature.407 (6804):651–654.arXiv:cond-mat/0010278.Bibcode:2000Natur.407..651J.doi:10.1038/35036627.PMID 11034217.S2CID 4426931.
  62. ^Arnold, B. C.; Brockett, P. L. (1983). "When does the βth percentile residual life function determine the distribution?".Operations Research.31 (2):391–396.doi:10.1287/opre.31.2.391.
  63. ^Joe, H.; Proschan, F. (1984). "Percentile residual life functions".Operations Research.32 (3):668–678.doi:10.1287/opre.32.3.668.
  64. ^Joe, H. (1985), "Characterizations of life distributions from percentile residual lifetimes",Ann. Inst. Statist. Math. 37, Part A, 165–172.
  65. ^Csorgo, S.; Viharos, L. (1992)."Confidence bands for percentile residual lifetimes"(PDF).Journal of Statistical Planning and Inference.30 (3):327–337.doi:10.1016/0378-3758(92)90159-p.hdl:2027.42/30190.
  66. ^Schmittlein, D. C.; Morrison, D. G. (1981). "The median residual lifetime: A characterization theorem and an application".Operations Research.29 (2):392–399.doi:10.1287/opre.29.2.392.
  67. ^Morrison, D. G.; Schmittlein, D. C. (1980). "Jobs, strikes, and wars: Probability models for duration".Organizational Behavior and Human Performance.25 (2):224–251.doi:10.1016/0030-5073(80)90065-3.
  68. ^Gerchak, Y (1984). "Decreasing failure rates and related issues in the social sciences".Operations Research.32 (3):537–546.doi:10.1287/opre.32.3.537.
  69. ^Bauke, H. (2007). "Parameter estimation for power-law distributions by maximum likelihood methods".European Physical Journal B.58 (2):167–173.arXiv:0704.1867.Bibcode:2007EPJB...58..167B.doi:10.1140/epjb/e2007-00219-y.S2CID 119602829.
  70. ^Hall, P. (1982). "On Some Simple Estimates of an Exponent of Regular Variation".Journal of the Royal Statistical Society, Series B.44 (1):37–42.doi:10.1111/j.2517-6161.1982.tb01183.x.JSTOR 2984706.
  71. ^Mitzenmacher 2004.
  72. ^Laherrère & Sornette 1998.
  73. ^abStumpf & Porter 2012.

Bibliography

External links

[edit]
Wikimedia Commons has media related toPower laws.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Power_law&oldid=1322904773"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp