Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Arithmetic mean

From Wikipedia, the free encyclopedia
Type of average of a collection of numbers
"X̄" redirects here. For the character, seeMacron (diacritic).
For broader coverage of this topic, seeMean.

Inmathematics andstatistics, thearithmetic mean (/ˌærɪθˈmɛtɪk/ arr-ith-MET-ik),arithmetic average, or just themean oraverage is the sum of a collection of numbers divided by the count of numbers in the collection.[1] The collection is often a set of results from anexperiment, anobservational study, or asurvey. The term "arithmetic mean" is preferred in some contexts in mathematics and statistics because it helps to distinguish it from other types of means, such asgeometric andharmonic.

Arithmetic means are also frequently used ineconomics,anthropology,history, and almost every otheracademic field to some extent. For example,per capita income is the arithmetic average of theincome of anation'spopulation.

While the arithmetic mean is often used to reportcentral tendencies, it is not arobust statistic: it is greatly influenced byoutliers (values much larger or smaller than most others). Forskewed distributions, such as thedistribution of income for which a few people's incomes are substantially higher than most people's, the arithmetic mean may not coincide with one's notion of "middle". In that case, robust statistics, such as themedian, may provide a better description ofcentral tendency.

Definition

[edit]
Further information:summation for an explanation of the summation operator

The arithmetic mean of a set of observed data is equal to the sum of the numerical values of each observation, divided by the total number of observations. Symbolically, for a data set consisting of the valuesx1,,xn{\displaystyle x_{1},\dots ,x_{n}}, the arithmetic mean is defined by the formula:[2]

x¯=1n(i=1nxi)=x1+x2++xnn{\displaystyle {\bar {x}}={\frac {1}{n}}\left(\sum _{i=1}^{n}{x_{i}}\right)={\frac {x_{1}+x_{2}+\dots +x_{n}}{n}}}

In simpler terms, the formula for the arithmetic mean is:

Total of all numbers within the dataAmount of total numbers within the data{\displaystyle {\frac {\text{Total of all numbers within the data}}{\text{Amount of total numbers within the data}}}}

For example, if the monthly salaries of5{\displaystyle 5} employees are{2500,2700,2300,2650,2450}{\displaystyle \{2500,2700,2300,2650,2450\}}, then the arithmetic mean is:

2500+2700+2300+2650+24505=2520{\displaystyle {\frac {2500+2700+2300+2650+2450}{5}}=2520}
Example
PersonSalary
A2500
B2700
C2300
D2650
E2450
Average2520

If the data set is astatistical population (i.e. consists of every possible observation and not just asubset of them), then the mean of that population is called thepopulation mean and denoted by theGreek letterμ{\displaystyle \mu }. If the data set is astatistical sample (a subset of the population), it is called thesample mean (which for a data setX{\displaystyle X} is denoted asX¯{\displaystyle {\overline {X}}}).

The arithmetic mean can be similarly defined forvectors in multiple dimensions, not onlyscalar values; this is often referred to as acentroid. More generally, because the arithmetic mean is aconvex combination (meaning its coefficients sum to1{\displaystyle 1}), it can be defined on aconvex space, not only avector space.

History

[edit]

StatisticianChurchill Eisenhart, senior researcher fellow at theU. S. National Bureau of Standards, traced the history of the arithmetic mean in detail. In the modern age, it started to be used as a way of combining various observations that should be identical, but were not such asestimates of the direction ofmagnetic north. In 1635, mathematicianHenry Gellibrand described as "meane" themidpoint of a lowest and highest number, not quite the arithmetic mean. In 1668, a person known as "D. B." was quoted in theTransactions of the Royal Society describing "taking the mean" of five values:[3]

In this Table, he [Capt. Sturmy] notes the greatest difference to be 14 minutes; and so taking the mean for the true Variation, he concludes it then and there to be just 1. deg. 27. min.

— D.B., p. 726

Motivating properties

[edit]

The arithmetic mean has several properties that make it interesting, especially as a measure of central tendency. These include:

Additional properties

[edit]
  • The arithmetic mean of a sample is always between the largest and smallest values in that sample.
  • The arithmetic mean of any amount of equal-sized number groups together is the arithmetic mean of the arithmetic means of each group.

Contrast with median

[edit]

The arithmetic mean differs from themedian, which is the value that separates the higher half and lower half of a data set. When the values in a data set form an arithmetic progression, the median and arithmetic mean are equal. For example, in the data set1,2,3,4{\displaystyle {1,2,3,4}}, both the mean and median are2.5{\displaystyle 2.5}.

In other cases, the mean and median can differ significantly. For instance, in the data set1,2,4,8,16{\displaystyle {1,2,4,8,16}}, the arithmetic mean is6.2{\displaystyle 6.2}, while the median is4{\displaystyle 4}. This occurs because the mean is sensitive to extreme values and may not accurately reflect the central tendency of most data points.

This distinction has practical implications across different fields. For example, since the 1980s, themedian income in theUnited States has increased at a slower rate than the arithmetic mean income.[5]

Similarly, in climate studies, daily mean temperature distributions tend to approximate a normal distribution, whereas annual or monthly rainfall totals often display a skewed distribution, with some periods having unusually high totals while most have relatively low amounts. In such cases, the median can provide a more representative measure of central tendency.[6]

Generalizations

[edit]

Weighted average

[edit]
Main article:Weighted average

A weighted average, or weighted mean, is an average in which some data points count more heavily than others in that they are given more weight in the calculation.[7] For example, the arithmetic mean of3{\displaystyle 3} and5{\displaystyle 5} is3+52=4{\displaystyle {\frac {3+5}{2}}=4}, or equivalently312+512=4{\displaystyle 3\cdot {\frac {1}{2}}+5\cdot {\frac {1}{2}}=4}. In contrast, aweighted mean in which the first number receives, for example, twice as much weight as the second (perhaps because it is assumed to appear twice as often in the general population from which these numbers were sampled) would be calculated as323+513=113{\displaystyle 3\cdot {\frac {2}{3}}+5\cdot {\frac {1}{3}}={\frac {11}{3}}}. Here the weights, which necessarily sum to one, are23{\displaystyle {\frac {2}{3}}} and13{\displaystyle {\frac {1}{3}}}, the former being twice the latter. The arithmetic mean (sometimes called the "unweighted average" or "equally weighted average") can be interpreted as a special case of a weighted average in which all weights are equal to the same number (12{\displaystyle {\frac {1}{2}}} in the above example and1n{\displaystyle {\frac {1}{n}}} in a situation withn{\displaystyle n} numbers being averaged).

Functions

[edit]
This section is an excerpt fromMean of a function.[edit]
Incalculus, and especiallymultivariable calculus, themean of a function is loosely defined as the ”average" value of thefunction over itsdomain.

Continuous probability distributions

[edit]
Comparison of twolog-normal distributions with equal median, but differentskewness, resulting in various means andmodes

If a numerical property, and any sample of data from it, can take on any value from a continuous range instead of, for example, just integers, then theprobability of a number falling into some range of possible values can be described by integrating acontinuous probability distribution across this range, even when the naive probability for a sample number taking one certain value from infinitely many is zero. In this context, the analog of a weighted average, in which there are infinitely many possibilities for the precise value of the variable in each range, is called themean of theprobability distribution. The most widely encountered probability distribution is called thenormal distribution; it has the property that all measures of its central tendency, including not just the mean but also the median mentioned above and the mode (the three Ms[8]), are equal. This equality does not hold for other probability distributions, as illustrated for the log-normal distribution here.

Angles

[edit]
Main article:Mean of circular quantities

Particular care is needed when using cyclic data, such asphases orangles. Taking the arithmetic mean of 1° and 359° yields a result of 180°. This is incorrect for two reasons:

  1. Angle measurements are only defined up to an additive constant of 360° (2π{\displaystyle 2\pi } orτ{\displaystyle \tau }, if measuring inradians). Thus, these could easily be called 1° and -1°, or 361° and 719°, since each one of them produces a different average.
  2. In this situation, 0° (or 360°) is geometrically a betteraverage value: there is lowerdispersion about it (the points are both 1° from it and 179° from 180°, the putative average).

In general application, such an oversight will lead to the average value artificially moving towards the middle of the numerical range. A solution to this problem is to use the optimization formulation (that is, define the mean as the central point: the point about which one has the lowest dispersion) and redefine the difference as a modular distance (i.e. the distance on the circle: so the modular distance between 1° and 359° is 2°, not 358°).

Proof without words of theAM–GM inequality:
PR is the diameter of a circle centered on O; its radius AO is thearithmetic mean ofa andb. Triangle PGR is aright triangle fromThales's theorem, enabling use of thegeometric mean theorem to show that itsaltitude GQ is thegeometric mean. For any ratioa:b,AO ≥ GQ.

Symbols and encoding

[edit]

The arithmetic mean is often denoted by a bar (vinculum ormacron), as inx¯{\displaystyle {\bar {x}}}.[4]

See also

[edit]
Geometricproof without words thatmax (a,b) >root mean square (RMS) orquadratic mean (QM) >arithmetic mean (AM) >geometric mean (GM) >harmonic mean (HM) >min (a,b) of two distinct positive numbersa andb[note 1]

Notes

[edit]
  1. ^If NM =a and PM =b. AM =AM ofa andb, and radiusr = AQ = AG.
    UsingPythagoras' theorem, QM² = AQ² + AM² ∴ QM = √AQ² + AM² =QM.
    Using Pythagoras' theorem, AM² = AG² + GM² ∴ GM = √AM² − AG² =GM.
    Usingsimilar triangles,HM/GM =GM/AM ∴ HM =GM²/AM =HM.

References

[edit]
  1. ^Jacobs, Harold R. (1994).Mathematics: A Human Endeavor (Third ed.).W. H. Freeman. p. 547.ISBN 0-7167-2426-X.
  2. ^Weisstein, Eric W."Arithmetic Mean".mathworld.wolfram.com. Retrieved21 August 2020.
  3. ^Eisenhart, Churchill (24 August 1971)."The Development of the Concept of the Best Mean of a Set of Measurements from Antiquity to the Present Day"(PDF). Presidential Address, 131st Annual Meeting of the American Statistical Association, Colorado State University. pp. 68–69.
  4. ^abMedhi, Jyotiprasad (1992).Statistical Methods: An Introductory Text. New Age International. pp. 53–58.ISBN 9788122404197.
  5. ^Krugman, Paul (4 June 2014) [Fall 1992]."The Rich, the Right, and the Facts: Deconstructing the Income Distribution Debate".The American Prospect.
  6. ^Barry, Roger Graham; Chorley, Richard John (2005).Atmosphere, Weather and Climate (8th ed.). London: Routledge. p. 407.ISBN 978-0-415-27170-7.
  7. ^"Mean | mathematics".Encyclopedia Britannica. Retrieved21 August 2020.
  8. ^Thinkmap Visual Thesaurus (30 June 2010)."The Three M's of Statistics: Mode, Median, Mean June 30, 2010".www.visualthesaurus.com. Retrieved3 December 2018.

Further reading

[edit]
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis (see alsoTemplate:Least squares and regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Portal:
International
National
Retrieved from "https://en.wikipedia.org/w/index.php?title=Arithmetic_mean&oldid=1317358915"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp