Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Nonparametric statistics

From Wikipedia, the free encyclopedia
Type of statistical analysis

Nonparametric statistics is a type of statistical analysis that makes minimal assumptions about the underlyingdistribution of the data being studied. Often these models are infinite-dimensional, rather than finite dimensional, as inparametric statistics.[1] Nonparametric statistics can be used fordescriptive statistics orstatistical inference. Nonparametric tests are often used when the assumptions of parametric tests are evidently violated.[2]

Definitions

[edit]

The term "nonparametric statistics" has been defined imprecisely in the following two ways, among others:

The first meaning ofnonparametric involves techniques that do not rely on data belonging to any particular parametric family of probability distributions. These include, among others:

  • Methods which aredistribution-free, which do not rely on assumptions that the data are drawn from a given parametric family ofprobability distributions.
  • Statistics defined to be a function on a sample, without dependency on aparameter.

An example isorder statistics, which are based onordinal ranking of observations.

The discussion following is taken fromKendall's Advanced Theory of Statistics.[3]

Statistical hypotheses concern the behavior of observable random variables.... For example, the hypothesis (a) that a normal distribution has a specified mean and variance is statistical; so is the hypothesis (b) that it has a given mean but unspecified variance; so is the hypothesis (c) that a distribution is of normal form with both mean and variance unspecified; finally, so is the hypothesis (d) that two unspecified continuous distributions are identical.

It will have been noticed that in the examples (a) and (b) the distribution underlying the observations was taken to be of a certain form (the normal) and the hypothesis was concerned entirely with the value of one or both of its parameters. Such a hypothesis, for obvious reasons, is calledparametric.

Hypothesis (c) was of a different nature, as no parameter values are specified in the statement of the hypothesis; we might reasonably call such a hypothesisnon-parametric. Hypothesis (d) is also non-parametric but, in addition, it does not even specify the underlying form of the distribution and may now be reasonably termeddistribution-free. Notwithstanding these distinctions, the statistical literature now commonly applies the label "non-parametric" to test procedures that we have just termed "distribution-free", thereby losing a useful classification.

The second meaning ofnon-parametric involves techniques that do not assume that thestructure of a model is fixed. Typically, the model grows in size to accommodate the complexity of the data. In these techniques, individual variablesare typically assumed to belong to parametric distributions, and assumptions about the types of associations among variables are also made. These techniques include, among others:

  • non-parametric regression, which is modeling whereby the structure of the relationship between variables is treated non-parametrically, but where nevertheless there may be parametric assumptions about the distribution of model residuals.
  • non-parametric hierarchical Bayesian models, such as models based on theDirichlet process, which allow the number oflatent variables to grow as necessary to fit the data, but where individual variables still follow parametric distributions and even the process controlling the rate of growth of latent variables follows a parametric distribution.

Applications and purpose

[edit]

Non-parametric methods are widely used for studying populations that have a ranked order (such as movie reviews receiving one to five "stars"). The use of non-parametric methods may be necessary when data have aranking but no clearnumerical interpretation, such as when assessingpreferences. In terms oflevels of measurement, non-parametric methods result inordinal data.

As non-parametric methods make fewer assumptions, their applicability is much more general than the corresponding parametric methods. In particular, they may be applied in situations where less is known about the application in question. Also, due to the reliance on fewer assumptions, non-parametric methods are morerobust.

Non-parametric methods are sometimes considered simpler to use and more robust than parametric methods, even when the assumptions of parametric methods are justified. This is due to their more general nature, which may make them less susceptible to misuse and misunderstanding. Non-parametric methods can be considered a conservative choice, as they will work even when their assumptions are not met, whereas parametric methods can produce misleading results when their assumptions are violated.

The wider applicability and increasedrobustness of non-parametric tests comes at a cost: in cases where a parametric test's assumptions are met, non-parametric tests have lessstatistical power. In other words, a larger sample size can be required to draw conclusions with the same degree of confidence.

Non-parametric models

[edit]

Non-parametric models differ fromparametric models in that the model structure is not specifieda priori but is instead determined from data. The termnon-parametric is not meant to imply that such models completely lack parameters but that the number and nature of the parameters are flexible and not fixed in advance.

Methods

[edit]

Non-parametric (ordistribution-free)inferential statistical methods are mathematical procedures for statistical hypothesis testing which, unlikeparametric statistics, make no assumptions about theprobability distributions of the variables being assessed. The most frequently used tests include

History

[edit]

Early nonparametric statistics include themedian (13th century or earlier, use in estimation byEdward Wright, 1599; seeMedian § History) and thesign test byJohn Arbuthnot (1710) in analyzing thehuman sex ratio at birth (seeSign test § History).[5][6]

See also

[edit]

Notes

[edit]
  1. ^"All of Nonparametric Statistics".Springer Texts in Statistics. 2006.doi:10.1007/0-387-30623-4.ISBN 978-0-387-25145-5.
  2. ^Pearce, J; Derrick, B (2019)."Preliminary testing: The devil of statistics?".Reinvention: An International Journal of Undergraduate Research.12 (2).doi:10.31273/reinvention.v12i2.339.
  3. ^Stuart A., Ord J.K, Arnold S. (1999),Kendall's Advanced Theory of Statistics: Volume 2A—Classical Inference and the Linear Model, sixth edition, §20.2–20.3 (Arnold).
  4. ^Adikaram, K. K. L. B.; Hussein, M. A.; Effenberger, M.; Becker, T. (16 November 2015)."Universal Linear Fit Identification: A Method Independent of Data, Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation".PLOS ONE.10 (11) e0141486.Bibcode:2015PLoSO..1041486A.doi:10.1371/journal.pone.0141486.ISSN 1932-6203.PMC 4646355.PMID 26571035.
  5. ^Conover, W.J. (1999), "Chapter 3.4: The Sign Test",Practical Nonparametric Statistics (Third ed.), Wiley, pp. 157–176,ISBN 0-471-16068-7
  6. ^Sprent, P. (1989),Applied Nonparametric Statistical Methods (Second ed.), Chapman & Hall,ISBN 0-412-44980-3

General references

[edit]
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Nonparametric_statistics&oldid=1314053986"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp