Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Shapiro–Wilk test

From Wikipedia, the free encyclopedia
Test of normality in frequentist statistics
Not to be confused with thelikelihood-ratio test, which is sometimes referred to as Wilks test.

TheShapiro–Wilk test is atest of normality. It was published in 1965 bySamuel Sanford Shapiro andMartin Wilk.[1]

Theory

[edit]

The Shapiro–Wilk test tests thenull hypothesis that asamplex1, ...,xn came from anormally distributed population. Thetest statistic is

W=(i=1naix(i))2i=1n(xix¯)2,{\displaystyle W={\frac {{\left(\sum \limits _{i=1}^{n}a_{i}x_{(i)}\right)}^{2}}{\sum \limits _{i=1}^{n}{\left(x_{i}-{\overline {x}}\right)}^{2}}},}

where

The coefficientsai{\displaystyle a_{i}} are given by:[1](a1,,an)=mTV1C,{\displaystyle (a_{1},\dots ,a_{n})={m^{\mathsf {T}}V^{-1} \over C},}whereC is avector norm:[2]C=V1m=(mTV1V1m)1/2{\displaystyle C=\left\|V^{-1}m\right\|={\left(m^{\mathsf {T}}V^{-1}V^{-1}m\right)}^{1/2}}and the vectorm,m=(m1,,mn)T{\displaystyle m=(m_{1},\dots ,m_{n})^{\mathsf {T}}}is made of theexpected values of theorder statistics ofindependent and identically distributed random variables sampled from the standard normal distribution; finally,V{\displaystyle V} is thecovariance matrix of those normal order statistics.[3]

There is no name for the distribution ofW{\displaystyle W}. The cutoff values for the statistics are calculated throughMonte Carlo simulations.[2]

Interpretation

[edit]

Thenull-hypothesis of this test is that the population is normally distributed. If thep value is less than the chosenalpha level, then the null hypothesis is rejected and there is evidence that the data tested are not normally distributed.[4]

Like moststatistical significance tests, if the sample size is sufficiently large this test may detect even trivial departures from the null hypothesis (i.e., although there may be somestatistically significant effect, it may be too small to be of any practical significance); thus, additional investigation of theeffect size is typically advisable, e.g., aQ–Q plot in this case.[5]

Power analysis

[edit]

Monte Carlo simulations have demonstrated in practically relevant settings that Shapiro–Wilk has the bestpower for a givensignificance, followed closely byAnderson–Darling when comparing the Shapiro–Wilk,Kolmogorov–Smirnov, andLilliefors.[6][unreliable source?]

Approximation

[edit]

Royston proposed an alternative method of calculating the coefficients vector by providing an algorithm for calculating values that extended the sample size from 50 to 2,000.[7] This technique is used in several software packages includingGraphPad Prism, Stata,[8][9] SPSS and SAS.[10] Rahman and Govidarajulu extended the sample size further up to 5,000.[11]

See also

[edit]

References

[edit]
  1. ^abShapiro, S. S.;Wilk, M. B. (1965). "An analysis of variance test for normality (complete samples)".Biometrika.52 (3–4):591–611.doi:10.1093/biomet/52.3-4.591.JSTOR 2333709.MR 0205384. p. 593
  2. ^abRichard M. Dudley (2015)."The Shapiro-Wilk and related tests for normality"(PDF). Retrieved2022-06-16.
  3. ^Davis, C. S.; Stephens, M. A. (1978).The covariance matrix of normal order statistics(PDF) (Technical report). Department of Statistics, Stanford University, Stanford, California. Technical Report No. 14. Retrieved2022-06-17.
  4. ^"How do I interpret the Shapiro–Wilk test for normality?".JMP. 2004. RetrievedMarch 24, 2012.
  5. ^Field, Andy (2009).Discovering statistics using SPSS (3rd ed.). Los Angeles [i.e. Thousand Oaks, Calif.]: SAGE Publications. p. 143.ISBN 978-1-84787-906-6.
  6. ^Razali, Nornadiah; Wah, Yap Bee (2011)."Power comparisons of Shapiro–Wilk, Kolmogorov–Smirnov, Lilliefors and Anderson–Darling tests".Journal of Statistical Modeling and Analytics.2 (1):21–33. Retrieved30 March 2017.
  7. ^Royston, Patrick (September 1992). "Approximating the Shapiro–WilkW-test for non-normality".Statistics and Computing.2 (3):117–119.doi:10.1007/BF01891203.S2CID 122446146.
  8. ^Royston, Patrick. "Shapiro–Wilk and Shapiro–Francia Tests".Stata Technical Bulletin, StataCorp LP.1 (3).
  9. ^Shapiro–Wilk and Shapiro–Francia tests for normality
  10. ^Park, Hun Myoung (2002–2008)."Univariate Analysis and Normality Test Using SAS, Stata, and SPSS".[working paper].hdl:2022/19742. Retrieved29 July 2023.
  11. ^Rahman und Govidarajulu (1997). "A modification of the test of Shapiro and Wilk for normality".Journal of Applied Statistics.24 (2):219–236.Bibcode:1997JApSt..24..219R.doi:10.1080/02664769723828.

External links

[edit]
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Shapiro–Wilk_test&oldid=1334569005"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp