Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Fraction of variance unexplained

From Wikipedia, the free encyclopedia
For broader coverage of this topic, seeExplained variation.
Statistical noise
icon
This articlerelies largely or entirely on asingle source. Relevant discussion may be found on thetalk page. Please helpimprove this article byintroducing citations to additional sources.
Find sources: "Fraction of variance unexplained" – news ·newspapers ·books ·scholar ·JSTOR
(June 2020)

Instatistics, thefraction of variance unexplained (FVU) in the context of aregression task is the fraction of variance of theregressand (dependent variable)Y which cannot be explained, i.e., which is not correctly predicted, by theexplanatory variablesX.

Formal definition

[edit]

Suppose we are given a regression functionf{\displaystyle f} yielding for eachyi{\displaystyle y_{i}} an estimatey^i=f(xi){\displaystyle {\widehat {y}}_{i}=f(x_{i})} wherexi{\displaystyle x_{i}} is the vector of theith observations on all the explanatory variables.[1]: 181  We define the fraction of variance unexplained (FVU) as:

FVU=VARerrVARtot=SSerr/NSStot/N=SSerrSStot(=1SSregSStot, only true in some cases such as linear regression)=1R2{\displaystyle {\begin{aligned}{\text{FVU}}&={{\text{VAR}}_{\text{err}} \over {\text{VAR}}_{\text{tot}}}={{\text{SS}}_{\text{err}}/N \over {\text{SS}}_{\text{tot}}/N}={{\text{SS}}_{\text{err}} \over {\text{SS}}_{\text{tot}}}\left(=1-{{\text{SS}}_{\text{reg}} \over {\text{SS}}_{\text{tot}}},{\text{ only true in some cases such as linear regression}}\right)\\[6pt]&=1-R^{2}\end{aligned}}}

whereR2 is thecoefficient of determination andVARerr andVARtot are the variance of the residuals and the sample variance of the dependent variable.SSerr (the sum of squared predictions errors, equivalently theresidual sum of squares),SStot (thetotal sum of squares), andSSreg (the sum of squares of the regression, equivalently theexplained sum of squares) are given by

SSerr=i=1N(yiy^i)2SStot=i=1N(yiy¯)2SSreg=i=1N(y^iy¯)2 andy¯=1Ni=1Nyi.{\displaystyle {\begin{aligned}{\text{SS}}_{\text{err}}&=\sum _{i=1}^{N}\;(y_{i}-{\widehat {y}}_{i})^{2}\\{\text{SS}}_{\text{tot}}&=\sum _{i=1}^{N}\;(y_{i}-{\bar {y}})^{2}\\{\text{SS}}_{\text{reg}}&=\sum _{i=1}^{N}\;({\widehat {y}}_{i}-{\bar {y}})^{2}{\text{ and}}\\{\bar {y}}&={\frac {1}{N}}\sum _{i=1}^{N}\;y_{i}.\end{aligned}}}

Alternatively, the fraction of variance unexplained can be defined as follows:

FVU=MSE(f)var[Y]{\displaystyle {\text{FVU}}={\frac {\operatorname {MSE} (f)}{\operatorname {var} [Y]}}}

where MSE(f) is themean squared error of the regression function ƒ.

Explanation

[edit]

It is useful to consider the second definition to understand FVU. When trying to predictY, the most naive regression function that we can think of is the constant function predicting the mean ofY, i.e.,f(xi)=y¯{\displaystyle f(x_{i})={\bar {y}}}. It follows that the MSE of this function equals the variance ofY; that is,SSerr =SStot, andSSreg = 0. In this case, no variation inY can be accounted for, and the FVU then has its maximum value of 1.

More generally, the FVU will be 1 if the explanatory variablesX tell us nothing aboutY in the sense that the predicted values ofY do notcovary withY. But as prediction gets better and the MSE can be reduced, the FVU goes down. In the case of perfect prediction wherey^i=yi{\displaystyle {\hat {y}}_{i}=y_{i}} for alli, the MSE is 0,SSerr = 0,SSreg =SStot, and the FVU is 0.

See also

[edit]

References

[edit]
  1. ^Achen, C. H. (1990). "'What Does "Explained Variance" Explain?: Reply".Political Analysis.2 (1):173–184.doi:10.1093/pan/2.1.173.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Fraction_of_variance_unexplained&oldid=1221668440"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp