Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Variogram

From Wikipedia, the free encyclopedia
Spatial statistics function

Inspatial statistics the theoreticalvariogram, denoted2γ(s1,s2){\displaystyle 2\gamma (\mathbf {s} _{1},\mathbf {s} _{2})}, is a function describing the degree ofspatial dependence of a spatialrandom field orstochastic processZ(s){\displaystyle Z(\mathbf {s} )}. Thesemivariogramγ(s1,s2){\displaystyle \gamma (\mathbf {s} _{1},\mathbf {s} _{2})} is half the variogram.

Schematisation of a variogram. The points represent the measured data points (observed) and the curve represents the model function used (empirical). Range stands for the range sought, sill for the plateau value reached at maximum range, nugget for the nugget effect.

For example, ingold mining, a variogram will give a measure of how much two samples taken from the mining area will vary in gold percentage depending on the distance between those samples. Samples taken far apart will vary more than samples taken close to each other.

Definition

[edit]
"Semivariance" redirects here. For the measure of downside risk, seeVariance § Semivariance.

Thesemivariogramγ(h){\displaystyle \gamma (h)} was first defined by Matheron (1963) as half the average squared difference between a function and a translated copy of the function separated at distanceh{\displaystyle h}.[1][2] Formally

γ(h)=12V[f(M+h)f(M)]2dM,{\displaystyle \gamma (h)={\frac {1}{2}}\iiint _{V}\left[f(M+h)-f(M)\right]^{2}dM,}

whereM{\displaystyle M} is a point in the geometric fieldV{\displaystyle V}, andf(M){\displaystyle f(M)} is the value at that point. The triple integral is over 3 dimensions.h{\displaystyle h} is the separation distance (e.g., in meters or km) of interest. For example, the valuef(M){\displaystyle f(M)} could represent the iron content in soil, at some locationM{\displaystyle M} (withgeographic coordinates of latitude, longitude, and elevation) over some regionV{\displaystyle V} with element of volumedV{\displaystyle dV}.To obtain the semivariogram for a givenγ(h){\displaystyle \gamma (h)}, all pairs of points at that exact distance would be sampled. In practice it is impossible to sample everywhere, so theempirical variogram is used instead.

The variogram is twice the semivariogram and can be defined, differently, as thevariance of the difference between field values at two locations (s1{\displaystyle \mathbf {s} _{1}} ands2{\displaystyle \mathbf {s} _{2}}, note change of notation fromM{\displaystyle M} tos{\displaystyle \mathbf {s} } andf{\displaystyle f} toZ{\displaystyle Z}) across realizations of the field (Cressie 1993):

2γ(s1,s2)=var(Z(s1)Z(s2))=E[((Z(s1)Z(s2))E[Z(s1)Z(s2)])2].{\displaystyle 2\gamma (\mathbf {s} _{1},\mathbf {s} _{2})={\text{var}}\left(Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})\right)=E\left[((Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2}))-E[Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})])^{2}\right].}

If the spatial random field has constant meanμ{\displaystyle \mu }, this is equivalent to the expectation for the squared increment of the values between locationss1{\displaystyle \mathbf {s} _{1}} ands2{\displaystyle s_{2}} (Wackernagel 2003) (wheres1{\displaystyle \mathbf {s} _{1}} ands2{\displaystyle \mathbf {s} _{2}} are points in space and possibly time):

2γ(s1,s2)=E[(Z(s1)Z(s2))2].{\displaystyle 2\gamma (\mathbf {s} _{1},\mathbf {s} _{2})=E\left[\left(Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})\right)^{2}\right].}

In the case of astationary process, the variogram and semivariogram can be represented as a functionγs(h)=γ(0,0+h){\displaystyle \gamma _{s}(h)=\gamma (0,0+h)} of the differenceh=s2s1{\displaystyle h=\mathbf {s} _{2}-\mathbf {s} _{1}} between locations only, by the following relation (Cressie 1993):

γ(s1,s2)=γs(s2s1).{\displaystyle \gamma (\mathbf {s} _{1},\mathbf {s} _{2})=\gamma _{s}(\mathbf {s} _{2}-\mathbf {s} _{1}).}

If the process is furthermoreisotropic, then the variogram and semivariogram can be represented by a functionγi(h):=γs(he1){\displaystyle \gamma _{i}(h):=\gamma _{s}(he_{1})} of the distanceh=s2s1{\displaystyle h=\|\mathbf {s} _{2}-\mathbf {s} _{1}\|} only (Cressie 1993):

γ(s1,s2)=γi(h).{\displaystyle \gamma (\mathbf {s} _{1},\mathbf {s} _{2})=\gamma _{i}(h).}

The indexesi{\displaystyle i} ors{\displaystyle s} are typically not written. The terms are used for all three forms of the function. Moreover, the term "variogram" is sometimes used to denote the semivariogram, and the symbolγ{\displaystyle \gamma } is sometimes used for the variogram, which brings some confusion.[3]

Properties

[edit]

According to (Cressie 1993, Chiles and Delfiner 1999, Wackernagel 2003) the theoretical variogram has the following properties:

i=1Nj=1Nwiγ(si,sj)wj0{\displaystyle \sum _{i=1}^{N}\sum _{j=1}^{N}w_{i}\gamma (\mathbf {s} _{i},\mathbf {s} _{j})w_{j}\leq 0}
which corresponds to the fact that the variancevar(X){\displaystyle var(X)} ofX=i=1NwiZ(xi){\displaystyle X=\sum _{i=1}^{N}w_{i}Z(x_{i})} is given by the negative of this double sum and must be nonnegative.[disputeddiscuss]

Parameters

[edit]

In summary, the following parameters are often used to describe variograms:

  • nuggetn{\displaystyle n}: The height of the jump of the semivariogram at the discontinuity at the origin.
  • sills{\displaystyle s}: Limit of the variogram tending to infinity lag distances.
  • ranger{\displaystyle r}: The distance in which the difference of the variogram from the sill becomes negligible. In models with a fixed sill, it is the distance at which this is first reached; for models with an asymptotic sill, it is conventionally taken to be the distance when the semivariance first reaches 95% of the sill.

Empirical variogram

[edit]

Generally, anempirical variogram is needed for measured data, because sample informationZ{\displaystyle Z} is not available for every location. The sample information for example could be concentration of iron in soil samples, or pixel intensity on a camera. Each piece of sample information has coordinatess=(x,y){\displaystyle \mathbf {s} =(x,y)} for a 2D sample space wherex{\displaystyle x} andy{\displaystyle y} are geographical coordinates. In the case of the iron in soil, the sample space could be 3 dimensional. If there is temporal variability as well (e.g., phosphorus content in a lake) thens{\displaystyle \mathbf {s} } could be a 4 dimensional vector(x,y,z,t){\displaystyle (x,y,z,t)}. For the case where dimensions have different units (e.g., distance and time) then a scaling factorB{\displaystyle B} can be applied to each to obtain a modified Euclidean distance.[4]

Sample observations are denotedZ(si)=zi{\displaystyle Z(\mathbf {s} _{i})=z_{i}}. Samples may be taken atk{\displaystyle k} total different locations. This would provide as set of samplesz1,,zk{\displaystyle z_{1},\ldots ,z_{k}} at locationss1,,sk{\displaystyle \mathbf {s} _{1},\ldots ,\mathbf {s} _{k}}. Generally, plots show the semivariogram values as a function of sample point separationh{\displaystyle h}. In the case of empirical semivariogram, separation distance binsh±δ{\displaystyle h\pm \delta } are used rather than exact distances, and usually isotropic conditions are assumed (i.e., thatγ{\displaystyle \gamma } is only a function ofh{\displaystyle h} and does not depend on other variables such as center position). Then, the empirical semivariogramγ^(h±δ){\displaystyle {\hat {\gamma }}(h\pm \delta )} can be calculated for each bin:

γ^(h±δ):=12|N(h±δ)|(i,j)N(h±δ)|zizj|2{\displaystyle {\hat {\gamma }}(h\pm \delta ):={\frac {1}{2|N(h\pm \delta )|}}\sum _{(i,j)\in N(h\pm \delta )}|z_{i}-z_{j}|^{2}}

Or in other words, each pair of points separated byh{\displaystyle h} (plus or minus some bin width tolerance rangeδ{\displaystyle \delta }) are found. These form the set of pointsN(h±δ){(si,sj):|si,sj|=h±δ;i,j=1,,N}{\displaystyle N(h\pm \delta )\equiv \{(\mathbf {s} _{i},\mathbf {s} _{j}):|\mathbf {s} _{i},\mathbf {s} _{j}|=h\pm \delta ;i,j=1,\ldots ,N\}}. The number of these points in this bin is|N(h±δ)|{\displaystyle |N(h\pm \delta )|}. Then for each pair of pointsi,j{\displaystyle i,j}, the square of the difference in the observation (e.g., soil sample content or pixel intensity) is found (|zizj|2{\displaystyle |z_{i}-z_{j}|^{2}}). These squared differences are added together and normalized by the natural number|N(h±δ)|{\displaystyle |N(h\pm \delta )|}. By definition the result is divided by 2 for the semivariogram at this separation.

For computational speed, only the unique pairs of points are needed. For example, for 2 observations pairs [(za,zb),(zc,zd){\displaystyle (z_{a},z_{b}),(z_{c},z_{d})}] taken from locations with separationh±δ{\displaystyle h\pm \delta } only [(za,zb),(zc,zd){\displaystyle (z_{a},z_{b}),(z_{c},z_{d})}] need to be considered, as the pairs [(zb,za),(zd,zc){\displaystyle (z_{b},z_{a}),(z_{d},z_{c})}] do not provide any additional information.

Variogram models

[edit]

The empirical variogram cannot be computed at every lag distanceh{\displaystyle h} and due to variation in the estimation it is not ensured that it is a valid variogram, as defined above. However somegeostatistical methods such askriging need valid semivariograms. In applied geostatistics the empirical variograms are thus often approximated by model function ensuring validity (Chiles&Delfiner 1999). Some important models are (Chiles&Delfiner 1999, Cressie 1993):

The parametera{\displaystyle a} has different values in different references, due to the ambiguity in the definition of the range. E.g.a=1/3{\displaystyle a=1/3} is the value used in (Chiles&Delfiner 1999). The1A(h){\displaystyle 1_{A}(h)} function is 1 ifhA{\displaystyle h\in A} and 0 otherwise.

Discussion

[edit]
This sectiondoes notcite anysources. Please helpimprove this section byadding citations to reliable sources. Unsourced material may be challenged andremoved.(May 2015) (Learn how and when to remove this message)

Three functions are used ingeostatistics for describing the spatial or the temporal correlation of observations: these are thecorrelogram, thecovariance, and thesemivariogram. The last is also more simply calledvariogram.

The variogram is the key function in geostatistics as it will be used to fit a model of the temporal/spatial correlation of the observed phenomenon. One is thus making a distinction between theexperimental variogram that is a visualization of a possible spatial/temporal correlation and thevariogram model that is further used to define the weights of thekriging function. Note that the experimental variogram is an empirical estimate of thecovariance of aGaussian process. As such, it may not bepositive definite and hence not directly usable in kriging, without constraints or further processing. This explains why only a limited number of variogram models are used: most commonly, the linear, the spherical, the Gaussian, and the exponential models.

Applications

[edit]

Theempirical variogram is used ingeostatistics as a first estimate of the variogram model needed for spatial interpolation bykriging.

  • Empirical variograms for the spatiotemporal variability of column-averagedcarbon dioxide was used to determine coincidence criteria for satellite and ground-based measurements.[4]
  • Empirical variograms were calculated for the density of a heterogeneous material (Gilsocarbon).[5]
  • Empirical variograms are calculated from observations ofstrong ground motion fromearthquakes.[6] These models are used forseismic risk and loss assessments of spatially-distributed infrastructure.[7]

Related concepts

[edit]

The squared term in the variogram, for instance(Z(s1)Z(s2))2{\displaystyle (Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2}))^{2}}, can be replaced with different powers: Amadogram is defined with theabsolute difference,|Z(s1)Z(s2)|{\displaystyle |Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})|}, and arodogram is defined with thesquare root of the absolute difference,|Z(s1)Z(s2)|0.5{\displaystyle |Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})|^{0.5}}.Estimators based on these lower powers are said to be moreresistant tooutliers. They can be generalized as a "variogram of orderα",

2γ(s1,s2)=E[|Z(s1)Z(s2)|α]{\displaystyle 2\gamma (\mathbf {s} _{1},\mathbf {s} _{2})=E\left[\left|Z(\mathbf {s} _{1})-Z(\mathbf {s} _{2})\right|^{\alpha }\right]},

in which a variogram is of order 2, a madogram is a variogram of order 1, and a rodogram is a variogram of order 0.5.[8]

When a variogram is used to describe the correlation of different variables it is calledcross-variogram. Cross-variograms are used inco-kriging.Should the variable be binary or represent classes of values, one is then talking aboutindicator variograms. Indicator variograms are used inindicator kriging.

References

[edit]
  1. ^Matheron, Georges (1963). "Principles of geostatistics".Economic Geology.58 (8):1246–1266.doi:10.2113/gsecongeo.58.8.1246.ISSN 1554-0774.
  2. ^Ford, David."The Empirical Variogram"(PDF).faculty.washington.edu/edford. Retrieved31 October 2017.
  3. ^Bachmaier, Martin; Backes, Matthias (2008-02-24). "Variogram or semivariogram? Understanding the variances in a variogram".Precision Agriculture.9 (3). Springer Science and Business Media LLC:173–175.doi:10.1007/s11119-008-9056-2.ISSN 1385-2256.
  4. ^abNguyen, H.; Osterman, G.; Wunch, D.; O'Dell, C.; Mandrake, L.; Wennberg, P.; Fisher, B.; Castano, R. (2014)."A method for colocating satelliteXCO2 data to ground-based data and its application to ACOS-GOSAT and TCCON".Atmospheric Measurement Techniques.7 (8):2631–2644.Bibcode:2014AMT.....7.2631N.doi:10.5194/amt-7-2631-2014.ISSN 1867-8548.
  5. ^Arregui Mena, J.D.; et al. (2018)."Characterisation of the spatial variability of material properties of Gilsocarbon and NBG-18 using random fields".Journal of Nuclear Materials.511:91–108.Bibcode:2018JNuM..511...91A.doi:10.1016/j.jnucmat.2018.09.008.
  6. ^Schiappapietra, Erika; Douglas, John (April 2020)."Modelling the spatial correlation of earthquake ground motion: Insights from the literature, data from the 2016–2017 Central Italy earthquake sequence and ground-motion simulations".Earth-Science Reviews.203: 103139.Bibcode:2020ESRv..20303139S.doi:10.1016/j.earscirev.2020.103139.
  7. ^Sokolov, Vladimir; Wenzel, Friedemann (2011-07-25). "Influence of spatial correlation of strong ground motion on uncertainty in earthquake loss estimation".Earthquake Engineering & Structural Dynamics.40 (9):993–1009.doi:10.1002/eqe.1074.
  8. ^Olea, Ricardo A. (1991).Geostatistical Glossary and Multilingual Dictionary. Oxford University Press. pp. 47, 67, 81.ISBN 9780195066890.

Further reading

[edit]

External links

[edit]
Wikimedia Commons has media related toVariogram.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Variogram&oldid=1254040134"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp