Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Smoothing

From Wikipedia, the free encyclopedia
Fitting an approximating function to data
This article is about a type of statistical technique for handling data. For other uses, seeSmoothing (disambiguation).
Simple exponential smoothing example. Raw data: mean daily temperatures at the Paris-Montsouris weather station (France) from 1960/01/01 to 1960/02/29. Smoothed data with alpha factor = 0.1.

Instatistics andimage processing, tosmooth adata set is to create an approximatingfunction that attempts to capture importantpatterns in the data, while leaving outnoise or other fine-scale structures/rapid phenomena. In smoothing, the data points of a signal are modified so individual points higher than the adjacent points (presumably because of noise) are reduced, and points that are lower than the adjacent points are increased, leading to a smoother signal.

Reducing noise by smoothing may aid in data analysis in two notable ways:

  1. Help uncover more meaningful information from the underlying data, such as trends.[1]
  2. Provide analyses that are both flexible and robust.[2]

Many differentalgorithms are used in smoothing, most commonlybinning,kernels, andlocal weighted regression.[1]

Compared to curve fitting

[edit]

Smoothing may be distinguished from the related and partially overlapping concept ofcurve fitting in the following ways:

  • curve fitting often involves the use of an explicit function form for the result, whereas the immediate results from smoothing are the "smoothed" values with no later use made of a functional form if there is one;
  • the aim of smoothing is to give a general idea of relatively slow changes of value with little attention paid to the close matching of data values, while curve fitting concentrates on achieving as close a match as possible.
  • smoothing methods often have an associated tuning parameter which is used to control the extent of smoothing. Curve fitting will adjust any number of parameters of the function to obtain the 'best' fit.

Linear smoothers

[edit]

In the case that the smoothed values can be written as alinear transformation of the observed values, the smoothing operation is known as alinear smoother; the matrix representing the transformation is known as asmoother matrix orhat matrix.[citation needed]

The operation of applying such a matrix transformation is calledconvolution. Thus the matrix is also called convolution matrix or aconvolution kernel. In the case of simple series of data points (rather than a multi-dimensional image), the convolution kernel is a one-dimensionalvector.

Algorithms

[edit]

One of the most common algorithms is the "moving average", often used to try to capture important trends in repeatedstatistical surveys. Inimage processing andcomputer vision, smoothing ideas are used inscale space representations. The simplest smoothing algorithm is the "rectangular" or "unweighted sliding-average smooth". This method replaces each point in the signal with the average of "m" adjacent points, where "m" is a positive integer called the "smooth width". Usually m is an odd number. Thetriangular smooth is like therectangular smooth except that it implements a weighted smoothing function.[3]

Some specific smoothing and filter types, with their respective uses, pros and cons are:

AlgorithmOverview and usesProsCons
Additive smoothingused to smoothcategorical data.
Butterworth filterSlowerroll-off than aChebyshev Type I/Type II filter or anelliptic filter
  • More linear phase response in thepassband than Chebyshev Type I/Type II and elliptic filters can achieve.
  • Designed to have afrequency response as flat as possible in the passband.
  • requires a higher order to implement a particularstopband specification
Chebyshev filterHas a steeperroll-off and morepassbandripple (type I) orstopband ripple (type II) thanButterworth filters.
  • Minimizes the error between the idealized and the actual filter characteristic over the range of the filter
Digital filterUsed on asampled,discrete-timesignal to reduce or enhance certain aspects of that signal
Elliptic filter
Exponential smoothing
  • Used to reduce irregularities (random fluctuations) in time series data, thus providing a clearer view of the true underlying behaviour of the series.
  • Also, provides an effective means of predicting future values of the time series (forecasting).[4]
Kalman filterEstimates of unknown variables it produces tend to be more accurate than those based on a single measurement alone, when assumptions are met.

Assumes and therefore requires knowledge of how the system generating the data-points advances in time and how the measurements are acquired.

Kernel smoother
  • used to estimate a real valuedfunction as the weighted average of neighboring observed data.
  • most appropriate when the dimension of thepredictor is low (p < 3), for example for data visualization.
The estimated function is smooth, and the level of smoothness is set by a single parameter.
Kolmogorov–Zurbenko filter
  • robust and nearly optimal
  • performs well in a missing data environment, especially in multidimensional time and space where missing data can cause problems arising from spatial sparseness
  • the two parameters each have clear interpretations so that it can be easily adopted by specialists in different areas
  • Software implementations for time series, longitudinal and spatial data have been developed in the popular statistical packageR, which facilitate the use of the KZ filter and its extensions in different areas.
Laplacian smoothingalgorithm to smooth apolygonal mesh.[5][6]
Local regression also known as "loess" or "lowess"A generalization ofmoving average andpolynomial regression.

Generalizes aSavitzky–Golay smoothing filter to non-regular sampling instances.

  • fitting simple models to localized subsets of the data to build up a function that describes the deterministic part of the variation in the data, point by point
  • one of the chief attractions of this method is that the data analyst is not required to specify a global function of any form to fit a model to the data, only to fit segments of the data.
  • increased computation. Because it is so computationally intensive, LOESS would have been practically impossible to use in the era when least squares regression was being developed.
Low-pass filter
Moving average
  • A calculation to analyze data points by creating a series ofaverages of different subsets of the full data set.
  • a smoothing technique used to make the long term trends of a time series clearer.[4]
  • the first element of the moving average is obtained by taking the average of the initial fixed subset of the number series
  • commonly used withtime series data to smooth out short-term fluctuations and highlight longer-term trends or cycles.
  • has been adjusted to allow for seasonal or cyclical components of a time series
Ramer–Douglas–Peucker algorithmdecimates a curve composed of line segments to a similar curve with fewer points.
Savitzky–Golay smoothing filter

Based on the least-squares fitting of polynomials to segments of the data. A specific case ofLocal regression ("loess" or "lowess") when the sampling instances are regular.

Smoothing spline
Stretched grid method
  • anumerical technique for finding approximate solutions of various mathematical and engineering problems that can be related to an elastic grid behavior
  • meteorologists use the stretched grid method for weather prediction
  • engineers use the stretched grid method to design tents and othertensile structures.

See also

[edit]

References

[edit]
  1. ^abIrizarry, Rafael A.Chapter 28 Smoothing | Introduction to Data Science.
  2. ^Simonoff, Jeffrey S. (1998)Smoothing Methods in Statistics, 2nd edition. SpringerISBN 978-0387947167[page needed]
  3. ^O'Haver, T. (January 2012)."Smoothing".terpconnect.umd.edu.
  4. ^abEaston, V. J.; & McColl, J. H. (1997)"Time series", STEPS Statistics Glossary
  5. ^Herrmann, Leonard R. (1976),"Laplacian-isoparametric grid generation scheme",Journal of the Engineering Mechanics Division,102 (5):749–756,doi:10.1061/JMCEA3.0002158.
  6. ^Sorkine, O., Cohen-Or, D., Lipman, Y.,Alexa, M., Rössl, C., Seidel, H.-P. (2004)."Laplacian Surface Editing".Proceedings of the 2004 Eurographics/ACM SIGGRAPH Symposium on Geometry Processing. SGP '04. Nice, France: ACM. pp. 175–184.doi:10.1145/1057432.1057456.ISBN 3-905673-13-4.S2CID 1980978.{{cite book}}: CS1 maint: multiple names: authors list (link)

Further reading

[edit]
  • Hastie, T.J. and Tibshirani, R.J. (1990),Generalized Additive Models, New York: Chapman and Hall.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Smoothing&oldid=1325340634"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp