Movatterモバイル変換

[0]ホーム

Jump to content

Principal component analysis

Edit links

From Wikipedia, the free encyclopedia

Method of data analysis

PCA of amultivariate Gaussian distribution centered at (1, 3) with a standard deviation of 3 in roughly the (0.866, 0.5) direction and of 1 in the orthogonal direction. The vectors shown are theeigenvectors of thecovariance matrix scaled by the square root of the corresponding eigenvalue, and shifted so their tails are at the mean.

Machine learning anddata mining
Part of a series on
Paradigms Supervised learning Unsupervised learning Semi-supervised learning Self-supervised learning Reinforcement learning Meta-learning Online learning Batch learning Curriculum learning Rule-based learning Neuro-symbolic AI Neuromorphic engineering Quantum machine learning
Problems Classification Generative modeling Regression Clustering Dimensionality reduction Density estimation Anomaly detection Data cleaning AutoML Association rules Semantic analysis Structured prediction Feature engineering Feature learning Learning to rank Grammar induction Ontology learning Multimodal learning
Supervised learning (classification • regression) Apprenticeship learning Decision trees Ensembles Bagging Boosting Random forest k-NN Linear regression Naive Bayes Artificial neural networks Logistic regression Perceptron Relevance vector machine (RVM) Support vector machine (SVM)
Clustering BIRCH CURE Hierarchical k-means Fuzzy Expectation–maximization (EM) DBSCAN OPTICS Mean shift
Dimensionality reduction Factor analysis CCA ICA LDA NMF PCA PGD t-SNE SDL
Structured prediction Graphical models Bayes net Conditional random field Hidden Markov
Anomaly detection RANSAC k-NN Local outlier factor Isolation forest
Neural networks Autoencoder Deep learning Feedforward neural network Recurrent neural network LSTM GRU ESN reservoir computing Boltzmann machine Restricted GAN Diffusion model SOM Convolutional neural network U-Net LeNet AlexNet DeepDream Neural field Neural radiance field Physics-informed neural networks Transformer Vision Mamba Spiking neural network Memtransistor Electrochemical RAM (ECRAM)
Reinforcement learning Q-learning Policy gradient SARSA Temporal difference (TD) Multi-agent Self-play
Learning with humans Active learning Crowdsourcing Human-in-the-loop Mechanistic interpretability RLHF
Model diagnostics Coefficient of determination Confusion matrix Learning curve ROC curve
Mathematical foundations Kernel machines Bias–variance tradeoff Computational learning theory Empirical risk minimization Occam learning PAC learning Statistical learning VC theory Topological deep learning
Journals and conferences AAAI ECML PKDD NeurIPS ICML ICLR IJCAI ML JMLR
Related articles Glossary of artificial intelligence List of datasets for machine-learning research List of datasets in computer vision and image processing Outline of machine learning
v t e

Principal component analysis (PCA) is alinear dimensionality reduction technique with applications inexploratory data analysis, visualization anddata preprocessing.

The data arelinearly transformed onto a newcoordinate system such that the directions (principal components) capturing the largest variation in the data can be easily identified.

Theprincipal components of a collection of points in areal coordinate space are a sequence of $p {\displaystyle p}$ unit vectors, where the $i {\displaystyle i}$ -th vector is the direction of a line that best fits the data while beingorthogonal to the first $i-1$ vectors. Here, a best-fitting line is defined as one that minimizes the average squaredperpendicular distance from the points to the line. These directions (i.e., principal components) constitute anorthonormal basis in which different individual dimensions of the data arelinearly uncorrelated. Many studies use the first two principal components in order to plot the data in two dimensions and to visually identify clusters of closely related data points.^[1]

Principal component analysis has applications in many fields such aspopulation genetics,microbiome studies, andatmospheric science.^[2]

Symbol	Meaning	Dimensions	Indices
$\mathbf {X} =[X_{ij}]$	data matrix, consisting of the set of all data vectors, one vector per row	$n\times p$	$i=1\ldots n$ $j=1\ldots p$
$n {\displaystyle n}$	the number of row vectors in the data set	$1\times 1$	scalar
$p {\displaystyle p}$	the number of elements in each row vector (dimension)	$1\times 1$	scalar
$L {\displaystyle L}$	the number of dimensions in the dimensionally reduced subspace, $1\leq L\leq p$	$1\times 1$	scalar
$\mathbf {u} =[u_{j}]$	vector of empiricalmeans, one mean for each columnj of the data matrix	$p\times 1$	$j=1\ldots p$
$\mathbf {s} =[s_{j}]$	vector of empiricalstandard deviations, one standard deviation for each columnj of the data matrix	$p\times 1$	$j=1\ldots p$
$\mathbf {h} =[h_{i}]$	vector of all 1's	$1\times n$	$i=1\ldots n$
$\mathbf {B} =[B_{ij}]$	deviations from the mean of each columnj of the data matrix	$n\times p$	$i=1\ldots n$ $j=1\ldots p$
$\mathbf {Z} =[Z_{ij}]$	z-scores, computed using the mean and standard deviation for each columnj of the data matrix	$n\times p$	$i=1\ldots n$ $j=1\ldots p$
$\mathbf {C} =[C_{jj'}]$	covariance matrix	$p\times p$	$j=1\ldots p$ $j'=1\ldots p$
$\mathbf {R} =[R_{jj'}]$	correlation matrix	$p\times p$	$j=1\ldots p$ $j'=1\ldots p$
$\mathbf {V} =[V_{jj'}]$	matrix consisting of the set of alleigenvectors ofC, one eigenvector per column	$p\times p$	$j=1\ldots p$ $j'=1\ldots p$
$\mathbf {D} =[D_{jj'}]$	diagonal matrix consisting of the set of alleigenvalues ofC along itsprincipal diagonal, and 0 for all other elements ( note $\mathbf {\Lambda }$ used above )	$p\times p$	$j=1\ldots p$ $j'=1\ldots p$
$\mathbf {W} =[W_{jl}]$	matrix of basis vectors, one vector per column, where each basis vector is one of the eigenvectors ofC, and where the vectors inW are a sub-set of those inV	$p\times L$	$j=1\ldots p$ $l=1\ldots L$
$\mathbf {T} =[T_{il}]$	matrix consisting ofn row vectors, where each vector is the projection of the corresponding data vector from matrixX onto the basis vectors contained in the columns of matrixW.	$n\times L$	$i=1\ldots n$ $l=1\ldots L$

Authority control databases
International	GND
National	United States France BnF data Israel
Other	Yale LUX

Movatterモバイル変換

Overview

History

Intuition

Details

First component

Further components

Covariances

Dimensionality reduction

Singular value decomposition

Further considerations

Table of symbols and abbreviations

Properties and limitations

Properties

Limitations

PCA and information theory

Computation using the covariance method

Derivation using the covariance method

Covariance-free computation

Iterative computation

The NIPALS method

Online/sequential estimation

Qualitative variables

Applications

Intelligence

Residential differentiation

Development indexes

Population genetics

Market research and indexes of attitude

Quantitative finance

Neuroscience

Relation with other methods

Correspondence analysis

Factor analysis

K-means clustering

Non-negative matrix factorization

Iconography of correlations

Generalizations

Sparse PCA

Nonlinear PCA

Robust PCA

Similar techniques

Independent component analysis

Network component analysis

Discriminant analysis of principal components

Directional component analysis

Software/source code

See also

References

Further reading

External links

`K`-means clustering