Movatterモバイル変換

Support vector machine

From Wikipedia, the free encyclopedia

Set of methods for supervised statistical learning

Machine learning anddata mining
Part of a series on
Paradigms Supervised learning Unsupervised learning Semi-supervised learning Self-supervised learning Reinforcement learning Meta-learning Online learning Batch learning Curriculum learning Rule-based learning Neuro-symbolic AI Neuromorphic engineering Quantum machine learning
Problems Classification Generative modeling Regression Clustering Dimensionality reduction Density estimation Anomaly detection Data cleaning AutoML Association rules Semantic analysis Structured prediction Feature engineering Feature learning Learning to rank Grammar induction Ontology learning Multimodal learning
Supervised learning (classification • regression) Apprenticeship learning Decision trees Ensembles Bagging Boosting Random forest k-NN Linear regression Naive Bayes Artificial neural networks Logistic regression Perceptron Relevance vector machine (RVM) Support vector machine (SVM)
Clustering BIRCH CURE Hierarchical k-means Fuzzy Expectation–maximization (EM) DBSCAN OPTICS Mean shift
Dimensionality reduction Factor analysis CCA ICA LDA NMF PCA PGD t-SNE SDL
Structured prediction Graphical models Bayes net Conditional random field Hidden Markov
Anomaly detection RANSAC k-NN Local outlier factor Isolation forest
Artificial neural network Autoencoder Deep learning Feedforward neural network Recurrent neural network LSTM GRU ESN reservoir computing Boltzmann machine Restricted GAN Diffusion model SOM Convolutional neural network U-Net LeNet AlexNet DeepDream Neural radiance field Transformer Vision Mamba Spiking neural network Memtransistor Electrochemical RAM (ECRAM)
Reinforcement learning Q-learning SARSA Temporal difference (TD) Multi-agent Self-play
Learning with humans Active learning Crowdsourcing Human-in-the-loop RLHF
Model diagnostics Coefficient of determination Confusion matrix Learning curve ROC curve
Mathematical foundations Kernel machines Bias–variance tradeoff Computational learning theory Empirical risk minimization Occam learning PAC learning Statistical learning VC theory Topological deep learning
Journals and conferences ECML PKDD NeurIPS ICML ICLR IJCAI ML JMLR
Related articles Glossary of artificial intelligence List of datasets for machine-learning research List of datasets in computer vision and image processing Outline of machine learning
v t e

Inmachine learning,support vector machines (SVMs, alsosupport vector networks^[1]) aresupervised max-margin models with associated learningalgorithms that analyze data forclassification andregression analysis. Developed atAT&T Bell Laboratories,^[1]^[2] SVMs are one of the most studied models, being based on statistical learning frameworks ofVC theory proposed byVapnik (1982, 1995) andChervonenkis (1974).

In addition to performinglinear classification, SVMs can efficiently perform non-linear classification using thekernel trick, representing the data only through a set of pairwise similarity comparisons between the original data points using a kernel function, which transforms them into coordinates in a higher-dimensionalfeature space. Thus, SVMs use the kernel trick to implicitly map their inputs into high-dimensional feature spaces, where linear classification can be performed.^[3] Being max-margin models, SVMs are resilient to noisy data (e.g., misclassified examples). SVMs can also be used forregression tasks, where the objective becomes $\epsilon$ -sensitive.

The support vector clustering^[4] algorithm, created byHava Siegelmann andVladimir Vapnik, applies the statistics of support vectors, developed in the support vector machines algorithm, to categorize unlabeled data.^{[citation needed]} These data sets requireunsupervised learning approaches, which attempt to find naturalclustering of the data into groups, and then to map new data according to these clusters.

The popularity of SVMs is likely due to their amenability to theoretical analysis, and their flexibility in being applied to a wide variety of tasks, includingstructured prediction problems. It is not clear that SVMs have better predictive performance than other linear models, such aslogistic regression andlinear regression.^[5]

Movatterモバイル変換

Motivation

Applications

History

Linear SVM

Hard-margin

Soft-margin

Nonlinear kernels

Computing the SVM classifier

Primal

Dual

Kernel trick

Modern methods

Sub-gradient descent

Coordinate descent

Empirical risk minimization

Risk minimization

Regularization and stability

SVM and the hinge loss

Target functions

Properties

Parameter selection

Issues

Extensions

Multiclass SVM

Transductive support vector machines

Structured SVM

Regression

Bayesian SVM

Implementation

See also

References

Further reading

External links