Movatterモバイル変換

[0]ホーム

Jump to content

Feature (machine learning)

Edit links

From Wikipedia, the free encyclopedia

(Redirected fromFeature vector)

Measurable property or characteristic

Not to be confused withFeature (computer vision).

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Feature" machine learning – news ·newspapers ·books ·scholar ·JSTOR(December 2014) (Learn how and when to remove this message)

Machine learning anddata mining
Part of a series on
Paradigms Supervised learning Unsupervised learning Semi-supervised learning Self-supervised learning Reinforcement learning Meta-learning Online learning Batch learning Curriculum learning Rule-based learning Neuro-symbolic AI Neuromorphic engineering Quantum machine learning
Problems Classification Generative modeling Regression Clustering Dimensionality reduction Density estimation Anomaly detection Data cleaning AutoML Association rules Semantic analysis Structured prediction Feature engineering Feature learning Learning to rank Grammar induction Ontology learning Multimodal learning
Supervised learning (classification • regression) Apprenticeship learning Decision trees Ensembles Bagging Boosting Random forest k-NN Linear regression Naive Bayes Artificial neural networks Logistic regression Perceptron Relevance vector machine (RVM) Support vector machine (SVM)
Clustering BIRCH CURE Hierarchical k-means Fuzzy Expectation–maximization (EM) DBSCAN OPTICS Mean shift
Dimensionality reduction Factor analysis CCA ICA LDA NMF PCA PGD t-SNE SDL
Structured prediction Graphical models Bayes net Conditional random field Hidden Markov
Anomaly detection RANSAC k-NN Local outlier factor Isolation forest
Neural networks Autoencoder Deep learning Feedforward neural network Recurrent neural network LSTM GRU ESN reservoir computing Boltzmann machine Restricted GAN Diffusion model SOM Convolutional neural network U-Net LeNet AlexNet DeepDream Neural field Neural radiance field Physics-informed neural networks Transformer Vision Mamba Spiking neural network Memtransistor Electrochemical RAM (ECRAM)
Reinforcement learning Q-learning Policy gradient SARSA Temporal difference (TD) Multi-agent Self-play
Learning with humans Active learning Crowdsourcing Human-in-the-loop Mechanistic interpretability RLHF
Model diagnostics Coefficient of determination Confusion matrix Learning curve ROC curve
Mathematical foundations Kernel machines Bias–variance tradeoff Computational learning theory Empirical risk minimization Occam learning PAC learning Statistical learning VC theory Topological deep learning
Journals and conferences AAAI ECML PKDD NeurIPS ICML ICLR IJCAI ML JMLR
Related articles Glossary of artificial intelligence List of datasets for machine-learning research List of datasets in computer vision and image processing Outline of machine learning
v t e

Inmachine learning andpattern recognition, afeature is an individual measurable property or characteristic of a data set.^[1] Choosing informative, discriminating, and independent features is crucial to producing effectivealgorithms forpattern recognition,classification, andregression tasks. Features are usually numeric, but other types such asstrings andgraphs are used insyntactic pattern recognition, after some pre-processing step such asone-hot encoding. The concept of "features" is related to that ofexplanatory variables used in statistical techniques such aslinear regression.

Feature types

[edit]

In feature engineering, two types of features are commonly used: numerical and categorical.

Numerical features are continuous values that can be measured on a scale. Examples of numerical features include age, height, weight, and income. Numerical features can be used in machine learning algorithms directly.^{[citation needed]}

Categorical features are discrete values that can be grouped into categories. Examples of categorical features include gender, color, and zip code. Categorical features typically need to be converted to numerical features before they can be used in machine learning algorithms. This can be done using a variety of techniques, such as one-hot encoding, label encoding, and ordinal encoding.

The type of feature that is used in feature engineering depends on the specific machine learning algorithm that is being used. Some machine learning algorithms, such as decision trees, can handle both numerical and categorical features. Other machine learning algorithms, such as linear regression, can only handle numerical features.

Classification

[edit]

A numeric feature can be conveniently described by a feature vector. One way to achievebinary classification is using alinear predictor function (related to theperceptron) with a feature vector as input. The method consists of calculating thescalar product between the feature vector and a vector of weights, qualifying those observations whose result exceeds a threshold.

Algorithms for classification from a feature vector includenearest neighbor classification,neural networks, andstatistical techniques such asBayesian approaches.

Examples

[edit]

Feature vectors

[edit]

See also:Word embedding

"Feature space" redirects here. For feature spaces in kernel machines, seeKernel method.

Inpattern recognition andmachine learning, afeature vector is an n-dimensionalvector of numerical features that represent some object. Manyalgorithms in machine learning require a numerical representation of objects, since such representations facilitate processing and statistical analysis. When representing images, the feature values might correspond to the pixels of an image, while when representing texts the features might be the frequencies of occurrence of textual terms. Feature vectors are equivalent to the vectors ofexplanatory variables used instatistical procedures such aslinear regression. Feature vectors are often combined with weights using adot product in order to construct alinear predictor function that is used to determine a score for making a prediction.

Thevector space associated with these vectors is often called thefeature space. In order to reduce the dimensionality of the feature space, a number ofdimensionality reduction techniques can be employed.

Higher-level features can be obtained from already available features and added to the feature vector; for example, for the study of diseases the feature 'Age' is useful and is defined asAge = 'Year of death' minus 'Year of birth'. This process is referred to asfeature construction.^[2]^[3] Feature construction is the application of a set of constructive operators to a set of existing features resulting in construction of new features. Examples of such constructive operators include checking for the equality conditions {=, ≠}, the arithmetic operators {+,−,×, /}, the array operators {max(S), min(S), average(S)} as well as other more sophisticated operators, for example count(S,C)^[4] that counts the number of features in the feature vector S satisfying some condition C or, for example, distances to other recognition classes generalized by some accepting device. Feature construction has long been considered a powerful tool for increasing both accuracy and understanding of structure, particularly in high-dimensional problems.^[5] Applications include studies of disease andemotion recognition from speech.^[6]

Selection and extraction

[edit]

Main articles:Feature selection andFeature extraction

This section'stone or style may not reflect theencyclopedic tone used on Wikipedia. See Wikipedia'sguide to writing better articles for suggestions.(August 2025) (Learn how and when to remove this message)

The initial set of raw features can be redundant and large enough that estimation and optimization is made difficult or ineffective. Therefore, a preliminary step in many applications ofmachine learning andpattern recognition consists ofselecting a subset of features, orconstructing a new and reduced set of features to facilitate learning, and to improve generalization and interpretability.^[7]

Extracting or selecting features is a combination of art and science; developing systems to do so is known asfeature engineering. It requires the experimentation of multiple possibilities and the combination of automated techniques with the intuition and knowledge of thedomain expert. Automating this process isfeature learning, where a machine not only uses features for learning, but learns the features itself.

References

[edit]

^Bishop, Christopher (2006).Pattern recognition and machine learning. Berlin: Springer.ISBN 0-387-31073-8.
^Liu, H., Motoda H. (1998)Feature Selection for Knowledge Discovery and Data Mining., Kluwer Academic Publishers. Norwell, MA, USA. 1998.
^Piramuthu, S., Sikora R. T.Iterative feature construction for improving inductive learning algorithms. In Journal of Expert Systems with Applications. Vol. 36 , Iss. 2 (March 2009), pp. 3401-3406, 2009
^Bloedorn, E., Michalski, R. Data-driven constructive induction: a methodology and its applications. IEEE Intelligent Systems, Special issue on Feature Transformation and Subset Selection, pp. 30-37, March/April, 1998
^Breiman, L. Friedman, T., Olshen, R., Stone, C. (1984)Classification and regression trees, Wadsworth
^Sidorova, J., Badia T.Syntactic learning for ESEDA.1, tool for enhanced speech emotion detection and analysis. Internet Technology and Secured Transactions Conference 2009 (ICITST-2009), London, November 9–12. IEEE
^Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome H. (2009).The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer.ISBN 978-0-387-84884-6.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Feature_(machine_learning)&oldid=1319341732#Feature_vectors"

Categories:

Hidden categories:

[8]ページ先頭