precision_score#
- sklearn.metrics.precision_score(y_true,y_pred,*,labels=None,pos_label=1,average='binary',sample_weight=None,zero_division='warn')[source]#
Compute the precision.
The precision is the ratio
tp/(tp+fp)wheretpis the number oftrue positives andfpthe number of false positives. The precision isintuitively the ability of the classifier not to label as positive a samplethat is negative.The best value is 1 and the worst value is 0.
Support beyond term:
binarytargets is achieved by treatingmulticlassandmultilabel data as a collection of binary problems, one for eachlabel. For thebinary case, settingaverage='binary'will returnprecision forpos_label. Ifaverageis not'binary',pos_labelis ignoredand precision for both classes are computed, then averaged or both returned (whenaverage=None). Similarly, formulticlass andmultilabel targets,precision for alllabelsare either returned or averaged depending on theaverageparameter. Uselabelsspecify the set of labels to calculate precisionfor.Read more in theUser Guide.
- Parameters:
- y_true1d array-like, or label indicator array / sparse matrix
Ground truth (correct) target values.
- y_pred1d array-like, or label indicator array / sparse matrix
Estimated targets as returned by a classifier.
- labelsarray-like, default=None
The set of labels to include when
average!='binary', and theirorder ifaverageisNone. Labels present in the data can beexcluded, for example in multiclass classification to exclude a “negativeclass”. Labels not present in the data can be included and will be“assigned” 0 samples. For multilabel targets, labels are column indices.By default, all labels iny_trueandy_predare used in sorted order.Changed in version 0.17:Parameter
labelsimproved for multiclass problem.- pos_labelint, float, bool or str, default=1
The class to report if
average='binary'and the data is binary,otherwise this parameter is ignored.For multiclass or multilabel targets, setlabels=[pos_label]andaverage!='binary'to report metrics for one label only.- average{‘micro’, ‘macro’, ‘samples’, ‘weighted’, ‘binary’} or None, default=’binary’
This parameter is required for multiclass/multilabel targets.If
None, the metrics for each class are returned. Otherwise, thisdetermines the type of averaging performed on the data:'binary':Only report results for the class specified by
pos_label.This is applicable only if targets (y_{true,pred}) are binary.'micro':Calculate metrics globally by counting the total true positives,false negatives and false positives.
'macro':Calculate metrics for each label, and find their unweightedmean. This does not take label imbalance into account.
'weighted':Calculate metrics for each label, and find their average weightedby support (the number of true instances for each label). Thisalters ‘macro’ to account for label imbalance; it can result in anF-score that is not between precision and recall.
'samples':Calculate metrics for each instance, and find their average (onlymeaningful for multilabel classification where this differs from
accuracy_score).
- sample_weightarray-like of shape (n_samples,), default=None
Sample weights.
- zero_division{“warn”, 0.0, 1.0, np.nan}, default=”warn”
Sets the value to return when there is a zero division.
Notes:
If set to “warn”, this acts like 0, but a warning is also raised.
If set to
np.nan, such values will be excluded from the average.
Added in version 1.3:
np.nanoption was added.
- Returns:
- precisionfloat (if average is not None) or array of float of shape (n_unique_labels,)
Precision of the positive class in binary classification or weightedaverage of the precision of each class for the multiclass task.
See also
precision_recall_fscore_supportCompute precision, recall, F-measure and support for each class.
recall_scoreCompute the ratio
tp/(tp+fn)wheretpis the number of true positives andfnthe number of false negatives.PrecisionRecallDisplay.from_estimatorPlot precision-recall curve given an estimator and some data.
PrecisionRecallDisplay.from_predictionsPlot precision-recall curve given binary class predictions.
multilabel_confusion_matrixCompute a confusion matrix for each class or sample.
Notes
When
truepositive+falsepositive==0, precision returns 0 andraisesUndefinedMetricWarning. This behavior can bemodified withzero_division.Examples
>>>importnumpyasnp>>>fromsklearn.metricsimportprecision_score>>>y_true=[0,1,2,0,1,2]>>>y_pred=[0,2,1,0,0,1]>>>precision_score(y_true,y_pred,average='macro')0.22>>>precision_score(y_true,y_pred,average='micro')0.33>>>precision_score(y_true,y_pred,average='weighted')0.22>>>precision_score(y_true,y_pred,average=None)array([0.66, 0. , 0. ])>>>y_pred=[0,0,0,0,0,0]>>>precision_score(y_true,y_pred,average=None)array([0.33, 0. , 0. ])>>>precision_score(y_true,y_pred,average=None,zero_division=1)array([0.33, 1. , 1. ])>>>precision_score(y_true,y_pred,average=None,zero_division=np.nan)array([0.33, nan, nan])
>>># multilabel classification>>>y_true=[[0,0,0],[1,1,1],[0,1,1]]>>>y_pred=[[0,0,0],[1,1,1],[1,1,0]]>>>precision_score(y_true,y_pred,average=None)array([0.5, 1. , 1. ])
Gallery examples#
Post-tuning the decision threshold for cost-sensitive learning
