LeavePOut#
- classsklearn.model_selection.LeavePOut(p)[source]#
Leave-P-Out cross-validator.
Provides train/test indices to split data in train/test sets. This resultsin testing on all distinct samples of size p, while the remaining n - psamples form the training set in each iteration.
Note:
LeavePOut(p)is NOT equivalent toKFold(n_splits=n_samples//p)which creates non-overlapping test sets.Due to the high number of iterations which grows combinatorically with thenumber of samples this cross-validation method can be very costly. Forlarge datasets one should favor
KFold,StratifiedKFoldorShuffleSplit.Read more in theUser Guide.
- Parameters:
- pint
Size of the test sets. Must be strictly less than the number ofsamples.
Examples
>>>importnumpyasnp>>>fromsklearn.model_selectionimportLeavePOut>>>X=np.array([[1,2],[3,4],[5,6],[7,8]])>>>y=np.array([1,2,3,4])>>>lpo=LeavePOut(2)>>>lpo.get_n_splits(X)6>>>print(lpo)LeavePOut(p=2)>>>fori,(train_index,test_index)inenumerate(lpo.split(X)):...print(f"Fold{i}:")...print(f" Train: index={train_index}")...print(f" Test: index={test_index}")Fold 0: Train: index=[2 3] Test: index=[0 1]Fold 1: Train: index=[1 3] Test: index=[0 2]Fold 2: Train: index=[1 2] Test: index=[0 3]Fold 3: Train: index=[0 3] Test: index=[1 2]Fold 4: Train: index=[0 2] Test: index=[1 3]Fold 5: Train: index=[0 1] Test: index=[2 3]
- get_metadata_routing()[source]#
Get metadata routing of this object.
Please checkUser Guide on how the routingmechanism works.
- Returns:
- routingMetadataRequest
A
MetadataRequestencapsulatingrouting information.
- get_n_splits(X,y=None,groups=None)[source]#
Returns the number of splitting iterations in the cross-validator.
- Parameters:
- Xarray-like of shape (n_samples, n_features)
Training data, where
n_samplesis the number of samplesandn_featuresis the number of features.- yobject
Always ignored, exists for compatibility.
- groupsobject
Always ignored, exists for compatibility.
- split(X,y=None,groups=None)[source]#
Generate indices to split data into training and test set.
- Parameters:
- Xarray-like of shape (n_samples, n_features)
Training data, where
n_samplesis the number of samplesandn_featuresis the number of features.- yarray-like of shape (n_samples,)
The target variable for supervised learning problems.
- groupsobject
Always ignored, exists for compatibility.
- Yields:
- trainndarray
The training set indices for that split.
- testndarray
The testing set indices for that split.
