OPTICS #

classsklearn.cluster.OPTICS(*,min_samples=5,max_eps=inf,metric='minkowski',p=2,metric_params=None,cluster_method='xi',eps=None,xi=0.05,predecessor_correction=True,min_cluster_size=None,algorithm='auto',leaf_size=30,memory=None,n_jobs=None)[source]#

Estimate clustering structure from vector array.

OPTICS (Ordering Points To Identify the Clustering Structure), closelyrelated to DBSCAN, finds core samples of high density and expands clustersfrom them[1]. Unlike DBSCAN, it keeps cluster hierarchy for a variableneighborhood radius. Better suited for usage on large datasets than thecurrent scikit-learn implementation of DBSCAN.

Clusters are then extracted from the cluster-order using aDBSCAN-like method (cluster_method = ‘dbscan’) or an automatictechnique proposed in[1] (cluster_method = ‘xi’).

This implementation deviates from the original OPTICS by first performingk-nearest-neighborhood searches on all points to identify core sizes ofall points (instead of computing neighbors while looping through points).Reachability distances to only unprocessed points are then computed, toconstruct the cluster order, similar to the original OPTICS.Note that we do not employ a heap to manage the expansioncandidates, so the time complexity will be O(n^2).

See also

DBSCAN: A similar clustering for a specified neighborhood radius (eps). Our implementation is optimized for runtime.

References

[1](1,2)

Ankerst, Mihael, Markus M. Breunig, Hans-Peter Kriegel,and Jörg Sander. “OPTICS: ordering points to identify the clusteringstructure.” ACM SIGMOD Record 28, no. 2 (1999): 49-60.

[2]

Schubert, Erich, Michael Gertz.“Improving the Cluster Structure Extracted from OPTICS Plots.” Proc. ofthe Conference “Lernen, Wissen, Daten, Analysen” (LWDA) (2018): 318-329.

Examples

>>>fromsklearn.clusterimportOPTICS>>>importnumpyasnp>>>X=np.array([[1,2],[2,5],[3,6],...[8,7],[8,8],[7,3]])>>>clustering=OPTICS(min_samples=2).fit(X)>>>clustering.labels_array([0, 0, 0, 1, 1, 1])

For a more detailed example seeDemo of OPTICS clustering algorithm.

For a comparison of OPTICS with other clustering algorithms, seeComparing different clustering algorithms on toy datasets

fit(X,y=None)[source]#

Perform OPTICS clustering.

Extracts an ordered list of points and reachability distances, andperforms initial clustering usingmax_eps distance specified atOPTICS object instantiation.

Parameters:

X{ndarray, sparse matrix} of shape (n_samples, n_features), or (n_samples, n_samples) if metric=’precomputed’: A feature array, or array of distances between samples ifmetric=’precomputed’. If a sparse matrix is provided, it will beconverted into CSR format.
yIgnored: Not used, present for API consistency by convention.

Returns:

selfobject: Returns a fitted instance of self.

fit_predict(X,y=None,**kwargs)[source]#

Perform clustering onX and returns cluster labels.

Parameters:

Xarray-like of shape (n_samples, n_features): Input data.
yIgnored: Not used, present for API consistency by convention.
**kwargsdict: Arguments to be passed tofit.
Added in version 1.4.

Returns:

labelsndarray of shape (n_samples,), dtype=np.int64: Cluster labels.

get_metadata_routing()[source]#

Get metadata routing of this object.

Please checkUser Guide on how the routingmechanism works.

Returns:

routingMetadataRequest: AMetadataRequest encapsulatingrouting information.

get_params(deep=True)[source]#

Get parameters for this estimator.

Parameters:

deepbool, default=True: If True, will return the parameters for this estimator andcontained subobjects that are estimators.

Returns:

paramsdict: Parameter names mapped to their values.

set_params(**params)[source]#

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects(such asPipeline). The latter haveparameters of the form<component>__<parameter> so that it’spossible to update each component of a nested object.

Parameters:

**paramsdict: Estimator parameters.

Returns:

selfestimator instance: Estimator instance.

Gallery examples#

Comparing different clustering algorithms on toy datasets

Demo of OPTICS clustering algorithm

On this page

This Page

Show Source

Movatterモバイル変換

OPTICS#

Gallery examples#

This Page

OPTICS #