TheilSenRegressor #

classsklearn.linear_model.TheilSenRegressor(*,fit_intercept=True,copy_X='deprecated',max_subpopulation=10000.0,n_subsamples=None,max_iter=300,tol=0.001,random_state=None,n_jobs=None,verbose=False)[source]#

Theil-Sen Estimator: robust multivariate regression model.

The algorithm calculates least square solutions on subsets with sizen_subsamples of the samples in X. Any value of n_subsamples between thenumber of features and samples leads to an estimator with a compromisebetween robustness and efficiency. Since the number of least squaresolutions is “n_samples choose n_subsamples”, it can be extremely largeand can therefore be limited with max_subpopulation. If this limit isreached, the subsets are chosen randomly. In a final step, the spatialmedian (or L1 median) is calculated of all least square solutions.

See also

HuberRegressor: Linear regression model that is robust to outliers.
RANSACRegressor: RANSAC (RANdom SAmple Consensus) algorithm.
SGDRegressor: Fitted by minimizing a regularized empirical loss with SGD.

References

Theil-Sen Estimators in a Multiple Linear Regression Model, 2009Xin Dang, Hanxiang Peng, Xueqin Wang and Heping Zhanghttp://home.olemiss.edu/~xdang/papers/MTSE.pdf

Examples

>>>fromsklearn.linear_modelimportTheilSenRegressor>>>fromsklearn.datasetsimportmake_regression>>>X,y=make_regression(...n_samples=200,n_features=2,noise=4.0,random_state=0)>>>reg=TheilSenRegressor(random_state=0).fit(X,y)>>>reg.score(X,y)0.9884>>>reg.predict(X[:1,])array([-31.5871])

fit(X,y)[source]#

Fit linear model.

Parameters:

Xndarray of shape (n_samples, n_features): Training data.
yndarray of shape (n_samples,): Target values.

Returns:

selfreturns an instance of self.: FittedTheilSenRegressor estimator.

get_metadata_routing()[source]#

Get metadata routing of this object.

Please checkUser Guide on how the routingmechanism works.

Returns:

routingMetadataRequest: AMetadataRequest encapsulatingrouting information.

get_params(deep=True)[source]#

Get parameters for this estimator.

Parameters:

deepbool, default=True: If True, will return the parameters for this estimator andcontained subobjects that are estimators.

Returns:

paramsdict: Parameter names mapped to their values.

predict(X)[source]#

Predict using the linear model.

Parameters:

Xarray-like or sparse matrix, shape (n_samples, n_features): Samples.

Returns:

Carray, shape (n_samples,): Returns predicted values.

score(X,y,sample_weight=None)[source]#

Returncoefficient of determination on test data.

The coefficient of determination,$R^2$, is defined as$(1 - \frac{u}{v})$, where$u$ is the residualsum of squares((y_true-y_pred)**2).sum() and$v$is the total sum of squares((y_true-y_true.mean())**2).sum().The best possible score is 1.0 and it can be negative (because themodel can be arbitrarily worse). A constant model that always predictsthe expected value ofy, disregarding the input features, would geta$R^2$ score of 0.0.

Parameters:

Xarray-like of shape (n_samples, n_features): Test samples. For some estimators this may be a precomputedkernel matrix or a list of generic objects instead with shape(n_samples,n_samples_fitted), wheren_samples_fittedis the number of samples used in the fitting for the estimator.
yarray-like of shape (n_samples,) or (n_samples, n_outputs): True values forX.
sample_weightarray-like of shape (n_samples,), default=None: Sample weights.

Returns:

scorefloat: $R^2$ ofself.predict(X) w.r.t.y.

Notes

The$R^2$ score used when callingscore on a regressor usesmultioutput='uniform_average' from version 0.23 to keep consistentwith default value ofr2_score.This influences thescore method of all the multioutputregressors (except forMultiOutputRegressor).

set_params(**params)[source]#

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects(such asPipeline). The latter haveparameters of the form<component>__<parameter> so that it’spossible to update each component of a nested object.

Parameters:

**paramsdict: Estimator parameters.

Returns:

selfestimator instance: Estimator instance.

set_score_request(*,sample_weight:bool|None|str='$UNCHANGED$')→TheilSenRegressor[source]#

Configure whether metadata should be requested to be passed to thescore method.

Note that this method is only relevant when this estimator is used as asub-estimator within ameta-estimator and metadata routing is enabledwithenable_metadata_routing=True (seesklearn.set_config).Please check theUser Guide on how the routingmechanism works.

The options for each parameter are:

True: metadata is requested, and passed toscore if provided. The request is ignored if metadata is not provided.
False: metadata is not requested and the meta-estimator will not pass it toscore.
None: metadata is not requested, and the meta-estimator will raise an error if the user provides it.
str: metadata should be passed to the meta-estimator with this given alias instead of the original name.

The default (sklearn.utils.metadata_routing.UNCHANGED) retains theexisting request. This allows you to change the request for someparameters and not others.

Added in version 1.3.

Parameters:

sample_weightstr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED: Metadata routing forsample_weight parameter inscore.

Returns:

selfobject: The updated object.

Gallery examples#

Robust linear estimator fitting

Theil-Sen Regression

On this page

This Page

Show Source

Movatterモバイル変換

TheilSenRegressor#

Gallery examples#

This Page

TheilSenRegressor #