sklearn.dummy.DummyClassifier

class sklearn.dummy.DummyClassifier(*, strategy='prior', random_state=None, constant=None) [source]

DummyClassifier is a classifier that makes predictions using simple rules.

This classifier is useful as a simple baseline to compare with other (real) classifiers. Do not use it for real problems.

Examples

>>> import numpy as np
>>> from sklearn.dummy import DummyClassifier
>>> X = np.array([-1, 1, 1, 1])
>>> y = np.array([0, 1, 1, 1])
>>> dummy_clf = DummyClassifier(strategy="most_frequent")
>>> dummy_clf.fit(X, y)
DummyClassifier(strategy='most_frequent')
>>> dummy_clf.predict(X)
array([1, 1, 1, 1])
>>> dummy_clf.score(X, y)
0.75

Methods

`fit`(X, y[, sample_weight])	Fit the random classifier.
`get_params`([deep])	Get parameters for this estimator.
`predict`(X)	Perform classification on test vectors X.
`predict_log_proba`(X)	Return log probability estimates for the test vectors X.
`predict_proba`(X)	Return probability estimates for the test vectors X.
`score`(X, y[, sample_weight])	Returns the mean accuracy on the given test data and labels.
`set_params`(**params)	Set the parameters of this estimator.

fit(X, y, sample_weight=None) [source]

Fit the random classifier.

Parameters

Xarray-like of shape (n_samples, n_features): Training data.
yarray-like of shape (n_samples,) or (n_samples, n_outputs): Target values.
sample_weightarray-like of shape (n_samples,), default=None: Sample weights.

Returns

selfobject

get_params(deep=True) [source]

Get parameters for this estimator.

Parameters

deepbool, default=True: If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns

paramsdict: Parameter names mapped to their values.

predict(X) [source]

Perform classification on test vectors X.

Parameters

Xarray-like of shape (n_samples, n_features): Test data.

Returns

yarray-like of shape (n_samples,) or (n_samples, n_outputs): Predicted target values for X.

predict_log_proba(X) [source]

Return log probability estimates for the test vectors X.

Parameters

X{array-like, object with finite length or shape}: Training data, requires length = n_samples

Returns

Pndarray of shape (n_samples, n_classes) or list of such arrays: Returns the log probability of the sample for each class in the model, where classes are ordered arithmetically for each output.

predict_proba(X) [source]

Return probability estimates for the test vectors X.

Parameters

Xarray-like of shape (n_samples, n_features): Test data.

Returns

Pndarray of shape (n_samples, n_classes) or list of such arrays: Returns the probability of the sample for each class in the model, where classes are ordered arithmetically, for each output.

score(X, y, sample_weight=None) [source]

Returns the mean accuracy on the given test data and labels.

In multi-label classification, this is the subset accuracy which is a harsh metric since you require for each sample that each label set be correctly predicted.

Parameters

XNone or array-like of shape (n_samples, n_features): Test samples. Passing None as test samples gives the same result as passing real test samples, since DummyClassifier operates independently of the sampled observations.
yarray-like of shape (n_samples,) or (n_samples, n_outputs): True labels for X.
sample_weightarray-like of shape (n_samples,), default=None: Sample weights.

Returns

scorefloat: Mean accuracy of self.predict(X) wrt. y.

set_params(**params) [source]

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as Pipeline). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Parameters

**paramsdict: Estimator parameters.

Returns

selfestimator instance: Estimator instance.

© 2007–2020 The scikit-learn developers
Licensed under the 3-clause BSD License.
https://scikit-learn.org/0.24/modules/generated/sklearn.dummy.DummyClassifier.html