mds_2025_helper_functions.scores

Functions

compare_model_scores(*args, X[, y, scoring, ...])

Creates a table comparing mean cross-validation scores of multiple models.

Module Contents

mds_2025_helper_functions.scores.compare_model_scores(*args, X, y=None, scoring=None, return_train_scores=False, **kwargs)[source]

Creates a table comparing mean cross-validation scores of multiple models.

Parameters:

*args (sklearn.base.BaseEstimator) – Model objects implementing the fit method. At least two models are required.
X (array-like of shape (n_samples, n_features)) – Training data.
y (array-like of shape (n_samples,) or (n_samples, n_outputs), optional) – Target values for supervised learning tasks.
scoring (str, callable, list, tuple, or dict, optional) – Metrics to evaluate models. Refer to scikit-learn scoring documentation: https://scikit-learn.org/stable/modules/model_evaluation.html#scoring-parameter.
return_train_scores (bool, default=False) – Whether to include training scores in addition to test scores.
**kwargs (dict) – Additional arguments passed to sklearn.model_selection.cross_validate.

Returns:

A DataFrame comparing model performance: - Rows represent different models. - Columns include metrics from cross-validation. - Index contains model names.

Return type:

pd.DataFrame

Examples

>>> from sklearn.linear_model import LogisticRegression
>>> from sklearn.ensemble import RandomForestClassifier
>>> compare_model_scores(LogisticRegression(), RandomForestClassifier(), X=X_train, y=y_train, scoring="accuracy")