Scikit Learn · UCB DS100 数据科学的原理与技巧

# Scikit Learn > 原文：[https://www.bookbookmark.ds100.org/ch/20/ref_sklearn.html](https://www.bookbookmark.ds100.org/ch/20/ref_sklearn.html) ``` # HIDDEN # Clear previously defined variables %reset -f # Set directory for data loading to work properly import os os.chdir(os.path.expanduser('~/notebooks/20')) ``` ## 型号和型号选择 | 进口 | 功能 | 截面 | 说明 | | --- | --- | --- | --- | | `sklearn.model_selection` | [`train_test_split(*arrays, test_size=0.2)`](http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html) | 建模与估计 | 返回传入的每个数组的两个随机子集，其中第一个子集中有 0.8 个数组，第二个子集中有 0.2 个数组 | | `sklearn.linear_model` | [`LinearRegression()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html) | Modeling and Estimation | 返回普通最小二乘线性回归模型 | | `sklearn.linear_model` | [`LassoCV()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html) | Modeling and Estimation | 返回通过交叉验证选择最佳模型的 Lasso（L1 正则化）线性模型 | | `sklearn.linear_model` | [`RidgeCV()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html) | Modeling and Estimation | 返回一个脊线（L2 正则化）线性模型，并通过交叉验证选择最佳模型 | | `sklearn.linear_model` | [`ElasticNetCV()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html) | Modeling and Estimation | 返回 ElasticNet（l1 和 l2 正则化）线性模型，并通过交叉验证选择最佳模型 | | `sklearn.linear_model` | [`LogisticRegression()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html) | Modeling and Estimation | 返回逻辑回归分类器 | | `sklearn.linear_model` | [`LogisticRegressionCV()`](http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegressionCV.html) | Modeling and Estimation | 返回通过交叉验证选择最佳模型的逻辑回归分类器 | ## 使用模型[¶](#Working-with-a-Model) 假设您有一个`model`变量是`scikit-learn`对象： | Function | Section | Description | | --- | --- | --- | | `model.fit(X, y)` | Modeling and Estimation | 与传入的 X 和 Y 匹配的模型 | | `model.predict(X)` | Modeling and Estimation | 返回根据模型传入的 x 的预测 | | `model.score(X, y)` | Modeling and Estimation | 返回基于 corect 值（y）的 x 预测精度 |