您的位置：首页 > 其它

scikit-learn preprocessing模块

2015-10-16 21:24 190 查看

version:sklearn0.16.1

sklearn.preprocessing模块包括scaling， centering, normalization 和imputation方法。

包括

data.py 处理数据（均值，方差等）

label.py 二值化

_weight.py

imputation.py 缺失值处理

1.data.py

_mean_and_std(X, axis=0, with_mean=True, with_std=True):

计算均值和标准差，标准差为0的话则置为1

scale(X, axis=0, with_mean=True, with_std=True, copy=True)

公式为：(X-mean)/std 计算时对每个属性/每列分别进行。

将数据按期属性（按列进行）减去其均值，并处以其方差。得到的结果是，对于每个属性/每列来说所有数据都聚集在0附近，方差为1。

scale函数不能用来处理稀疏矩阵

3.class MinMaxScaler()

使特征标准化

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航

scikit-learn preprocessing模块

1.data.py

1.
_mean_and_std(X, axis=0, with_mean=True, with_std=True):

2.
scale(X, axis=0, with_mean=True, with_std=True, copy=True)

3.class MinMaxScaler()

scikit-learn preprocessing模块

1.data.py

1._mean_and_std(X, axis=0, with_mean=True, with_std=True):

2.scale(X, axis=0, with_mean=True, with_std=True, copy=True)

3.class MinMaxScaler()

1.
_mean_and_std(X, axis=0, with_mean=True, with_std=True):

2.
scale(X, axis=0, with_mean=True, with_std=True, copy=True)