Fit vs Fit_transform

Have you ever wondered whats the difference between fit() and fit_transform(). you must have came across these 2 functions somewhere while preprocessing your data. So, lets learn the difference between fit and fit_transform. we are going to understand this using an example

whenever you want to perform standardization which is an essential preprocessing step, you typically need to calculate various parameters of the data like mean, min, max, variance. fit_transform calculates these parameters and applies to the dataset, where as fit calculates these parameters but doesn’t apply to the dataset.

Lets assume this small array of data
data = [[1,2,3],[4,5,6],[7,8,9]]

when you apply standard scaler and use fit and transform seperately:

from sklearn.preprocessing import StandardScaler

# step-1
Scaler = StandardScaler()

# step-2
scaled_data = Scaler.fit(data) # no scaling of data takes place here ,just the mean and std deviation are calculated. 

# step-3
scaled_data = Scaler.transform() # now the scaled data contains the data after performing standardization.

Enter fullscreen mode Exit fullscreen mode

when you apply fit_transform instead of fit and transform seperately.

from sklearn.preprocessing import StandardScaler

# step-1
Scaler = StandardScaler()

# step-2
scaled_data = Scaler.fit_transform(data) # scaled_data contains the data after performing standardization.

Enter fullscreen mode Exit fullscreen mode

we can observe that by using fit_transform() we are essentially reducing an extra step

which one to use purely depends upon your usecase. If you want to learn parameters for once and then apply transformations to multiple datasets like training set and testing set, using fit and transform seperately is preferred. but if you want to apply transformation to a single dataset, use fit_transform() which makes the preprocessing pipeline concise.

原文链接：Fit vs Fit_transform

文章版权声明 1、本网站名称：拾光赋
2、本站永久网址：https://www.blogs.ink
3、本网站的文章部分内容可能来源于网络，仅供大家学习与参考，如有侵权，请联系站长QQ：805375623进行删除处理。
4、本站一切资源不代表本站立场，并不代表本站赞同其观点和对其真实性负责。
5、本站一律禁止以任何方式发布或转载任何违法的相关信息，访客发现请向站长举报
6、本站资源大多存储在云盘，如发现链接失效，请联系我们我们会第一时间更新。

THE END

Fit vs Fit_transform

请登录后发表评论