Data splitting using sklearn

Data splitting is a technique of dividing you data into training and testing.This help you use the training data to teach or train the model ,while the test data from the name implies it would be use to test the accuracy level of the model.This technique of splitting is mostly used in supervised learning, where data has some kind of labels attached to it.

`
import numpy as np
from sklearn.model_selection import train_test_split

a = np.arange(1,100)

a_train,a_test =train_test_split(a)

Some other options include

test_size: It must range from 0 – 1, which shows the percentage of the data required for testing alone.
Shuffle: By default it is True but it can be made False.It is to prevent data shuffle
random_state: It make randomised data to remain fix and unchanging no matter the amount of time you slit the data

原文链接：Data splitting using sklearn

文章版权声明 1、本网站名称：拾光赋
2、本站永久网址：https://www.blogs.ink
3、本网站的文章部分内容可能来源于网络，仅供大家学习与参考，如有侵权，请联系站长QQ：805375623进行删除处理。
4、本站一切资源不代表本站立场，并不代表本站赞同其观点和对其真实性负责。
5、本站一律禁止以任何方式发布或转载任何违法的相关信息，访客发现请向站长举报
6、本站资源大多存储在云盘，如发现链接失效，请联系我们我们会第一时间更新。

THE END

Data splitting using sklearn

请登录后发表评论