Splitting the data into training and testing sets - R

Question

I am working with the 'beaver1' data-set, below is a sample:

   day time  temp activ
1 346  840   36.33   0
2 346  850   36.34   0
3 346  900   36.35   0
4 346  910   36.42   0
5 346  920   36.55   0
6 346  930   36.69   0

I want to split this data into 'train' and 'test' sets with 65:35 ratio so that i can build a machine learning model on top of it, how can i do it?

Bharani · Answer 1 · May 7, 2018

You can use the sample.split() function from the caTools package for this purpose:

Start off by loading the 'caTools' package:

library(caTools)

Then, use the sample.split() function which takes in two parameters -> the dataset - 'beaver1' and SplitRatio - 0.65

sample.split(beaver1,SplitRatio = 0.65)->mysplit

Following which, use the subset() function and select all those observations where 'mysplit' tag is True and store them in 'train'

subset(beaver1,mysplit==T)->train

Similarly, select all those observations where the 'mysplilt' tag is Fasle, and store them in 'test'

subset(beaver1,mysplit==F)->test

And that's how you split the data into 'train' and 'test' sets.

answered May 7, 2018 by Bharani
• 4,660 points

score 0 · Answer 2 · Aug 21, 2019

Hi,

Try like this.

train  = sample(x = c(TRUE, FALSE), size = nrow(dataframe),replace = TRUE, prob = c(0.65, 0.35))

Get training and test data frame from the main dataset.

data[train,]
data[!train,]

answered Aug 21, 2019 by anonymous
• 33,050 points

Splitting the data into training and testing sets - R

Your comment on this question:

2 answers to this question.

Your answer

Your comment on this answer:

Your comment on this answer:

Related Questions In Data Analytics

How can I list all the data sets available in all R packages?

I am new to R and need to encode the standard DES - Data encription algorithm in R. Can anyone help me?

Hello kindly share the houserates.csv file used in this video,ggplot2 Tutorial | ggplot2 In R Tutorial | Data Visualization In R | R Training | Edureka

How to insert Data into the MySQL Tables using R?

Transforming a key/value string into distinct rows in R

Finding frequency of observations in R

Left Join and Right Join using "dplyr"

Plotting multiple graphs on the same page in R

"Train" and "Test" sets in Data Science

Applying the same function to every row of a data.frame - R

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES