Training and testing data in machine learning

0 votes
I want to train data using K-means algorithm and then test it over another similar kind of data removing only one column. I am new to machine learning, don't understand where does the prediction part take place? We are just giving data and testing the accuracy. How can we apply the algorithm on test data (which obviously will be different) to predict the value of the missing attribute?
Feb 23, 2022 in Machine Learning by Nandini
• 5,480 points
581 views

1 answer to this question.

0 votes
Unsupervised learning is used with the K-means clustering technique. Because you are not attempting to predict something, unsupervised learning does not employ labels. Instead, you're looking for a mechanism to arrange your data into clusters based on similar traits.
In Supervised Learning, the purpose of test (and frequently validation) sets is to verify the generalization properties of your model in order to avoid over-fitting. However, since you don't know the real clusters of the data in unsupervised learning, you can't evaluate this. As a result, employing a test set is pointless.
answered Feb 23, 2022 by Dev
• 6,000 points

Related Questions In Machine Learning

0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Difference between Regression and classification in Machine Learning?

The goal of regression is to forecast ...READ MORE

answered Mar 4, 2022 in Machine Learning by Dev
• 6,000 points
874 views
0 votes
1 answer

What is the process involved in machine Learning?

Discussing this on a high level, these ...READ MORE

answered May 10, 2019 in Machine Learning by Rhea
1,518 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

answered Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,657 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

answered Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,189 views
+1 vote
1 answer

How to handle Nominal Data?

Nominal data is basically data which can ...READ MORE

answered Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
653 views
+2 votes
2 answers

How to handle outliers

There are multiple ways to handle outliers ...READ MORE

answered Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
959 views
0 votes
1 answer

Training and testing data in machine learning

Unsupervised learning is used with the K-means ...READ MORE

answered Mar 2, 2022 in Machine Learning by Dev
• 6,000 points
701 views
0 votes
1 answer

What is difference between loss function and RMSE in Machine Learning?

The loss function is a function of ...READ MORE

answered Feb 23, 2022 in Machine Learning by Dev
• 6,000 points
2,881 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP