How does Label Encoder assigns the same number?

Question

I have the column in my data framecity

London
Paris
New York 
.
I am label encoding the column and it assigns the 0 to London , 1 to Paris and 2 to New York . But when I pass single value for predictions from model I gives city name New York and it assigns the 0 to it . How it shall remains same , I want that if New York values assigns 2 by label encoder in training phase, it should assign 2 again at the predictions .Code
from sklearn.preprocessing import LabelEncoder
labelencoder=LabelEncoder()
df['city']=labelencoder.fit_transform(df['city'])

Nandini · Answer

I am creating a dummy data set by using list and using the zip function.city = ['London','Paris','New York ']
continent = ['Europe', 'Europe' ,'North America']

data = list(zip(city, continent))
dataOutput[('London', 'Europe'), ('Paris', 'Europe'), ('New York ', 'North America')]

Converting the data set into data frameimport pandas as pd
from sklearn.preprocessing import LabelEncoder
labelencoder=LabelEncoder()
df= pd.DataFrame(data, columns=['city', 'continent'])
df
df['label'] = labelencoder.fit_transform(df['city'])
dfCity        Continent
London      Europe
Paris       Europe
New York   North AmericaYou need to use fit_transform to fit the encoder and then transform the data. This will encode the labels as you want and will not re-fit the encoder.OutputCity                   Continent             label
London                Europe                  0
Paris                 Europe                  2
New York          North America               1

How does Label Encoder assigns the same number

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

how to analysis the heatmap to find the correlation

OpenCV Error: Unspecified error (The node does not represent a user object (unknown type?)) in cvRead, file /build/opencv-FWWjHr/opencv-2.4.9.1+dfsg/modules/core/src/persistence.cpp,

In ANN how the weight gets selected by the model INITIALLY?

How to import the BatchNormalization function in Keras?

AttributeError: module 'numpy' has no attribute 'version'

module 'numpy' has no attribute 'unit8'

How to rename columns in pandas (Python)?

What is the Difference in Size and Count in pandas (python)?

Leela Chess Zero: how large is the probability vector in the output layer?

How to compute the probability of a value given a list of samples from a distribution in Python?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES