Hadoop MapReduce - Join of two files and Computation on grouped values

0 votes

I am fairly new to Hadoop and MapReduce programming. I want to know whether it is possible to group by another value (not key) after joining of two files.

I have two files which have following data

File1

name    marks
A       Male
B       Male
C       Female


File2

name    marks
A       25
B       28
A       30
C       22

Now is there any method to find the percentage of marks for each gender. I am trying to get the following as output

Male    percentage_of_marks_of_male_students
Female  percentage_of_marks_of_female_students

Is there anyway to do this in a single job. I've tried using two jobs for this, but couldn't find any headway.

Any tips would be appreciated.


Jan 20, 2021 in Big Data Hadoop by anonymous

edited Mar 4, 2025 316 views

No answer to this question. Be the first to respond.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP