As much I understand Reduce phase start with the map phase and keep consuming the record from maps. However, since there are sort and shuffle phase after the map phase all the outputs have to be sorted and sent to the reducer. So logically you can imagine that reduce phase starts only after map phase but actually for performance reason reducers are also initialized with the mappers.