When I start a MapReduce, does C3:
- load all the objects in memory then split the resulting collection and send them to the map() workers,
- or does each map() worker load a subset of the objects?
I have quite a few
include in my MapReduce spec so I wouldn’t want the master to load all the objects in a single batch.