A single-server Map-reduce tool for data processing. A blog post regarding the implementation can be found here
- Map part and Intermediate Data generation
- IR data partitioning into Buckets
- Transitioning all workers from map phsae to reduce phase (wait on other map workers to complete task)
- Reduce phase
- Generating Output files
Based on the map-reduce paper