CSC352 Problem of the Day
Homework #4, Problem #2
- Conditions:
- Processing of wiki pages with Hadoop on 6-PC cluster
- Same Mapper and same Reducer program to process two different input folders
Number of files | Number of wiki pages | Execution Time (seconds) |
---|---|---|
589 | 589 | 388 |
1 | 117,617 | 30.7 |
Ratio=589/1 | Ratio=1/199 | Ratio=12.6/1 |
- Discuss these results
- Identify the parties responsible for this surprising difference