Difference between revisions of "CSC352 Problem of the Day"
(→Homework #4, Problem #2) |
(→Homework #4, Problem #2) |
||
Line 11: | Line 11: | ||
! Number of files | ! Number of files | ||
! Number of wiki pages | ! Number of wiki pages | ||
+ | ! Number of categories | ||
! Execution Time<br />(seconds) | ! Execution Time<br />(seconds) | ||
|- | |- | ||
| 589 | | 589 | ||
| 589 | | 589 | ||
+ | | 832 | ||
| 388 | | 388 | ||
|- | |- | ||
| 1 | | 1 | ||
| 117,617 | | 117,617 | ||
+ | | 51,120 | ||
| 30.7 | | 30.7 | ||
|- | |- | ||
| Ratio=589/1 | | Ratio=589/1 | ||
| Ratio=1/199 | | Ratio=1/199 | ||
+ | | Ratio=1/61.4 | ||
| Ratio=12.6/1 | | Ratio=12.6/1 | ||
|} | |} |
Revision as of 20:56, 26 April 2010
Homework #4, Problem #2
- Conditions:
- Processing of wiki pages with Hadoop on 6-PC cluster
- Same Mapper and same Reducer program to process two different input folders
Number of files | Number of wiki pages | Number of categories | Execution Time (seconds) |
---|---|---|---|
589 | 589 | 832 | 388 |
1 | 117,617 | 51,120 | 30.7 |
Ratio=589/1 | Ratio=1/199 | Ratio=1/61.4 | Ratio=12.6/1 |
- Discuss these results
- Identify the parties responsible for this surprising difference