Difference between revisions of "CSC352 Project 3"
Line 7: | Line 7: | ||
<onlysmith> | <onlysmith> | ||
=The Big Picture= | =The Big Picture= | ||
+ | {| | ||
<tanbox> | <tanbox> | ||
− | |||
Your project should present your answers to the following three questions: | Your project should present your answers to the following three questions: | ||
* How should one attempt to process 5 Million Wikipedia pages with MapReduce/Hadoop? What parameters control the execution time, and what is the best guess for the values they should be set at? | * How should one attempt to process 5 Million Wikipedia pages with MapReduce/Hadoop? What parameters control the execution time, and what is the best guess for the values they should be set at? | ||
Line 14: | Line 14: | ||
* How does this compare to the execution time of the 5 Million pages on an XGrid system? | * How does this compare to the execution time of the 5 Million pages on an XGrid system? | ||
</tanbox> | </tanbox> | ||
+ | | | ||
+ | [[Image:cherriesXparent.gif|right|50px]] | ||
+ | |} | ||
+ | <br /> | ||
=Assignment (same as for the XGrid Project)= | =Assignment (same as for the XGrid Project)= |
Revision as of 14:59, 17 April 2010
This is the extension of Project #2, which is built on top of the Hadoop/Mapreduce Tutorials. It is due on the last day of Exams, at 4:00 p.m.