Difference between revisions of "CSC352 Project 3"

From dftwiki3
Jump to: navigation, search
Line 7: Line 7:
 
<onlysmith>
 
<onlysmith>
 
=The Big Picture=
 
=The Big Picture=
 +
{|
 
<tanbox>
 
<tanbox>
[[Image:cherriesXparent.gif|right|50px]]
 
 
Your project should present your answers to the following three questions:
 
Your project should present your answers to the following three questions:
 
* How should one attempt to process 5 Million Wikipedia pages with MapReduce/Hadoop?  What parameters control the execution time, and what is the best guess for the values they should be set at?
 
* How should one attempt to process 5 Million Wikipedia pages with MapReduce/Hadoop?  What parameters control the execution time, and what is the best guess for the values they should be set at?
Line 14: Line 14:
 
* How does this compare to the execution time of the 5 Million pages on an XGrid system?
 
* How does this compare to the execution time of the 5 Million pages on an XGrid system?
 
</tanbox>
 
</tanbox>
 +
|
 +
[[Image:cherriesXparent.gif|right|50px]]
 +
|}
 +
<br />
  
 
=Assignment (same as for the XGrid Project)=
 
=Assignment (same as for the XGrid Project)=

Revision as of 13:59, 17 April 2010


This is the extension of Project #2, which is built on top of the Hadoop/Mapreduce Tutorials. It is due on the last day of Exams, at 4:00 p.m.


This section is only visible to computers located at Smith College