Difference between revisions of "CSC352 Project 3"
(11 intermediate revisions by the same user not shown) | |||
Line 2: | Line 2: | ||
<bluebox> | <bluebox> | ||
− | This is the extension of [[CSC352_Project_2 | Project #2]], which is built on top of the [[ | + | This is the extension of [[CSC352_Project_2 | Project #2]], which is built on top of the [[Hadoop/MapReduce_Tutorials| Hadoop/Mapreduce Tutorials]]. It is due on the last day of Exams, at 4:00 p.m. |
</bluebox> | </bluebox> | ||
<onlysmith> | <onlysmith> | ||
=The Big Picture= | =The Big Picture= | ||
+ | {| | ||
+ | | | ||
<tanbox> | <tanbox> | ||
Your project should present your answers to the following three questions: | Your project should present your answers to the following three questions: | ||
Line 13: | Line 15: | ||
* How does this compare to the execution time of the 5 Million pages on an XGrid system? | * How does this compare to the execution time of the 5 Million pages on an XGrid system? | ||
</tanbox> | </tanbox> | ||
+ | | | ||
+ | [[Image:cherriesXparent.gif|right|100px]] | ||
+ | |} | ||
+ | <br /> | ||
=Assignment (same as for the XGrid Project)= | =Assignment (same as for the XGrid Project)= | ||
Line 116: | Line 122: | ||
</pre></code> | </pre></code> | ||
+ | |||
+ | You are free to put additional wiki pages from the local disk of Hadoop6 into HDFS, but if you do so, do it in the '''wikipages''' directory, and update the README_dft.txt file in the HDFS wikipages directory with information about what you have added and how to access it. Thanks! | ||
===Web Server=== | ===Web Server=== | ||
Line 121: | Line 129: | ||
Of course, all the pages are still available on XGridMac, as they were for Project 2. It is up to you to figure out if it is worth exploring writing MapReduce programs that would gather the pages from the Web rather than from HDFS. | Of course, all the pages are still available on XGridMac, as they were for Project 2. It is up to you to figure out if it is worth exploring writing MapReduce programs that would gather the pages from the Web rather than from HDFS. | ||
− | |||
− | + | =Submission= | |
Submit a pdf (and additional files if needed) as follows: | Submit a pdf (and additional files if needed) as follows: | ||
Line 140: | Line 147: | ||
tar -czvf ''yourFirstNameProject3.tgz'' * | tar -czvf ''yourFirstNameProject3.tgz'' * | ||
submit project3 ''yourFirstNameProject3.tgz'' | submit project3 ''yourFirstNameProject3.tgz'' | ||
+ | |||
+ | =Extra Credits= | ||
+ | |||
+ | Extra credits will be given for some work done on AWS. This could be the whole project or sections of it, or just comparison on some of the input sets. | ||
+ | </onlysmith> | ||
<br /> | <br /> | ||
Line 148: | Line 160: | ||
<br /> | <br /> | ||
<br /> | <br /> | ||
− | [[Category:CSC352]][[Category: | + | [[Category:CSC352]][[Category:Project]][[Category:MapReduce]][[Category:XGrid]] |
Latest revision as of 13:07, 18 November 2010
This is the extension of Project #2, which is built on top of the Hadoop/Mapreduce Tutorials. It is due on the last day of Exams, at 4:00 p.m.