Hadoop/MapReduce Tutorials

From dftwiki3
Revision as of 07:25, 1 April 2010 by Thiebaut (talk | contribs)
Jump to: navigation, search
AmazonAWS.jpg


HadoopCartoon.png



These tutorials target the Hadoop/MapReduce Cluster in the CS Dept. at Smith College, as well as Amazon's EC2 and S3.




Tutorial Comments

Tutorial #1

Running WordCount written in Java on the Smith College Hadoop/MapReduce Cluster

Tutorial #2

Running WordCount in Python on the Smith College Hadoop/MapReduce Cluster

Tutorial #3

Running Hadoop jobs on Amazon AWS

Tutorial 3.1

Uploading text to S3 and running Amazon's WordCount Java program on our own data.

Tutorial 3.2

Compiling our own version of the Java WordCount program and uploading it to AWS.

Tutorial 4

Start a server on Amazon's EC2 infrastructure