Difference between revisions of "Hadoop/MapReduce Tutorials"
Line 17: | Line 17: | ||
! Comments | ! Comments | ||
|- | |- | ||
− | | | + | | width="30%" | |
[[Hadoop Tutorial 1 -- Running WordCount | Tutorial #1]] | [[Hadoop Tutorial 1 -- Running WordCount | Tutorial #1]] | ||
| | | | ||
Line 33: | Line 33: | ||
|- | |- | ||
| | | | ||
− | [[Hadoop_Tutorial_3.1_--_Using_Amazon's_WordCount_program | Tutorial 3.1]] | + | [[Hadoop_Tutorial_3.1_--_Using_Amazon's_WordCount_program | Tutorial #3.1]] |
| | | | ||
Uploading text to S3 and running Amazon's WordCount Java program on our own data. | Uploading text to S3 and running Amazon's WordCount Java program on our own data. | ||
|- | |- | ||
| | | | ||
− | [[Hadoop_Tutorial_3.2_--_Using_Your_Own_WordCount_program | Tutorial 3.2]] | + | [[Hadoop_Tutorial_3.2_--_Using_Your_Own_WordCount_program | Tutorial #3.2]] |
| | | | ||
Compiling our own version of the Java WordCount program and uploading it to AWS. | Compiling our own version of the Java WordCount program and uploading it to AWS. | ||
|- | |- | ||
| | | | ||
− | [[Hadoop Tutorial 4: Start an EC2 Instance | Tutorial 4]] | + | [[Hadoop Tutorial 4: Start an EC2 Instance | Tutorial #4]] |
| | | | ||
Start a server on Amazon's EC2 infrastructure | Start a server on Amazon's EC2 infrastructure |
Revision as of 06:26, 1 April 2010
These tutorials target the Hadoop/MapReduce Cluster in the CS Dept. at Smith College, as well as Amazon's EC2 and S3.
Tutorial | Comments |
---|---|
Running WordCount written in Java on the Smith College Hadoop/MapReduce Cluster | |
Running WordCount in Python on the Smith College Hadoop/MapReduce Cluster | |
Running Hadoop jobs on Amazon AWS | |
Uploading text to S3 and running Amazon's WordCount Java program on our own data. | |
Compiling our own version of the Java WordCount program and uploading it to AWS. | |
Start a server on Amazon's EC2 infrastructure |