Difference between revisions of "Hadoop/MapReduce Tutorials"

From dftwiki3
Jump to: navigation, search
Line 17: Line 17:
 
! Comments
 
! Comments
 
|-
 
|-
|
+
| width="30%" |
 
[[Hadoop Tutorial 1 -- Running WordCount | Tutorial #1]]
 
[[Hadoop Tutorial 1 -- Running WordCount | Tutorial #1]]
 
|  
 
|  
Line 33: Line 33:
 
|-
 
|-
 
|
 
|
[[Hadoop_Tutorial_3.1_--_Using_Amazon's_WordCount_program | Tutorial 3.1]]
+
[[Hadoop_Tutorial_3.1_--_Using_Amazon's_WordCount_program | Tutorial #3.1]]
 
|
 
|
 
Uploading text to S3 and running Amazon's WordCount Java program on our own data.
 
Uploading text to S3 and running Amazon's WordCount Java program on our own data.
 
|-
 
|-
 
|
 
|
[[Hadoop_Tutorial_3.2_--_Using_Your_Own_WordCount_program | Tutorial 3.2]]
+
[[Hadoop_Tutorial_3.2_--_Using_Your_Own_WordCount_program | Tutorial #3.2]]
 
|
 
|
 
Compiling our own version of the Java WordCount program and uploading it to AWS.
 
Compiling our own version of the Java WordCount program and uploading it to AWS.
 
|-
 
|-
 
|
 
|
[[Hadoop Tutorial 4: Start an EC2 Instance | Tutorial 4]]
+
[[Hadoop Tutorial 4: Start an EC2 Instance | Tutorial #4]]
 
|
 
|
 
Start a server on Amazon's EC2 infrastructure
 
Start a server on Amazon's EC2 infrastructure

Revision as of 06:26, 1 April 2010

AmazonAWS.jpg


HadoopCartoon.png



These tutorials target the Hadoop/MapReduce Cluster in the CS Dept. at Smith College, as well as Amazon's EC2 and S3.




Tutorial Comments

Tutorial #1

Running WordCount written in Java on the Smith College Hadoop/MapReduce Cluster

Tutorial #2

Running WordCount in Python on the Smith College Hadoop/MapReduce Cluster

Tutorial #3

Running Hadoop jobs on Amazon AWS

Tutorial #3.1

Uploading text to S3 and running Amazon's WordCount Java program on our own data.

Tutorial #3.2

Compiling our own version of the Java WordCount program and uploading it to AWS.

Tutorial #4

Start a server on Amazon's EC2 infrastructure