Difference between revisions of "CSC352 Class Page 2017"

From dftwiki3
Jump to: navigation, search
( )
( )
 
(155 intermediate revisions by the same user not shown)
Line 4: Line 4:
 
<br />
 
<br />
 
<center>[[CSC_352_--_2017 | Home]] | [[CSC352 Syllabus -- Spring 2017 | Syllabus]] | [[CSC352 Class Page 2017 | Schedule]] |
 
<center>[[CSC_352_--_2017 | Home]] | [[CSC352 Syllabus -- Spring 2017 | Syllabus]] | [[CSC352 Class Page 2017 | Schedule]] |
[[CSC352_Class_Page_2017#Links_and_Resources | Links &amp; Resources]]</center><br />
+
[[CSC352_Class_Page_2017#Links_and_Resources | Links &amp; Resources]] | [[CSC352_Project_Page_2017 | Final Project]]</center><br />
 
<br />
 
<br />
 
{| class="mw-collapsible mw-collapsed" style="width:100%" border="1"
 
{| class="mw-collapsible mw-collapsed" style="width:100%" border="1"
Line 128: Line 128:
 
----
 
----
  
::* Introduction to Latex
+
::* '''Newsletter'''
:::* [[Tutorial:_Writing_a_Latex_paper_with_ShareLatex.com|DT's tutorial #1]]
 
:::* [[Latex_Skeleton_for_Simple_Articles_and_Tech_Reports| A page with a Latex skeleton for ''technical reports'']]
 
:::* [[Latex Example: Bib File: Example Bib File| Example of Bibtex file]]
 
:::* Learn how to find BibTex entries. Example: [http://dl.acm.org/citation.cfm?id=1525689 The Unreasonable Effectiveness of Data] (go to ACM and click on BibTex link).
 
----
 
 
 
::* Newsletter
 
 
:::* [[Media:CSC352_ExampleLatexNewsletter.pdf|Example newsletter]]
 
:::* [[Media:CSC352_ExampleLatexNewsletter.pdf|Example newsletter]]
 
::* The format should be similar to the [http://technews.acm.org/ ACM Tech News] format.
 
::* The format should be similar to the [http://technews.acm.org/ ACM Tech News] format.
Line 151: Line 144:
 
::** The Official Google blog
 
::** The Official Google blog
 
::** Review: Tom's Hardware
 
::** Review: Tom's Hardware
::** Some of the  sites listed in [http://tech.blorge.com/Structure:%20/2008/11/15/top-40-technology-news-sites-the-definitive-guide/ blorge.com]'s top-40 list.
+
::** Some of the  sites listed in [https://rohidassanap.wordpress.com/2013/06/18/top-40-best-technology-news-websites-the-definitive-list/ this page's] top 40 list.
 
::* Recommendation for news aggregator:  [http://cloud.feedly.com/#welcome Feedly.com]
 
::* Recommendation for news aggregator:  [http://cloud.feedly.com/#welcome Feedly.com]
 +
 +
----
 +
 +
::* '''Introduction to Latex'''
 +
:::* [[Tutorial:_Writing_a_Latex_paper_with_ShareLatex.com|DT's tutorial #1]]
 +
:::* [[Latex_Skeleton_for_Simple_Articles_and_Tech_Reports| A page with a Latex skeleton for ''technical reports'']]
 +
:::* [[Latex Example: Bib File: Example Bib File| Example of Bibtex file]]
 +
:::* Learn how to find BibTex entries. Example: [http://dl.acm.org/citation.cfm?id=1525689 The Unreasonable Effectiveness of Data] (go to ACM and click on BibTex link).
  
 
----
 
----
Line 164: Line 165:
 
::* [[Python Multithreading/Multiprocessing Examples| Python programs ]] illustrating threads and processes
 
::* [[Python Multithreading/Multiprocessing Examples| Python programs ]] illustrating threads and processes
 
::* Lab (see slides)
 
::* Lab (see slides)
::* Multithreading in Java
+
::* Multithreading in Java.  See this [[Computing_Pi| page on Java threads, and the computation of Pi in Java]]
 
::* [[Media:CSC352_2017_Week2.pdf| Slides (pdf)]]
 
::* [[Media:CSC352_2017_Week2.pdf| Slides (pdf)]]
  
 
* '''Thursday'''
 
* '''Thursday'''
 
+
::* Presenting Technical Papers, [[Media:TheButlerDidIt_2017.pdf | a Presentation]]
 +
::* Multiprocessing in Python: '''in-class lab''' on parallel computation of Pi using the Monte-Carlo approach.  Multi-Processing
 +
::* [[Multiprocessing Python Program Using Monte Carlo to Compute Pi| Lujun's Lab solution program]]
 +
::* [[Media:CSC352_2017_Week2.pdf| Slides (pdf)]]
 
|
 
|
 
*  
 
*  
Line 178: Line 182:
 
** [http://en.wikibooks.org/wiki/LaTeX WikiBooks guide to Latex]
 
** [http://en.wikibooks.org/wiki/LaTeX WikiBooks guide to Latex]
 
** [http://www-biblio.physik.uni-bonn.de/info/downloads/thesis_guide.pdf Users Guide to Writing a Thesis in a Physics/Astronomy Institute of the University of Bonn]  
 
** [http://www-biblio.physik.uni-bonn.de/info/downloads/thesis_guide.pdf Users Guide to Writing a Thesis in a Physics/Astronomy Institute of the University of Bonn]  
 +
* References on Processes and Threads
 +
** [http://docs.oracle.com/javase/tutorial/essential/concurrency/index.html Oracle's lesson on ''Concurrency''], a very good source of information on ''processes'' and ''threads'' for the Java platform.
 
|}
 
|}
 
<br />
 
<br />
 +
 +
  
 
====&nbsp;====
 
====&nbsp;====
Line 197: Line 205:
 
|-valign="top"
 
|-valign="top"
 
|
 
|
* '''Tuesday/Thursday'''
+
* '''Tuesday'''
::*  
+
::* [[CSC352:_Using_Bash,_an_example| Using bash to measure execution time]]
 +
::* Synchronizing access to shared variables and data structures.
 +
::* [[CSC352_Synchronization_and_Java_Threads| DT's notes on Java Synchronization]] (contains programs seen in class)
 +
::* [[Media:CSC352Slides_Week3.pdf| slides (pdf)]]
 +
 
 
* '''Thursday'''
 
* '''Thursday'''
 
::* <font color="magenta">Paper Presentation: Berkeley</font>
 
::* <font color="magenta">Paper Presentation: Berkeley</font>
Line 205: Line 217:
 
*  
 
*  
 
|
 
|
*  
+
* Read Oracle's documentation on [http://docs.oracle.com/javase/tutorial/essential/concurrency/locksync.html locks and synchronized methods, objects, and classes].
 +
* Read [https://en.wikipedia.org/wiki/Compare-and-swap Wikipedia's article on Compare-and-Swap]
 
|}
 
|}
 
<br />
 
<br />
Line 226: Line 239:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* Discussion of View from Berkeley paper.  Many concepts to revisit
 +
:::* Moore's Law
 +
:::* Amdahl's Law
 +
:::* Multi-Core vs Many-Core
 +
:::* Nanometers
 +
:::* Multi-Core Performance as a fn of # of Processors
 +
:::* Programming Lab: Making the Game of Life parallel
 +
:::* [[CSC352:_Using_Bash,_an_example|Using Script Files to Measure Performance of Programs]]
 +
::* [[Media:CSC352_Week4.pdf| Slides (pdf)]]
 
* '''Thursday'''
 
* '''Thursday'''
::* Rocco Piccinino: presentation on Library Resources
+
::* Rocco Piccinino: presentation on Library Resources.  You can find a link to Rocco's presentation [http://libguides.smith.edu/c.php?g=619276&p=4310286 here]
::* <font color="magenta">Paper Presentations: Patterns + Framework, and Throughput Oriented Architectures</font>
+
::* <font color="magenta">Paper Presentations: Patterns + Framework (presented by Riley), and Throughput Oriented Architectures</font>
 +
::* Game of Life?
  
 
|
 
|
 
*  
 
*  
 
|
 
|
*  
+
Two papers to read for Thrusday.  In class discussion.
 +
* [http://www.science.smith.edu/dftwiki/images/a/a6/ParallelCOmputingWithPatternsAndFrameworks2010b.pdf Parallel computing with Patterns and Frameworks]
 +
* [http://www.science.smith.edu/dftwiki/images/0/03/UnderstandingThroughputOrientedArchitectures2010.pdf Understanding Throughput-Oriented Architectures]
 
|}
 
|}
 
<br />
 
<br />
Line 255: Line 279:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* [[CSC352_Game_of_Life_Lab_2017| Game of Life]]
::* <font color="red">Newsletter due</font>
+
::* [[CSC352: Using BlockingQueues| Using blocking queues]]
 +
::* [[CSC352:_Using_Bash_Scripts_to_Measure_Program_Execution_Time | Using Bash to measure execution times]]
 +
::* [[Media:CSC352_slides_Week5.pdf| Slides (pdf)]]
 
* '''Thursday: <font color="magenta">Rally Day</font>'''
 
* '''Thursday: <font color="magenta">Rally Day</font>'''
 
+
::* <font color="red">Newsletter due</font> on Moodle, at 1:00 p.m., just before Rally Day events.
 +
::* '''Requirements for newsletter''': Add an image or figure to your newsletter.  It could be a photo of you at the top, as the author, or a graph, or a logo.  You pick the jpeg or png, and incorporate it in your newsletter.
 
|
 
|
*  
+
* [[CSC352 Game of Life Homework 1| Game of Life Homework]]
 
|
 
|
*  
+
* [https://docs.oracle.com/javase/7/docs/api/java/util/concurrent/BlockingQueue.html Java BlockingQueues]
 +
* [http://tutorials.jenkov.com/java-util-concurrent/blockingqueue.html Simple tutorial on BlockingQueues]
 +
* [https://docs.oracle.com/javase/7/docs/api/java/util/concurrent/ArrayBlockingQueue.html ArrayBlockingQueues]
 
|}
 
|}
 
<br />
 
<br />
Line 283: Line 312:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* Game of Life, revisited
 +
::* Deadlocks, starvation, and the Dining Philosophers problem
 +
::* Thread states
 +
::* Measuring performance: speedup, bash scripts, R graphs
 +
::* Performance measures
 +
::* [[Media:CSC352_week6_2017.pdf | Slides Week 6 (pdf)]]
 +
::* [[Media:CSC352_CTutorial_2017.pdf | Slides for C Tutorial (pdf)]].  [[Solutions_to_Introduction_to_C_Presentation| Solutions for exercises]].
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentation: Unreasonable, and Top500</font>
+
::* <font color="magenta">Paper Presentation: Unreasonable (Lujun), and Top500</font>
  
 
|
 
|
*  
+
* [[CSC352 Game of Life Homework 2| Homework 2]], due Thursday after the break, 3/23/17, at 1:00 p.m.
 
|
 
|
 
*  
 
*  
 
|}
 
|}
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 310: Line 346:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* C & MPI
 +
::* [[CSC352_GCC-Generated_Assembly_ARM_%26_Pentium| Comparing executable C code]] for ARM and for Pentium
 +
::* [[CSC352 MPI pi2.c program | Computing Pi]] with MPI
 +
::* [[Game of Life in C and MPI | Game of Life]] in C
 +
<br />
 +
::* [[Media:CSC352_Week7_MPISlides.pdf| MPI slides (pdf)]]
 +
::* [[Media:CSC352_Week7_CTutorial.pdf| C Tutorial slides (pdf)]]
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentation: Success of MPI, and General purpose vs GPU</font>
+
::* A word about the [[CSC352_Project_Page_2017| final project]]
::* <font color="red">Newsletter due</font>
+
::* <font color="magenta">Paper Presentation: Success of MPI (Zainab), and General purpose vs GPU</font>
 +
::* <font color="red">Newsletter due</font>.  This week, add a sentence to each summary that explains why you picked this news item.  Why does the news you reported matters?  How is it important to you?  Does it connect in any way to something you have read or programmed?  Also, this week, add an a section to your newsletter that refers to Amdahl's law.  The purpose of this is to have you create an equation with Latex.  So, you can simply explain what Amdahl's law is, or you can find a bit of news in the past year that mentions Amdahl's Law, and take this opportunity to create an equation: Speedup = T(1)/T(N).
 
|
 
|
*  
+
* Prepare a 1-page description of the project you will present at the end of the semester.  Submit it before March 21, 1:00 p.m. to Moodle.  See this [[CSC352_Project_Page_2017| page]] for information.
 
|
 
|
*  
+
* [https://computing.llnl.gov/tutorials/mpi/ MPI Tutorials]  by Blaise Barney of the Lawrence Livermore National Laboratory
 +
* [http://cslibrary.stanford.edu/101/EssentialC.pdf Essential C] by Nick Parlante, Stanford U.
 
|}
 
|}
 
<br />
 
<br />
Line 325: Line 369:
 
|
 
|
 
'''SPRING BREAK'''
 
'''SPRING BREAK'''
 +
<br />
 +
<!--  [[Image:CalvinDancing.gif | center]] -->
 
|}
 
|}
 
<!-- ============================================================= -->
 
<!-- ============================================================= -->
 
<!-- ============================================================= -->
 
<!-- ============================================================= -->
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 346: Line 393:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* 1-page project description due on Moodle
 +
::* C &amp; MPI
 +
::* [http://www.mpich.org/static/docs/latest/www3/ All MPI functions (for reference only)]
 +
::* [[Media:CSC352_Week8_CTutorial.pdf| C-Tutorial Slides (pdf)]]
 +
::* [[Media:CSC352_Week8_MPISlides.pdf| MPI slides (pdf)]]
 +
 
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentations: Virtualization, and View of Cloud</font>
+
::* <font color="magenta">Paper Presentations: Virtualization, and View of Cloud (Kathleen)</font>
 
+
::* [[Setup_MPI_on_Hadoop_Cluster | Running MPI programs on Local Cluster (Hadoop Cluster)]]
 +
::* [[Media:CSC352_Week8_MPISlides.pdf| MPI slides (pdf)]]
 
|
 
|
 
*  
 
*  
 
|
 
|
*  
+
* [http://cslibrary.stanford.edu/101/EssentialC.pdf Essential C] by Nick Parlante, Stanford U.
 +
* [https://computing.llnl.gov/tutorials/mpi/ MPI Tutorials]  by Blaise Barney of the Lawrence Livermore National Laboratory
 +
 
 +
* [http://star.mit.edu/cluster/docs/latest/overview.html StarCluster overview]: must read.  StarCluster is a set of Python programs that one installs on one's laptop to control clusters on Amazon Web Services (AWS).  StarCluster can easily setup MPI and Hadoop clusters.
 
|}
 
|}
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 373: Line 430:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* MPI, Continued
 +
::* [[Media:CSC352_Week9_MPI.pdf | Slides (pdf)]]
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentations: MapReduce and (Nobody got fired + Beyond Hadoop)</font>
+
::* MPI, Continued
 +
::* [[Media:CSC352_Week9_MPI.pdf | Slides (pdf)]]
 +
::* [[Tutorial:_Create_an_MPI_Cluster_on_the_Amazon_Elastic_Cloud_(EC2)| MPI on AWS, Part 1 (Hello World!)]]
 +
::* <font color="magenta">Paper Presentations: MapReduce (Sam) and (Nobody got fired + Beyond Hadoop) (Grace)</font>
 
::* <font color="red">Newsletter due</font>
 
::* <font color="red">Newsletter due</font>
  
Line 384: Line 445:
 
|}
 
|}
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 395: Line 457:
 
{| class="mw-collapsible " style="width:100%" border="1"
 
{| class="mw-collapsible " style="width:100%" border="1"
 
|- style="background:#dddddd;"
 
|- style="background:#dddddd;"
|width="60%"| '''Topics''':  
+
|width="60%"| '''Topics''': MPI, AWS
 
|width="15%"| '''Lab/Hw'''
 
|width="15%"| '''Lab/Hw'''
 
|width="25%"| '''Reading'''  
 
|width="25%"| '''Reading'''  
Line 401: Line 463:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* More MPI functions
 +
::* [[Tutorial:_Create_an_MPI_Cluster_on_the_Amazon_Elastic_Cloud_(EC2)| MPI on AWS, Part 1 (Hello World!)]]
 +
::* [[Computing_Pi_on_an_AWS_MPI-Cluster | MPI on AWS, Part 2 (Computing Pi)]]
 +
::* [[media:CSC352_Slides_Week10MPIAWS_2017.pdf| slides (pdf)]]
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentations: Spark and Microsoft Bet Future</font>
+
::* <font color="magenta">Paper Presentations: Spark (Youyou), and Microsoft Bet Future (Isaiah & Muriel)</font>
  
 
|
 
|
*  
+
* [[CSC352 Homework 3 2017| Homework 3]]
 
|
 
|
 
*  
 
*  
 
|}
 
|}
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 428: Line 494:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::*  
+
::* Finish [[Computing_Pi_on_an_AWS_MPI-Cluster | tutorial on MPI on AWS (Computing Pi)]]
 +
::* Map-Reduce
 +
:::* Introduction to Functional Programming
 +
:::* Implementing Map-Reduce in Python
 
* '''Thursday'''
 
* '''Thursday'''
::* <font color="magenta">Paper Presentations: Wikipedia dumps, and Cloud &amp; Top500</font>
+
::* Class cancelled
::* <font color="red">Newsletter due</font>
 
  
 
|
 
|
Line 439: Line 507:
 
|}
 
|}
 
<br />
 
<br />
 +
 
====&nbsp;====
 
====&nbsp;====
 
----
 
----
Line 455: Line 524:
 
|-valign="top"
 
|-valign="top"
 
|
 
|
* '''Tuesday'''
+
* '''Tuesday/Thursday'''
* '''Thursday'''
+
::* <font color="magenta">Paper Presentations: Wikipedia dumps (Angie), and Can Cloud Reach Top500 (Vega)</font>
::* 3 Project Presentations (lunch period included)?
+
::* Map-Reduce
 +
:::* Implementing Map-Reduce in Python: various problems solved with Map-Reduce
 +
::::* [[Map-Reduce_Examples| Example page]]
 +
::::* [[CSC352_Game_of_Life_in_Map-Reduce| Solving Conway's Game of Life with Map-Reduce]]
 +
 
 +
::* [[Media:CSC352_Week12_MapReduce.pdf | Map-Reduce slides (pdf)]]
 +
 
 
|
 
|
 
*  
 
*  
Line 482: Line 557:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::* 3 Project Presentations (Lunch time included?)
+
::* Labs
 +
:::* [[Tutorial:_Creating_a_Hadoop_Cluster_on_Amazon_AWS | Lab 1: Hadoop on AWS]]
 +
:::* [[Hadoop_Tutorial_2.3_--_Running_WordCount_in_Python_on_AWS | Lab 2:Hadoop on AWS: streaming Python]]
 +
:::* [[Hadoop_Tutorial_1.1_--_Generating_Task_Timelines | Lab 3: Creating Task Graphs]]
 +
 
 
* '''Thursday'''
 
* '''Thursday'''
::* 3 Project Presentations (Lunch time included?)
+
::* Project Presentations:
 +
:::* Isaiah
 
|
 
|
*  
+
* [[CSC352 Homework 4 2017| Homework 4, Game of Life using Map-Reduce on AWS]]
 
|
 
|
 
*  
 
*  
Line 509: Line 589:
 
|
 
|
 
* '''Tuesday'''
 
* '''Tuesday'''
::* 3 Project Presentations (Lunch time included?)
+
::* Project Presentations
 +
:::* Grace (lunch time)
 +
:::* Kathleen
 +
:::* Riley
 
* '''Thursday'''
 
* '''Thursday'''
::* 3 Project Presentations (Lunch time included?)
+
::* Project Presentations:
 +
:::* Sam & Angie (lunch time)  
 +
:::* Lujun & Vega
 +
:::* Youyou and Zainab
 
|
 
|
 
*  
 
*  
Line 518: Line 604:
 
|}
 
|}
  
 +
====&nbsp;====
 +
----
 +
{| style="width:100%" border="0"
 +
|width="45%"|'''Exam Week '''  May 9, 12
 +
|width="20%"| '''&nbsp;'''
 +
|width="25%"| '''&nbsp;'''
 +
|-
 +
|}
 +
<br />
 +
{| class="mw-collapsible " style="width:100%" border="1"
 +
|- style="background:#bb8fce;"
 +
|width="100%"| '''Final Exam Week'''
 +
|-valign="top"
 +
|
 +
* '''Friday, May 12, 4:00 p.m.''': pdf of project due on Moodle.
 +
* '''Friday, May 12, 4:00 p.m.''': Hadoop Game of Life due.
 +
|}
 +
<br />
 +
 +
=Links & Resources=
 +
<br />
 +
==Map-Reduce==
 +
<br />
 +
* [https://www.tutorialspoint.com/hadoop/hadoop_mapreduce.htm Map-Reduce Tutorial from tutorialpoint.com]
 +
* [https://www.slideshare.net/andreaiacono/mapreduce-34478449 Good slides on Slide-Share by Andrea Jacono]
 
<br />
 
<br />
  
=Projects=
+
=Final Project=
 
<br />
 
<br />
 
:Go to the [[CSC352 Project Page 2017|project page]] for information on the projects.
 
:Go to the [[CSC352 Project Page 2017|project page]] for information on the projects.
 
<br />
 
<br />
 +
 
=Links and Resources=
 
=Links and Resources=
 
<br />
 
<br />
 +
==Cloud Services==
 +
{|class="wikitable"
 +
!colspan="2"|Cloud Services
 +
|-
 +
! style="width: 30%;" | Name
 +
! Description
 +
|-
 +
|
 +
[http://www.bitfusion.io/ bitfusion.io]
 +
|
 +
Provides AMIs on AWS with GPUs attached.  The AMIs are already loaded with tensorflow and python, as well as Jupyter for remote development of Python programs.  They offer a 5-day free trial.
 +
|-
 +
|
 +
[https://www.digitalocean.com/ Digital Ocean]
 +
|
 +
A whole cloud infrastructure similar to AWS, but (apparently) not running on AWS.  Offers a cluster infrastructure with many packages preloaded, including Node.js, Rails, Redis, MongoDB, Cassandra, Docker, MediaWiki, Django, Drupal, Drone, LAMP, LEMP, Ruby, etc.
 +
Offers a $10 credit when first signing up.
 +
 +
|}
 +
 
==Latex==
 
==Latex==
 
<br />
 
<br />
Line 544: Line 676:
 
===MPI===
 
===MPI===
 
* [https://computing.llnl.gov/tutorials/mpi/ Introduction to MPI], by Blaise Barney, Lawrence Livermore National Laboratory.  Another short but excellent coverage of a topic in parallel processing, this time MPI.
 
* [https://computing.llnl.gov/tutorials/mpi/ Introduction to MPI], by Blaise Barney, Lawrence Livermore National Laboratory.  Another short but excellent coverage of a topic in parallel processing, this time MPI.
 +
* [http://mpitutorial.com/tutorials/launching-an-amazon-ec2-mpi-cluster/ Launching and Amazon EC2 MPI Cluster]
 +
 
===Hadoop===
 
===Hadoop===
 
* [[CSC352 MapReduce/Hadoop Class Notes| Thiebaut's class notes]]
 
* [[CSC352 MapReduce/Hadoop Class Notes| Thiebaut's class notes]]
 +
* [https://blog.cloudera.com/blog/2014/01/how-to-create-a-simple-hadoop-cluster-with-virtualbox/ How to create a simple hadoop cluster with VirtualBox]
 +
* [http://star.mit.edu/cluster/docs/latest/plugins/hadoop.html?highlight=hadoop StarCluster's Hadoop Plugin]
  
 
===Microprocessors===
 
===Microprocessors===
Line 715: Line 851:
 
<br />
 
<br />
 
<br />
 
<br />
[[Category:CSC352]][[Category:Classes]]
+
[[Category:CSC352]][[Category:Classes]][[Category:Hadoop]][[Category:MPI]][[Category:Java]][[Category:Threads]]

Latest revision as of 11:50, 25 April 2017

--D. Thiebaut (talk) 10:37, 2 December 2016 (EST)



Home | Syllabus | Schedule | Links & Resources | Final Project


Topics to be covered: Week (approx)
  • Syllabus
  • Parallelism
  • Interrupts
  • Processes/Threads
  • Latex

Week 1

  • Performance
  • Speedup
  • Amdhal's Law
  • Java threads + Synchronization

Week 2

  • Java Thread-Safe Libraries
  • Static Variables

Week 3

  • Introduction to C

Week 4
Week 5

  • MPI

Week 6
Week 7

  • AWS
  • MySQL + MPI
  • MPI Clusters on AWS

Week 8
Week 9

  • MapReduce
  • Hadoop, Java
  • Hadoop, Python
  • Hadoop AWS

Week 10
Week 11

  • Apache Spark

Week 12

  • GPU

Week 13


 


Week 1 Jan 26, 27    


Topics: Lab/Hw Reading
  • Thursday
  • Syllabus
  • papers
  • Biweekly newsletters in Latex
  • Parallelism: going to the source: Interrupts!
    • 8086 type of interrupts (simplified)
    • Interrupt Vector
    • Interrupt Priority


 


Week 2 Jan 31, Feb 2    


Topics: Lab/Hw Reading
  • Tuesday

  • Newsletter
  • The format should be similar to the ACM Tech News format.
  • The header should contain a title, your name, the class (CSC352) and the date
  • Each paragraph should have a header with a title, the source of news, the date, and possibly a link to the full article.
  • The paragraph describing a news item should be between 3 to 10 lines, give or take.
  • Write 1 full page to 2 pages, depending on the richness of events in the technology field
  • Feel free to present N-1 topics with just 3 lines, and 1 topic which you highlight with a longer paragraph.
  • Topics: anything related to parallelism: computers, mobile platforms, cloud, companies, new software, new algorithms, conferences, people in the field, etc.
  • Good sources of information to get started:
  • Recommendation for news aggregator: Feedly.com

  • Introduction to Latex

  • Thursday



 


Week 3 Feb 7, 9    


Topics: Lab/Hw Reading
  • Tuesday
  • Thursday
  • Paper Presentation: Berkeley
  • Newsletter due


 


Week 4 Feb 14, 16    


Topics: Lab/Hw Reading
  • Tuesday
  • Discussion of View from Berkeley paper. Many concepts to revisit
  • Thursday
  • Rocco Piccinino: presentation on Library Resources. You can find a link to Rocco's presentation here
  • Paper Presentations: Patterns + Framework (presented by Riley), and Throughput Oriented Architectures
  • Game of Life?

Two papers to read for Thrusday. In class discussion.


 


Week 5 Feb 21    


Topics: Lab/Hw Reading
  • Tuesday
  • Thursday: Rally Day
  • Newsletter due on Moodle, at 1:00 p.m., just before Rally Day events.
  • Requirements for newsletter: Add an image or figure to your newsletter. It could be a photo of you at the top, as the author, or a graph, or a logo. You pick the jpeg or png, and incorporate it in your newsletter.


 


Week 6 Feb 28, Mar 2    


Topics: Lab/Hw Reading
  • Tuesday
  • Thursday
  • Paper Presentation: Unreasonable (Lujun), and Top500
  • Homework 2, due Thursday after the break, 3/23/17, at 1:00 p.m.


 


Week 7 Mar 7, 9    


Topics: Lab/Hw Reading
  • Tuesday


  • Thursday
  • A word about the final project
  • Paper Presentation: Success of MPI (Zainab), and General purpose vs GPU
  • Newsletter due. This week, add a sentence to each summary that explains why you picked this news item. Why does the news you reported matters? How is it important to you? Does it connect in any way to something you have read or programmed? Also, this week, add an a section to your newsletter that refers to Amdahl's law. The purpose of this is to have you create an equation with Latex. So, you can simply explain what Amdahl's law is, or you can find a bit of news in the past year that mentions Amdahl's Law, and take this opportunity to create an equation: Speedup = T(1)/T(N).
  • Prepare a 1-page description of the project you will present at the end of the semester. Submit it before March 21, 1:00 p.m. to Moodle. See this page for information.


SPRING BREAK


 


Week 8 Mar 21, 23    


Topics: Lab/Hw Reading
  • Tuesday
  • Thursday
  • StarCluster overview: must read. StarCluster is a set of Python programs that one installs on one's laptop to control clusters on Amazon Web Services (AWS). StarCluster can easily setup MPI and Hadoop clusters.


 


Week 9 Mar 28, 30    


Topics: Lab/Hw Reading
  • Tuesday
  • Thursday


 


Week 10 Apr 4, 6    


Topics: MPI, AWS Lab/Hw Reading
  • Tuesday
  • Thursday
  • Paper Presentations: Spark (Youyou), and Microsoft Bet Future (Isaiah & Muriel)


 


Week 11 Apr 11, 13    


Topics: Lab/Hw Reading
  • Tuesday
  • Introduction to Functional Programming
  • Implementing Map-Reduce in Python
  • Thursday
  • Class cancelled


 


Week 12 Apr 18, 20    


Topics: Lab/Hw Reading
  • Tuesday/Thursday
  • Paper Presentations: Wikipedia dumps (Angie), and Can Cloud Reach Top500 (Vega)
  • Map-Reduce
  • Implementing Map-Reduce in Python: various problems solved with Map-Reduce


 


Week 13 Apr 25, 27    


Topics: Lab/Hw Reading
  • Tuesday
  • Labs
  • Thursday
  • Project Presentations:
  • Isaiah


 


Week 14 May 2, 4    


Topics: Lab/Hw Reading
  • Tuesday
  • Project Presentations:
  • Grace (lunch time)
  • Kathleen
  • Riley
  • Thursday
  • Project Presentations:
  • Sam & Angie (lunch time)
  • Lujun & Vega
  • Youyou and Zainab

 


Exam Week May 9, 12    


Final Exam Week
  • Friday, May 12, 4:00 p.m.: pdf of project due on Moodle.
  • Friday, May 12, 4:00 p.m.: Hadoop Game of Life due.


Links & Resources


Map-Reduce



Final Project


Go to the project page for information on the projects.


Links and Resources


Cloud Services

Cloud Services
Name Description

bitfusion.io

Provides AMIs on AWS with GPUs attached. The AMIs are already loaded with tensorflow and python, as well as Jupyter for remote development of Python programs. They offer a 5-day free trial.

Digital Ocean

A whole cloud infrastructure similar to AWS, but (apparently) not running on AWS. Offers a cluster infrastructure with many packages preloaded, including Node.js, Rails, Redis, MongoDB, Cassandra, Docker, MediaWiki, Django, Drupal, Drone, LAMP, LEMP, Ruby, etc. Offers a $10 credit when first signing up.

Latex


Here are some nice video tutorials on using Latex to write papers.


Smith Elements of Style



On-Line Resources

MPI

Hadoop

Microprocessors



Papers

This is a tentative and non exhaustive list of papers scheduled for reading this semester.

Introduction

Paper Pages

50

2

General/Parallelism

Paper Pages

5

7

5

  • Top500 Versus Sustained Performance, by William Kramer, 21st International Conference On Parallel Architectures And Compilation Techniques (PACT12), 19–23 September 2012, Minneapolis, MN, US

8

MPI

Paper Pages
  • Learning from the Success of MPI, by WIlliam D. Gropp, Argonne National Lab, 2002.                                             

11

GPUs

Paper Pages

6

Virtualization

Paper Pages

5

Cloud

Paper Pages

1.5

  • A View of Cloud Computing, 2010, By Armbrust, Michael and Fox, Armando and Griffith, Rean and Joseph, Anthony D. and Katz, Randy and Konwinski, Andy and Lee, Gunho and Patterson, David and Rabkin, Ariel and Stoica, Ion and Zaharia, Matei.

9

13

5

2

10

19

8

  • Can Cloud Computing Reach the Top500?, by Jeffrey Napper and Paolo Bientinesi, Proceeding of UCHPC-MAW '09 on UnConventional high performance computing workshop plus memory access workshop,

Ischia, Italy — May 18 - 20, 2009.

4