Difference between revisions of "CSC352 Class Page 2013"
(→Weekly Schedule) |
|||
(64 intermediate revisions by the same user not shown) | |||
Line 61: | Line 61: | ||
*** Regroup and gather statistics on the different machines in the classroom | *** Regroup and gather statistics on the different machines in the classroom | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
---- | ---- | ||
** Comments on '''bimonthly newsletter''' | ** Comments on '''bimonthly newsletter''' | ||
Line 85: | Line 77: | ||
**** The Official Google blog | **** The Official Google blog | ||
**** Review: Tom's Hardware | **** Review: Tom's Hardware | ||
− | **** Some of the sites listed in [ | + | **** Some of the sites listed in [https://rohidassanap.wordpress.com/2013/06/18/top-40-best-technology-news-websites-the-definitive-list/ this page's] top 40 list. |
*** Recommendation for news aggregator: [http://cloud.feedly.com/#welcome Feedly.com] | *** Recommendation for news aggregator: [http://cloud.feedly.com/#welcome Feedly.com] | ||
Line 291: | Line 283: | ||
** A project-oriented MPI example. Bring your Mac! | ** A project-oriented MPI example. Bring your Mac! | ||
---- | ---- | ||
− | * [[CSC352 Homework 4 2013| Homework 4]] on C and MPI | + | * [[CSC352 Homework 4 2013| Homework 4]] on C and MPI. [[CSC352 Homework 4 Solutions| Solution programs]] |
|| | || | ||
| | ||
Line 302: | Line 294: | ||
** <font color="goldenrod">Paper presentation</font>: [[Media:ServerVirtualizationArchitectureAndImplementation2009.pdf | Server Virtualization Architecture and Implementation]] presented by Emily | ** <font color="goldenrod">Paper presentation</font>: [[Media:ServerVirtualizationArchitectureAndImplementation2009.pdf | Server Virtualization Architecture and Implementation]] presented by Emily | ||
** A few words about newsletters | ** A few words about newsletters | ||
+ | ** [[CSC352 Where is What on Hadoop0 2013| Where is What on Hadoop0]] | ||
** MySQL Exercises | ** MySQL Exercises | ||
− | ** [[Tutorial:_C_%2B_MySQL_%2B_MPI | Combining C, MySQL and MPI]] | + | ** [[Tutorial:_C_%2B_MySQL_%2B_MPI | Combining C, MySQL and MPI]]: combing through a lot of code |
** Project discussion | ** Project discussion | ||
− | |||
* '''Thursday''' | * '''Thursday''' | ||
− | + | ** [[Tutorial: Create an MPI Cluster on the Amazon Elastic Cloud (EC2) | Creating an MPI Cluster on Amazon]], ([[CSC352 Keynote Presentations 2013| Accompanying keynote]]), followed by a second [[Computing Pi on an AWS MPI-Cluster| tutorial]] on computing Pi on a 10-node AWS cluster. | |
---- | ---- | ||
* | * | ||
Line 317: | Line 309: | ||
|| | || | ||
* '''Tuesday''': <font color="magenta">Otelia Cromwell Day</font> | * '''Tuesday''': <font color="magenta">Otelia Cromwell Day</font> | ||
− | * '''Thursday''': <font color="goldenrod">Paper presentation</font>: [[Media:MapReduceDeanGhemawat_2004.pdf |MapReduce: Simplified Data Processing on Large Clusters]] presented by | + | * '''Thursday''': |
+ | ** <font color="goldenrod">Paper presentation</font>: [[Media:MapReduceDeanGhemawat_2004.pdf |MapReduce: Simplified Data Processing on Large Clusters]] presented by Sharon Pamela | ||
+ | ** <font color="red">Newsletter #4 due today!</font>. Please include at least one image, and at least one news item covering some form of project that could be related or influential for our own wiki-collage project. See [http://cs.smith.edu/dftwiki/index.php/Latex_and_Editing_Tools_to_write_an_Honors_Thesis this document on writing theses] for information about the inclusion of images in Latex. The end section has a good list of sites that have good coverage of Latex topics. There is also plenty of information on the Web about this subject. | ||
+ | ** Preparation for [[CSC352 Homework 5 2013 | Homework 5]]: attaching EBS volumes. We'll do a lab in class to [[Create_an_MPI_Cluster_on_the_Amazon_Elastic_Cloud_(EC2)#Creating_an_EBS_Volume | create ]] and [[Create_an_MPI_Cluster_on_the_Amazon_Elastic_Cloud_(EC2)#Attaching_the_EBS_Volume_to_the_Cluster | attach]] an EBS volume to your AWS cluster. | ||
+ | |||
---- | ---- | ||
− | * | + | * [[CSC352 Homework 5 2013 | Homework 5]] and [[CSC352 Homework 5 Solution 2013| Solution]] |
|| | || | ||
| | ||
Line 330: | Line 326: | ||
* '''Tuesday''' | * '''Tuesday''' | ||
** <font color="goldenrod">Paper presentation</font>: [[Media:GeneralPurposeVsGPU_Comparison_Many_Cores_2010_Caragea.pdf |General-Purpose vs. GPU: Comparisons of Many-Cores on Irregular Workloads]], presented by Yoshie | ** <font color="goldenrod">Paper presentation</font>: [[Media:GeneralPurposeVsGPU_Comparison_Many_Cores_2010_Caragea.pdf |General-Purpose vs. GPU: Comparisons of Many-Cores on Irregular Workloads]], presented by Yoshie | ||
+ | *** '''Questions about the paper:''' | ||
+ | **** What kind of paper is this? Broad distribution? Research? Small group? | ||
+ | **** Organization? Abstract? Introduction? Definition of specialized terms? Early enough in the paper? | ||
+ | **** Are the contributions of paper clear? The section on related research sufficient? | ||
+ | **** What is being compared? Similar machines? Hardware? Software? | ||
+ | **** Are authors partial? Do they have a stake? | ||
+ | **** How does the paper advance the state of research? | ||
+ | **** What does it tell us about the way computer systems evolve? | ||
+ | ** Thinking about the project | ||
+ | *** What do we know better about the overall project. What pieces have we looked at? | ||
+ | *** What is it we don't know? | ||
+ | *** Can we turn any of these questions into a project? | ||
+ | <br /> | ||
* '''Thursday''' | * '''Thursday''' | ||
− | + | ** A few comments on Manager/Worker paradigm in MPI: not the only one. Many logical communication networks do not match star pattern | |
+ | ** Continuation of Project Discussion | ||
+ | ** Quick Introduction to Hadoop/MapReduce ([[CSC352 Keynote Presentations 2013| Accompanying keynote]]) | ||
+ | ** [[Tutorial:_Creating_a_Hadoop_Cluster_on_Amazon_AWS | MapReduce lab on AWS]] | ||
+ | ** [[CSC352 Bash Script to Run Hadoop WordCount| Script to run Hadoop WordCount program]] on AWS | ||
---- | ---- | ||
* | * | ||
|| | || | ||
+ | Yahoo has some very good reading material on Hadoop. One reason is that they may be one of the largest users of AWS and of Hadoop. | ||
+ | ** [http://developer.yahoo.com/hadoop/tutorial/ Yahoo Developers Network]: Tutorial on Hadoop. All the chapters are worth reading! | ||
<!-- ================================================================== --> | <!-- ================================================================== --> | ||
Line 340: | Line 355: | ||
| Week 12 <br /> 11/19<br /> | | Week 12 <br /> 11/19<br /> | ||
|| | || | ||
− | * '''Tuesday''': <font color="magenta"> | + | * '''Tuesday''': |
− | * '''Thursday''': <font color="magenta">Tentative guest lecture: Nick Howe on CUDA and GPUs</font> | + | ** <font color="red">1 month to go (exactly) before the project is due (Dec. 19)!</font> |
+ | ** <font color="magenta">Student-directed work (DT @ INFOCOMP 2013)</font> | ||
+ | ** Finish the [[Tutorial:_Creating_a_Hadoop_Cluster_on_Amazon_AWS | MapReduce lab on AWS]] and make sure you do the [[Tutorial:_Creating_a_Hadoop_Cluster_with_StarCluster_on_Amazon_AWS#Challenge_.23_2 | Challenge 2]] part of the lab. | ||
+ | ** Food for thought: some videos<br />I suggest one of you connects her laptop to the projection system and you all watch these videos together. After each one, discuss it as a group. Take notes and be ready to share your comments during Thursday's class when we resume our regular schedule. | ||
+ | *** The Cave 2 Project at the University of Illinois: Just another hardware solution for presenting the user with a large number of pixels; in this case 27320 x 3072 pixels. ''Short, 3 minutes.'' | ||
+ | <center> | ||
+ | <videoflash>yf0sllpZx3w</videoflash> | ||
+ | </center> | ||
+ | <br /> | ||
+ | *** The Creators Projects video<br /> | ||
+ | ::::This video is not necessarily anything that can work for us, but it's just "food for thought." Just a different way an artist has come up to make still pictures interesting to look at. ''Short, 6 minutes''. | ||
+ | <center> | ||
+ | <videoflash>rKmMaDBoZhs</videoflash> | ||
+ | </center> | ||
+ | <br /> | ||
+ | *** O'Reilly Radar Videos<br /> | ||
+ | [[Image:OReillyPerlBookCover.jpg|100px|right]] | ||
+ | ::: Tim O'Reilly is a visionary who figured out a long time ago that computer technology was an exploding field and he started a very successful line of books to support all new technology projects that were emerging and promising. The books all have animals on them and are uniquely easy to spot. O'Reilly now also has an on-line channel (O'Reilly Radar), and organizes conferences with top researchers and intellectuals in the field of computer science. | ||
+ | ::: The first video is with Doug Cutting, one of the creators of Hadoop. He makes some very good points about what Hadoop is, what it is good at, and what it might not be good at (Homework 5 lesson?). After Cutting you can skip the 2nd interview (about video technology) and zip to the 3rd interview with Jeremy Howard, at time-tag 13:47. Then learn about big data and analytics, and what is said of ''data scientists''. '' About 12 minutes total''. | ||
+ | <center> | ||
+ | <videoflash>BWBGQIq5zow</videoflash> | ||
+ | </center> | ||
+ | <br /> | ||
+ | ::: Good interview of Tim O'Reilly describing Web 2.0, and his view of a data-driven Internet. 8-minute long. You may want to think about how our wikipedia data (images, stats) relate to what is said about data as described in the interview. ''About 8 minutes''. | ||
+ | <center> | ||
+ | <videoflash>FJ3TxeE_tHI</videoflash> | ||
+ | </center> | ||
+ | <br /> | ||
+ | ::: The next video filmed in June 2013 presents Bruno Fernandez-Ruiz of Yahoo, who speaks about Hadoop since 2005, Hadoop today, and what is ahead. An important type of data property Fernandez-Ruiz is interested in is ''timeliness'', which we haven't really looked at for our project, but you will see that it could apply easily to the dynamics of wikipedia. Some interesting statistics about the number of servers, the size of the HDFS they use, the number of processes are given. ''About 17 minutes''. | ||
+ | <center> | ||
+ | [[Image:LookingBeyondHadoop.png | 430px | link=http://fora.tv/2013/06/26/Hadoop_and_Continuous_Computing_Looking_Beyond_MapReduce ]] | ||
+ | </center> | ||
+ | ** If you have at least 25 minutes left before the class time is over, do the [[Tutorial:_Running_a_Python_version_of_WorkCount_on_an_AWS_cluster| MapReduce-Python lab]], without attempting the challenges at the end. We'll do these together. | ||
+ | |||
+ | <br /> | ||
+ | * '''Thursday''': | ||
+ | ** <font color="magenta">Tentative guest lecture: Nick Howe on CUDA and GPUs</font> | ||
+ | ** Some thoughts about INFOCOMP 2013 ([[CSC352 Keynote Presentations 2013| keynote]]) | ||
+ | ** Going over Homework #5 ([[CSC352 Walking a 2-Level Directory in C| Walking a 2-Level Directory in C]]) | ||
+ | |||
---- | ---- | ||
Line 353: | Line 407: | ||
|width="60%"| | |width="60%"| | ||
* '''Tuesday''': | * '''Tuesday''': | ||
− | ** <font color="goldenrod">Paper presentation</font>: [[Media:AViewOfCloudComputing_CACM_Apr2010.pdf| A View of Cloud Computing]] presented by | + | ** <font color="red">No newsletter due</font> |
+ | ** <font color="goldenrod">Paper presentation</font>: [[Media:AViewOfCloudComputing_CACM_Apr2010.pdf| A View of Cloud Computing]] presented by Danaë. | ||
+ | ** 5-minute project presentations (everybody) | ||
+ | ** Instead of a newsletter, you may turn today a [[CSC352 Project Introduction in Latex | draft of an introduction to your final project]]. If you have too much work this week, you can turn this in on 12/3. | ||
+ | ** [[Tutorial: A bit of Bash | A bit of Bash]] | ||
+ | ** The challenges of the [[Tutorial:_Running_a_Python_version_of_WorkCount_on_an_AWS_cluster| MapReducing in Python]] lab | ||
* '''Thursday''': <font color="magenta">Thanksgiving Break</font> | * '''Thursday''': <font color="magenta">Thanksgiving Break</font> | ||
Line 366: | Line 425: | ||
* '''Tuesday''' | * '''Tuesday''' | ||
** <font color="goldenrod">Paper presentation</font>: [[Media:unreasonableEffectivenessOfData2009_HalevyNorvigPereira.pdf | The Unreasonable Effectiveness of Data]] presented by Julia | ** <font color="goldenrod">Paper presentation</font>: [[Media:unreasonableEffectivenessOfData2009_HalevyNorvigPereira.pdf | The Unreasonable Effectiveness of Data]] presented by Julia | ||
+ | ** Instead of a newsletter, you need to turn in a [[CSC352 Project Introduction in Latex | draft of an introduction to your final project]] (unless you submitted it last week). | ||
+ | |||
+ | ** The challenges of the [[Tutorial:_Running_a_Python_version_of_WorkCount_on_an_AWS_cluster| MapReducing in Python]] lab. We have done Challenge #1 last time. We'll look at Challenge #2 and #3. | ||
+ | ** Some feedback on Homework #5 and one [[CSC352 Homework 5 Solution 2013| solution]]. | ||
+ | ** MapReduce task graphs | ||
+ | ---- | ||
+ | ---- | ||
* '''Thursday''' | * '''Thursday''' | ||
− | + | ** [[Hadoop_Tutorial_1.1_--_Generating_Task_Timelines | Distribution of Map and Reduce tasks over time]] | |
+ | ** Project work and discussion | ||
+ | ** 20-minute individual session (in class) to go over project, questions, setup, etc... | ||
---- | ---- | ||
− | * | + | * |
|| | || | ||
| | ||
Line 377: | Line 445: | ||
|width="15%"| Week 15 <br /> 12/10 | |width="15%"| Week 15 <br /> 12/10 | ||
|width="60%"| | |width="60%"| | ||
+ | [[Image:CSC352Row.jpg|150px|right]] | ||
* '''Tuesday''': <font color="lightblue">Last Day of Class</font> | * '''Tuesday''': <font color="lightblue">Last Day of Class</font> | ||
+ | ** 20-minute presentations of projects. Suggested outline: | ||
+ | *** The context: how your project fits in the overall pictures | ||
+ | *** Has other similar work been done and documented before | ||
+ | *** What you decided to do | ||
+ | **** The challenges | ||
+ | **** The choices | ||
+ | **** The target experiments | ||
+ | *** Preliminary results | ||
+ | *** Expected results | ||
+ | *** Possible directions for continuing research after the project | ||
---- | ---- | ||
− | + | An afternoon of packing circular crepes, including some imaginative variations... | |
+ | [[Image:PackingCrepes1.jpg|200px]][[Image:PackingCrepes2.jpg|200px]] | ||
+ | [[Image:PackingCrepes3.jpg|200px]] | ||
+ | [[Image:PackingCrepes4.jpg|200px]] | ||
+ | [[Image:PackingCrepes5.jpg|200px]] | ||
+ | [[Image:PackingCrepes6.jpg|200px]] | ||
|| | || | ||
Line 398: | Line 482: | ||
<br /> | <br /> | ||
+ | ==Smith Elements of Style== | ||
+ | <br /> | ||
+ | * [[media:SmithJacobsonCenterWritingPapers-1.pdf | "Writing Papers" from the Smith College Jacobson Center for writing]] | ||
+ | <br /> | ||
==On-Line Resources== | ==On-Line Resources== | ||
* [https://computing.llnl.gov/tutorials/parallel_comp/ Introduction to Parallel Processing], by Blaise Barney, Lawrence Livermore National Laboratory. A good read. Covers most of the important topics. | * [https://computing.llnl.gov/tutorials/parallel_comp/ Introduction to Parallel Processing], by Blaise Barney, Lawrence Livermore National Laboratory. A good read. Covers most of the important topics. |
Latest revision as of 11:31, 31 January 2017
--D. Thiebaut (talk) 11:15, 9 August 2013 (EDT)
Contents
Weekly Schedule
Week | Topics | Reading |
Week 1 9/3 |
Thread 1 ----------------------|====|-------------------------> time Thread 2 ------------|====|-----------------------------------> time
|
|
Week 2 9/10 |
(last slide taken from [www.cs.utexas.edu/users/mckinley/352/lectures/16.pdf http://www.cs.utexas.edu/users/mckinley/352/lectures/16.pdf])
(Image taken from URL: http://www.altera.com/technology/system-design/articles/2012/multicore-many-core.html)
|
|
Week 3 9/17 |
|
All the data structures of interest (concurrent non-blocking and blocking) can be found in the Oracle documentation. The information is a bit cryptic, but you need to get comfortable with it! |
Week 4 9/24 |
|
|
Week 5 10/1 |
|
|
Week 6 10/8 |
|
|
Week 7 10/15 |
|
|
Week 8 10/22 |
|
|
Week 9 10/29 |
|
|
Week 10 11/5 |
|
|
Week 11 11/12 |
|
Yahoo has some very good reading material on Hadoop. One reason is that they may be one of the largest users of AWS and of Hadoop.
|
Week 12 11/19 |
|
|
Week 13 11/26 |
|
|
Week 14 12/3 |
|
|
Week 15 12/10 |
An afternoon of packing circular crepes, including some imaginative variations... |
|
Links and Resources
Latex
- Latex tutorial (video)
- How to write a thesis in Latex (video)
- Presentations with Beamer (video)
- TikZ examples (video)
Smith Elements of Style
On-Line Resources
- Introduction to Parallel Processing, by Blaise Barney, Lawrence Livermore National Laboratory. A good read. Covers most of the important topics.
- Introduction to MPI, by Blaise Barney, Lawrence Livermore National Laboratory. Another short but excellent coverage of a topic in parallel processing, this time MPI.
- A 90-Minute Guide to Modern Microprocessors
Classics
- Designing and Building Parallel Programs, by Ian Foster. A relatively old reference (1995), with still good information.
Papers
This is a tentative and non exhaustive list of papers scheduled for reading this semester.
Introduction
Paper | Pages |
---|---|
|
50 |
|
2 |
General/Parallelism
Paper | Pages |
---|---|
|
5 |
|
7 |
|
5 |
MPI
Paper | Pages |
---|---|
|
11 |
GPUs
Paper | Pages |
---|---|
6 |
Virtualization
Paper | Pages |
---|---|
5 |
Cloud
Paper | Pages |
---|---|
|
1.5 |
|
9 |
|
13 |
|
5 |
|
2 |
Project-Related
Paper | Pages |
---|---|
|
8 |