Difference between revisions of "CSC352 Project Page 2017"

From dftwiki3
Jump to: navigation, search
(Timing)
(Apache)
Line 17: Line 17:
 
::* NoSQL: HBase, Cassandra, Accumulo, Kudu, MongoDB
 
::* NoSQL: HBase, Cassandra, Accumulo, Kudu, MongoDB
 
::* Machine Learning: Mahoot, H2O
 
::* Machine Learning: Mahoot, H2O
 +
::* Other?
 +
 
==Misc==
 
==Misc==
 
::* GPU: Cuda
 
::* GPU: Cuda

Revision as of 10:32, 26 January 2017

--D. Thiebaut (talk) 10:22, 26 January 2017 (EST)


Projects


The second half of the semester (or earlier if you wish), you will work on a project that will consist of several components and/or requirements.
  1. Pick a partner to create a pair.
  2. Pick a topic from the list below.
  3. Write a Latex tutorial which will introduce somebody with a CS background to the topic you've picked. You need to include an introduction that sets the background for your pick, a series of simple examples (all of which you will need to demonstrate you have run), a section on performance, and an assessment on the future of the technology you covered. This document must contain a list bibliography.
  4. Do a 1-hour presentation to the class, with slides, where you will introduce the class to the technology you have picked. This 1-hour presentation should include (if possible) a mini-lab that will allow everybody in class to run a few programs that will be using the technology you picked.


Non Exhaustive List of Topics


The list below is taken from the Hadoop Ecosystem Table page:

Apache


  • Pig, Hive, JAQL, Storm, Flink, Apex, Pydoop, and others
  • NoSQL: HBase, Cassandra, Accumulo, Kudu, MongoDB
  • Machine Learning: Mahoot, H2O
  • Other?

Misc

  • GPU: Cuda

Microsoft

  • Azure, and/or any component of its ecosystem.

Google

  • Google Compute Engine, Google App Engine


Timing


  • You need to have a plan in place the first week after Spring Break. This plan includes
  1. who your partner is
  2. what topic you have picked