Difference between revisions of "CSC352 Project 2"

From dftwiki3
Jump to: navigation, search
Line 12: Line 12:
 
* Process N wiki pages, and for each one keep track of the categories contained in the page find the 5 most frequent words (not including stop words) in the page.
 
* Process N wiki pages, and for each one keep track of the categories contained in the page find the 5 most frequent words (not including stop words) in the page.
 
* Associate with each category the most frequent words that have been associated with it over the N pages processed
 
* Associate with each category the most frequent words that have been associated with it over the N pages processed
output the result (or a sample of it)
+
* Output the result (or a sample of it)
 
* Measure the execution time of the program
 
* Measure the execution time of the program
 
* write a summary of it as illustrated in the guidelines presented in class (3/9, 3/11).  
 
* write a summary of it as illustrated in the guidelines presented in class (3/9, 3/11).  

Revision as of 09:03, 23 March 2010

This project is currently under construction...


This section is only visible to computers located at Smith College