Difference between revisions of "CSC352 Project 2"
Line 12: | Line 12: | ||
* Process N wiki pages, and for each one keep track of the categories contained in the page find the 5 most frequent words (not including stop words) in the page. | * Process N wiki pages, and for each one keep track of the categories contained in the page find the 5 most frequent words (not including stop words) in the page. | ||
* Associate with each category the most frequent words that have been associated with it over the N pages processed | * Associate with each category the most frequent words that have been associated with it over the N pages processed | ||
− | + | * Output the result (or a sample of it) | |
* Measure the execution time of the program | * Measure the execution time of the program | ||
* write a summary of it as illustrated in the guidelines presented in class (3/9, 3/11). | * write a summary of it as illustrated in the guidelines presented in class (3/9, 3/11). |
Revision as of 09:03, 23 March 2010
This project is currently under construction...