Difference between revisions of "CSC352 Project 2"
Line 219: | Line 219: | ||
* the links to other wikipedia pages the page contains, in '''<pagelinks>''' and '''<page>''' tags, | * the links to other wikipedia pages the page contains, in '''<pagelinks>''' and '''<page>''' tags, | ||
* the text of the page, with all the html and wiki tags removed, between '''<text>''' tags. | * the text of the page, with all the html and wiki tags removed, between '''<text>''' tags. | ||
+ | |||
+ | The end of the text section always contains foreign characters. The text should be coded in UTF-8, which is the international character set, of which ASCII is a variant. | ||
</onlysmith> | </onlysmith> |
Revision as of 22:16, 23 February 2010
This project is currently under construction...