Using Roget¿s Thesaurus to Determine the Similarity of Texts
Jeremy Ellman
Broschiertes Buch

Using Roget¿s Thesaurus to Determine the Similarity of Texts

A Thesis in Computational Linguistics

Versandkostenfrei!
Versandfertig in 6-10 Tagen
51,99 €
inkl. MwSt.
PAYBACK Punkte
26 °P sammeln!
This thesis addresses the problem of extracting a representation of text's meaning from its content. The solution investigated is based on the use of Roget s thesaurus as an external knowledge source and can be used to analyse texts of any length or complexity. The resulting document representation can then be compared to others, producing a new method for text similarity assessment. All coherent texts contain embedded sequences of words that are related in meaning. These sequences can be detected by identifying simple relationships between the relevant thesaural entries in which the words are...