Computational morphological and syntactic lexicon of Modern Greek
The Computational morphological and syntactic lexicon of Modern Greek, which has been developed by ILSP / R.C. "Athena" in the framework of the LE-PAROLE project, can be used in Human Language Technology applications. It consists of 20,149 lemmas containing morphological and syntactic information, according to the PAROLE model, which has been based on international linguistic standards. This project caters for the compilation of lexicons for 12 European languages (Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Italian, Portuguese, Spanish, Swedish). The lexicons are in SGML format, following a common DTD for all languages. Lexicon contents
More specifically, the lexicon includes
At the morphological level, lemmas encode information with regard to their relation with other lemmas, spelling variations, etc., as well as information concerning their grammatical category (Part of Speech), and their inflection (inflectional paradigm, stems). At the next level, syntactic units are used to encode the syntactic behaviour of a lemma: i.e. the complements a lemma selects, as well as the features required for the characterisation and identification of these complements (e.g. whether it is a subject - noun in nominative case, etc.) Lemma distribution per grammatical category at each level Morphological level
Syntactic level
For more information and lexicon samples, please visit the PAROLE web site. |
|