Δημοσίευση - Theoretical and Practical Issues in the Construction of a Greek Dependency Treebank

ΑΝΑΖΗΤΗΣΗ

Theoretical and Practical Issues in the Construction of a Greek Dependency Treebank

Ερευνητική περιοχή:  
    
Είδος:  
Άρθρο σε πρακτικά

 

Έτος: 2005
Συγγραφείς: Προκόπης Προκοπίδης; Ελίνα Δεσύπρη; Μαρία Κουτσομπόγερα; Χάρης Παπαγεωργίου; Στέλιος Πιπερίδης
Επιμέλεια: Montserrat Civit and Sandra K?bler and Ma. Ant?nia Mart?
Τίτλος βιβλίου: Proceedings of The Fourth Workshop on Treebanks and Linguistic Theories (TLT 2005)
Σελίδες: 149-160
Διεύθυνση: Barcelona, Spain
Οργανισμός: Universitat de Barcelona
Ημερομηνία: December
Περίληψη:
In this paper, we present work in progress for the construction of the Greek Dependency Treebank. GDT currently encompasses annotation at the level of syntax and semantics. The initial GDT dataset comprises 70KW of Greek texts, pertaining mainly to EU politics, with smaller segments from the travel and health domains. The data were extracted from collections compiled to meet the needs of funded research projects focusing on multilingual, multimedia information extraction. Thus, annotation efforts aim at the creation of training and testing material that will aid the development of processing tools in specific application domains. On the other hand, we are trying to build the basis for a reference corpus of Greek that can prove useful in contexts different from the particular application domains as a resource for investigation of linguistic structures in real-life texts, and as training material for developing machine-learning approaches to syntactic parsing and semantic role labeling of unrestricted Greek text.
[Bibtex]