Δημοσίευση - Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation
ΕΡΕΥΝΑ

Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation

Ερευνητική περιοχή:  
    
Είδος:  
Άρθρο σε πρακτικά

 

Έτος: 2011
Συγγραφείς: Pavel Pecina; Antonio Toral; Andy Way; Προκόπης Προκοπίδης; Βασίλης Παπαβασιλείου; Μαρία Γιάγκου
Επιμέλεια: M.L. Forcada; H. Depraetere; V. Vadeghinste
Τίτλος βιβλίου: Proceedings of the 15th Annual conference of the European Association for Machine Translation
Σελίδες: 297-304
Διεύθυνση: Leuven, Belgium
Ημερομηνία: Μάϊος
Περίληψη:
This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused crawling of the web. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase-based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language pairs: English–French and English–Greek.
[Bibtex]