SEARCH
Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation
| Year: | 2011 | ||||
|---|---|---|---|---|---|
| Authors: | Pavel Pecina; Antonio Toral; Andy Way; Prokopis Prokopidis; Vassilis Papavassiliou; Maria Giagkou | ||||
| Editor: | M.L. Forcada; H. Depraetere; V. Vadeghinste | ||||
| Book title: | Proceedings of the 15th Annual conference of the European Association for Machine Translation | ||||
| Pages: | 297-304 | ||||
| Address: | Leuven, Belgium | ||||
| Date: | May | ||||
| Abstract: | This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused crawling of the web. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase-based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language pairs: English–French and English–Greek. |
||||
| [Bibtex] | |||||






