PROFILE
Implementing a language-independent MT Methodology
Year: | 2012 | ||||
---|---|---|---|---|---|
Authors: | Sokratis Sofianopoulos; Marina Vassiliou; George Tambouratzis | ||||
Editor: | Jagadeesh Jagarlamudi; Sujith Ravi; Xiaojun Wan; Hal Daume III | ||||
Book title: | Proceedings of the First Workshop on Multilingual Modeling (ACL 2012) | ||||
Pages: | 1-10 | ||||
Address: | Jeju, Republic of Korea | ||||
Organization: | ACL | ||||
Date: | July 9-11 | ||||
ISBN: | 978-1-937284-35-0 | ||||
Abstract: | The current paper presents a language independent methodology, which facilitates the creation of machine translation (MT) systems for various language pairs. This methodology is implemented in the PRESEMT hybrid MT system. PRESEMT has the lowest possible requirements on specialised resources and tools, given that for many languages (especially less widely used ones) only limited linguistic resources are available. In PRESEMT, the main translation process comprises two phases. The first one, Structure selection, determines the overall structure of a target language (TL) sentence, drawing on syntactic information from a small bilingual corpus. The second phase, Translation equivalent selection, relies on models extracted solely from monolingual corpora to implement translation disambiguation, determine intra-phrase word order and handle functional words. This paper proposes extracting information for disambiguation from the monolingual corpus. Experimental results indicate that such information substantially contributes in improving translation quality. |
||||
[Bibtex] |