Publication - Implementing a language-independent MT Methodology
RESEARCH

Implementing a language-independent MT Methodology

Research Area:  
    
Type:  
In Proceedings

 

Year: 2012
Authors: Sokratis Sofianopoulos; Marina Vassiliou; George Tambouratzis
Editor: Jagadeesh Jagarlamudi; Sujith Ravi; Xiaojun Wan; Hal Daume III
Book title: Proceedings of the First Workshop on Multilingual Modeling (ACL 2012)
Pages: 1-10
Address: Jeju, Republic of Korea
Organization: ACL
Date: July 9-11
ISBN: 978-1-937284-35-0
Abstract:
The current paper presents a language independent methodology, which facilitates the creation of machine translation (MT) systems for various language pairs. This methodology is implemented in the PRESEMT hybrid MT system. PRESEMT has the lowest possible requirements on specialised resources and tools, given that for many languages (especially less widely used ones) only limited linguistic resources are available. In PRESEMT, the main translation process comprises two phases. The first one, Structure selection, determines the overall structure of a target language (TL) sentence, drawing on syntactic information from a small bilingual corpus. The second phase, Translation equivalent selection, relies on models extracted solely from monolingual corpora to implement translation disambiguation, determine intra-phrase word order and handle functional words. This paper proposes extracting information for disambiguation from the monolingual corpus. Experimental results indicate that such information substantially contributes in improving translation quality.
[Bibtex]