Publication - A novel technique for words reordering based on n-grams
RESEARCH

A novel technique for words reordering based on n-grams

Research Area:  
    
Type:  
In Proceedings

 

Year: 2007
Authors: Theologos Athanaselis; Stylianos Bakamidis; Ioannis Dologlou
Book title: Proceedings of the International Symposium on Signal Processing and its Applications in conjunction with the International Conference on Information Sciences, Signal Processing and its Applications
Number: 4555284
Address: Sharjah, United Arab Emirates (U.A.E.)
Abstract:
This paper presents an approach for repairing word order errors in English text by reordering words in a sentence and choosing the version that maximizes the number of trigram hits according to a language model. The novelty of this method concerns the use of an efficient confusion matrix technique for reordering the words. For further reducing the number of permutations the use of unigrams' probability is used. The comparative advantage of this method is that works with a large set of words, and avoids the laborious and costly process of collecting word order errors for creating error patterns.
[Bibtex]