RESEARCH
A novel technique for words reordering based on n-grams
Year: | 2007 | ||||
---|---|---|---|---|---|
Authors: | Theologos Athanaselis; Stylianos Bakamidis; Ioannis Dologlou | ||||
Book title: | Proceedings of the International Symposium on Signal Processing and its Applications in conjunction with the International Conference on Information Sciences, Signal Processing and its Applications | ||||
Number: | 4555284 | ||||
Address: | Sharjah, United Arab Emirates (U.A.E.) | ||||
Abstract: | This paper presents an approach for repairing word order errors in English text by reordering words in a sentence and choosing the version that maximizes the number of trigram hits according to a language model. The novelty of this method concerns the use of an efficient confusion matrix technique for reordering the words. For further reducing the number of permutations the use of unigrams' probability is used. The comparative advantage of this method is that works with a large set of words, and avoids the laborious and costly process of collecting word order errors for creating error patterns. |
||||
[Bibtex] |