RESEARCH
Optimizing word segmentation tasks using ant colony metaheuristics
Year: | 2014 | ||||
---|---|---|---|---|---|
Authors: | George Tambouratzis | ||||
Journal: | Literary & Linguistic Computing | ||||
Volume: | 29 | ||||
Number: | 2 | ||||
Pages: | 234-254 | ||||
ISSN: | 1477-4615 | ||||
DOI: | (doi:10.1093/llc/fqt026). | ||||
Abstract: | In this article, the application of Ant-Colony Optimization (ACO) to a morphological segmentation task is described, where the aim is to analyse a set of words into their constituent stem and ending. A number of criteria for determining the optimal segmentation are evaluated comparatively while at the same time investigating more comprehensively the effectiveness of the ACO system in defining appropriate values for system parameters. Owing to the characteristics of the task at hand, particular emphasis is placed on studying the ACO process for learning sessions of a limited duration. Morphological segmentation becomes hardest in highly inflectional languages, where each stem is associated with a large number of distinct endings. Consequently, the present article investigates morphological segmentation of words from a highly inflectional language, specifically Ancient Greek, by combining pattern-recognition principles with limited linguistic knowledge. To weigh these sources of knowledge, a set of weights is used as a set of system parameters, to be optimized via ACO. ACO-based experimental results are shown to be of a higher quality than those achieved by manual optimisation or ‘randomised generate and test’ methods. This illustrates the applicability of the ACO-based approach to the morphological segmentation task. |
||||
[Bibtex] |