Publication - Linguistic Resources for CH/SSH. A parallel Greek-Bulgarian corpus: a digital resource of the shared Cultural Heritage.
RESEARCH

Linguistic Resources for CH/SSH. A parallel Greek-Bulgarian corpus: a digital resource of the shared Cultural Heritage.

Research Area:  
    
Type:  
Book chapter

 

Year: 2011
Authors: Voula Giouli; K. Simov; P. Osenova
Editor: Sporlender, C., van de Bosch, A., and Zervanou, K.
Chapter: 4.2
Publisher: Springer
Series: Language Technology for Cultural Heritage: Selected papers from the LaTeCH Workshop Series
Abstract:
There has been a long tradition in the digitization and manual documentation of cultural heritage data, yet the need for indexing and retrieval that goes beyond mere bibliographic information has only recently been recognized. This chapter reports on completed work aimed at highlighting textual cultural resources that, as of yet, remain under-exploited by creating the necessary infrastructure with the support and customization of Language Technologies (LT). The ultimate goal was to promote the study of cultural heritage of the neighboring areas of Greece and Bulgaria and to raise awareness about their common cultural identity, the focus being on literature, folklore and language. To this end, a bilingual collection of literary and folklore texts in Greek and Bulgarian was developed along with a number of accompanying resources. The authors present the methodology adopted for the automatic annotation of the textual data at various levels of linguistic analysis elaborating on the Greek and Bulgarian text processing tools that are integrated in the cross-lingual search and retrieval mechanisms, and discuss issues and problems encountered in the course of the project life-cycle.
[Bibtex]