Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/24292
Title: Resources for Machine Translation of the Macedonian Language
Authors: Stolić, Milosh
Zdravkova, Katerina 
Keywords: Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources
Issue Date: 2009
Conference: ICT Innovations 2009 
Abstract: This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results.
URI: http://hdl.handle.net/20.500.12188/24292
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Description SizeFormat 
Resources_for_Machine_Translation_of_the_Macedonia.pdf117.94 kBAdobe PDFView/Open
Show full item record

Page view(s)

31
checked on May 16, 2024

Download(s)

23
checked on May 16, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.