Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/24292
Title: | Resources for Machine Translation of the Macedonian Language | Authors: | Stolić, Milosh Zdravkova, Katerina |
Keywords: | Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources | Issue Date: | 2009 | Conference: | ICT Innovations 2009 | Abstract: | This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results. | URI: | http://hdl.handle.net/20.500.12188/24292 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Resources_for_Machine_Translation_of_the_Macedonia.pdf | 117.94 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.