Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис:
http://hdl.handle.net/20.500.12188/24292
Наслов: | Resources for Machine Translation of the Macedonian Language | Authors: | Stolić, Milosh Zdravkova, Katerina |
Keywords: | Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources | Issue Date: | 2009 | Conference: | ICT Innovations 2009 | Abstract: | This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results. | URI: | http://hdl.handle.net/20.500.12188/24292 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Опис | Size | Format | |
---|---|---|---|---|
Resources_for_Machine_Translation_of_the_Macedonia.pdf | 117.94 kB | Adobe PDF | View/Open |
Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.