Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/24292
Наслов: Resources for Machine Translation of the Macedonian Language
Authors: Stolić, Milosh
Zdravkova, Katerina 
Keywords: Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources
Issue Date: 2009
Conference: ICT Innovations 2009 
Abstract: This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results.
URI: http://hdl.handle.net/20.500.12188/24292
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Опис SizeFormat 
Resources_for_Machine_Translation_of_the_Macedonia.pdf117.94 kBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

33
checked on 12.6.2024

Download(s)

27
checked on 12.6.2024

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.