Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/24292
DC FieldValueLanguage
dc.contributor.authorStolić, Miloshen_US
dc.contributor.authorZdravkova, Katerinaen_US
dc.date.accessioned2022-11-08T14:10:47Z-
dc.date.available2022-11-08T14:10:47Z-
dc.date.issued2009-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/24292-
dc.description.abstractThis paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results.en_US
dc.subjectNatural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resourcesen_US
dc.titleResources for Machine Translation of the Macedonian Languageen_US
dc.typeProceeding articleen_US
dc.relation.conferenceICT Innovations 2009en_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
crisitem.author.deptFaculty of Computer Science and Engineering-
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers
Files in This Item:
File Опис SizeFormat 
Resources_for_Machine_Translation_of_the_Macedonia.pdf117.94 kBAdobe PDFView/Open
Прикажи едноставен запис

Page view(s)

39
checked on 9.11.2024

Download(s)

30
checked on 9.11.2024

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.