Resources for Machine Translation of the Macedonian Language

Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/24292

DC Field	Value	Language
dc.contributor.author	Stolić, Milosh	en_US
dc.contributor.author	Zdravkova, Katerina	en_US
dc.date.accessioned	2022-11-08T14:10:47Z	-
dc.date.available	2022-11-08T14:10:47Z	-
dc.date.issued	2009	-
dc.identifier.uri	http://hdl.handle.net/20.500.12188/24292	-
dc.description.abstract	This paper focuses on creating new linguistic resources for the Macedonian language. It presents a new parallel corpus between Macedonian and Serbian language, build around the digitalized version of George Orwell's "1984", developed during the MULTEXT-EAST project. The original corpus is expanded with news articles from the Southeast European Times newspaper, published in public domain. The paper describes the retrieval, conversion, preprocessing, filtering and sentence-alignment of the corpus, then discusses and evaluates the alignment results.	en_US
dc.subject	Natural Language Processing, Computational Linguistics, Bilingual Machine Translation, Statistical analysis, Language Resources	en_US
dc.title	Resources for Machine Translation of the Macedonian Language	en_US
dc.type	Proceeding article	en_US
dc.relation.conference	ICT Innovations 2009	en_US
item.fulltext	With Fulltext	-
item.grantfulltext	open	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
Appears in Collections:	Faculty of Computer Science and Engineering: Conference papers

File	Опис	Size	Format
Resources_for_Machine_Translation_of_the_Macedonia.pdf		117.94 kB	Adobe PDF	View/Open

60

checked on 3.5.2025

44

checked on 3.5.2025

Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.