Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/24282
DC FieldValueLanguage
dc.contributor.authorBonchanoski, Martinen_US
dc.contributor.authorZdravkova, Katerinaen_US
dc.date.accessioned2022-11-08T13:22:46Z-
dc.date.available2022-11-08T13:22:46Z-
dc.date.issued2018-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/24282-
dc.description.abstractThis paper presents the creation of machine learning based systems for Part-of-speech tagging of Macedonian language. Four well-known PoS tagger systems implemented for English and Slavic languages: TnT, cyclic dependency network, guided learning framework for bidirectional sequence classification, and dynamic features induction were trained. Orwell’s novel “1984” was manually tagged from the authors and it was used split into training and test set. After the training of the models, a comparison between the models was made. At the end, a POS tagger with an accuracy that reaches 97.5% was achieved, making it very appropriate for the future grammatical tagging of the National corpus of Macedonian language, which is currently in its initial stage. The Part-of-speech tagger that was create is published online and free to use.en_US
dc.relation.ispartofComputer Science and Information Systemsen_US
dc.subjectPart-of-speech tagging, TnT tagger, Cyclic dependency network, Guided learning for bidirectional sequence classification, Dynamic features inductionen_US
dc.titleLearning syntactic tagging of Macedonian languageen_US
dc.typeArticleen_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
crisitem.author.deptFaculty of Computer Science and Engineering-
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles
Files in This Item:
File Опис SizeFormat 
1820-02141800027B.pdf676.17 kBAdobe PDFView/Open
Прикажи едноставен запис

Page view(s)

40
checked on 2.5.2025

Download(s)

11
checked on 2.5.2025

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.