Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/24282
Title: Learning syntactic tagging of Macedonian language
Authors: Bonchanoski, Martin
Zdravkova, Katerina 
Keywords: Part-of-speech tagging, TnT tagger, Cyclic dependency network, Guided learning for bidirectional sequence classification, Dynamic features induction
Issue Date: 2018
Journal: Computer Science and Information Systems
Abstract: This paper presents the creation of machine learning based systems for Part-of-speech tagging of Macedonian language. Four well-known PoS tagger systems implemented for English and Slavic languages: TnT, cyclic dependency network, guided learning framework for bidirectional sequence classification, and dynamic features induction were trained. Orwell’s novel “1984” was manually tagged from the authors and it was used split into training and test set. After the training of the models, a comparison between the models was made. At the end, a POS tagger with an accuracy that reaches 97.5% was achieved, making it very appropriate for the future grammatical tagging of the National corpus of Macedonian language, which is currently in its initial stage. The Part-of-speech tagger that was create is published online and free to use.
URI: http://hdl.handle.net/20.500.12188/24282
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:
File Description SizeFormat 
1820-02141800027B.pdf676.17 kBAdobe PDFView/Open
Show full item record

Page view(s)

31
checked on Apr 29, 2024

Download(s)

6
checked on Apr 29, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.