Learning rules for morphological analysis and synthesis of Macedonian nouns
Journal
Proceedings of SiKDD 2005 (Conference on Data Mining and Data Warehouses)
Date Issued
2005-10
Author(s)
Ivanovska, Aneta
Džeroski, Sasho
Erjavec, Tomaž
Abstract
This paper presents a machine learning approach to
morphological analysis and synthesis of Macedonian
nouns. For training and testing we used the nouns
originating from Orwell’s “1984”. The paper presents
experimental results of using the learned rules in the
process of analysis, and in the process of noun formation.
Training was performed with the whole set of Macedonian
nouns from “1984” and tested by 10-fold cross-validation.
All the potential nouns forms generated by the learning
rules were compared with 275000 Macedonian noun forms.
The accuracy of 92-97% is encouraging to apply the same
approach to all categories of Macedonian words.
morphological analysis and synthesis of Macedonian
nouns. For training and testing we used the nouns
originating from Orwell’s “1984”. The paper presents
experimental results of using the learned rules in the
process of analysis, and in the process of noun formation.
Training was performed with the whole set of Macedonian
nouns from “1984” and tested by 10-fold cross-validation.
All the potential nouns forms generated by the learning
rules were compared with 275000 Macedonian noun forms.
The accuracy of 92-97% is encouraging to apply the same
approach to all categories of Macedonian words.
File(s)![Thumbnail Image]()
Loading...
Name
SIKDD05-mklem.pdf
Size
369.1 KB
Format
Adobe PDF
Checksum
(MD5):b4e40d839949174b1943eb832b7d62bf
