Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/25670
DC FieldValueLanguage
dc.contributor.authorJaneva, Teaen_US
dc.contributor.authorMishev, Kostadinen_US
dc.contributor.authorSimjanoska, Monikaen_US
dc.date.accessioned2023-02-13T09:38:32Z-
dc.date.available2023-02-13T09:38:32Z-
dc.date.issued2022-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/25670-
dc.description.abstractVoice recognition is the ability of a machine to identify a person based on their unique voiceprint. As this task is becoming more important and dominant in everyday people’s lives, this paper is testing different approaches for its implementation. Using a multilanguage database and working with the different frequencies’ characteristics, five machine learning models such as Random Forest, XGBoost, MLP, SVM and Gradient Boosting, along with CNN deep learning model were implemented. The models were trained on three different tasks, gender prediction, age range prediction, and combined gender and age range prediction. These models were evaluated using accuracy, F1-score and MCC score. The results showed that Random Forest outperforms other models by achieving an accuracy of more than 0.9 for all the three classification tasks.en_US
dc.subjectVoice recognition, Deep learning, Machine learning, Explainable Machine learningen_US
dc.titleLanguage Agnostic Voice Recognition Modelen_US
dc.typeProceedingsen_US
dc.relation.conference19th Conference for Informatics and Information Technology 2022 (CIIT) 2022en_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers
Files in This Item:
File Опис SizeFormat 
CIIT_2022_1.pdf259.68 kBAdobe PDFView/Open
Прикажи едноставен запис

Page view(s)

56
checked on 4.5.2025

Download(s)

98
checked on 4.5.2025

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.