Language Agnostic Voice Recognition Model

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/25670

Title:	Language Agnostic Voice Recognition Model
Authors:	Janeva, Tea Mishev, Kostadin Simjanoska, Monika
Keywords:	Voice recognition, Deep learning, Machine learning, Explainable Machine learning
Issue Date:	2022
Conference:	19th Conference for Informatics and Information Technology 2022 (CIIT) 2022
Abstract:	Voice recognition is the ability of a machine to identify a person based on their unique voiceprint. As this task is becoming more important and dominant in everyday people’s lives, this paper is testing different approaches for its implementation. Using a multilanguage database and working with the different frequencies’ characteristics, five machine learning models such as Random Forest, XGBoost, MLP, SVM and Gradient Boosting, along with CNN deep learning model were implemented. The models were trained on three different tasks, gender prediction, age range prediction, and combined gender and age range prediction. These models were evaluated using accuracy, F1-score and MCC score. The results showed that Random Forest outperforms other models by achieving an accuracy of more than 0.9 for all the three classification tasks.
URI:	http://hdl.handle.net/20.500.12188/25670
Appears in Collections:	Faculty of Computer Science and Engineering: Conference papers

File	Description	Size	Format
CIIT_2022_1.pdf		259.68 kB	Adobe PDF	View/Open

47

checked on Nov 9, 2024

92

checked on Nov 9, 2024

Check