Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/27402
Наслов: Representation Learning for Automatic Speech Recognition: A Review of Speech-to-Text Methods
Authors: Mitreska, Maja
Penkova, Blagica
Mishev, Kostadin 
Simjanoska, Monika
Keywords: Speech-to-text, representation learning
Issue Date: јул-2023
Publisher: Ss Cyril and Methodius University in Skopje, Faculty of Computer Science and Engineering, Republic of North Macedonia
Series/Report no.: CIIT 2023 papers;27;
Conference: 20th International Conference on Informatics and Information Technologies - CIIT 2023
Abstract: Representation learning has emerged as a promising approach to overcoming the limitations of discriminative repre sentations from the raw speech signal. In this review, we cover a range of speech-to-text methods that employ representation learning, including deep neural networks (DNNs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformer-based models. The advantages and limitations of each approach are described, as well as recent advances in pretraining techniques such as contrastive predictive coding (CPC) and masked language modelling (MLM). The reviewed papers are divided according to their novelty, their approaches and their type of representation learning models.
URI: http://hdl.handle.net/20.500.12188/27402
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Опис SizeFormat 
CIIT2023_paper_27.pdf9.19 MBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

114
checked on 17.5.2024

Download(s)

77
checked on 17.5.2024

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.