Overview of Methods for Data Augmentation for Speech-to-Text and Text-to-Speech

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/27404

Title:	Overview of Methods for Data Augmentation for Speech-to-Text and Text-to-Speech
Authors:	Penkova, Blagica Mitreska, Maja Mishev, Kostadin Simjanoska, Monika
Keywords:	Data augmentation, Speech-to-text, Text-tospeech
Issue Date:	Jul-2023
Publisher:	Ss Cyril and Methodius University in Skopje, Faculty of Computer Science and Engineering, Republic of North Macedonia
Series/Report no.:	CIIT 2023 papers;29;
Conference:	20th International Conference on Informatics and Information Technologies - CIIT 2023
Abstract:	In the field of machine learning and deep learning, data augmentation is a widely used technique to expand the amount of training data available. This involves altering existing data instances or generating new synthetic data, with the aim of enhancing the quantity and variability of the training set.It has shown to be especially useful when working with low resource languages and domains, where datasets are limited. This paper provides an overview of the data augmentation methods used for speech-related tasks, specifically for speech to-text and text-to-speech applications.The goal of this paper is to provide researchers and practitioners with a comprehensive understanding of the data augmentation methods available for speech-related tasks, their strengths and potential applications.
URI:	http://hdl.handle.net/20.500.12188/27404
Appears in Collections:	Faculty of Computer Science and Engineering: Conference papers

File	Description	Size	Format
CIIT2023_paper_29.pdf		9.19 MB	Adobe PDF	View/Open

147

checked on May 4, 2025

170

checked on May 4, 2025

Check