Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/27404
Title: Overview of Methods for Data Augmentation for Speech-to-Text and Text-to-Speech
Authors: Penkova, Blagica
Mitreska, Maja
Mishev, Kostadin 
Simjanoska, Monika
Keywords: Data augmentation, Speech-to-text, Text-tospeech
Issue Date: Jul-2023
Publisher: Ss Cyril and Methodius University in Skopje, Faculty of Computer Science and Engineering, Republic of North Macedonia
Series/Report no.: CIIT 2023 papers;29;
Conference: 20th International Conference on Informatics and Information Technologies - CIIT 2023
Abstract: In the field of machine learning and deep learning, data augmentation is a widely used technique to expand the amount of training data available. This involves altering existing data instances or generating new synthetic data, with the aim of enhancing the quantity and variability of the training set.It has shown to be especially useful when working with low resource languages and domains, where datasets are limited. This paper provides an overview of the data augmentation methods used for speech-related tasks, specifically for speech to-text and text-to-speech applications.The goal of this paper is to provide researchers and practitioners with a comprehensive understanding of the data augmentation methods available for speech-related tasks, their strengths and potential applications.
URI: http://hdl.handle.net/20.500.12188/27404
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Description SizeFormat 
CIIT2023_paper_29.pdf9.19 MBAdobe PDFView/Open
Show full item record

Page view(s)

92
checked on May 3, 2024

Download(s)

128
checked on May 3, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.