Repository logo
Communities & Collections
Research Outputs
Fundings & Projects
People
Statistics
User Manual
Have you forgotten your password?
  1. Home
  2. Faculty of Computer Science and Engineering
  3. Faculty of Computer Science and Engineering: Conference papers
  4. Representation Learning for Automatic Speech Recognition: A Review of Speech-to-Text Methods
Details

Representation Learning for Automatic Speech Recognition: A Review of Speech-to-Text Methods

Date Issued
2023-07
Author(s)
Mitreska, Maja
Penkova, Blagica
Simjanoska, Monika
Abstract
Representation learning has emerged as a promising
approach to overcoming the limitations of discriminative repre sentations from the raw speech signal. In this review, we cover
a range of speech-to-text methods that employ representation
learning, including deep neural networks (DNNs), convolutional
neural networks (CNNs), recurrent neural networks (RNNs),
and transformer-based models. The advantages and limitations
of each approach are described, as well as recent advances
in pretraining techniques such as contrastive predictive coding
(CPC) and masked language modelling (MLM). The reviewed
papers are divided according to their novelty, their approaches
and their type of representation learning models.
Subjects

Speech-to-text, repre...

File(s)
Loading...
Thumbnail Image
Name

CIIT2023_paper_27.pdf

Size

8.97 MB

Format

Adobe PDF

Checksum

(MD5):bdc73d0badd18c196562474a464c73e6

⠀

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify