Application of russian language phonemics to generate macedonian speech recognition model using sphinx

Mingov, Riste; Zdravevski, Eftim; Lameski, Petre

Application of russian language phonemics to generate macedonian speech recognition model using sphinx

Date Issued

2016-09

Author(s)

Mingov, Riste

Abstract

Speech-recognition provides possibilities for improved user
experience and new level of features in various applications. Although
there are widely available open-source and proprietary systems for speech
recognition and synthesis for the more widely used languages, there are
no available and robust enough systems for the Macedonian language.
Building a speech recognition system requires a lot of high-quality recordings, which is expensive operation. To overcome this issue we propose
applying some background knowledge and using existing speech synthesis engines to train speech recognition system. Since the Macedonian
language belongs to the group of Slavic languages, there are many similarities between them. In this paper we apply this fact to generate a
speech synthesis module for the Macedonian language based on the Russian language model. Furthermore, we use the speech synthesis module
to build a speech recognition module using the CMUSphinx Toolkit.
Finally the results are presented and they confirm that a system with
substantial quality can be built without the need of manual recordings
in the specific language.

Subjects

speech recognition, a...

File(s)

Name

SpeechRecognition_RM1.pdf

Size

324.09 KB

Format

Adobe PDF

Checksum

(MD5):d199435a56874651f7b42937f80728cc