Articles

HIDDEN MARKOV MODELS BASED INDONESIAN VISEME MODEL FOR NATURAL SPEECH WITH AFFECTION

Main Article Content

Endang Setyati Mauridhi Hery Purnomo Surya Sumpeno Joan Santoso

Abstract

In a communication using texts input, viseme (visual phonemes) is derived from a group of phonemes having similar visual appearances. Hidden Markov model (HMM) has been a popular mathematical approach for sequence classification such as speech recognition. For speech emotion recognition, a HMM is trained for each emotion and an unknown sample is classified according to the model which illustrate the derived feature sequence best. Viterbi algorithm, HMM is used for guessing the most possible state sequence of observable states. In this work, first stage, we defined system of an Indonesian viseme set and the associated mouth shapes, namely system of text input segmentation. The second stage, we defined a choice of one of affection type as input in the system. The last stage, we experimentally using Trigram HMMs for generating the viseme sequence to be used for synchronized mouth shape and lip movements. The whole system is interconnected in a sequence. The final system produced a viseme sequence for natural speech of Indonesian sentences with affection. We show through various experiments that the proposed, the results in about 82,19% relative improvement in classification accuracy.

Article Details

How to Cite
SETYATI, Endang et al. HIDDEN MARKOV MODELS BASED INDONESIAN VISEME MODEL FOR NATURAL SPEECH WITH AFFECTION. Kursor, [S.l.], v. 8, n. 3, p. 102-122, dec. 2016. ISSN 2301-6914. Available at: <https://kursorjournal.org/index.php/kursor/article/view/61>. Date accessed: 12 dec. 2019. doi: https://doi.org/10.28961/kursor.v8i3.61.
Section
Articles