Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

Eslami, M.; Sayadianii, A.

doi:10.22060/miscj.2011.160

	Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
AUT Journal of Modeling and Simulation
مقاله 2، دوره 43، شماره 2، 2011، صفحه 11-17 اصل مقاله (622.5 K)
نوع مقاله: Research Article
شناسه دیجیتال (DOI): 10.22060/miscj.2011.160
نویسندگان
M. Eslami^* ¹؛ A. Sayadianii²
¹Corresponding Author, Department of Electrical Engineering, Amirkabir University of Technology, Tehran, Iran (e-mail: ee35as@aut.ac.ir).
²Department of Product and Services, Tamin Telecom Co.(3G mobile operator), Tehran, Iran (e-mail: m.eslami@tamintelecom.ir)
چکیده
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality reduction. In this paper, after introducing GMM2 method, several GMM models will be used to model each phoneme. Furthermore, in the stage of corresponding the clusters of each state, before applying Dynamic Time Warping algorithm, we use a LMR conversion for further correspondence among the parameters of two corresponding states of two speakers. Another reason for quality reduction in voice conversion system is that the precision of speech signal parameters was underestimated. In order to overcome such a problem, Generalized Harmonic Model is introduced which is replaced by sinusoid harmonic model applied in GMM2 giving another method called GMM3. Finally, we will present GMM4 method, the objective of which is to promote the system performance with limited data and a restricted number of demi-syllables to train conversion functions.
کلیدواژه‌ها
High quality voice conversion؛ Gaussian mixed model (GMM)؛ Generalized Harmonic Model (GHM)؛ spectral conversion
مراجع

آمار تعداد مشاهده مقاله: 2,008 تعداد دریافت فایل اصل مقاله: 1,415

پیوندهای مفید

دانشگاه صنعتی امیرکبیر

آمار

تعداد نشریات	9
تعداد شماره‌ها	466
تعداد مقالات	5,842
تعداد مشاهده مقاله	8,767,243
تعداد دریافت فایل اصل مقاله	7,371,324