Speech Recognition System Using Hilbert Huang Transform and DHMM

S. Dhanalakshmi S.Satish, Dr.C. Venkatesh

How to Cite

Dr.C. Venkatesh, S. D. S. (2013). Speech Recognition System Using Hilbert Huang Transform and DHMM. International Journal of Scientific Research And Education, 1(1). Retrieved from http://ijsae.in/index.php/ijsae/article/view/187

Published: May 1, 2013

Issue

Vol. 1 No. 1 (2013)

Section

Articles

S. Dhanalakshmi S.Satish, Dr.C. Venkatesh

Abstract

This paper presents robust speech
recognition system in the presence of noise.
Discrete Hidden Markov Model (DHMM) is used
for mainly reducing the computation burden of
voice recognition which in turn increases speed.
Hilbert Huang Transform (HHT) is an empirical
approach to decompose any complicated data set
into a finite number of Intrinsic Mode Functions
(IMF) to obtain the instantaneous frequency data.
This Empirical Mode Decomposition (EMD)
method of HHT operates in time domain on the
local characteristic time scale of the data, making
it adaptive and highly efficient to work with any
nonlinear and nonstationary dataâ€™s unlike Fourier
transforms. The Mel Frequency Spectrum
Coefficients (MFCC) is derived from cepstral
coefficients of IMFs. The features are then
weighted and summed to get the original speech
reconstructed signal. Genetic Algorithm (GA) was
designed for each IMF to get better optimal
solution. This results in significant reduction in
time measurement, and thus it improves the
speech recognition rate

##references## ##ver##

J. W. Hung and W. H. Tu, â€œIncorporating
codebook and utterance information in cepstral
statistics normalization techniques for robust
speech recognition in additive noise
environments,â€ IEEE Signal Process. Lett., vol.
16, no. 6, pp. 473â€“476, Jun. 2009.
[2]. L. D. Persia, D. Milone, H. L. Rufiner, and M.
Yanagida, â€œPerceptual evaluation of blind source
separation for robust speech recognition,â€ Signal
Process., vol. 88, no. 10, pp. 2578â€“2583, 2008.
[3]. C. T. Ishi, S. Matsuda, T. Kanda, T. Jitsuhiro,
H. Ishiguro, S. Nakamura, and N. Hagita, â€œA
robust speech recognition system for
communication robots in noisy environments,â€
IEEE Trans. Robot., vol. 24, no. 3, pp. 759â€“763,
Jun. 2008.
[4]. D. Wang, H. Leung, A. P. Kurian, H. J. Kim,
and H. Yoon, â€œA deconvolutive neural network
for speech classification with applications to home
service robot,â€ IEEE Trans. Instrum. Meas., vol.
59, no. 12, pp. 3237â€“ 3243, Dec. 2010.
[5] L. Buera, A. Miguel, O. Saz, A. Ortega, and E.
Lleida, â€œUnsupervised data-driven feature vector
normalization with acoustic model adaptation for
robust speech recognition,â€ IEEE Trans. Audio,
Speech, Lang. Process., vol. 18, no. 2, pp. 296â€“
309, Feb. 2010.
[6] A. Sankar and C. H. Lee, â€œA maximumlikelihood
approach to stochastic matching for
robust speech recognition,â€ IEEE Trans. Speech
Audio Process., vol. 4, no. 3, pp. 190â€“202, May
1996.
[7] C.W. Hsu and L. S. Lee, â€œHigher order
cepstral moment normalization for improved
robust speech recognition,â€ IEEE Trans. Audio,
Speech, Lang. Process., vol. 17, no. 2, pp. 205â€“
220, Feb. 2009.
[8] C. T. Ishi, S. Matsuda, T. Kanda, T. Jitsuhiro,
H. Ishiguro, S. Nakamura, and N. Hagita, â€œA
robust speech recognition system for
communication robots in noisy environments,â€
IEEE Trans. Robot., vol. 24, no. 3, pp. 759â€“763,
Jun. 2008.
[9] Y. Zhan, H. Leung, K. C. Kwak, and H. Yoon,
â€œAutomated speaker recognition for home service
robots using genetic algorithm and
Dempsterâ€“Shafer fusion technique,â€ IEEE Trans.
Instrum. Meas., vol. 58, no. 9, pp. 3058â€“3068,
Sep. 2009.
[10] Y. Tsao and C. H. Lee, â€œAn ensemble
speaker and speaking environment modeling
approach to robust speech recognition,â€ IEEE
Trans. Audio, Speech, Lang. Process., vol. 17, no.
5, pp. 1025â€“1037, Jul. 2009.
[11] A. Sankar and C. H. Lee, â€œA maximumlikelihood
approach to stochastic matching for
robust speech recognition,â€ IEEE Trans. Speech
S. Dhanalakshmi IJSRE volume 1 issue 1 May 2013 Page 9
Audio Process., vol. 4, no. 3, pp. 190â€“202, May
1996
[12] W.Wang, X. Li, and R. Zhang, â€œSpeech
detection based on Hilbertâ€“Huang transform,â€ in
Proc. 1st Int. Multi-Symp. Comput. Comput. Sci.,
Jun. 20â€“24, 2006, vol. 1, pp. 290â€“293
[13] N. E. Huang, â€œThe empirical mode
decomposition and the Hilbert spectrum for
nonlinear and non-stationary time series analysis,â€
Proc. R. Soc. Lond. A, vol. 454, no. 1971, pp.
903â€“995, Mar. 1998
[14] M. K. I. Molla and K. Hirose, â€œSinglemixture
audio source separation by subspace
decomposition of Hilbert spectrum,â€ IEEE Trans.
Acoust., Speech, Signal Process., vol. 15, no. 3,
pp. 893â€“900, Mar. 2007
[15] S. T. Pan, â€œDesign of robust D-stable IIR
filters using genetic algorithms with embedded
stability criterion,â€ IEEE Trans. Signal Process.,
vol. 57, no. 8, pp. 3008â€“3016, Aug. 2009.
[16] M. K. I. Molla and K. Hirose, â€œSinglemixture
audio source separation by subspace
decomposition of Hilbert spectrum,â€ IEEE Trans.
Acoust., Speech, Signal Process., vol. 15, no. 3,
pp. 893â€“900, Mar. 2007
[17] J. A. Rosero, L. Romeral, J. A. Ortega, and E.
Rosero, â€œShort-circuit detection by means of
empirical mode decomposition and Wignerâ€“Ville
distribution for PMSM running under dynamic
condition,â€ IEEE Trans. Ind. Electron., vol. 56,
no. 11, pp. 4534â€“4547, Nov. 2009
[18] S. Windmann and R. Haeb-Umbach,
â€œParameter estimation of a statespace model of
noise for robust speech recognition,â€ IEEE Trans.
Audio, Speech, Lang. Process., vol. 17, no. 8, pp.
1577â€“1590, Nov. 2009
[19] M. J. F. Gales and S. J. Young, â€œRobust
speech recognition in additive and convolutional
noise using parallel model combination,â€ Comput.
Speech Lang., vol. 9, no. 4, pp. 289â€“307, Oct.
1995
[20] C. H. Lee, â€œOn stochastic feature and model
compensation approaches to robust speech
recognition,â€ Speech Commun., vol. 25, no. 1â€“3,
pp. 29â€“47, Aug. 1998.

Speech Recognition System Using Hilbert Huang Transform and DHMM

Speech Recognition System Using Hilbert Huang Transform and DHMM

Abstract

##references## ##ver##

Citado por

Article Sidebar

Main Article Content

Abstract

Article Details

##references## ##ver##

Citado por