Show simple item record

dc.contributor.advisorClements, Mark A.
dc.contributor.authorMushtaq, Aleem
dc.date.accessioned2013-09-19T12:19:03Z
dc.date.available2013-09-19T12:19:03Z
dc.date.created2013-08
dc.date.issued2013-05-15
dc.date.submittedAugust 2013
dc.identifier.urihttp://hdl.handle.net/1853/48982
dc.description.abstractThe performance of automatic speech recognition systems often degrades in adverse conditions where there is a mismatch between training and testing conditions. This is true for most modern systems which employ Hidden Markov Models (HMMs) to decode speech utterances. One strategy is to map the distorted features back to clean speech features that correspond well to the features used for training of HMMs. This can be achieved by treating the noisy speech as the distorted version of the clean speech of interest. Under this framework, we can track and consequently extract the underlying clean speech from the noisy signal and use this derived signal to perform utterance recognition. Particle filter is a versatile tracking technique that can be used where often conventional techniques such as Kalman filter fall short. We propose a particle filters based algorithm to compensate the corrupted features according to an additive noise model incorporating both the statistics from clean speech HMMs and observed background noise to map noisy features back to clean speech features. Instead of using specific knowledge at the model and state levels from HMMs which is hard to estimate, we pool model states into clusters as side information. Since each cluster encompasses more statistics when compared to the original HMM states, there is a higher possibility that the newly formed probability density function at the cluster level can cover the underlying speech variation to generate appropriate particle filter samples for feature compensation. Additionally, a dynamic joint tracking framework to monitor the clean speech signal and noise simultaneously is also introduced to obtain good noise statistics. In this approach, the information available from clean speech tracking can be effectively used for noise estimation. The availability of dynamic noise information can enhance the robustness of the algorithm in case of large fluctuations in noise parameters within an utterance. Testing the proposed PF-based compensation scheme on the Aurora 2 connected digit recognition task, we achieve an error reduction of 12.15% from the best multi-condition trained models using this integrated PF-HMM framework to estimate the cluster-based HMM state sequence information. Finally, we extended the PFC framework and evaluated it on a large-vocabulary recognition task, and showed that PFC works well for large-vocabulary systems also.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherGeorgia Institute of Technology
dc.subjectParticle filter
dc.subjectHidden Markov model
dc.subjectRobust speech recognition
dc.subjectClustering
dc.subjectMarkov chain Monte Carlo
dc.subject.lcshHidden Markov models
dc.subject.lcshSpeech perception
dc.subject.lcshMonte Carlo method
dc.subject.lcshAlgorithms
dc.titleAn integrated approach to feature compensation combining particle filters and Hidden Markov Models for robust speech recognition
dc.typeDissertation
dc.description.degreePh.D.
dc.contributor.departmentElectrical and Computer Engineering
thesis.degree.levelDoctoral
dc.contributor.committeeMemberLee, Chin-Hui
dc.contributor.committeeMemberZhang, Fumin
dc.contributor.committeeMemberGoldsman, David M.
dc.contributor.committeeMemberAlRegib, Ghassan
dc.date.updated2013-09-19T12:19:03Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record