Show simple item record

dc.contributor.advisorLee, Chin-Hui
dc.contributor.advisorSiniscalchi, Sabato M.
dc.contributor.authorWang, Sicheng
dc.date.accessioned2021-06-10T16:51:54Z
dc.date.available2021-06-10T16:51:54Z
dc.date.created2021-05
dc.date.issued2021-03-18
dc.date.submittedMay 2021
dc.identifier.urihttp://hdl.handle.net/1853/64715
dc.description.abstractNoise presents a severe challenge in speech communication and processing systems. Speech enhancement aims at removing the inference and restoring speech quality. It is an essential step in a speech processing pipeline in many modern electronic devices, such as mobile phones and smart speakers. Traditionally, speech engineers have relied on signal processing techniques, such as spectral subtraction or Wiener filtering. Since the advent of deep learning, data-driven methods have offered an alternative solution to speech enhancement. Researchers and engineers have proposed various neural network architectures to map noisy speech features into clean ones. In this thesis, we refer to this class of mapping based data-driven techniques collectively as a direct method in speech enhancement. The output speech from direct mapping methods usually contains noise residue and unpleasant distortion if the speech power is low relative to the noise power or the background noise is very complex. The former adverse condition refers to low signal-to-noise-ratio (SNR). The latter condition implies difficult noise types. Researchers have proposed improving the SNR of speech signal incrementally during enhancement to overcome such difficulty, known as SNR-progressive speech enhancement. This design breaks down the problem of direct mapping into manageable sub-tasks. Inspired by the previous work, we propose to adopt a multi-stage indirect approach to speech enhancement in challenging noise conditions. Unlike SNR-progressive speech enhancement, we gradually transform noisy speech from difficult background noise to speech in simple noise types. The thesis's focus will include the characterization of background noise, speech transformation techniques, and integration of an indirect speech enhancement system.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherGeorgia Institute of Technology
dc.subjectSpeech enhancement
dc.subjectCurriculum learning
dc.subjectNoise characterization
dc.subjectSupervised learning
dc.subjectUnsupervised learning
dc.titleAn Indirect Speech Enhancement Framework Through Intermediate Noisy Speech Targets
dc.typeDissertation
dc.description.degreePh.D.
dc.contributor.departmentElectrical and Computer Engineering
thesis.degree.levelDoctoral
dc.contributor.committeeMemberAnderson, David V.
dc.contributor.committeeMemberMoore, Elliot
dc.contributor.committeeMemberLerch, Alexander
dc.date.updated2021-06-10T16:51:55Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record