DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement

DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement

EnglishEbook
Hendriks, Richard C.
Springer International Publishing
EAN: 9783031025648
Available online
CZK 770
Common price CZK 855
Discount 10%
pc

Detailed information

As speech processing devices like mobile phones, voice controlled devices, and hearing aids have increased in popularity, people expect them to work anywhere and at any time without user intervention. However, the presence of acoustical disturbances limits the use of these applications, degrades their performance, or causes the user difficulties in understanding the conversation or appreciating the device. A common way to reduce the effects of such disturbances is through the use of single-microphone noise reduction algorithms for speech enhancement. The field of single-microphone noise reduction for speech enhancement comprises a history of more than 30 years of research. In this survey, we wish to demonstrate the significant advances that have been made during the last decade in the field of discrete Fourier transform domain-based single-channel noise reduction for speech enhancement.Furthermore, our goal is to provide a concise description of a state-of-the-art speech enhancement system, and demonstrate the relative importance of the various building blocks of such a system. This allows the non-expert DSP practitioner to judge the relevance of each building block and to implement a close-to-optimal enhancement system for the particular application at hand. Table of Contents: Introduction / Single Channel Speech Enhancement: General Principles / DFT-Based Speech Enhancement Methods: Signal Model and Notation / Speech DFT Estimators / Speech Presence Probability Estimation / Noise PSD Estimation / Speech PSD Estimation / Performance Evaluation Methods / Simulation Experiments with Single-Channel Enhancement Systems / Future Directions
EAN 9783031025648
ISBN 3031025644
Binding Ebook
Publisher Springer International Publishing
Publication date May 31, 2022
Language English
Authors Gerkmann, Timo; Hendriks, Richard C.; Jensen, Jesper
Series Synthesis Lectures on Speech and Audio Processing