Schädler, Marc Reni (2016) Robust automatic speech recognition and modeling of auditory discrimination experiments with auditory spectro-temporal features. Monografie. Bis-Verlag, ISBN 978-3-8142-2333-9.


Volltext (2567Kb)


Automatic speech recognition (ASR) systems still do not perform as well as human listeners under realistic conditions. The unmatched ability of humans to understand speech in most difficult acoustic conditions originates from the superior properties of their auditory system. The aim of this thesis is to improve the recognition performance of ASR systems in difficult acoustic conditions by carefully integrating auditory signal processing strategies. To this end, the physiologically inspired extraction of spectro-temporal modulation patterns was successfully integrated into the front-end of a standard ASR system. Further, the joint spectro-temporal processing could be separated into independent temporal and spectral processes. To investigate the reason for the remaining "man-machine-gap" in recognition performance, a range of critical auditory discrimination tasks were performed using ASR systems. The comparison with empirical data showed that the separate spectro-temporal modulation front-end provides a suitable auditory model and revealed the importance of across-frequency processing in speech recognition.

Item Type: Monograph (Art der Monografie: Project Report)
Uncontrolled Keywords: Hochschulschrift
Subjects: Science and mathematics > Physics
Divisions: Miscellaneous > BIS Publishing House
Date Deposited: 08 Sep 2016 09:04
Last Modified: 08 Sep 2016 09:04
URN: urn:nbn:de:gbv:715-oops-29259

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...