Simon ReceveurContributions to Turbo Automatic Speech Recognition | |||||||
| |||||||
ISBN: | 978-3-8440-7756-8 | ||||||
Reihe: | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig Herausgeber: Prof. Dr.-Ing. U. Reimers, Prof. Dr.-Ing. T. Kürner und Prof. Dr.-Ing. T. Fingscheidt Braunschweig | ||||||
Band: | 63 | ||||||
Schlagwörter: | Speech Recognition; Decoding; Digital communication; Hidden Markov models; Iterative decoding; Convolutional codes; Speech; Acoustics | ||||||
Publikationsart: | Dissertation | ||||||
Sprache: | Englisch | ||||||
Seiten: | 272 Seiten | ||||||
Abbildungen: | 29 Abbildungen | ||||||
Gewicht: | 405 g | ||||||
Format: | 21 x 14,8 cm | ||||||
Bindung: | Paperback | ||||||
Preis: | 49,80 € | ||||||
Erscheinungsdatum: | Dezember 2020 | ||||||
Kaufen: | |||||||
Weiterempfehlung: | Sie möchten diesen Titel weiterempfehlen? | ||||||
Rezensionsexemplar: | Hier können Sie ein Rezensionsexemplar bestellen. | ||||||
Verlinken: | Sie möchten diese Seite verlinken? Hier klicken. | ||||||
Export Zitat: |
|
||||||
Zusammenfassung: | Be it Siri or Amazon Echo - automatic speech recognition is making its way into our lives and despite astonishing improvements in recognition in general, it is still far from being as good as human speech comprehension. In order to open up possible paths for more robust and possibly distributed speech recognition systems, the PhD thesis "Contributions to Turbo Automatic Speech Recognition" deals with a novel method for iterative optimal information fusion. A fusion is always necessary and profitable when different information sources are to be combined in a statistically optimal way. This can be the combination of audio (speech recognition) and video (lip reading), but also the combination of two similar sensors (two microphones, or for humans the right and left ear). The chosen approach represents the consequent application of the turbo code principle known from communications to questions of automatic speech recognition with multiple data streams. As a major innovation, the PhD thesis presents a so-called modified Viterbi algorithm, which provides a novel information representation for iterative feedback. Two individual recognizers repeatedly evaluate their respective input signal of the underlying speech utterance and exchange information from iteration to iteration, thus moving step by step towards a jointly improved recognition result. |