Simon Receveur

Contributions to Turbo Automatic Speech Recognition


Vorderseite	Rückseite

ISBN:

978-3-8440-7756-8

Reihe:

Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig
Herausgeber: Prof. Dr.-Ing. U. Reimers, Prof. Dr.-Ing. T. Kürner und Prof. Dr.-Ing. T. Fingscheidt
Braunschweig

Band:

Schlagwörter:

Speech Recognition; Decoding; Digital communication; Hidden Markov models; Iterative decoding; Convolutional codes; Speech; Acoustics

Publikationsart:

Dissertation

Sprache:

Englisch

Seiten:

272 Seiten

Abbildungen:

29 Abbildungen

Gewicht:

405 g

Format:

21 x 14,8 cm

Bindung:

Paperback

Preis:

49,80 €

Erscheinungsdatum:

Dezember 2020

Kaufen:

Weiterempfehlung:

Sie möchten diesen Titel weiterempfehlen?

Rezensionsexemplar:

Hier können Sie ein Rezensionsexemplar bestellen.

Verlinken:

Sie möchten diese Seite verlinken? Hier klicken.

Export Zitat:

Text
BibTex
RIS

Zusammenfassung:

Be it Siri or Amazon Echo - automatic speech recognition is making its way into our lives and despite astonishing improvements in recognition in general, it is still far from being as good as human speech comprehension. In order to open up possible paths for more robust and possibly distributed speech recognition systems, the PhD thesis "Contributions to Turbo Automatic Speech Recognition" deals with a novel method for iterative optimal information fusion.
A fusion is always necessary and profitable when different information sources are to be combined in a statistically optimal way. This can be the combination of audio (speech recognition) and video (lip reading), but also the combination of two similar sensors (two microphones, or for humans the right and left ear). The chosen approach represents the consequent application of the turbo code principle known from communications to questions of automatic speech recognition with multiple data streams. As a major innovation, the PhD thesis presents a so-called modified Viterbi algorithm, which provides a novel information representation for iterative feedback. Two individual recognizers repeatedly evaluate their respective input signal of the underlying speech utterance and exchange information from iteration to iteration, thus moving step by step towards a jointly improved recognition result.