Header

Shop : Details

Shop
Details
48,80 €
ISBN 978-3-8440-1174-6
Paperback
188 Seiten
278 g
21 x 14,8 cm
Englisch
Dissertation
August 2012
Martin Spiertz
Underdetermined Blind Source Separation for Audio Signals
Blind source separation is a topic of ongoing interest either as a pre-processing step for arbitrary audio analysis frameworks or for re-/up-mixing of audio streams. Many state-of-the-art algorithms are based on the non-negative tensor factorization (NTF). This thesis addresses one short-coming of the NTF: It separates only notes but not whole melodies consisting of several (different) notes of one single instrument.

In this thesis, an algorithm for clustering the separated notes into melodies is developed. For this, audio features and unsupervised clustering algorithms and their strengths and weaknesses are discussed. Good pairs of audio features and clustering algorithms are shown by experiments. In order to reduce the error-rate of these clustering algorithms, strategies for combining different clustering algorithms are developed.
The clustering algorithms discussed in this thesis fulfill the following requirements. They can be used unsupervised. No interaction of humans is necessary up to the signal synthesis step. Their robustness is tested on different sets of mixtures to assure the parameters to be as universally valid as possible. Finally, the proposed approach leads to comparable separation quality but can be evaluated in a fraction of the time compared to other state-of-the-art algorithms used for Blind Source Separation.
Schlagwörter: Blind Source Separation; Non-Negative Matrix Factorization
Aachen Series on Multimedia and Communications Engineering
Herausgegeben von Univ.-Prof. Dr.-Ing. Jens-Rainer Ohm, Aachen
Band 10
Export bibliographischer Daten
Teilen
Shaker Verlag GmbH
Am Langen Graben 15a
52353 Düren
  +49 2421 99011 9
Mo. - Do. 8:00 Uhr bis 16:00 Uhr
Fr. 8:00 Uhr bis 15:00 Uhr
Kontaktieren Sie uns. Wir helfen Ihnen gerne weiter.
Captcha
Social Media