Joint Sound Source Separation and Speaker Recognition

TitleJoint Sound Source Separation and Speaker Recognition
Publication TypeConference Paper
Year of Publication2016
AuthorsZegers, J, Van hamme, H
Conference NameInterspeech 2016
Conference LocationSan Francisco, USA

Non-negative Matrix Factorization (NMF) has already been applied to learn speaker characterizations from single or non-simultaneous speech for speaker recognition applications. It is also known for its good performance in (blind) source separation for simultaneous speech. This paper explains how NMF can be used to jointly solve the two problems in a multichannel speaker recognizer for simultaneous speech. It is shown how state-of-the-art multichannel NMF for blind source separation can be easily extended to incorporate speaker recognition. Experiments on the CHiME corpus show that this method outperforms the sequential approach of first applying source separation, followed by speaker recognition that uses state-of-the-art i-vector techniques.