Computer Science / Machine Learning Computer Science / Sound Electrical Engineering and Systems Science / Audio Processing

Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

February 10, 2026

Reading time: 1 minute

...

📝 Original Info

Title: Audio Source Separation Using Variational Autoencoders and Weak Class Supervision
ArXiv ID: 1810.13104
Date: 2019-08-06
Authors: Ertuu{g} Karamatl{i}, Ali Taylan Cemgil, Serap K{i}rb{i}z

📝 Abstract

In this paper, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non-negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision.

📄 Full Content

📄 Read Full PDF on ArXiv

Reference

This content is AI-processed based on open access ArXiv data.

Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

📝 Original Info

📝 Abstract

📄 Full Content

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

📄 Full Content

Reference

Start searching

No results found