Singing voice separation with deep U-Net convolutional networks

Jansson, A.; Humphrey, E.; Montecchio, N.; Bittner, R.; Kumar, A.; Weyde, T.

Singing voice separation with deep U-Net convolutional networks

Jansson, A., Humphrey, E., Montecchio, N. , Bittner, R., Kumar, A. & Weyde, T. ORCID: 0000-0001-8028-9905 (2017). Singing voice separation with deep U-Net convolutional networks. Paper presented at the 18th International Society for Music Information Retrieval Conference, 23-27 Oct 2017, Suzhou, China.

Abstract

The decomposition of a music audio signal into its vocal and backing track components is analogous to image-to-image translation, where a mixed spectrogram is transformed into its constituent sources. We propose a novel application of the U-Net architecture — initially developed for medical imaging — for the task of source separation, given its proven capacity for recreating the fine, low-level detail required for high-quality audio reproduction. Through both quantitative evaluation and subjective assessment, experiments demonstrate that the proposed algorithm achieves state-of-the-art performance.

Publication Type:	Conference or Workshop Item (Paper)
Departments:	School of Communication & Creativity School of Communication & Creativity > Department of Media, Culture & Creative Industries

[thumbnail of 7bb8d1600fba70dd79408775cd0c37a4ff62.pdf]

Preview

Text - Accepted Version
Available under License Creative Commons: Attribution International Public License 4.0.
Download (2MB) | Preview

Official URL: https://ismir2017.smcnus.org/

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Jansson, A. Humphrey, E. Montecchio, N. Bittner, R. Kumar, A. Weyde, T. ORCID: 0000-0001-8028-9905
Event Title:	18th International Society for Music Information Retrieval Conference
Event Type:	Conference
Event Location:	Suzhou, China
Event Dates:	23-27 Oct 2017
Status:	Unpublished
Refereed:	Yes
URI:	https://openaccess.city.ac.uk/id/eprint/19289
Date available in CRO:	19 Mar 2018 10:47
Date deposited:	19 March 2018
Dates:	Date Event 27 October 2017 Published