City Research Online

Automatic transcription of pitched and unpitched sounds from polyphonic music

Benetos, E., Ewert, S. and Weyde, T. (2014). Automatic transcription of pitched and unpitched sounds from polyphonic music. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3107-3111. doi: 10.1109/ICASSP.2014.6854172

Abstract

Automatic transcription of polyphonic music has been an active research field for several years and is considered by many to be a key enabling technology in music signal processing. However, current transcription approaches either focus on detecting pitched sounds (from pitched musical instruments) or on detecting unpitched sounds (from drum kits). In this paper, we propose a method that jointly transcribes pitched and unpitched sounds from polyphonic music recordings. The proposed model extends the probabilistic latent component analysis algorithm and supports the detection of pitched sounds from multiple instruments as well as the detection of unpitched sounds from drum kit components, including bass drums, snare drums, cymbals, hi-hats, and toms. Our experiments based on polyphonic Western music containing both pitched and unpitched instruments led to very encouraging results in multi-pitch detection and drum transcription tasks.

Publication Type: Article
Additional Information: © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Subjects: M Music and Books on Music > M Music
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Mathematics, Computer Science & Engineering > Computer Science
URI: http://openaccess.city.ac.uk/id/eprint/3268
[img]
Preview
Text (Paper) - Accepted Version
Available under License Creative Commons: Attribution 3.0.

Download (104kB) | Preview
[img] Text (MIDI data & README file) - Accepted Version
Available under License Creative Commons: Attribution 3.0.

Download (134kB)

Export

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login