City Research Online - Big Chord Data Extraction and Mining

Big Chord Data Extraction and Mining

Barthet, M., Plumbley, M. D., Kachkaev, A. , Dykes, J., Wolff, D. & Weyde, T. (2014). Big Chord Data Extraction and Mining. Paper presented at the 9th Conference on Interdisciplinary Musicology – CIM14, 03-12-2014 - 06-12-2014, Staatliches Institut für Musikforschung, Berlin, Germany.

Abstract

Harmonic progression is one of the cornerstones of tonal music composition and is thereby essential to many musical styles and traditions. Previous studies have shown that musical genres and composers could be discriminated based on chord progressions modeled as chord n-grams. These studies were however conducted on small-scale datasets and using symbolic music transcriptions.

In this work, we apply pattern mining techniques to over 200,000 chord progression sequences out of 1,000,000 extracted from the I Like Music (ILM) commercial music audio collection. The ILM collection spans 37 musical genres and includes pieces released between 1907 and 2013. We developed a single program multiple data parallel computing approach whereby audio feature extraction tasks are split up and run simultaneously on multiple cores. An audio-based chord recognition model (Vamp plugin Chordino) was used to extract the chord progressions from the ILM set. To keep low-weight feature sets, the chord data were stored using a compact binary format. We used the CM-SPADE algorithm, which performs a vertical mining of sequential patterns using co-occurence information, and which is fast and efﬁcient enough to be applied to big data collections like the ILM set. In orderto derive key-independent frequent patterns, the transition between chords are modeled by changes of qualities (e.g. major, minor, etc.) and root keys (e.g. fourth, ﬁfth, etc.). The resulting key-independent chord progression patterns vary in length (from 2 to 16) and frequency (from 2 to 19,820) across genres. As illustrated by graphs generated to represent frequent 4-chord progressions, some patterns like circle-of-ﬁfths movements are well represented in most genres but in varying degrees.

These large-scale results offer the opportunity to uncover similarities and discrepancies between sets of musical pieces and therefore to build classiﬁers for search and recommendation. They also support the empirical testing of music theory. It is however more difﬁcult to derive new hypotheses from such dataset due to its size. This can be addressed by using pattern detection algorithms or suitable visualisation which we present in a companion study.

Publication Type:	Conference or Workshop Item (Paper)
Subjects:	M Music and Books on Music Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments:	School of Science & Technology > Computer Science School of Science & Technology > Computer Science > giCentre

[thumbnail of Barthet_et_al_BigChordDataExtractionAndMining_CIM14_proceedings.pdf]

Preview

PDF - Published Version
Download (468kB) | Preview

Official URL: http://www.sim.spk-berlin.de/cim14_919.html

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Altmetric

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Barthet, M. Plumbley, M. D. Kachkaev, A. Dykes, J. Wolff, D. Weyde, T.
Event Title:	9th Conference on Interdisciplinary Musicology – CIM14
Event Type:	Conference
Event Location:	Staatliches Institut für Musikforschung, Berlin, Germany
Event Dates:	03-12-2014 - 06-12-2014
Status:	Published
Refereed:	Yes
URI:	https://openaccess.city.ac.uk/id/eprint/5803
Date available in CRO:	15 Apr 2015 08:19
Date deposited:	27 July 2017
Dates:	Date Event 2014 Published