Music genre classification: a multilinear approach
Panagakis, I., Benetos, E. & Kotropoulos, C. (2008). Music genre classification: a multilinear approach. In: Bello, JP, Chew, E & Turnbull, D (Eds.), ISMIR. International Symposium Music Information Retrieval, 14 - 18 September 2008, Philadelphia, USA.
Abstract
In this paper, music genre classification is addressed in a multilinear perspective. Inspired by a model of auditory cortical processing, multiscale spectro-temporal modulation features are extracted. Such spectro-temporal modulation features have been successfully used in various content- based audio classification tasks recently, but not yet in music genre classification. Each recording is represented by a third-order feature tensor generated by the auditory model. Thus, the ensemble of recordings is represented by a fourth-order data tensor created by stacking the third-order feature tensors associated to the recordings. To handle large data tensors and derive compact feature vectors suitable for classification, three multilinear subspace techniques are examined, namely the Non-Negative Tensor Factorization (NTF), the High-Order Singular Value Decomposition (HOSVD), and the Multilinear Principal Component Analysis (MPCA). Classification is performed by a Support Vector Machine. Stratified cross-validation tests on the GTZAN dataset and the ISMIR 2004 Genre one demonstrate the advantages of NTF and HOSVD versus MPCA. The best accuracies obtained by the proposed multilinear approach is comparable with those achieved by state-of-the-art music genre classification algorithms.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | M Music and Books on Music > M Music Q Science > QA Mathematics > QA76 Computer software |
Departments: | School of Science & Technology > Computer Science |
Download (190kB) | Preview
Export
Downloads
Downloads per month over past year