Classification of Music Genres Based on Music Separation into Harmonic and Drum Components

Aldona ROSNER; Björn SCHULLER; Bożena KOSTEK

doi:10.2478/aoa-2014-0068

Authors

Aldona ROSNER Institute of Informatics, Silesian University of Technology, Poland
Björn SCHULLER Technische Universität München, Germany
Bożena KOSTEK Audio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland

Abstract

This article presents a study on music genre classification based on music separation into harmonic and drum components. For this purpose, audio signal separation is executed to extend the overall vector of parameters by new descriptors extracted from harmonic and/or drum music content. The study is performed using the ISMIS database of music files represented by vectors of parameters containing music features. The Support Vector Machine (SVM) classifier and co-training method adapted for the standard SVM are involved in genre classification. Also, some additional experiments are performed using reduced feature vectors, which improved the overall result. Finally, results and conclusions drawn from the study are presented, and suggestions for further work are outlined.

Keywords:

Music Information Retrieval, musical isound separation, drum separation, music genre classification, Support Vector Machine, co-training, Non-Negative Matrix Factorization.

References

1. BEAUCHAMP J. (2011), Perceptually Correlated Parameters of Musical Instrument Tones, Archives of Acoustics, 36, 2, 225–238.

2. BLUM A., MITCHELL T. (1998), Combining labeled and unlabeled data with co-training. Proceedings of the Workshop on Computational Learning Theory, Morgan Kaufmann, 92-100.

3. BREGMAN A. (1990), Auditory scene analysis: the perceptual organization of sound, MIT Press.

4. CASEY M., WESTNER A. (2000), Separation of mixed audio sources by independent subspace analysis. Proceedings of International Computer Music Conference, 154-161, Berlin.

5. de CHEVEIGNÉ A. (1993), Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing, J. Acoust. Soc. Am.

6. DZIUBIŃSKI M., DALKA P., KOSTEK B. (2005), Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks, J. Intel. Inform. Systems, 24, 2, 133-157.

7. EWERET S., PRADO B., MULLER M., PLUMBLEY M. (2014), Score-Informed Source Separation for Musical Audio Recordings, IEEE Signal Proc. Magazine, 116-124.

8. GERBER T., DUTASTA M., GIRIN L., FÉVOTTE C. (2012), Professionally-produced music separation guided by covers, 13th International Society for Music Information Retrieval Conference.

9. GILLET O., RICHARD G. (2008), Transcription and separation of drum signals from polyphonic music, IEEE Transactions on Audio, Speech and Language Processing, 16, 529–540 .

10. GUNAWAN D., SEN S. (2012), Separation of Harmonic Musical Instrument Notes Using Spectro-Temporal Modeling of Harmonic Magnitudes and Spectrogram Inversion with Phase Optimization, JAES, 60, 12, pp. 1004-1014.

11. HERRERA P., AMATRIAIN X., BATLLE E., SERRA X. (2000), Towards instrument segmentation for music content description: a critical review of instrument classification techniques, Proceedings of International Symp. on Music Information Retrieval, Plymouth, Massachusetts.

12. KLAPURI A. (2001), Multipitch estimation and sound separation by the spectral smoothness principle, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3381-3384, Salt Lake City.

13. KLECZKOWSKI P. (2012), Perception of Mixture of Musical Instruments with Spectral Overlap Removed, Archives of Acoustics, 37, 3, 355–363.

14. KOSTEK B. (1999), Soft Computing in Acoustics, Applications of Neural Networks, Fuzzy Logic and Rough Sets to Musical Acoustics, Studies in Fuzziness and Soft Computing, Physica Verlag.

15. KOSTEK B., CZYZEWSKI A. (2001), Representing Musical Instrument Sounds for Their Automatic Classification, J. Audio Eng. Soc., 49, 9, 768-785.

16. KOSTEK B. (2004), Musical Instrument Classification and Duet Analysis Employing Music Information Retrieval Techniques, Proceedings of the IEEE, 92, 4, 712-729.

17. KOSTEK B. (2005), Perception-Based Data Processing in Acoustics, Applications to Music Information Retrieval and Psychophysiology of Hearing, Series on Cognitive Technologies, Springer Verlag, Berlin, Heidelberg, New York 2005.

18. KOSTEK B., DZIUBIŃSKI M. (2010), Evaluation of the separation algorithm performance employing ANNs, 34, Springer Verlag, in: Advances in Intelligent and Soft Computing, 80, 27 – 37, Berlin, Heidelberg.

19. KOSTEK B., KUPRYJANOW A., ZWAN P., JIANG W., RAS Z., WOJNARSKI M., SWIETLICKA J. (2011), Report of the ISMIS 2011 Contest: Music Information Retrieval, Foundations of Intelligent Systems, ISMIS 2011, Springer Verlag, 715–724, Berlin, Heidelberg.

20. KOSTEK B. (2013), Music Information Retrieval in Music Repositories, Rough Sets and Intelligent Systems (A. Skowron, Z. Suraj, eds.), 463-489, Springer Verlag, Berlin, Heilderberg.

21. LEE D.D. and SEUNG H.S. (1999), Learning the parts of objects by non-negative matrix factorization, Nature, 401:788-791.

22. LIUTKUS A., PINEL J., BADEAU R., GIRIN L., RICHARD G. (2012), Informed source separation through spectrogram coding and data embedding, Signal Processing, 92, 8,1937–1949.

23. LOHRI A., CARRAL S., CHATZIIOANNOU V. (2012), Combination Tones in Violins, Archives of Acoustics, 36, 4, 727–740.

24. MIKA D., KLECZKOWSKI P. (2011), ICA-based Single Channel Audio Separation: New Bases and Measures of Distance, Archives of Acoustics, 36, 2, 311–331.

25. NIKUNEN J., VIRTANEN T., VILERMO M. (2012), Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization, JAES, 60, 10, 794-806.

26. RAS Z., WIECZORKOWSKA A., eds. (2010), Advances in Music Information Retrieval (Studies in Computational Intelligence, no. 274), Springer Publishing Company.

27. ROSNER A., MICHALAK M., KOSTEK B. (2013a), A Study on Influence of Normalization Methods on Music Genre Classification Results Employing kNN Algorithm, Proceedings 9th National Conference on Bazy Danych: Aplikacje i Systemy, 411-423, Ustroń.

28. ROSNER A., WENINGER F., SCHULLER B., MICHALAK M., Kostek B. (2013b), A study on Influence of Instruments on Music Genre Classification Results, Proceedings of International Conference on Man-Machine Interactions, 467-473, Beskidy.

29. RUMP H., MIYABE S., TSUNOO E., ONO N., SAGAMA S. (2010), Autoregressive MFCC Models For Genre Classification Improved By Harmonic-Percussion Separation, Proceedings of the 11th International Society for Music Information Retrieval Conference, pp 87-92, Utrecht.

30. SERRA X., SMITH J. O. (1990), Spectral modeling synthesis: a sound analysis/synthesis system based on a deterministic plus stochastic decomposition, Computer Music Journal, 14, 4, 12-24.

31. SOFIANOS S., ARIYAEEINIA A., POLFREMAN R., SOTUDEH R. (2012) H-Semantics: a Hybrid Approach to Singing Voice Separation, JAES, 60, 10, pp. 831-841.

32. TERASAWA H, BERGER J., MAKINO S. (2012), In Search of a Perceptual Metric for Timbre: Dissimilarity Judgments among Synthetic Sounds with MFCC-Derived Spectral Envelopes, JAES, 60, 9, pp. 674-685.

33. TOLONEN T. (1999), Methods for Separation of Harmonic Sound Sources using Sinusoidal Modeling, 106th Audio Engineering Society Conv., Munich.

34. WACK N., GUAUS E., LAURIER C., MEYERS O., MARXER R., BOGDANOV D., SERRA J., HERRERA P. (2009), Music Type Groupers (Mtg): Generic Music Classification Algorithms, International Society for Music Information Retrieval.

35. WENINGER F., DURRIEU J., EYBEN F., RICHARD G., Schuller B. (2011), Combining monaural source separation with long short-term memory for increased robustness in vocalist gender recognition. In: Proceedings of International Conference on Acoustics Speech and Signal Processing, pp. 2196-2199, IEEE, Prague, Czech Republic.

36. WENINGER F., SCHULLER B. (2012), Optimization and parallelization of monaural source separation algorithms in the openblissart toolkit. J. Signal Processing Systems, 69(3), 267-277.

37. WIECZORKOWSKA A., KUBERA E., KUBIK-KOMAR A. (2011), Analysis of Recognition of a Musical Instrument in Sound Mixes Using Support Vector Machines, Fundamenta Informaticae, 107, 1.

38. http://ismir2012.ismir.net (Intern. Conference on Music Information Retrieval website).

Online first
Early birds
2026, Vol 51
	No 1
2025, Vol 50
	No 1	No 2	No 3	No 4
2024, Vol 49
	No 1	No 2	No 3	No 4
2023, Vol 48
	No 1	No 2	No 3	No 4
2022, Vol 47
	No 1	No 2	No 3	No 4
2021, Vol 46
	No 1	No 2	No 3	No 4
2020, Vol 45
	No 1	No 2	No 3	No 4
2019, Vol 44
	No 1	No 2	No 3	No 4
2018, Vol 43
	No 1	No 2	No 3	No 4
2017, Vol 42
	No 1	No 2	No 3	No 4
2016, Vol 41
	No 1	No 2	No 3	No 4
2015, Vol 40
	No 1	No 2	No 3	No 4
2014, Vol 39
	No 1	No 2	No 3	No 4
2013, Vol 38
	No 1	No 2	No 3	No 4
2012, Vol 37
	No 1	No 2	No 3	No 4
2011, Vol 36
	No 1	No 2	No 3	No 4
2010, Vol 35
	No 1	No 2	No 3	No 4
2009, Vol 34
	No 1	No 2	No 3	No 4
2008, Vol 33
	No 1	No 2	No 3	No 4	No 4(S)
2007, Vol 32
	No 1	No 2	No 3	No 4	No 4(S)
2006, Vol 31
	No 1	No 2	No 3	No 4	No 4(S)
2005, Vol 30
	No 1	No 2	No 3	No 4
2004, Vol 29
	No 1	No 2	No 3	No 4
2003, Vol 28
	No 1	No 2	No 3	No 4
2002, Vol 27
	No 1	No 2	No 3	No 4
2001, Vol 26
	No 1	No 2	No 3	No 4
2000, Vol 25
	No 1	No 2	No 3	No 4
1999, Vol 24
	No 1	No 2	No 3	No 4
1998, Vol 23
	No 1	No 2	No 3	No 4
1997, Vol 22
	No 1	No 2	No 3	No 4
1996, Vol 21
	No 1	No 2	No 3	No 4
1995, Vol 20
	No 1	No 2	No 3	No 4
1994, Vol 19
	No 1	No 2	No 3	No 4
1993, Vol 18
	No 1	No 2	No 3	No 4
1992, Vol 17
	No 1	No 2	No 3	No 4
1991, Vol 16
	No 1	No 2	No 3-4
1990, Vol 15
	No 1-2		No 3-4
1989, Vol 14
	No 1-2		No 3-4
1988, Vol 13
	No 1-2		No 3-4
1987, Vol 12
	No 1	No 2	No 3-4
1986, Vol 11
	No 1	No 2	No 3	No 4
1985, Vol 10
	No 1	No 2	No 3	No 4
1984, Vol 9
	No 1-2		No 3	No 4
1983, Vol 8
	No 1	No 2	No 3	No 4
1982, Vol 7
	No 1	No 2	No 3-4
1981, Vol 6
	No 1	No 2	No 3	No 4
1980, Vol 5
	No 1	No 2	No 3	No 4
1979, Vol 4
	No 1	No 2	No 3	No 4
1978, Vol 3
	No 1	No 2	No 3	No 4
1977, Vol 2
	No 1	No 2	No 3	No 4
1976, Vol 1
	No 1	No 2	No 3	No 4

Classification of Music Genres Based on Music Separation into Harmonic and Drum Components

Downloads

Authors

Abstract

Keywords:

References

Other articles by the same author(s)

cover

ippt-pan

Issue

Pages

Section

DOI

License

How to Cite

Principal Contact

Address

Support Contact