ABSTRACT
As there is a growing amount of MP3 music data available on the Internet today, the problems related to music classification and content-based music retrieval are getting more attention recently. In this paper, we propose an approach to automatically classify MP3 music objects according to their singers. First, the coefficients extracted from the output of the polyphase filters are used to compute the MP3 features for segmentation. Based on these features, an MP3 music object can be decomposed into a sequence of notes (or phonemes). Then for each MP3 phoneme in the training set, its MP3 feature is extracted and used to train an MP3 classifier which can identify the singer of an unknown MP3 music object. Experiments are performed and analyzed to show the effectiveness of the proposed method.
- Bakhmutova, V., V. D. Gusev, and T. N. Titkova, "The Search for Adaptations in Song Melodies," Computer Music Journal, Vol. 21, No. 1, pp. 58--67, Spring 1997.Google ScholarCross Ref
- Brandenburg, K., and G. Stoll, "ISO-MPEG-1 Audio: A Generic Standard for Coding of High Quality Digital Audio," Journal of the Audio Engineering Society, Vol. 42, No. 10, Oct 1994, pp. 780--792.Google Scholar
- Campbell, J.P., Jr., "Speaker Recognition: a Tutorial," Proceedings of the IEEE, Vol. 85, No. 9, Sept. 1997 pp. 1437--1462.Google ScholarCross Ref
- Chen, J. C. C. and A. L. P. Chen, "Query by Rhythm: An Approach for Song Retrieval in Music Databases," In Proc. of 8th Intl. Workshop on Research Issues in Data Engineering, pp. 139--146, 1998. Google ScholarDigital Library
- Chibelushi, C.C., F. Deravi, and J. S. D. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. On Multimedia, Vol. 4, No. 1, pp. 23--37, March 2002. Google ScholarDigital Library
- Chou, T. C., A. L. P. Chen, and C. C. Liu, "Music Databases: Indexing Techniques and Implementation," in Proc. IEEE Intl. Workshop on Multimedia Data Base Management Systems, 1996. Google ScholarDigital Library
- Chou, W., and L. Gu, "Robust Singing Detection in Speech/Music Discriminator Design," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 865--868, 2001. Google ScholarDigital Library
- Foote, J., "Content-Based Retrieval of Music and Audio", Multimedia Storage and Archiving systems II, Proc. SPIE, Vol.3229, pp. 138--147.Google Scholar
- Fukunaga, K., An Introduction to Statistical Pattern Recognition, San Diego, CA, Academic Press, 2nd ed., 1990. Google ScholarDigital Library
- Ghias, A., Logan, H., Chamberlin, D., and Smith, B. C., "Query by Humming: Musical Information Retrieval in an Audio Database," in Proc. of Third ACM International Conference on Multimedia, pp. 231--236, 1995. Google ScholarDigital Library
- Hsu, J.L., C.C. Liu and A.L.P. Chen, "Discovering Non-Trivial Repeating Patterns in Music Data," IEEE Transactions on Multimedia, Vol. 3, No. 3, pp. 311--325, 2001. Google ScholarDigital Library
- ISO/IEC 11172-3:1993, "Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3: Audio."Google Scholar
- Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "Music Retrieval by Humming," in Proceedings of IEEE PACRIM'99, pp. 404--407, 1999.Google Scholar
- Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "A Practical Query-By-Humming System for a Large Music Database," In Proc. ACM Multimedia, 2000. Google ScholarDigital Library
- Lambrou, T. et al., "Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3621--3624, 1998.Google Scholar
- Li, S. Z., "Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method," IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 5, pp. 619--625, Sept. 2000.Google ScholarCross Ref
- Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "Efficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases," in Proc. of IEEE Intl. Conf. on Data Engineering, pp. 14--21, 1999. Google ScholarDigital Library
- Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "An Approximate String Matching Algorithm for Content-Based Music Data Retrieval," in Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, 1999. Google ScholarDigital Library
- Liu, C. C., and Wei-Yi Kuo, "Content-Based Segmentation of MP3 Music Objects," in Proc. of the Workshop on the 21st Century Digital Life and Internet Technologies, 2001.Google Scholar
- Liu, C. C. and Po-Jun Tsai, "Content-Based Retrieval of MP3 Music Objects," in Proc. of the ACM Intl. Conf. on Information and Knowledge Management (CIKM 2001), 2001. Google ScholarDigital Library
- Liu, Z. et al., "Audio Feature Extraction and Analysis for Scene Classification," in Proc. IEEE First Workshop on Multimedia Signal Processing, pp. 343--348, 1997.Google Scholar
- Liu, Z. and Q. Huang., "Classification of Audio Events in Broadcast News," in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 364--369, 1998.Google Scholar
- Lu, G.J. and T. Hankinson, "A Technique Towards Automatic Audio Classification and Retrieval," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 1142--1145, 1998.Google Scholar
- Lu, G.J. and T. Hankinson, "An Investigation of Automatic Audio Classification and Segmentation," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 776--781, 2000.Google Scholar
- Martin, K. D., and Y. E. Kim, "2pMU9. Musical instrument identification : A pattern-recognition approach," in the 136th meeting of the Acoustical Society of America, October 13, 1998.Google Scholar
- Melih, K., and R. Gonzalez, "Audio Retrieval Using Perceptually Based Structures", in Proc. of IEEE International Conference on Multimedia Computing and system, pp 338--347, 1998. Google ScholarDigital Library
- Melih, K., and R. Gonzalez, "Audio Source Type Segmentation Using a Perceptually Based Representation," in ISSPA 99, Brisbane, Australia, 22--25 August, 1999.Google Scholar
- Mo, J. S., C. H. Han, and Y. S. Kim, "A Melody-Based Similarity Computation Algorithm for Musical Information," in Proc. of Knowledge and Data Engineering Exchange Workshop ?KDEX '99?, pp. 114--121, 1999. Google ScholarDigital Library
- Moreno, P.J. and R. Rifkin, "Using The Fisher Kernel Method for Web Audio Classification," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 2417--2420, 2000. Google ScholarDigital Library
- Noll, P., "MPEG Digital Audio Coding," IEEE Signal Processing Magazine, Vol. 14, No. 5, pp. 59--81, Sept. 1997.Google ScholarCross Ref
- Painter, T. and A. Spanias, "Perceptual Coding of Digital Audio," Proceedings of the IEEE , Vol. 88, No. 4 , pp. 451--515, April 2000.Google ScholarCross Ref
- Pan, D., "A Tutorial on MPEG/Audio Compression," IEEE Multimedia Magazine, Vol. 2, No. 2, pp. 60--74, Summer 1995. Google ScholarDigital Library
- Rolland, P. Y., G Raskinis, and J. G. Ganascia, "Musical Content-Based Retrieval: an Overview of the Melodiscov Approach and System," In Proc. ACM Multimedia 99, pp. 81--84, 1999. Google ScholarDigital Library
- Saunders, J., "Real-Time Discrimination of Broadcast Speech/Music," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 993--996, 1996. Google ScholarDigital Library
- Scheirer, E. and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 1331--1334, 1997. Google ScholarDigital Library
- Smith, G., H. Murase, H. Kashino, "Quick Audio Retrieval Using Active Search", in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3777--3780, 1998.Google Scholar
- Tsai, Po-Jun and Chih-Chin Liu, "An MP3 Search Engine on the Internet", in Proc. of 2000 Workshop on Internet & Distributed Systems, Vol. 1, pp. 18--27, 2000.Google Scholar
- Tzanetakis, G., G. Essl, and P. Cook, "Automatic Musical Genre Classification of Audio Signals," in Proc. Int. Symposium on Music Information Retrieval (ISMIR), Bloomington, Indiana, 2001.Google Scholar
- Tzanetakis, G., and P. Cook, "A Framework for Audio Analysis Based on Classification and Temporal Segmentation," in Proc. EUROMICRO Conf., Vol. 2, pp. 61--67, 1999.Google Scholar
- Wold, E., T. Blum, D. Keislar, and J. Wheaton, "Contented-Based Classification, Search, and Retrieval of Audio", IEEE Multimedia Vol. 3, No. 3, pp. 27--36, Fall 1996. Google ScholarDigital Library
- Zhang, T. and C.-C.J. Kuo, "Hierarchical Classification of Audio Data for Archiving and Retrieving," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3001--3004, 1999. Google ScholarDigital Library
Index Terms
- A singer identification technique for content-based classification of MP3 music objects
Recommendations
Content-based retrieval of MP3 music objects
CIKM '01: Proceedings of the tenth international conference on Information and knowledge managementIn recent years, the searching and indexing techniques for multimedia data are getting more attention in the area of multimedia databases. As many research works were done on the content-based retrieval of image and video data, less attention was ...
Music Information Retrieval of Carnatic Songs Based on Carnatic Music Singer Identification
ICCEE '08: Proceedings of the 2008 International Conference on Computer and Electrical EngineeringIn this paper, a methodology for Carnatic music singer identification is proposed and implemented. The motive behind identifying the singer is to extend this work for efficient music information retrieval of Carnatic music song based on singer ...
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval
This paper describes a method of modeling the characteristics of a singing voice from polyphonic musical audio signals including sounds of various musical instruments. Because singing voices play an important role in musical pieces with vocals, such ...
Comments