Article

A singer identification technique for content-based classification of MP3 music objects

Authors:
Chih-Chin Liu

Chung Hua University, Hsinchu, Taiwan

Chung Hua University, Hsinchu, Taiwan
View Profile

,
Chuan-Sung Huang

Chung Hua University, Hsinchu, Taiwan

Chung Hua University, Hsinchu, Taiwan
View Profile

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge managementNovember 2002Pages 438–445https://doi.org/10.1145/584792.584864

Published:04 November 2002Publication History

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management

Pages 438–445

ABSTRACT

As there is a growing amount of MP3 music data available on the Internet today, the problems related to music classification and content-based music retrieval are getting more attention recently. In this paper, we propose an approach to automatically classify MP3 music objects according to their singers. First, the coefficients extracted from the output of the polyphase filters are used to compute the MP3 features for segmentation. Based on these features, an MP3 music object can be decomposed into a sequence of notes (or phonemes). Then for each MP3 phoneme in the training set, its MP3 feature is extracted and used to train an MP3 classifier which can identify the singer of an unknown MP3 music object. Experiments are performed and analyzed to show the effectiveness of the proposed method.

References

Bakhmutova, V., V. D. Gusev, and T. N. Titkova, "The Search for Adaptations in Song Melodies," Computer Music Journal, Vol. 21, No. 1, pp. 58--67, Spring 1997.Google ScholarCross Ref
Brandenburg, K., and G. Stoll, "ISO-MPEG-1 Audio: A Generic Standard for Coding of High Quality Digital Audio," Journal of the Audio Engineering Society, Vol. 42, No. 10, Oct 1994, pp. 780--792.Google Scholar
Campbell, J.P., Jr., "Speaker Recognition: a Tutorial," Proceedings of the IEEE, Vol. 85, No. 9, Sept. 1997 pp. 1437--1462.Google ScholarCross Ref
Chen, J. C. C. and A. L. P. Chen, "Query by Rhythm: An Approach for Song Retrieval in Music Databases," In Proc. of 8th Intl. Workshop on Research Issues in Data Engineering, pp. 139--146, 1998. Google ScholarDigital Library
Chibelushi, C.C., F. Deravi, and J. S. D. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. On Multimedia, Vol. 4, No. 1, pp. 23--37, March 2002. Google ScholarDigital Library
Chou, T. C., A. L. P. Chen, and C. C. Liu, "Music Databases: Indexing Techniques and Implementation," in Proc. IEEE Intl. Workshop on Multimedia Data Base Management Systems, 1996. Google ScholarDigital Library
Chou, W., and L. Gu, "Robust Singing Detection in Speech/Music Discriminator Design," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 865--868, 2001. Google ScholarDigital Library
Foote, J., "Content-Based Retrieval of Music and Audio", Multimedia Storage and Archiving systems II, Proc. SPIE, Vol.3229, pp. 138--147.Google Scholar
Fukunaga, K., An Introduction to Statistical Pattern Recognition, San Diego, CA, Academic Press, 2nd ed., 1990. Google ScholarDigital Library
Ghias, A., Logan, H., Chamberlin, D., and Smith, B. C., "Query by Humming: Musical Information Retrieval in an Audio Database," in Proc. of Third ACM International Conference on Multimedia, pp. 231--236, 1995. Google ScholarDigital Library
Hsu, J.L., C.C. Liu and A.L.P. Chen, "Discovering Non-Trivial Repeating Patterns in Music Data," IEEE Transactions on Multimedia, Vol. 3, No. 3, pp. 311--325, 2001. Google ScholarDigital Library
ISO/IEC 11172-3:1993, "Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3: Audio."Google Scholar
Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "Music Retrieval by Humming," in Proceedings of IEEE PACRIM'99, pp. 404--407, 1999.Google Scholar
Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "A Practical Query-By-Humming System for a Large Music Database," In Proc. ACM Multimedia, 2000. Google ScholarDigital Library
Lambrou, T. et al., "Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3621--3624, 1998.Google Scholar
Li, S. Z., "Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method," IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 5, pp. 619--625, Sept. 2000.Google ScholarCross Ref
Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "Efficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases," in Proc. of IEEE Intl. Conf. on Data Engineering, pp. 14--21, 1999. Google ScholarDigital Library
Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "An Approximate String Matching Algorithm for Content-Based Music Data Retrieval," in Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, 1999. Google ScholarDigital Library
Liu, C. C., and Wei-Yi Kuo, "Content-Based Segmentation of MP3 Music Objects," in Proc. of the Workshop on the 21st Century Digital Life and Internet Technologies, 2001.Google Scholar
Liu, C. C. and Po-Jun Tsai, "Content-Based Retrieval of MP3 Music Objects," in Proc. of the ACM Intl. Conf. on Information and Knowledge Management (CIKM 2001), 2001. Google ScholarDigital Library
Liu, Z. et al., "Audio Feature Extraction and Analysis for Scene Classification," in Proc. IEEE First Workshop on Multimedia Signal Processing, pp. 343--348, 1997.Google Scholar
Liu, Z. and Q. Huang., "Classification of Audio Events in Broadcast News," in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 364--369, 1998.Google Scholar
Lu, G.J. and T. Hankinson, "A Technique Towards Automatic Audio Classification and Retrieval," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 1142--1145, 1998.Google Scholar
Lu, G.J. and T. Hankinson, "An Investigation of Automatic Audio Classification and Segmentation," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 776--781, 2000.Google Scholar
Martin, K. D., and Y. E. Kim, "2pMU9. Musical instrument identification : A pattern-recognition approach," in the 136th meeting of the Acoustical Society of America, October 13, 1998.Google Scholar
Melih, K., and R. Gonzalez, "Audio Retrieval Using Perceptually Based Structures", in Proc. of IEEE International Conference on Multimedia Computing and system, pp 338--347, 1998. Google ScholarDigital Library
Melih, K., and R. Gonzalez, "Audio Source Type Segmentation Using a Perceptually Based Representation," in ISSPA 99, Brisbane, Australia, 22--25 August, 1999.Google Scholar
Mo, J. S., C. H. Han, and Y. S. Kim, "A Melody-Based Similarity Computation Algorithm for Musical Information," in Proc. of Knowledge and Data Engineering Exchange Workshop ?KDEX '99?, pp. 114--121, 1999. Google ScholarDigital Library
Moreno, P.J. and R. Rifkin, "Using The Fisher Kernel Method for Web Audio Classification," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 2417--2420, 2000. Google ScholarDigital Library
Noll, P., "MPEG Digital Audio Coding," IEEE Signal Processing Magazine, Vol. 14, No. 5, pp. 59--81, Sept. 1997.Google ScholarCross Ref
Painter, T. and A. Spanias, "Perceptual Coding of Digital Audio," Proceedings of the IEEE , Vol. 88, No. 4 , pp. 451--515, April 2000.Google ScholarCross Ref
Pan, D., "A Tutorial on MPEG/Audio Compression," IEEE Multimedia Magazine, Vol. 2, No. 2, pp. 60--74, Summer 1995. Google ScholarDigital Library
Rolland, P. Y., G Raskinis, and J. G. Ganascia, "Musical Content-Based Retrieval: an Overview of the Melodiscov Approach and System," In Proc. ACM Multimedia 99, pp. 81--84, 1999. Google ScholarDigital Library
Saunders, J., "Real-Time Discrimination of Broadcast Speech/Music," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 993--996, 1996. Google ScholarDigital Library
Scheirer, E. and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 1331--1334, 1997. Google ScholarDigital Library
Smith, G., H. Murase, H. Kashino, "Quick Audio Retrieval Using Active Search", in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3777--3780, 1998.Google Scholar
Tsai, Po-Jun and Chih-Chin Liu, "An MP3 Search Engine on the Internet", in Proc. of 2000 Workshop on Internet & Distributed Systems, Vol. 1, pp. 18--27, 2000.Google Scholar
Tzanetakis, G., G. Essl, and P. Cook, "Automatic Musical Genre Classification of Audio Signals," in Proc. Int. Symposium on Music Information Retrieval (ISMIR), Bloomington, Indiana, 2001.Google Scholar
Tzanetakis, G., and P. Cook, "A Framework for Audio Analysis Based on Classification and Temporal Segmentation," in Proc. EUROMICRO Conf., Vol. 2, pp. 61--67, 1999.Google Scholar
Wold, E., T. Blum, D. Keislar, and J. Wheaton, "Contented-Based Classification, Search, and Retrieval of Audio", IEEE Multimedia Vol. 3, No. 3, pp. 27--36, Fall 1996. Google ScholarDigital Library
Zhang, T. and C.-C.J. Kuo, "Hierarchical Classification of Audio Data for Archiving and Retrieving," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3001--3004, 1999. Google ScholarDigital Library

Index Terms

A singer identification technique for content-based classification of MP3 music objects
1. Information systems
  1. Information retrieval

Recommendations

Content-based retrieval of MP3 music objects
CIKM '01: Proceedings of the tenth international conference on Information and knowledge management

In recent years, the searching and indexing techniques for multimedia data are getting more attention in the area of multimedia databases. As many research works were done on the content-based retrieval of image and video data, less attention was ...
Read More
Music Information Retrieval of Carnatic Songs Based on Carnatic Music Singer Identification
ICCEE '08: Proceedings of the 2008 International Conference on Computer and Electrical Engineering

In this paper, a methodology for Carnatic music singer identification is proposed and implemented. The motive behind identifying the singer is to extend this work for efficient music information retrieval of Carnatic music song based on singer ...
Read More
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval

This paper describes a method of modeling the characteristics of a singing voice from polyphonic musical audio signals including sounds of various musical instruments. Because singing voices play an important role in musical pieces with vocals, such ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
November 2002
704 pages
ISBN:1581134924
DOI:10.1145/584792
General Chair:
Charles Nicholas
University of Maryland Baltimore County
,
Program Chairs:
David Grossman
Illinois Institute of Technology
,
Konstantinos Kalpakis
University of Maryland Baltimore County
,
Sajda Qureshi
Erasmus University, Rotterdam
,
Han van Dissel
Erasmus University, Rotterdam
,
Len Seligman
The MITRE Corporation
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 November 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
MP3
MP3 classification
MP3 databases
content-based music classification
music classification
music databases
music feature extraction
singer identification
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 2,094
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A singer identification technique for content-based classification of MP3 music objects

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Content-based retrieval of MP3 music objects

Music Information Retrieval of Carnatic Songs Based on Carnatic Music Singer Identification

A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval