skip to main content
10.1145/584792.584864acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

A singer identification technique for content-based classification of MP3 music objects

Published:04 November 2002Publication History

ABSTRACT

As there is a growing amount of MP3 music data available on the Internet today, the problems related to music classification and content-based music retrieval are getting more attention recently. In this paper, we propose an approach to automatically classify MP3 music objects according to their singers. First, the coefficients extracted from the output of the polyphase filters are used to compute the MP3 features for segmentation. Based on these features, an MP3 music object can be decomposed into a sequence of notes (or phonemes). Then for each MP3 phoneme in the training set, its MP3 feature is extracted and used to train an MP3 classifier which can identify the singer of an unknown MP3 music object. Experiments are performed and analyzed to show the effectiveness of the proposed method.

References

  1. Bakhmutova, V., V. D. Gusev, and T. N. Titkova, "The Search for Adaptations in Song Melodies," Computer Music Journal, Vol. 21, No. 1, pp. 58--67, Spring 1997.Google ScholarGoogle ScholarCross RefCross Ref
  2. Brandenburg, K., and G. Stoll, "ISO-MPEG-1 Audio: A Generic Standard for Coding of High Quality Digital Audio," Journal of the Audio Engineering Society, Vol. 42, No. 10, Oct 1994, pp. 780--792.Google ScholarGoogle Scholar
  3. Campbell, J.P., Jr., "Speaker Recognition: a Tutorial," Proceedings of the IEEE, Vol. 85, No. 9, Sept. 1997 pp. 1437--1462.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chen, J. C. C. and A. L. P. Chen, "Query by Rhythm: An Approach for Song Retrieval in Music Databases," In Proc. of 8th Intl. Workshop on Research Issues in Data Engineering, pp. 139--146, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chibelushi, C.C., F. Deravi, and J. S. D. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. On Multimedia, Vol. 4, No. 1, pp. 23--37, March 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chou, T. C., A. L. P. Chen, and C. C. Liu, "Music Databases: Indexing Techniques and Implementation," in Proc. IEEE Intl. Workshop on Multimedia Data Base Management Systems, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Chou, W., and L. Gu, "Robust Singing Detection in Speech/Music Discriminator Design," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 865--868, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Foote, J., "Content-Based Retrieval of Music and Audio", Multimedia Storage and Archiving systems II, Proc. SPIE, Vol.3229, pp. 138--147.Google ScholarGoogle Scholar
  9. Fukunaga, K., An Introduction to Statistical Pattern Recognition, San Diego, CA, Academic Press, 2nd ed., 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ghias, A., Logan, H., Chamberlin, D., and Smith, B. C., "Query by Humming: Musical Information Retrieval in an Audio Database," in Proc. of Third ACM International Conference on Multimedia, pp. 231--236, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hsu, J.L., C.C. Liu and A.L.P. Chen, "Discovering Non-Trivial Repeating Patterns in Music Data," IEEE Transactions on Multimedia, Vol. 3, No. 3, pp. 311--325, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. ISO/IEC 11172-3:1993, "Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3: Audio."Google ScholarGoogle Scholar
  13. Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "Music Retrieval by Humming," in Proceedings of IEEE PACRIM'99, pp. 404--407, 1999.Google ScholarGoogle Scholar
  14. Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "A Practical Query-By-Humming System for a Large Music Database," In Proc. ACM Multimedia, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lambrou, T. et al., "Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3621--3624, 1998.Google ScholarGoogle Scholar
  16. Li, S. Z., "Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method," IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 5, pp. 619--625, Sept. 2000.Google ScholarGoogle ScholarCross RefCross Ref
  17. Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "Efficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases," in Proc. of IEEE Intl. Conf. on Data Engineering, pp. 14--21, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "An Approximate String Matching Algorithm for Content-Based Music Data Retrieval," in Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Liu, C. C., and Wei-Yi Kuo, "Content-Based Segmentation of MP3 Music Objects," in Proc. of the Workshop on the 21st Century Digital Life and Internet Technologies, 2001.Google ScholarGoogle Scholar
  20. Liu, C. C. and Po-Jun Tsai, "Content-Based Retrieval of MP3 Music Objects," in Proc. of the ACM Intl. Conf. on Information and Knowledge Management (CIKM 2001), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Liu, Z. et al., "Audio Feature Extraction and Analysis for Scene Classification," in Proc. IEEE First Workshop on Multimedia Signal Processing, pp. 343--348, 1997.Google ScholarGoogle Scholar
  22. Liu, Z. and Q. Huang., "Classification of Audio Events in Broadcast News," in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 364--369, 1998.Google ScholarGoogle Scholar
  23. Lu, G.J. and T. Hankinson, "A Technique Towards Automatic Audio Classification and Retrieval," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 1142--1145, 1998.Google ScholarGoogle Scholar
  24. Lu, G.J. and T. Hankinson, "An Investigation of Automatic Audio Classification and Segmentation," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 776--781, 2000.Google ScholarGoogle Scholar
  25. Martin, K. D., and Y. E. Kim, "2pMU9. Musical instrument identification : A pattern-recognition approach," in the 136th meeting of the Acoustical Society of America, October 13, 1998.Google ScholarGoogle Scholar
  26. Melih, K., and R. Gonzalez, "Audio Retrieval Using Perceptually Based Structures", in Proc. of IEEE International Conference on Multimedia Computing and system, pp 338--347, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Melih, K., and R. Gonzalez, "Audio Source Type Segmentation Using a Perceptually Based Representation," in ISSPA 99, Brisbane, Australia, 22--25 August, 1999.Google ScholarGoogle Scholar
  28. Mo, J. S., C. H. Han, and Y. S. Kim, "A Melody-Based Similarity Computation Algorithm for Musical Information," in Proc. of Knowledge and Data Engineering Exchange Workshop ?KDEX '99?, pp. 114--121, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Moreno, P.J. and R. Rifkin, "Using The Fisher Kernel Method for Web Audio Classification," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 2417--2420, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Noll, P., "MPEG Digital Audio Coding," IEEE Signal Processing Magazine, Vol. 14, No. 5, pp. 59--81, Sept. 1997.Google ScholarGoogle ScholarCross RefCross Ref
  31. Painter, T. and A. Spanias, "Perceptual Coding of Digital Audio," Proceedings of the IEEE , Vol. 88, No. 4 , pp. 451--515, April 2000.Google ScholarGoogle ScholarCross RefCross Ref
  32. Pan, D., "A Tutorial on MPEG/Audio Compression," IEEE Multimedia Magazine, Vol. 2, No. 2, pp. 60--74, Summer 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Rolland, P. Y., G Raskinis, and J. G. Ganascia, "Musical Content-Based Retrieval: an Overview of the Melodiscov Approach and System," In Proc. ACM Multimedia 99, pp. 81--84, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Saunders, J., "Real-Time Discrimination of Broadcast Speech/Music," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 993--996, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Scheirer, E. and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 1331--1334, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Smith, G., H. Murase, H. Kashino, "Quick Audio Retrieval Using Active Search", in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3777--3780, 1998.Google ScholarGoogle Scholar
  37. Tsai, Po-Jun and Chih-Chin Liu, "An MP3 Search Engine on the Internet", in Proc. of 2000 Workshop on Internet & Distributed Systems, Vol. 1, pp. 18--27, 2000.Google ScholarGoogle Scholar
  38. Tzanetakis, G., G. Essl, and P. Cook, "Automatic Musical Genre Classification of Audio Signals," in Proc. Int. Symposium on Music Information Retrieval (ISMIR), Bloomington, Indiana, 2001.Google ScholarGoogle Scholar
  39. Tzanetakis, G., and P. Cook, "A Framework for Audio Analysis Based on Classification and Temporal Segmentation," in Proc. EUROMICRO Conf., Vol. 2, pp. 61--67, 1999.Google ScholarGoogle Scholar
  40. Wold, E., T. Blum, D. Keislar, and J. Wheaton, "Contented-Based Classification, Search, and Retrieval of Audio", IEEE Multimedia Vol. 3, No. 3, pp. 27--36, Fall 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Zhang, T. and C.-C.J. Kuo, "Hierarchical Classification of Audio Data for Archiving and Retrieving," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3001--3004, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A singer identification technique for content-based classification of MP3 music objects

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader