With the proliferation of digital audio distribution over digital media, audio content analysis is fast becoming a requirement for designers of intelligent signal-adaptive audio processing systems. Written by a well-known expert in the field, this book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike. A review of relevant fundamentals in audio signal processing, psychoacoustics, and music theory, as well as downloadable MATLAB files are also included. Please visit the companion website: www.AudioContentAnalysis.org
Cited By
- Sun J, Deng L, Afouras T, Owens A and Davis A (2023). Eventfulness for Interactive Video Alignment, ACM Transactions on Graphics, 42:4, (1-10), Online publication date: 1-Aug-2023.
- Yang P, Kuang S, Wu C and Hsu J Predicting Music Emotion by Using Convolutional Neural Network HCI in Business, Government and Organizations, (266-275)
- Trowitzsch I, Schymura C, Kolossa D and Obermayer K (2019). Joining Sound Event Detection and Localization Through Spatial Segregation, IEEE/ACM Transactions on Audio, Speech and Language Processing, 28, (487-502), Online publication date: 1-Jan-2020.
- Bayle Y, Robine M and Hanna P (2019). SATIN, Multimedia Tools and Applications, 78:3, (2703-2718), Online publication date: 1-Feb-2019.
- Davis A and Agrawala M (2018). Visual rhythm and beat, ACM Transactions on Graphics, 37:4, (1-11), Online publication date: 31-Aug-2018.
- Alam F, Danieli M and Riccardi G (2018). Annotating and modeling empathy in spoken conversations, Computer Speech and Language, 50:C, (40-61), Online publication date: 1-Jul-2018.
- Ordiales H and Bruno M Sound recycling from public databases Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, (1-8)
- Sanchez-Hevia H, Ayllon D, Gil-Pita R, Rosa-Zurera M, Sanchez-Hevia H, Ayllon D, Gil-Pita R and Rosa-Zurera M (2017). Maximum Likelihood Decision Fusion for Weapon Classification in Wireless Acoustic Sensor Networks, IEEE/ACM Transactions on Audio, Speech and Language Processing, 25:6, (1172-1182), Online publication date: 1-Jun-2017.
- Trowitzsch I, Mohr J, Kashef Y, Obermayer K, Trowitzsch I, Mohr J, Kashef Y and Obermayer K (2017). Robust Detection of Environmental Sounds in Binaural Auditory Scenes, IEEE/ACM Transactions on Audio, Speech and Language Processing, 25:6, (1344-1356), Online publication date: 1-Jun-2017.
- Lu Y, Wu C, Lu C and Lerch A An Unsupervised Approach to Anomaly Detection in Music Datasets Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (749-752)
- Hupperich T, Hosseini H and Holz T Leveraging Sensor Fingerprinting for Mobile Device Authentication Proceedings of the 13th International Conference on Detection of Intrusions and Malware, and Vulnerability Assessment - Volume 9721, (377-396)
- Zhao H, Chen Y, Wang R and Malik H (2016). Anti-Forensics of Environmental-Signature-Based Audio Splicing Detection and Its Countermeasure via Rich-Features Classification, IEEE Transactions on Information Forensics and Security, 11:7, (1603-1617), Online publication date: 1-Jul-2016.
- Bano S and Cavallaro A (2016). ViComp, Multimedia Tools and Applications, 75:12, (7187-7210), Online publication date: 1-Jun-2016.
- Bretan M and Weinberg G (2016). A survey of robotic musicianship, Communications of the ACM, 59:5, (100-109), Online publication date: 26-Apr-2016.
- Mahesha P and Vinod D Automatic Segmentation and Classification of Dysfluencies in Stuttering Speech Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies, (1-6)
- Dimoulas C and Symeonidis A (2015). Syncing Shared Multimedia through Audiovisual Bimodal Segmentation, IEEE MultiMedia, 22:3, (26-42), Online publication date: 1-Jul-2015.
- Abadi M, Abad A, Subramanian R, Rostamzadeh N, Ricci E, Varadarajan J and Sebe N A Multi-task Learning Framework for Time-continuous Emotion Estimation from Crowd Annotations Proceedings of the 2014 International ACM Workshop on Crowdsourcing for Multimedia, (17-23)
- Liu Y, Liu Y, Zhao Y and Hua K What Strikes the Strings of Your Heart? Proceedings of the 22nd ACM international conference on Multimedia, (1069-1072)
- Sturm B A Survey of Evaluation in Music Genre Recognition Adaptive Multimedia Retrieval: Semantics, Context, and Adaptation, (29-66)
Index Terms
- An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics