The “Horse” Inside-Reference-Cited by-同舟云学术

The “Horse” Inside

Published:2016-12-30 Issue:2 Volume:14 Page:1-32
ISSN:1544-3574
Container-title:Computers in Entertainment
language:en
Short-container-title:Comput. Entertain.

Author:

Sturm Bob L.¹

Affiliation:

1. School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK

Abstract

Building systems that possess the sensitivity and intelligence to identify and describe high-level attributes in music audio signals continues to be an elusive goal but one that surely has broad and deep implications for a wide variety of applications. Hundreds of articles have so far been published toward this goal, and great progress appears to have been made. Some systems produce remarkable accuracies at recognizing high-level semantic concepts, such as music style, genre, and mood. However, it might be that these numbers do not mean what they seem. In this article, we take a state-of-the-art music content analysis system and investigate what causes it to achieve exceptionally high performance in a benchmark music audio dataset. We dissect the system to understand its operation, determine its sensitivities and limitations, and predict the kinds of knowledge it could and could not possess about music. We perform a series of experiments to illuminate what the system has actually learned to do and to what extent it is performing the intended music listening task. Our results demonstrate how the initial manifestation of music intelligence in this state of the art can be deceptive. Our work provides constructive directions toward developing music content analysis systems that can address the music information and creation needs of real-world users.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications

Link

https://dl.acm.org/doi/pdf/10.1145/2967507

Reference83 articles.

1. S. Argamon K. Burns and S. Dubnov (Eds.). 2010. The Structure of Style: Algorithmic Approaches to Understanding Manner and Meaning. Springer. S. Argamon K. Burns and S. Dubnov (Eds.). 2010. The Structure of Style: Algorithmic Approaches to Understanding Manner and Meaning. Springer.

2. Evolution and the Brain: Frontiers in Linguistic Series;Aucouturier J.

3. Aggregate features and ADABOOST for music classification

4. Content-Based Music Information Retrieval: Current Directions and Future Challenges

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14

2. We are Not Groupies⋯ We are Band Aids’: Assessment Reliability in the AI Song Contest;Transactions of the International Society for Music Information Retrieval;2021-12-03

3. How to Design a Relevant Corpus for Sleepiness Detection Through Voice?;Frontiers in Digital Health;2021-09-22

4. Beyond the Creative Species;2021-02-23

5. Sociocultural and Design Perspectives on AI-Based Music Production: Why Do We Make Music and What Changes if AI Makes It for Us?;Handbook of Artificial Intelligence for Music;2021