Multimodal Classification: Current Landscape, Taxonomy and Future Directions-Reference-Cited by-同舟云学术

Multimodal Classification: Current Landscape, Taxonomy and Future Directions

Published:2022-12-15 Issue:7 Volume:55 Page:1-31
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Sleeman William C.¹^ORCID,Kapoor Rishabh¹^ORCID,Ghosh Preetam¹^ORCID

Affiliation:

1. Virginia Commonwealth University, Richmond, Virginia, USA

Abstract

Multimodal classification research has been gaining popularity with new datasets in domains such as satellite imagery, biometrics, and medicine. Prior research has shown the benefits of combining data from multiple sources compared to traditional unimodal data that has led to the development of many novel multimodal architectures. However, the lack of consistent terminologies and architectural descriptions makes it difficult to compare different solutions. We address these challenges by proposing a new taxonomy for describing multimodal classification models based on trends found in recent publications. Examples of how this taxonomy could be applied to existing models are presented as well as a checklist to aid in the clear and complete presentation of future models. Many of the most difficult aspects of unimodal classification have not yet been fully addressed for multimodal datasets, including big data, class imbalance, and instance-level difficulty. We also provide a discussion of these challenges and future directions of research.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3543848

Reference125 articles.

1. MIMETIC: Mobile encrypted traffic classification using multimodal deep learning

2. DISTILLER: Encrypted traffic classification via multimodal multitask deep learning

3. Wavelet-Based Cough Signal Decomposition for Multimodal Classification

4. ECG Heartbeat Classification Using Multimodal Fusion

5. EasyMKL: a scalable multiple kernel learning algorithm

Cited by 41 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A review of aquaculture: From single modality analysis to multimodality fusion;Computers and Electronics in Agriculture;2024-11

2. Non-contact multimodal indoor human monitoring systems: A survey;Information Fusion;2024-10

3. Enhanced Feature Representation for Multimodal Fake News Detection Using Localized Fine-Tuning of Improved BERT and VGG-19 Models;Arabian Journal for Science and Engineering;2024-08-07

4. Leveraging small-scale datasets for additive manufacturing process modeling and part certification: Current practice and remaining gaps;Journal of Manufacturing Systems;2024-08

5. Multimodal data integration for oncology in the era of deep neural networks: a review;Frontiers in Artificial Intelligence;2024-07-25