Depthwise Convolution Is All You Need for Learning Multiple Visual Domains-Reference-Cited by-同舟云学术

Depthwise Convolution Is All You Need for Learning Multiple Visual Domains

Published:2019-07-17 Issue: Volume:33 Page:8368-8375
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Guo Yunhui,Li Yandong,Wang Liqiang,Rosing Tajana

Abstract

There is a growing interest in designing models that can deal with images from different visual domains. If there exists a universal structure in different visual domains that can be captured via a common parameterization, then we can use a single model for all domains rather than one model per domain. A model aware of the relationships between different domains can also be trained to work on new domains with less resources. However, to identify the reusable structure in a model is not easy. In this paper, we propose a multi-domain learning architecture based on depthwise separable convolution. The proposed approach is based on the assumption that images from different domains share cross-channel correlations but have domain-specific spatial correlations. The proposed model is compact and has minimal overhead when being applied to new domains. Additionally, we introduce a gating mechanism to promote soft sharing between different domains. We evaluate our approach on Visual Decathlon Challenge, a benchmark for testing the ability of multi-domain models. The experiments show that our approach can achieve the highest score while only requiring 50% of the parameters compared with the state-of-the-art approaches.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 64 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TMPSformer: An Efficient Hybrid Transformer-MLP Network for Polyp Segmentation;Mobile Networks and Applications;2024-09-10

2. Adaptive condition-aware high-dimensional decoupling remote sensing image object detection algorithm;Scientific Reports;2024-08-29

3. Agricultural object detection with You Only Look Once (YOLO) Algorithm: A bibliometric and systematic literature review;Computers and Electronics in Agriculture;2024-08

4. Multiscale PatchTCN-Mixer: A New Method for Extracting Spatial and Temporal Degradation Information in Remaining Useful Life Prognosis;IEEE Sensors Journal;2024-08-01

5. Online inspection of blackheart in potatoes using visible-near infrared spectroscopy and interpretable spectrogram-based modified ResNet modeling;Frontiers in Plant Science;2024-06-07