Mask-Aware Semi-Supervised Object Detection in Floor Plans-Reference-Cited by-同舟云学术

Mask-Aware Semi-Supervised Object Detection in Floor Plans

Published:2022-09-20 Issue:19 Volume:12 Page:9398
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Shehzadi Tahira^ORCID,Hashmi Khurram Azeem^ORCID,Pagani Alain,Liwicki Marcus^ORCID,Stricker Didier,Afzal Muhammad Zeshan^ORCID

Abstract

Research has been growing on object detection using semi-supervised methods in past few years. We examine the intersection of these two areas for floor-plan objects to promote the research objective of detecting more accurate objects with less labeled data. The floor-plan objects include different furniture items with multiple types of the same class, and this high inter-class similarity impacts the performance of prior methods. In this paper, we present Mask R-CNN-based semi-supervised approach that provides pixel-to-pixel alignment to generate individual annotation masks for each class to mine the inter-class similarity. The semi-supervised approach has a student–teacher network that pulls information from the teacher network and feeds it to the student network. The teacher network uses unlabeled data to form pseudo-boxes, and the student network uses both label data with the pseudo boxes and labeled data as the ground truth for training. It learns representations of furniture items by combining labeled and label data. On the Mask R-CNN detector with ResNet-101 backbone network, the proposed approach achieves a mAP of 98.8%, 99.7%, and 99.8% with only 1%, 5% and 10% labeled data, respectively. Our experiment affirms the efficiency of the proposed approach, as it outperforms the previous semi-supervised approaches using only 1% of the labels.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/19/9398/pdf

Reference69 articles.

1. ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring;Berthelot;arXiv,2019

2. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence;Sohn;Adv. Neural Inf. Process. Syst.,2020

3. Best practices for convolutional neural networks applied to visual document analysis

4. ImageNet classification with deep convolutional neural networks

5. Self-training with Noisy Student improves ImageNet classification;Xie;arXiv,2019

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Symbol Detection in Mechanical Engineering Sketches: Experimental Study on Principle Sketches with Synthetic Data Generation and Deep Learning;Applied Sciences;2024-07-12

2. End-to-end semi-supervised approach with modulated object queries for table detection in documents;International Journal on Document Analysis and Recognition (IJDAR);2024-07-10

3. Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer;Lecture Notes in Computer Science;2024

4. A Hybrid Approach for Document Layout Analysis in Document Images;Lecture Notes in Computer Science;2024

5. UnSupDLA: Towards Unsupervised Document Layout Analysis;Lecture Notes in Computer Science;2024