A Quality Metric for Semantically Transmitted Images in Machine-to-Machine Communications-Reference-Cited by-同舟云学术

A Quality Metric for Semantically Transmitted Images in Machine-to-Machine Communications

Published:2024-08-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Gowrisetty Vishnu¹,Lokumarambage Maheshi¹,Samarathunga Prabath¹,Fernando Thanuj¹,Fernando Anil²

Affiliation:

1. University of Strathclyde

2. University of Surrey

Abstract

Semantic communications focus on transmitting information that encapsulates meaning, enabling both machines and humans to understand the intended message with greater accuracy. Unlike traditional communication systems, which send data without considering its semantic value, this approach prioritises the content's meaning and requires a novel metric to gauge semantic quality. Our framework integrates a specialised Vision Transformer (ViT) for semantic segmentation, named SemExT, at the transmission end and a pre-trained Generative Adversarial Network (GAN) for image reconstruction at the receiving end. The system's effectiveness is evaluated by comparing the semantic content of the reconstructed image with the original, using Deceptron2, an advanced object detection model. This comparison establishes a new metric for assessing the quality of semantic transmission. Empirical evidence shows that the semantic quality metric ranges from 90% to 100% for images containing fewer objects and 80% to 98% for those with more objects. In comparison, an autoencoder-based communication system exhibits a range of 80% to 100% for simpler images and 75% to 95% for more complex ones. These findings highlight the robustness of our proposed metric across different semantic communication frameworks, contributing to the advancement of semantic information transmission and setting a foundation for future research in this field.

Publisher

Springer Science and Business Media LLC

Reference37 articles.

1. Alexey Dosovitskiy and Lucas Beyer and Alexander Kolesnikov and Dirk Weissenborn and Xiaohua Zhai and Thomas Unterthiner and Mostafa Dehghani and Matthias Minderer and Georg Heigold and Sylvain Gelly and Jakob Uszkoreit and Neil Houlsby (2021) An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. https://openreview.net/forum?id=YicbFdNTTy, International Conference on Learning Representations

2. Shannon, C.E. and Weaver, W. (1949) The {Mathematical} {Theory} of {Communication}. University of Illinois Press, Urbana

3. Goodfellow, Ian (2016) Nips 2016 tutorial: Generative adversarial networks. arXiv preprint arXiv:1701.00160

4. Strudel, Robin and Garcia, Ricardo and Laptev, Ivan and Schmid, Cordelia (2021) Segmenter: Transformer for Semantic Segmentation. 10.1109/ICCV48922.2021.00717, 7242-7252, , , 2021 IEEE/CVF International Conference on Computer Vision (ICCV)

5. Yuxin Wu and Alexander Kirillov and Francisco Massa and Wan-Yen Lo and Ross Girshick. Detectron2. 2019, https://github.com/facebookresearch/detectron2