WET-UNet: Wavelet integrated efficient transformer networks for nasopharyngeal carcinoma tumor segmentation-Reference-Cited by-同舟云学术

WET-UNet: Wavelet integrated efficient transformer networks for nasopharyngeal carcinoma tumor segmentation

Published:2024-04 Issue:2 Volume:107 Page:
ISSN:0036-8504
Container-title:Science Progress
language:en
Short-container-title:Science Progress

Author:

Zeng Yan¹²,Li Jun¹²^ORCID,Zhao Zhe¹²,Liang Wei¹²,Zeng Penghui¹²,Shen Shaodong¹²,Zhang Kun¹³^ORCID,Shen Chong¹²

Affiliation:

1. State Key Laboratory of Marine Resource Utilization in South China Sea, Hainan University, Haikou, China

2. School of Information and Communication Engineering, Hainan University, Haikou, China

3. School of Information Science and Technology, Hainan Normal University, Haikou, China

Abstract

Nasopharyngeal carcinoma is a malignant tumor that occurs in the epithelium and mucosal glands of the nasopharynx, and its pathological type is mostly poorly differentiated squamous cell carcinoma. Since the nasopharynx is located deep in the head and neck, early diagnosis and timely treatment are critical to patient survival. However, nasopharyngeal carcinoma tumors are small in size and vary widely in shape, and it is also a challenge for experienced doctors to delineate tumor contours. In addition, due to the special location of nasopharyngeal carcinoma, complex treatments such as radiotherapy or surgical resection are often required, so accurate pathological diagnosis is also very important for the selection of treatment options. However, the current deep learning segmentation model faces the problems of inaccurate segmentation and unstable segmentation process, which are mainly limited by the accuracy of data sets, fuzzy boundaries, and complex lines. In order to solve these two challenges, this article proposes a hybrid model WET-UNet based on the UNet network as a powerful alternative for nasopharyngeal cancer image segmentation. On the one hand, wavelet transform is integrated into UNet to enhance the lesion boundary information by using low-frequency components to adjust the encoder at low frequencies and optimize the subsequent computational process of the Transformer to improve the accuracy and robustness of image segmentation. On the other hand, the attention mechanism retains the most valuable pixels in the image for us, captures the remote dependencies, and enables the network to learn more representative features to improve the recognition ability of the model. Comparative experiments show that our network structure outperforms other models for nasopharyngeal cancer image segmentation, and we demonstrate the effectiveness of adding two modules to help tumor segmentation. The total data set of this article is 5000, and the ratio of training and verification is 8:2. In the experiment, accuracy = 85.2% and precision = 84.9% can show that our proposed model has good performance in nasopharyngeal cancer image segmentation.

Funder

Hainan Province Science and Technology Special Fund

Publisher

SAGE Publications

Link

https://journals.sagepub.com/doi/pdf/10.1177/00368504241232537

Reference36 articles.

1. Analysis of Plasma Epstein–Barr Virus DNA to Screen for Nasopharyngeal Cancer

2. Global trends in incidence and mortality of nasopharyngeal carcinoma

3. NPCNet: Jointly Segment Primary Nasopharyngeal Carcinoma Tumors and Metastatic Lymph Nodes in MR Images

4. Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp.3431–3440.

5. Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer International Publishing, 2015: 234–241.