Reinforced Visual Interaction Fusion Radiology Report Generation-Reference-Cited by-同舟云学术

Reinforced Visual Interaction Fusion Radiology Report Generation

Published:2024-07-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Wang Liya¹,Chen Haipeng¹,Liu Yu¹,Lyu Yingda¹,Qiu Feng²

Affiliation:

1. Jilin University

2. Publicity Department of Jilin Provincial Committee of CPC

Abstract

The explosion in the number of more complex types of chest X-rays and CT scans in recent years has placed a significant workload on physicians, particularly in radiology departments, to interpret and produce radiology reports. There is therefore a need for more efficient generation of medical reports. In this paper, we propose the Reinforced Visual Interaction Fusion (RVIF) radiology report generation model, which adopts a novel and effective visual interaction fusion module, which is more conducive to extracting fused visual features of radiology images with clinical diagnostic significance and performing subsequent correlation. Sexual analysis and processing. In addition, a reinforcement learning step from image captioning to this task is introduced to further enhance the aligned diagnosis effect brought by the visual interactive fusion module to generate accurate and highly credible radiology reports. Quantitative experiments and visualization results prove that our model performs well on two public medical report generation datasets, IU X-Ray, and MIMIC-CXR, surpassing some SOTA methods. Compared with the SOTA model COMG+RL in 2024, the BLEU@1, 2, and 3 of the NLG metrics increased by 3.9%, 2.8%, and 0.5% respectively, METEOR increased by 2.2%, the precision P of the CE index increased by 0.4%, and the recall rate R increased by 1.5%, F1-score increased by 1.8%. Source code in https://github.com/200084/RVIF-Radiology-Report-Generation.

Publisher

Springer Science and Business Media LLC

Reference63 articles.

1. Gu, Tiancheng and Liu, Dongnan and Li, Zhiyuan and Cai, Weidong (2024) Complex Organ Mask Guided Radiology Report Generation. 7995--8004, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision

2. Hochreiter, S and Schmidhuber, J (1997) Long Short-Term Memory. neural Computation, 9 (8), 1735-1780. Search in

3. Ayesha, Hareem and Iqbal, Sajid and Tariq, Mehreen and Abrar, Muhammad and Sanaullah, Muhammad and Abbas, Ishaq and Rehman, Amjad and Niazi, Muhammad Farooq Khan and Hussain, Shafiq (2021) Automatic medical image interpretation: State of the art and future directions. Pattern Recognition 114: 107856 Elsevier

4. Shamshad, Fahad and Khan, Salman and Zamir, Syed Waqas and Khan, Muhammad Haris and Hayat, Munawar and Khan, Fahad Shahbaz and Fu, Huazhu (2023) Transformers in medical imaging: A survey. Medical Image Analysis : 102802 Elsevier

5. Park, Hyeryun and Kim, Kyungmo and Park, Seongkeun and Choi, Jinwook (2021) Medical image captioning model to convey more details: Methodological comparison of feature difference generation. IEEE Access 9: 150560--150568 IEEE