1. Spice: Semantic propositional image caption evaluation;Anderson,2016
2. Bottom-up and top-down attention for image captioning and visual question answering;Anderson,2018
3. ACapMed: Automatic captioning for medical imaging;Beddiar;Applied Sciences,2022
4. Generation of image captions using VGG and ResNet CNN models cascaded with RNN approach;Bhalekar,2020
5. A simple and effective positional encoding for transformers;Chen,2021