Generating Bangla Image Captions with Deep Learning Techniques

Published in 6th International Conference on Sustainable Technologies for Industry 5.0 (STI), Dhaka, Bangladesh, 2024

We introduce a Bangla image captioning model using EfficientNetB4 and ResNet-50 for feature extraction, with EfficientNetB4 achieving a BLEU score of 0.54. This study also presents the BanglaView dataset, fostering advancements in accessibility and Bengali digital communication.[Full paper uploaded soon!]

Recommended citation: MA Hossain, Mirza AFMRH, SK Ray and N Islam, "Generating Bangla Image Captions with Deep Learning Techniques", 2024 6th International Conference on Sustainable Technologies for Industry 5.0 (STI), Dhaka, Bangladesh.