[1]
“Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion”, JITI, vol. 2, no. 3, pp. 19–46, Oct. 2024, Accessed: Apr. 03, 2025. [Online]. Available: https://itip-submit.com/index.php/JITI/article/view/65