[1]
“Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion”, JITI, vol. 2, no. 3, pp. 19–46, Oct. 2024, Accessed: Dec. 23, 2024. [Online]. Available: https://itip-submit.com/index.php/JITI/article/view/65