1.
Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion. JITI. 2024;2(3):19-46. Accessed April 6, 2025. https://itip-submit.com/index.php/JITI/article/view/65