(1)
Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion. JITI 2024, 2 (3), 19-46.