1.
Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion. JITI [Internet]. 2024 Oct. 21 [cited 2024 Dec. 23];2(3):19-46. Available from: https://itip-submit.com/index.php/JITI/article/view/65