Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion. (2024). Journal of Intelligence Technology and Innovation, 2(3), 19-46. https://itip-submit.com/index.php/JITI/article/view/65