Journal of biomedical informaticsHumansSemanticsAlgorithmsMedical Informatics
Caption-augmented reasoning model with Hierarchical rank LoRA finetuing for medical visual question Answering.
Yong Li, Jianping Man, Yi Zhou, Likeng Liang
Published: 202510.1016/j.jbi.2025.104964
Abstract
OBJECTIVE: Medical Visual Question Answering (VQA) is a quintessential application scenario of biomedical Multimodal Large Language Models (MLLMs). Previous studies mainly focused on input image-question pairs, neglecting the rich medical knowledge o…
Preview only. Read the full abstract at the source