MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images
Authors: Ankan Deria, Komal Kumar, Adinath Madhavrao Dukre, Eran Segal, Salman Khan, Imran Razzak
Deep-Dive Summary:
MedMO:面向医学图像定位与理解的多模态大语…
SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs
Authors: Niccolo Avogaro, Nayanika Debnath, Li Mi, Thomas Frick, Junling Wang, Zexue He, Hang Hua, Konrad Schindler, Mattia Rigotti
Deep-Dive Summary: 这篇文章介绍了 SPARC …