Vision Language Model for Interpretable Medical Image Segmentation

Last updated on Jul 2, 2023 Research

Developed a novel approach utilizing multi-modal vision-language models to extract semantic information from image descriptions and images, enabling accurate segmentation of diverse medical images.
Conducted extensive evaluations of existing vision language models on multiple datasets, assessing their applicability and transferability to the medical domain.
Explored the impact of variations in image descriptions on model performance, revealing valuable insights into the model’s responsiveness to different prompts.