7LFB / QAP

3 stars 0 forks source link

Prompting Vision Foundation Models for Pathology Image Analysis

Chong Yin, Siqi Liu, Kaiyang Zhou, Vincent Wai-Sun Wong, Pong C. Yuen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2024
PDF      Video

Updates

Introduction

This is the github repository of Prompting Vision Foundation Models for Pathology Image Analysis. In this paper, we propose Quantitative Attribute-based Prompting (QAP), a novel prompting method specifically for liver pathology image analysis. QAP is based on two quantitative attributes, namely K-function-based spatial attributes and histogram-based morphological attributes, which are aimed for quantitative assessment of tissue states. Moreover, a conditional prompt generator is designed to turn these instance-specific attributes into visual prompts.

The framework of the proposed QAP model:

Enhanced Interpretability

Image samples with its attention map and attribute significance histogram when identifying specific histological findings. Our method enhances interpretability by visually representing the decision-making process through attention maps and attribute significance histograms. a. The structures the model focuses on; b. The attributes of structures the model focus on.

Score

Boost Diagnosis Performance

We further explore learning various prompts. The tissue structure segments provide more informative cues compared to task-agnostic visual prompts learned from randomly initialized vectors. Additionally, the quantitative attributes obtained from summarizing the statistical information about tissue structures are more explicit. Using prompts learned conditioned on explicit cues can enhance the learning process and improve performance.

Score

Citation

If you find this work useful for your research, please kindly cite our paper:

@inproceedings{yin2024prompting,
  title={Prompting Vision Foundation Models for Pathology Image Analysis},
  author={Yin, Chong and Liu, Siqi and Zhou, Kaiyang and Wong, Vincent Wai-Sun and Yuen, Pong C},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={11292--11301},
  year={2024}
}