hnjzbss / EKAGen

[CVPR 2024]Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation
Apache License 2.0
15 stars 0 forks source link

EKAGen

Code for CVPR2024 paper: "Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation". Shenshen Bu, Taiji Li, Yuedong Yang, Zhiming Dai. [Video]

EKAGen 框架示意图

Abstract: Automatic radiology report generation can provide substantial advantages to clinical physicians by effectively reducing their workload and improving efficiency. Despite the promising potential of current methods, challenges persist in effectively extracting and preventing degradation of prominent features, as well as enhancing attention on pivotal regions. In this paper, we propose an Instance-level Expert Knowledge and Aggregate Discriminative Attention framework for radiology report generation. We convert expert reports into an embedding space and generate comprehensive representations for each disease, which serve as Preliminary Knowledge Support (PKS). To prevent feature disruption, we select the representations in the embedding space with the smallest distances to PKS as Rectified Knowledge Support (RKS). Then, EKAGen diagnoses the diseases and retrieves knowledge from RKS, creating Instance-level Expert Knowledge (IEK) for each query image, boosting generation. Additionally, we introduce Aggregate Discriminative Attention Map (ADM), which uses weak supervision to create maps of discriminative regions that highlight pivotal regions. For training, we propose a Global Information Self-Distillation (GID) strategy, using an iteratively optimized model to distill global knowledge into EKAGen. Extensive experiments and analyses on IU X-Ray and MIMIC-CXR datasets demonstrate that EKAGen outperforms previous state-of-the-art methods.


Get Started

1) Requirement

2) Data Preparation

MIMIC-CXR

IU X-Ray

3) Download Model Weights and Knowledge Base


4) Training

IU X-Ray

bash train_iu.sh

MIMIC-CXR

bash train_mimic.sh

5) Inference

You can download our trained models for inference from IU X-Ray and MIMIC-CXR.

IU X-Ray

bash test_iu.sh

MIMIC-CXR

bash test_mimic.sh

Citation

If you find this work useful in your research, please cite:

@InProceedings{Bu_2024_CVPR,
    author    = {Bu, Shenshen and Li, Taiji and Yang, Yuedong and Dai, Zhiming},
    title     = {Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2024},
    pages     = {14194-14204}
}

Contact Information

If you have any suggestions or questions, you can contact us by: bushsh@mail2.sysu.edu.cn. Thank you for your attention!