HUANGLIZI / LViT

[IEEE Transactions on Medical Imaging/TMI] This repo is the official implementation of "LViT: Language meets Vision Transformer in Medical Image Segmentation"
MIT License
283 stars 26 forks source link

Inquiry About PixLevelModule Usage in LViT Network Structure #35

Closed Studentpengyu closed 9 months ago

Studentpengyu commented 9 months ago

Dear Zihan,

I hope this message finds you well. As I delve deeper into the LViT model, I have encountered a specific aspect of the network structure that I am eager to understand better with your guidance.

In my thorough examination of the LViT's network architecture, I observed that the PixLevelModule, as defined in the code, appears not to be used in the forward pass. To better illustrate my point, I am referencing the following sections of the code from your GitHub repository: image image image

This observation has led to some confusion about its role in the overall network structure as described in your paper. Could you kindly provide clarification on this and, if possible, share the correct code that reflects the implementation used in the paper?

Your insights are invaluable to my understanding and proper implementation of the LViT model. I greatly appreciate your time and the support you've provided thus far.

Thank you once again for your assistance.

Best regards, Pengyu

HUANGLIZI commented 9 months ago

There is some redundancy in the code, but you can use it freely. And the code has been the proper implementation of the LViT model.