Inquiry About PixLevelModule Usage in LViT Network Structure

Dear Zihan,

I hope this message finds you well. As I delve deeper into the LViT model, I have encountered a specific aspect of the network structure that I am eager to understand better with your guidance.

In my thorough examination of the LViT's network architecture, I observed that the PixLevelModule, as defined in the code, appears not to be used in the forward pass. To better illustrate my point, I am referencing the following sections of the code from your GitHub repository:

This observation has led to some confusion about its role in the overall network structure as described in your paper. Could you kindly provide clarification on this and, if possible, share the correct code that reflects the implementation used in the paper?

Your insights are invaluable to my understanding and proper implementation of the LViT model. I greatly appreciate your time and the support you've provided thus far.

Thank you once again for your assistance.

Best regards, Pengyu

HUANGLIZI / LViT

Inquiry About PixLevelModule Usage in LViT Network Structure #35