Closed starsholic closed 6 months ago
Hi! Here in paper 3.4 eq1,
should be output = processed_tokens + x I wonder why only add selected tokens here https://github.com/astramind-ai/Mixture-of-depths/blob/103aa4b6c211346599cc8b853cc3108bf9cb72d0/MoD/MoD.py#L93
output = processed_tokens + x
Hello, Thank you so much for pointing this out. I agree with the issue and the solution you mentioned. Have you proceeded with this solution because I think the implementation still has this issue?
Hi! Here in paper 3.4 eq1,
should be
output = processed_tokens + x
I wonder why only add selected tokens here https://github.com/astramind-ai/Mixture-of-depths/blob/103aa4b6c211346599cc8b853cc3108bf9cb72d0/MoD/MoD.py#L93