Closed fanfanfan-hff closed 1 year ago
Thanks for carefully checking the consistency of Equation(7) and pseudo code. There is an index mistake in Equation(7), where Cross-Chunk should be $Q{[i]}R{i-1}\odot \xi$. We will fix this in the next version of our paper.
When are you planning to release the code ? The repo says in two days. Are you adhering to that timeline ?
When are you planning to release the code ? The repo says in two days. Are you adhering to that timeline ?
@okpatil4u https://github.com/microsoft/torchscale/commit/bf65397b26469ac9c24d83a9b779b285c1ec640b
is the cross_retention different with Cross-Chunk?