Closed PriNova closed 1 year ago
Hey @PriNova yes, I am working on this now with the torchscale repository!
https://github.com/kyegomez/LongNet/blob/main/LongNet/torchscale/torchscale/architecture/decoder.py
I need help integrating it and providing usage examples!
@PriNova I implemented the model architecture and a training script here:
https://github.com/kyegomez/LongNet/blob/0.0.3/LongNet/model.py
Hey kyegomez,
I'm interested in trying out the implementation. Is it already possible to use a basemodel for this?