Open athitten opened 2 months ago
Hi @athitten!
Would it be possible for you to share a small repro to run the NeVa model? I think this is the last operation that is needed to unlock GPT and NeVa from NeMo.
I see there are the examples in the NeMo repo but they don't seem to work even in the NeMo container :(
Would it be possible for you to share a small repro to run the NeVa model? I think this is the last operation that is needed to unlock GPT and NeVa from NeMo.
Thanks for your interest! I pasted some ways to run this in #660.
@athitten going forward let's create a branch and include something similar to the instructions in #660 in the "root" issues (i.e. #343 in this case) so that other people can reproduce.
🚀 Feature
Implement
torch.Tensor.__setitem__
Motivation
NeMo's MegatronNeVaModel
cc @tfogal