Open davidADSP opened 2 weeks ago
And as a follow up question for @dusty-nv - is llama 3.2 90b supported yet through ollama?
@davidADSP that container was just for a special pre-release version of transformers before the model came out, hence it copied the wheel from my drive. However now that is upstreamed into Transformers. IIRC that container doesn't have llama.cpp or ollama, and you would need to check those projects if they support it or not. And if it is supported, it depends on how memory efficient they are with loading the weights/ect if it will fit in AGX Orin 64GB or not (anecdotally I was able to run NVLM-72B on AGX Orin with load_in_4bit=True
, but 90B is substantially larger)
When building like this:
I get the following error at the last step of the build:
I have pulled and installed the latest version of jetson containers repo.