About minimizing local_llm image size

dusty-nv / jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

MIT License

2.38k stars 482 forks source link

Hi @hardychen1991, yea I feel you, have been trying to make this smaller and build faster, in fact we basically re-did most of the containers in this repo for minimization. Perhaps unsurprisingly considering what it achieves, this one has many big/complex dependencies including MLC/TVM, AWQ, FAISS, ASR/TTS, ect so it is still quite large. The local_llm has also transitioned to NanoLLM for future development, where I hope to continue making progress on issues like this:

https://github.com/dusty-nv/jetson-containers/tree/dev/packages/llm/nano_llm
https://dusty-nv.github.io/NanoLLM/

dusty-nv / jetson-containers

About minimizing local_llm image size #476