possee-org / genai-numpy

MIT License
4 stars 6 forks source link

Nebari and Llama 3 introduction #10

Closed bmwoodruff closed 1 month ago

bmwoodruff commented 2 months ago

The nebari server we're using is located at https://possee.openteams.com/.

I'd love a discussion below about any issues you encountered, ideas about how to use this for the project, etc.

bmwoodruff commented 2 months ago

I see the chatbot as a great introductory exploratory tool. As we're trying to figure out good prompts, how much code to include from a method that's missing documentation, how much of an issue with/without comments to include, etc., the 7B chatbot can provide a quick idea of what capabilities we have. The no_chatbot version can be incorporated into a workflow which then applies what we learned via experimentation over a much larger set of issues, methods missing documentation, etc. I'll be creating another page similar to this which examines how we can use RAG.

bmwoodruff commented 1 month ago

If you encounter a VRAM overflow error, make sure you've adjusted the model_split variable to pick the correct 8B version. You may also need to "Shut Down All Kernels" to free up any VRAM that is still being used. Once you start up the chatbot, it will keep using VRAM till you kill the kernel.