HPC: LLM from scratch - on prem preparation for Generative AI work - Githubissues

ObrienlabsDev / blog

Blogs and Wiki

Apache License 2.0

1 stars 0 forks source link

HPC: LLM from scratch - on prem preparation for Generative AI work #6

Open fmichaelobrien opened 1 year ago

fmichaelobrien commented 1 year ago

see - https://github.com/ObrienlabsDev/blog/wiki/CUDA-based-%E2%80%90-High-Performance-Computing-%E2%80%90-LLM-Training-%E2%80%90-Ground-to-GCP-Cloud-Hybrid#use-cases Start with the following site - read it in its entirety https://jaykmody.com/blog/gpt-from-scratch/

or https://github.com/lm-sys/FastChat

fmichaelobrien commented 11 months ago

review https://github.com/vectara/hallucination-leaderboard

https://f-a.nz/dev/develop-your-own-llm-like-chatgpt-with-tensorflow-and-keras/

obriensystems commented 11 months ago

RTX-A4500 Dual at 40G

Thinking also of RTX-A6000 with 48G (nvlink) - as an alternative to my two RTX-A4500s with 20+20 = 40G under an nvlink

GTX-4090 Dual at 48G

https://www.reddit.com/r/LocalLLaMA/comments/15zx322/ideal_setup_for_dual_4090/

Splitting load