Open hiento09 opened 10 months ago
hi @hiento09 can we spin up the azure instance of AMD GPU and test
hi @hiento09 can you set up an instance with the specs like the link below, azure is not working
https://gist.github.com/cgmb/6ae0d118bf357fc4576a7568b85e1c45
Tutorial to compile nitro on that system above (details in link https://gist.github.com/cgmb/6ae0d118bf357fc4576a7568b85e1c45 ):
First step, install AMD things (for details link: https://gist.github.com/cgmb/6ae0d118bf357fc4576a7568b85e1c45) :
After that just build nitro with below flag:
CC=/opt/rocm/llvm/bin/clang CXX=/opt/rocm/llvm/bin/clang++ \
cmake -H. -Bbuild -DLLAMA_HIPBLAS=ON -DAMDGPU_TARGETS= gfx1011 -DCMAKE_BUILD_TYPE=Release
and
make
I am requesting quota for g4ad ec2 instance on aws, will comeback to this task when they open quota for us
I was able to build nitro with above flag successfully using this docker image https://hub.docker.com/r/rocm/dev-ubuntu-22.04 but did not have AMD GPU for testing the bin file
AWS reject our quota request, we may need to consider purchasing for some AMD GPU, @tikikun , @dan-jan
I have one, let me test tomorrow 🗡️ @hiento09
Do you have file so I can test? Or what do I need to do to built it if need?
@hiento09 can you provide bin file? I cannot make the build run on my WSL 2 windows
my target CPU is: gfx1032 (radeon 6600xt) if you need
rename to ROCm support since vulkan already supported
still have ROCm build left @hiento09 @hiro-v
Experimental feature: 0.4.7 - Vulkan
@louis-jan No it's not This is AMD RoCm (equivalent to NVIDIA CUDA), not Vulkan. I'm moving this one back to Icebox as of now
converting this to epic to close https://github.com/janhq/jan/issues/913
@Van-QA can we queue this up after trtllm for Cam? 🙏
hi @0xSage, if you look at the sprint / status, you can see that it's already in the n‌ext spr‌int
From Jan: I think a UX enhancement is needed. (GPU settings)
Overview
Tasklist
Original Post
Problem We need a new nitro bin file that support AMD GPU for both windows and linux
Success Criteria