Closed segabor closed 3 months ago
This is a CPU only run. Rockchip has a unique neural engine with 6 TOPS performance. I will try to unlock it but it takes some time.
Interesting...
Turing RK1 costs $149 for 8GB, so it gives $24.8 / TOPS. GeForce RTX 4070 costs ~$550 for 29.15 TFLOPS, so $18.9 / TFLOP. Units are different, but I suppose still it's cheaper to buy a few Geforce's.
No doubt GPU wins the contest of GPU/NPU performance per dollars. But it's nice to have an NPU for an IoT board.
I promised to share results of Turing RK1 module. It arrived yesterday so I took the chance to run distributed llama on it. Capability: 8 cores, 32 GB RAM. Storage: 1 TB NVMe SSD OS: custom Ubuntu Server Model: llama-2-7b
Command
Result