harrisonvanderbyl / rwkv-cpp-accelerated

A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies
MIT License
303 stars 19 forks source link

ACO ERROR: Unsupported opcode #36

Open Jipok opened 11 months ago

Jipok commented 11 months ago
Full log ``` ~/.../storygen/realese [main] $ ./storygen-vulkan /home/user/dev/RWKV/rwkv-cpp-accelerated/examples/storygen/realese/model.bin n_layers: 32 n_embed: 4096 loading: xbuf isBuffer: 1 cuda_mem: 0x7f2777c2c000 loading: embed loading: layernorms isBuffer: 0 cuda_mem: 0x7f276de9e000 loading: state_xy isBuffer: 1 cuda_mem: 0x7f276dd9e000 loading: state_aa isBuffer: 1 cuda_mem: 0x7f276dc9e000 loading: state_bb isBuffer: 1 cuda_mem: 0x7f276db9e000 loading: state_pp isBuffer: 1 cuda_mem: 0x7f276da9e000 loading: state_dd isBuffer: 1 cuda_mem: 0x7f276d99e000 loading: buffer1 isBuffer: 1 cuda_mem: 0x7f276ee12000 loading: buffer2 isBuffer: 1 cuda_mem: 0x7f276d96c000 loading: buffer3 isBuffer: 1 cuda_mem: 0x7f2777c24000 loading: buffer4 isBuffer: 1 cuda_mem: 0x7f2777568000 loading: mix_k isBuffer: 0 cuda_mem: 0x7f276d86c000 loading: mix_v isBuffer: 0 cuda_mem: 0x7f276d76c000 loading: mix_r isBuffer: 0 cuda_mem: 0x7f276d66c000 loading: km isBuffer: 0 cuda_mem: 0x7f26d6e66000 loading: vm isBuffer: 0 cuda_mem: 0x7f2676e66000 loading: rm isBuffer: 0 cuda_mem: 0x7f2656e66000 loading: kr isBuffer: 0 cuda_mem: 0x7f276d5ec000 loading: vr isBuffer: 0 cuda_mem: 0x7f276d56c000 loading: rr isBuffer: 0 cuda_mem: 0x7f276d4ec000 loading: o1 isBuffer: 0 cuda_mem: 0x7f276d46c000 loading: o2 isBuffer: 0 cuda_mem: 0x7f276d3ec000 loading: o3 isBuffer: 0 cuda_mem: 0x7f276d36c000 loading: att_out isBuffer: 0 cuda_mem: 0x7f2636e66000 loading: att_out_r isBuffer: 0 cuda_mem: 0x7f276d2ec000 loading: att_out_o isBuffer: 0 cuda_mem: 0x7f276d26c000 loading: ffn_mix_k isBuffer: 0 cuda_mem: 0x7f276d16c000 loading: ffn_mix_v isBuffer: 0 cuda_mem: 0x7f276d06c000 loading: ffn_k isBuffer: 0 cuda_mem: 0x7f2536e65000 loading: ffn_v isBuffer: 0 cuda_mem: 0x7f23b6e65000 loading: ffn_r isBuffer: 0 cuda_mem: 0x7f2616e66000 loading: ffn_kr isBuffer: 0 cuda_mem: 0x7f276cfec000 loading: ffn_vr isBuffer: 0 cuda_mem: 0x7f276cdec000 loading: ffn_rr isBuffer: 0 cuda_mem: 0x7f276cd6c000 loading: ffn_ko isBuffer: 0 cuda_mem: 0x7f276ccec000 loading: ffn_vo isBuffer: 0 cuda_mem: 0x7f276caec000 loading: ffn_ro isBuffer: 0 cuda_mem: 0x7f276ca6c000 loading: ffn_k_buffer isBuffer: 1 cuda_mem: 0x7f276ee0a000 loading: ffn_v_buffer isBuffer: 1 cuda_mem: 0x7f276ca64000 loading: ffn_r_buffer isBuffer: 1 cuda_mem: 0x7f276ca54000 loading: decay isBuffer: 0 cuda_mem: 0x7f276c954000 loading: bonus isBuffer: 0 cuda_mem: 0x7f276c854000 loading: head isBuffer: 0 cuda_mem: 0x7f26fe59c000 loading: head_r isBuffer: 0 cuda_mem: 0x7f276f203000 loading: head_o isBuffer: 0 cuda_mem: 0x7f276ee06000 Loaded model loading context ACO ERROR: In file ../src/amd/compiler/aco_assembler.cpp:168 Unsupported opcode: buffer_atomic_add_f32 %18:s[8-11], v1: undef, 0, %54:v[0] disable_wqm storage:buffer semantics:volatile,atomic,rmw Aborted ```