python3 python/generate_completions.py RWKV-5-World-0.1B-v1-20230803-ctx4096-ggml.bin
System info: AVX=1 AVX2=1 AVX512=0 FMA=1 NEON=0 ARM_FMA=0 F16C=1 FP16_VA=0 WASM_SIMD=0 BLAS=0 SSE3=1 VSX=0
Loading RWKV model
GGML_ASSERT: rwkv.cpp/ggml/src/ggml.c:5316: ggml_can_repeat_rows(b, a)
Aborted (core dumped)
gdb
Program received signal SIGABRT, Aborted.
__pthread_kill_implementation (no_tid=0, signo=6, threadid=140737350497344) at ./nptl/pthread_kill.c:44
44 ./nptl/pthread_kill.c: No such file or directory.
(gdb) bt
#0 __pthread_kill_implementation (no_tid=0, signo=6, threadid=140737350497344) at ./nptl/pthread_kill.c:44
#1 __pthread_kill_internal (signo=6, threadid=140737350497344) at ./nptl/pthread_kill.c:78
#2 __GI___pthread_kill (threadid=140737350497344, signo=signo@entry=6) at ./nptl/pthread_kill.c:89
#3 0x00007ffff7cca476 in __GI_raise (sig=sig@entry=6) at ../sysdeps/posix/raise.c:26
#4 0x00007ffff7cb07f3 in __GI_abort () at ./stdlib/abort.c:79
#5 0x00007fffd30baea4 in ggml_add_impl (ctx=0x7fffd31d6908 <g_state+104>, a=0x7fffc03ffad0, b=0x7fffb8753e90, inplace=false)
at /work/rwkv.cpp/ggml/src/ggml.c:5316
#6 0x00007fffd30bafdb in ggml_add (ctx=0x7fffd31d6908 <g_state+104>, a=0x7fffc03ffad0, b=0x7fffb8753e90)
at /work/rwkv.cpp/ggml/src/ggml.c:5340
#7 0x00007fffd30f83c0 in rwkv_att_wkv (ctx=0x7fffd31d6908 <g_state+104>, att_time_first=0x7fffc03ffad0, att_time_decay=0x7fffc03ff950,
k=0x7fffb8753e90, v=0x7fffb8753fe0, aa=@0x7fffffffcf20: 0x7fffb87486d0, bb=@0x7fffffffcf28: 0x7fffb8748820,
pp=@0x7fffffffcf30: 0x7fffb8748970) at /work/rwkv.cpp/rwkv_graph.inc:132
#8 0x00007fffd30f86fc in rwkv_att (ctx=0x7fffd31d6908 <g_state+104>, x=0x7fffb8752ae0, layer=..., state=...)
at /work/rwkv.cpp/rwkv_graph.inc:172
#9 0x00007fffd30f90e3 in rwkv_build_serial_graph (model=..., graph=...) at /work/rwkv.cpp/rwkv_graph.inc:274
#10 0x00007fffd30f9552 in rwkv_measure_and_build_serial_context (model=..., graph=...) at /work/rwkv.cpp/rwkv_graph.inc:320
#11 0x00007fffd30fa6db in rwkv_init_from_file (
file_path=0x7fffd37e7770 "/work/models/RWKV-5-World-0.1B-v1-20230803-ctx4096-ggml.bin", n_threads=4)
at /work/rwkv.cpp/rwkv.cpp:65
#12 0x00007ffff7083052 in ffi_call_unix64 () from /.conda/envs/pytorch/lib/python3.11/lib-dynload/../../libffi.so.8
#13 0x00007ffff7081925 in ffi_call_int () from /.conda/envs/pytorch/lib/python3.11/lib-dynload/../../libffi.so.8
#14 0x00007ffff708206e in ffi_call () from /.conda/envs/pytorch/lib/python3.11/lib-dynload/../../libffi.so.8
#15 0x00007ffff70932e4 in _call_function_pointer (argtypecount=<optimized out>, argcount=2, resmem=0x7fffffffd1a0, restype=<optimized out>,
atypes=<optimized out>, avalues=<optimized out>, pProc=0x7fffd30fa471 <rwkv_init_from_file(char const*, uint32_t)>, flags=4353)
at /usr/local/src/conda/python-3.11.4/Modules/_ctypes/callproc.c:923
System: Linux x86_64, Ubuntu 22.04.3 LTS
gdb
llama.cpp works fine.