Use-after-free in llama_build_graph

JohannesGaessler commented 2 hours ago

In llama_build_graph:

An instance of llm_build_context is created.
The function calls llm_build_context.init() which in turn calls ggml_init and stores the ggml_context pointer as llm_build_context.ctx0.
A new graph is constructed using llm_build_context.ctx0.
The function calls llm_build_context.free() which in turn calls ggml_free(llm_build_context.ctx0).
The function returns the created graph.

Unless I'm missing something this is a use-after-free bug since the ggml_context that the graph (and its tensors) have been allocated in is always freed before returning the graph.

slaren commented 2 hours ago

The graph and tensors do not have any references to the ggml_context, they are allocated from the buf_compute_meta which is stored in llama_context.

JohannesGaessler commented 2 hours ago

Thanks, I was missing that the buffer for the graph + tensors is allocated externally.

ggerganov / llama.cpp

Use-after-free in llama_build_graph #10363