Do we need to build the computational graph every time during prediction?

ggerganov / ggml

Tensor library for machine learning

MIT License

10.93k stars 1k forks source link

Do we need to build the computational graph every time during prediction? #942

Closed EeyoreLee closed 5 days ago

EeyoreLee commented 1 month ago

Like the gptj_eval, it seems building the graph every time during invoking this function. Is it possible to store the computational graph we built, and just input tensors into the graph to perform computations each time? In other words, does building the computational graph for each prediction consume more time, especially when I need to perform batch computations? Thanks in advance.

EeyoreLee commented 5 days ago

I figured it out myself