Closed brianyu-nexusflowai closed 1 year ago
I will push a fix for this issue.
Another unrelated question: what's the significance of the ⁇ in the output?
That the BOS
token. We will remove that token in RequestManager's log.
The issue should have been fixed.
Hey FlexFlow team!
For some reason the output of
llm.generate
is a cffi struct with a single attributeimpl
of typevoid *
. I can see generation in the RequestManager log, but I can't access the return value.Using the example from https://flexflow.readthedocs.io/en/latest/serve_overview.html#incremental-decoding with the
zero_copy_memory_per_gpu
fixed tozero_copy_memory_per_node
:Another unrelated question: what's the significance of the
⁇
in the output?Thanks!