rmihaylov / falcontune

Tune any FALCON in 4-bit
Apache License 2.0
468 stars 51 forks source link

Add contextual generate #25

Closed koonweee closed 1 year ago

koonweee commented 1 year ago

When --contextual flag is passed to the generate command, the input is retained as part of the context. Subsequent outputs from the model are appended to a "running input".