Closed sherzod-hakimov closed 7 months ago
@davidschlangen re issue #26 should this stay a backend property or become a model spec item?
That's related to our discussion about temperature (I believe it was) from a while ago. Under the conception that I mostly have been thinking about, the model spec is more describing the model as a general (the collection of weights and how to run inference on them / access them), and so things like temperature and this do not belong there. But I can see that one can also think about the model spec giving the full specification of a model instance.
Whatever we do, it should be consistent for temperature and max_token.
And my intuition would be that a way to distinguish is "everything that can meaningfully be set differently for each separate call to generate()
does not belong to the model spec". But I'm open for discussions here.
allow the user to set the max_new_token parameter. By default it should be set to 100.