Currently the title generation uses too many tokens because it sends the entire chat. Chat titles should ideally be generated after the response to the first chat query response, using just the first chat interaction with a cheap model (GPT3.5 or cheaper (replicate))
Currently the title generation uses too many tokens because it sends the entire chat. Chat titles should ideally be generated after the response to the first chat query response, using just the first chat interaction with a cheap model (GPT3.5 or cheaper (replicate))