Closed DKNTZMN closed 1 week ago
Please add a support of in-app flash attention option for the model. Current model is spitting nonsense while running without flash attention. Thank you.
You can enable it in the json chat settings file I’m pretty sure.
Please add a support of in-app flash attention option for the model. Current model is spitting nonsense while running without flash attention. Thank you.