Closed Minami-su closed 8 months ago
Such as top_p,top_k,temperature,repetition_penalty,do_sample,beam_search,etc
Hello! The FastChat Chat CLI has now been integrated. You can check the generation parameters in inf_llm/chat.py. Beam Search requires duplication of the KV Cache, which is not supported at the moment.
inf_llm/chat.py
Such as top_p,top_k,temperature,repetition_penalty,do_sample,beam_search,etc