Closed skhorasganiTT closed 1 month ago
max_model_len
max_num_batched_tokens
scheduler_config.max_num_seqs
max_model_len
andmax_num_batched_tokens
to 128*1024 to allow running larger seq lensscheduler_config.max_num_seqs