Incomplete Output even with max_new_tokens

So the output of my finetuned mistral model ends abruptly and I ideally want it to complete the paragraph/sentences/code which it was it between of. Although I have provided max_new_tokens = 300 and also in prompt I give to limit by 300 words.

The response is always big and ends abruptly. Any way I can ask for a complete output within desired number of output tokens?

here is the given generationconfig

generation_config = GenerationConfig(
    do_sample=True,
    top_k=10,
    temperature=0.01,
    pad_token_id=tokenizer.eos_token_id,
    early_stopping = True,
    max_new_tokens=300,
    return_full_text=False
)

mistralai / mistral-inference

Incomplete Output even with max_new_tokens #92