Generating and Parsing requirements strictness clarification

I think the spec should be strict, i. e. clear and unambiguous. Implementation, i. e. when finetuning the model based on data that's formatted accordingly, it would probably make sense to add a percentage of differently-formatted examples to get the model to be able to deal with that, too.

I'm convinced that LLMs understand even terribly misspelled and grammatically incorrect input because of them being pretrained on the whole Internet where there's all kinds of weird data, but still the majority is proper spelling and grammar, so the model knows what's right and at the same time understands what you mean. I'd expect the same to hold true regarding the finetuning data.

cognitivecomputations / OpenChatML

Generating and Parsing requirements strictness clarification #2