Open lifeiteng opened 1 year ago
The dataset generation pipeline contains some steps that are not 100% reversible, so currently I'm afraid the answer is no.
I would be also interested in this. Has perhaps anybody tried to produce the reverse normalization? Should be easily doable with some LLM.
Can you provide "text_raw" information? text_raw contains richer text information.