nsaef / leichte-sprache

Create a dataset and train an LLM to translate standard German into Leichte Sprache
0 stars 0 forks source link

Define parallel dataset format #5

Closed nsaef closed 5 months ago

nsaef commented 6 months ago

Define the format for the parallel dataset. Minimum per line:

More data, such as the craswling date, may be desirable (if available).