tldr-pages / tldr-translation-pairs-gen

Generates a structured dataset in various formats derived from tldr-pages.
https://opus.nlpl.eu/tldr-pages/corpus/version/tldr-pages
MIT License
4 stars 3 forks source link

Include translatable tokens in dataset #3

Open SethFalco opened 1 year ago

SethFalco commented 1 year ago

In commands, tokens can be translated.

We can index and pull out the tokens from each command string, and include those mappings in the output as well.

Example

- Add a new task to the work list:

`todo new {{thing_to_do}} --list {{work}}`
- Aggiungere un nuovo task alla lista delle cose da fare per lavoro:

`todo new {{cose_da_fare}} --list {{lavoro}}`