Closed mikethetexan closed 7 months ago
You're right, it's a drawback, and tag insertion is one of the priorities for future development. There is actually already a feature in the fine-tuning functionality which can be used to automatically insert tags in Studio. The feature is very simplistic, as it assumes that tags are in the same order in the source as they are in the target, but it's still useful with many documents.
When you fine-tune, there are these two checkboxes:
If you check those boxes, the fine-tuning functionality will include tags as text in the fine-tuning bitext, which means that the fine-tuned model will be trained to place tags present in the source segment to the target segment. I just tested that the feature works with TMX fine-tuning, and seems to have an effect even when trained with just 2000 sentence pairs (but more the better):
I actually added this feature a long time ago for a specific translation job where there was a lot of tags (I do freelance translation when I can find the time), and it was very useful for that job. Once I have the time, I'll develop this into a more sophisticated tag insertion method.
Inline tag restoration has been added to the Trados plugin (long time ago, actually)
See the comparison of the Opus CAT plugin vs the DeepL or Google plugin. Not supporting inline tags is a big drawback for technical documentation (such as DITA based) which is typically full of them.