MicrosoftTranslator / DocumentTranslator-Legacy

Microsoft Document Translator (Archive) - Replaced by the MicrosoftTranslator/DocumentTranslation project in this repository.
Other
409 stars 152 forks source link

Support arbitrary XML translation #73

Open chriswendt1 opened 5 years ago

chriswendt1 commented 5 years ago

Many customers need to translate custom XML. Not following a standardized schema. Translator will need to know which elements break sentences and which elements are intra-sentential, information that won't be in a DTD. In the ago of AI we should be able to make a decent guess by just looking at how text flows around the elements. Or, simpler to implement, allow customer to supply a list of intra-sentential element names, considering the unlisted element names extra-sentential. I think that'll be all that is needed as a starter.