EticaAI / hxltm

HXLTM - Multilingual Terminology in Humanitarian Language Exchange.TBX, TMX, XLIFF, UTX, XML, CSV, Excel XLSX, Google Sheets, (...)
https://hxltm.etica.ai
The Unlicense
1 stars 1 forks source link

Test viability of abstract even further mappings from XML-like formats vs HXL tabular format using as base XML nested tags (vs HXL attributes) using as baseline concept/language/term. #10

Open fititnt opened 3 years ago

fititnt commented 3 years ago

Test what the title says.

I'm not very sure if this is possible without making it more complicated to the end user, but I think it may be viable to go a bit less repetitive.

The general idea of how to organize rows in a table in concepts on XML (which could go several rows, with relationships) needs a lot of creativity. The second (but already likely to be solved) is how to generalize language code parsing. Then, there are the terms.

But, after these three big groups, I'm starting to think that additional data attached to these groups (if done with the same logic, requiring adding more lines on python) may actually be worth abstract.

One way to generalize such an idea

Note: is obviously possible to add more semantics by adding more lines to python. The point here is make the ontologia even more powerful

In the current state, the way things are on HXLTM on tabular format (aka HXL, with some extra attributes) it could be ported to a direct mapping. For example, when reading from XML (it could be JSON or YAML, but we would need to make sure to avoid adding powerful features of XML not portable) we already know when we are at concept, language or term level. So In theory the generalization here would be some XML tag (that could appear at 3 levels) with an attribute that tells what inner XML tags (or HXL in tabular format, additional attributes) how to change the strategy.

Some disadvantages

Some advantages: