EticaAI / HXL-Data-Science-file-formats

Common file formats used for Data Science and language localization exported from (and to) HXL (The Humanitarian Exchange Language)
https://hdp.etica.ai/
The Unlicense
3 stars 1 forks source link

HXLTabConverter common class #8

Closed fititnt closed 3 years ago

fititnt commented 3 years ago

See:


(As expected) both hxl2tab and hxlquickimporttab are starting to share common code. Also, while the one-liner Orange Data Mining format is actually very similar to HXL itself (sometimes it just add an extra 2 chararacters before the base hashtag, but uses tab instead of comma) if the user is importing back one format already previously saved on the orange data mining, it uses the non-compact format. So some very basic functionality may need to not only export HXL to .tab , but import back.

The initial idea of HXLTabConverter is a move the already existing functionality of export/import to a single class (even if, for sake of simplicity for one-file executable scripts, for now just duplicate the code).

Compared to the base libhxl-python (https://github.com/HXLStandard/libhxl-python), one downside of the HXLTabConverter is that actually may have to implement some of the schema on the code itself instead of use an external schema in special because for sake of simplicity, it would have to make inferences on the type of some hashtags+attributes without explicity attributes understood by the HXLTabConverter

fititnt commented 3 years ago

I will close this issue. Maybe reopen if necessary, but most of the work would be done on the HXLMeta (Usable Class) #9