Niger-Volta-LTI / iranlowo

Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo
MIT License
17 stars 8 forks source link

Preprocessing helper #10

Closed Olamyy closed 5 years ago

Olamyy commented 5 years ago

I don't know how often we'll need to repeat https://github.com/Niger-Volta-LTI/yoruba-text/issues/16 but I wrote a helper for it. The helper is in a preprocessing module with a use case that should be somewhat similar to:

from iranlowo import preprocessing

>>> preprocessing.is_valid_owé_format('text or filepath', sep=4) 
>>> preprocessing.convert_to_owé_format('path. Could be csv or zipped', sep=4, outpath=[yo_path, en_path])
Olamyy commented 5 years ago

The test I wrote for this seem to be failing. I'll look at it once I get in tonight and fix whatever the issue is.