code-for-venezuela / c4v-py

3 stars 3 forks source link

Feature/vocab generator #32

Closed marianelamin closed 3 years ago

marianelamin commented 4 years ago

@dieko95 and @Edilmo please have a look at tests/data/test_data_cleaner.py and feel free to add any edge case that I might be forgetting.

There are two methods I need to test, they are commented.

Cheers,

marianelamin commented 3 years ago

Created a new PR with only data cleanner class.. byte pair encoding will be PR'd up if needed.. in the mean time. I wil go ahead and close/decline/remove this PR