AndyTheFactory / RO-Diacritics

Python package for Romanian diacritics restoration
MIT License
4 stars 0 forks source link
bert diacritics diacritics-removal diacritics-restoration nlp romanian romanian-bert romanian-diacritics-restoration romanian-language transformers transformers-models

RO Diacritics module

RO Diacritics is a straightforward diacritics restoration module for Romanian Language

from ro_diacritics import restore_diacritics
print(restore_diacritics("fara poezie, viata e pustiu"))

or correcting a pandas dataframe:

from ro_diacritics import restore_diacritics
df['text-diacritice'] = df['text'].apply(restore_diacritics)

Installing

$ python -m pip install ro-diacritics

or

$ pip install ro-diacritics

Requirements

References