anoopkunchukuttan / indic_nlp_library

Resources and tools for Indian language Natural Language Processing
http://anoopkunchukuttan.github.io/indic_nlp_library/
MIT License
546 stars 158 forks source link

Change Romanizer/Indicizer implementation #27

Closed anoopkunchukuttan closed 4 years ago

anoopkunchukuttan commented 4 years ago

The current romanizer/indicizer implementation is based on Alan Little's code. This worked for Devanagari alone and some retrofitting had been done to make it work for other languages. Now doing an implementation from scratch for ITRANS.

anoopkunchukuttan commented 4 years ago

Comparison between old and new romanizer

anoopkunchukuttan commented 4 years ago

Comparing old and new indicizers

anoopkunchukuttan commented 4 years ago

Completed and merged