GDGCVITBHOPAL / ML-Reserve

An Open-Source repository where students could showcase their skills by contributing their ML and DL projects!
MIT License
35 stars 44 forks source link

Hindi Tokenizer-Apoorva57 #110

Closed Apoorva57 closed 2 years ago

Apoorva57 commented 2 years ago

Tokenization is a simple process that takes raw data and converts it into a useful data string. The proposed model turns any Hindi sentence into tokens and also helps to detect usage of any language used other than Hindi. @HemanthSai7 Please review this PR (Pull Request) and label this PR as "hacktoberfest-accepted" and "hacktoberfest-2022".