taranjeet / hindi-tokenizer

This is a package in Python which implements a tokenizer, stemmer for Hindi language
91 stars 75 forks source link

File is written in python 2.x ,so it is not working correctly in python 3.x #5

Open Bipulkumarsingh opened 5 years ago

Bipulkumarsingh commented 5 years ago

i am working on Hindi sentimental analysis , so need to stem Hindi .. but this file have few's issue with python 3.x

pavanKulkarni2000 commented 4 years ago

same here, but just a couple of modifications of print, encode, decode functions will get it to work

swatisharan commented 4 years ago

Hello Can you please explain how to solve this issue. I have downloaded the hindi-tokenizer-master from here but don't know how to proceed further. I am new to this field. Your help will be very much appriciated.

pavanKulkarni2000 commented 4 years ago

I use spider, it suggests you all the errors properly and incase you can't directly understand it, just Google the same

pramod-mamidi commented 4 years ago

Hello Can you please explain how to solve this issue. I have downloaded the hindi-tokenizer-master from here but don't know how to proceed further. I am new to this field. Your help will be very much appriciated.

small changes will make it work since decode function is not defined for python 3. this can be used. tokenizer.zip

and the rss.txt file is the stopwords.txt file