anoopkunchukuttan indic_nlp_library issues

anoopkunchukuttan / indic_nlp_library

Resources and tools for Indian language Natural Language Processing

http://anoopkunchukuttan.github.io/indic_nlp_library/

MIT License

549 stars 160 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Modified sentence_tokenize to handle tokeniztion of sentence which ends with numerics.

#72 varunkatiyar819 opened 3 months ago
0
Sentence tokenizer creating issue while splitting for end of the sentence.

#71 varunkatiyar819 opened 3 months ago
0
Add '|' as delimiter

#70 rumourscape opened 5 months ago
0
Better tokenization of numbers needed #40: Resolved

#69 prassr opened 6 months ago
0
Integrated UrduHack and IndicNLP Resources directly into the module

#68 VarunGumma opened 9 months ago
2
Add more NLP models list

#67 rajveer43 opened 11 months ago
0
Bad sentence splitting performance on flores 200 hindi language

#66 asusdisciple opened 1 year ago
1
Fixed sentence tokenization, added heuristic for emails, urls.

#65 PranjalChitale closed 1 year ago
3
Added version.txt file to package_data in call to setuptools.setup

#64 Ubadub opened 1 year ago
3
Making indic-nlp-library available via Conda Forge

#63 Ubadub opened 1 year ago
4
Broken "Getting Started" links

#62 Rhitabrat closed 1 year ago
2
Add Indic NLP Resources as submodule

#61 ProgramComputer opened 1 year ago
0
AttributeError: 'NoneType' object has no attribute 'iloc'

#60 AlvinKimata closed 1 year ago
1
Fix issue with Urduhack imports and add punctuations to support sentence tokenizer for Meitei and Ol Chiki script

#59 jaygala24 closed 1 year ago
0
Unable to do transliteration using BrahmiNet REST API

#58 015itachiucchiha closed 1 year ago
1
BrahmiNet is down

#57 ma08 closed 1 year ago
2
make sphinx an optional dependency

#56 gwenzek opened 2 years ago
0
ImportError: No module named indicnlp.common

#55 A-d-DASARE opened 2 years ago
0
Text Normalisation using Indic NLP library not working

#54 lusifer021 closed 1 year ago
2
ABNF grammer rule implementation for Indic language

#53 shubham303 opened 2 years ago
0
Inappropriate Hindi English Transliteration

#52 Sonali210 closed 1 year ago
1
Make a kaggle dataset to use this library in the inferece of a kaggle competetion

#51 I-am-sayantan opened 2 years ago
1
Schwa deletion in romanization for Hindi

#50 anilkumar911 opened 3 years ago
0
Fix for Urdu Normalizer imports

#49 neerajchhimwal closed 1 year ago
0
Is translate function available?

#48 udaykumar1998 closed 1 year ago
1
installation of latest version not working correctly

#47 m-sean closed 3 years ago
3
Gokul changes

#46 anoopkunchukuttan closed 3 years ago
0
sentence_split missing all_script_phonetic_data.csv

#45 attardi opened 3 years ago
0
AttributeError: 'NoneType' object has no attribute 'iloc'

#44 arunbaby0 opened 3 years ago
3
Placement of Anuswara

#43 shantanuo opened 3 years ago
2
Script conversion of danda and double danda

#42 anoopkunchukuttan closed 3 years ago
1
Text Normalisation

#41 ShubhamKumarNigam closed 3 years ago
4
Better tokenization of numbers needed

#40 anoopkunchukuttan opened 3 years ago
1
[Tokenizer] Fixes in Sentence Splitter

#39 GokulNC closed 3 years ago
0
Issue in Romanization

#38 Sreelakshmi-k opened 3 years ago
6
Wrong sentence tokenization of sentences with quotes

#37 GokulNC opened 3 years ago
0
Undo wrong Moses tokenization

#36 anoopkunchukuttan opened 3 years ago
1
Transliteration not working

#35 RaviTeja51 closed 3 years ago
6
vectors for SOS and EOS

#34 samyakag closed 3 years ago
3
Detect the language of transliterated text

#33 bnriiitb closed 3 years ago
2
loaderload() fails in latest pandas

#32 sayanb-7c6 closed 4 years ago
4
bug dealing with pandas new version, .ix removed with .loc and .iloc

#31 neerajvashistha closed 4 years ago
1
Preserve abbreviation punctuation for Tokenization & adding more abbreviations for Sentence Splitting

#30 rhn19 opened 4 years ago
0
get_normalizer() takes 2 positional arguments but 3 were given

#29 supreet21 opened 4 years ago
2
CLI parser: BrokenPipeError: [Errno 32] Broken pipe

#28 anoopkunchukuttan opened 4 years ago
0
Change Romanizer/Indicizer implementation

#27 anoopkunchukuttan closed 4 years ago
3
Unable to do Machine Translation

#26 aastha19 closed 3 years ago
4
Computing similarity between languages

#25 VP007-py closed 4 years ago
4
stdin/stdout versions of tokenizer and detokenizer. breaks python2 co…

#24 bhaddow closed 4 years ago
2
Enable setup.py script from open-tamil template.

#23 arcturusannamalai closed 4 years ago
2