issues
search
anoopkunchukuttan
/
indic_nlp_library
Resources and tools for Indian language Natural Language Processing
http://anoopkunchukuttan.github.io/indic_nlp_library/
MIT License
549
stars
160
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Modified sentence_tokenize to handle tokeniztion of sentence which ends with numerics.
#72
varunkatiyar819
opened
3 months ago
0
Sentence tokenizer creating issue while splitting for end of the sentence.
#71
varunkatiyar819
opened
3 months ago
0
Add '|' as delimiter
#70
rumourscape
opened
5 months ago
0
Better tokenization of numbers needed #40: Resolved
#69
prassr
opened
6 months ago
0
Integrated UrduHack and IndicNLP Resources directly into the module
#68
VarunGumma
opened
9 months ago
2
Add more NLP models list
#67
rajveer43
opened
11 months ago
0
Bad sentence splitting performance on flores 200 hindi language
#66
asusdisciple
opened
1 year ago
1
Fixed sentence tokenization, added heuristic for emails, urls.
#65
PranjalChitale
closed
1 year ago
3
Added version.txt file to package_data in call to setuptools.setup
#64
Ubadub
opened
1 year ago
3
Making indic-nlp-library available via Conda Forge
#63
Ubadub
opened
1 year ago
4
Broken "Getting Started" links
#62
Rhitabrat
closed
1 year ago
2
Add Indic NLP Resources as submodule
#61
ProgramComputer
opened
1 year ago
0
AttributeError: 'NoneType' object has no attribute 'iloc'
#60
AlvinKimata
closed
1 year ago
1
Fix issue with Urduhack imports and add punctuations to support sentence tokenizer for Meitei and Ol Chiki script
#59
jaygala24
closed
1 year ago
0
Unable to do transliteration using BrahmiNet REST API
#58
015itachiucchiha
closed
1 year ago
1
BrahmiNet is down
#57
ma08
closed
1 year ago
2
make sphinx an optional dependency
#56
gwenzek
opened
2 years ago
0
ImportError: No module named indicnlp.common
#55
A-d-DASARE
opened
2 years ago
0
Text Normalisation using Indic NLP library not working
#54
lusifer021
closed
1 year ago
2
ABNF grammer rule implementation for Indic language
#53
shubham303
opened
2 years ago
0
Inappropriate Hindi English Transliteration
#52
Sonali210
closed
1 year ago
1
Make a kaggle dataset to use this library in the inferece of a kaggle competetion
#51
I-am-sayantan
opened
2 years ago
1
Schwa deletion in romanization for Hindi
#50
anilkumar911
opened
3 years ago
0
Fix for Urdu Normalizer imports
#49
neerajchhimwal
closed
1 year ago
0
Is translate function available?
#48
udaykumar1998
closed
1 year ago
1
installation of latest version not working correctly
#47
m-sean
closed
3 years ago
3
Gokul changes
#46
anoopkunchukuttan
closed
3 years ago
0
sentence_split missing all_script_phonetic_data.csv
#45
attardi
opened
3 years ago
0
AttributeError: 'NoneType' object has no attribute 'iloc'
#44
arunbaby0
opened
3 years ago
3
Placement of Anuswara
#43
shantanuo
opened
3 years ago
2
Script conversion of danda and double danda
#42
anoopkunchukuttan
closed
3 years ago
1
Text Normalisation
#41
ShubhamKumarNigam
closed
3 years ago
4
Better tokenization of numbers needed
#40
anoopkunchukuttan
opened
3 years ago
1
[Tokenizer] Fixes in Sentence Splitter
#39
GokulNC
closed
3 years ago
0
Issue in Romanization
#38
Sreelakshmi-k
opened
3 years ago
6
Wrong sentence tokenization of sentences with quotes
#37
GokulNC
opened
3 years ago
0
Undo wrong Moses tokenization
#36
anoopkunchukuttan
opened
3 years ago
1
Transliteration not working
#35
RaviTeja51
closed
3 years ago
6
vectors for SOS and EOS
#34
samyakag
closed
3 years ago
3
Detect the language of transliterated text
#33
bnriiitb
closed
3 years ago
2
loaderload() fails in latest pandas
#32
sayanb-7c6
closed
4 years ago
4
bug dealing with pandas new version, .ix removed with .loc and .iloc
#31
neerajvashistha
closed
4 years ago
1
Preserve abbreviation punctuation for Tokenization & adding more abbreviations for Sentence Splitting
#30
rhn19
opened
4 years ago
0
get_normalizer() takes 2 positional arguments but 3 were given
#29
supreet21
opened
4 years ago
2
CLI parser: BrokenPipeError: [Errno 32] Broken pipe
#28
anoopkunchukuttan
opened
4 years ago
0
Change Romanizer/Indicizer implementation
#27
anoopkunchukuttan
closed
4 years ago
3
Unable to do Machine Translation
#26
aastha19
closed
3 years ago
4
Computing similarity between languages
#25
VP007-py
closed
4 years ago
4
stdin/stdout versions of tokenizer and detokenizer. breaks python2 co…
#24
bhaddow
closed
4 years ago
2
Enable setup.py script from open-tamil template.
#23
arcturusannamalai
closed
4 years ago
2
Next