issues
search
EFord36
/
normalise
A module for normalising text.
GNU General Public License v3.0
173
stars
33
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Eas fixes
#125
esrel
closed
3 years ago
0
Module not found 'sklearn.semi_supervised.label_propagation'
#124
dimaelzein
opened
3 years ago
6
Normalizing the text often removes the last word of one sentence
#123
121898
closed
3 years ago
2
Warning: Careful using a custom tokenizer...
#122
PetrochukM
opened
4 years ago
0
IndexError: list index out of range
#121
NouamaneTazi
opened
4 years ago
0
UserWarning re: LabelPropagation
#120
bbookman
opened
4 years ago
1
FutureWarning re: sklearn.semi_supervised.label_propagation
#119
bbookman
opened
4 years ago
2
Add functionality to be able disable modules
#118
mbalatsko
closed
3 months ago
0
wrong normalization
#117
cmatosve
closed
5 years ago
2
Squared/cubed symbols deleted, eg. 20cm² -> 'twenty'
#116
emmaflint27
closed
7 years ago
0
US and international phone numbers, eg. +44 (0)1223 760812, (905) 513-7480
#115
emmaflint27
opened
7 years ago
0
Roman numerals, eg. Pope Leo X, Henry VIII, Elizabeth II
#114
emmaflint27
closed
7 years ago
0
Unable to expand scientific formats eg. 4.321768×10^3, 10^−27, −5.3×10^4, get deleted
#113
emmaflint27
opened
7 years ago
0
'80km' -> 'eighty andeighty', getting tagged as NUMB not SPLT + incorrect expansion
#112
emmaflint27
closed
7 years ago
0
'°C' gets split and incorrectly expanded to 'degrees century'
#111
emmaflint27
closed
7 years ago
0
Error in expansion of measurement abbrevs with exponents that have not been superscripted, eg. km2, cm3
#110
emmaflint27
closed
7 years ago
0
User abbreviation not working properly
#109
javidalkaruzi
closed
5 years ago
4
Command line improvements
#108
EFord36
closed
8 years ago
0
Updated version number
#107
EFord36
closed
8 years ago
0
Updated command line tool
#106
EFord36
closed
8 years ago
1
Command line usage could allow multiple files
#105
EFord36
closed
8 years ago
1
Command line usage could allow custom abbrevs in specified file
#104
EFord36
closed
8 years ago
1
11/04/1996 not tagged as NDATE
#103
EFord36
closed
8 years ago
1
tokenize_basic fails with newline
#102
EFord36
closed
8 years ago
0
Issue with NDATE expansion
#101
EFord36
closed
8 years ago
1
Pickle files won't load because of directory issues
#100
EFord36
closed
8 years ago
1
Tokenizer deletes final word if it ends with '!' (and presumably '?')
#99
EFord36
closed
8 years ago
1
gen_sig speed improvement and rude dict
#98
EFord36
closed
8 years ago
0
Possibility of print statements on running normalisation.py
#97
EFord36
closed
8 years ago
1
Add command line functionality to module
#96
EFord36
closed
8 years ago
1
tokenize_basic fails with brackets
#95
EFord36
closed
8 years ago
1
Ready readme for public release
#94
EFord36
closed
8 years ago
0
Fixes #92
#93
EFord36
closed
8 years ago
0
crashes when trying to split
#92
EFord36
closed
8 years ago
1
Add print statements to normalisation
#91
EFord36
closed
8 years ago
2
"04:00GMT" tagged as SPLT but doesn't split?
#90
emmaflint27
closed
8 years ago
0
Delete all spyder created strings in files
#89
EFord36
closed
8 years ago
0
Introduce testing
#88
EFord36
opened
8 years ago
0
Add support for emails
#87
EFord36
closed
8 years ago
0
Lack of data for NDATE, NTEL, NSCI
#86
emmaflint27
closed
8 years ago
0
Street vs Saint
#85
EFord36
closed
8 years ago
0
Tidying
#84
EFord36
closed
8 years ago
0
Tokenizer and api
#83
EFord36
closed
8 years ago
0
Added NSCI tag
#82
emmaflint27
closed
8 years ago
0
WDLK expansions frequently very incorrect
#81
EFord36
opened
8 years ago
0
Abbreviations that aren't titlecase are tagged as LSEQ
#80
EFord36
closed
8 years ago
0
class_ALPHA improvements
#79
emmaflint27
closed
8 years ago
0
Fixes #37
#78
EFord36
closed
8 years ago
0
Bug fixes
#77
EFord36
closed
8 years ago
0
Reorganisation
#76
EFord36
closed
8 years ago
0
Next