Open anoopsingh opened 6 years ago
Believe it or not, I have been working on trying to fix this in my spare time. To fix it generally – not just for these 2 cases – seems harder than one might hope, in part because numbers are very common and symbols are quite rare in the data…. Maybe more later.
Thanks @manning
If any sentence has ????? or **** core\nlp tokenizes it and identifies it as Number, which should not happen.