issues
search
rshchekotov
/
whats-this
Text Classification Tool
GNU General Public License v3.0
1
stars
0
forks
source link
Text analysis
#5
Closed
Madcap3000
closed
1 year ago
Madcap3000
commented
1 year ago
change unknown words to be depicted as \<UNK>
change typical parts to be depicted uniformly (\<IMG>, \<EMAIL>, \<NUMBER>, \<FORMULA>,...)
Text Data Type Fields:
Vector
coefficient (combine unknown ascii signs, amount of modified characters, images)
Text Data Type Fields: