WormBase / wormicloud

Interaction data analysis with word clouds
2 stars 0 forks source link

Create custom tokenizer to extract words without splitting gene names #15

Closed valearna closed 4 years ago

valearna commented 4 years ago

the current word tokenizer treats dashes as word delimiters and gene names are being split (e.g., daf-2)

valearna commented 4 years ago

The new tokenizer works, but it needs some rework. @wbjae can you add some examples?