DataBiosphere / data-explorer-indexers

BSD 3-Clause "New" or "Revised" License
4 stars 5 forks source link

Use simple analyzer on indexes. #135

Closed wnojopra closed 5 years ago

wnojopra commented 5 years ago

Using the simple analyzer makes the tokenizer split on any non-alphanumeric character, including underscores.

q=phd Before: https://screenshot.googleplex.com/DZCQMNVCm94 After: https://screenshot.googleplex.com/K0N4abC7Up9

q=female Before: https://screenshot.googleplex.com/ANF6NQw36E6 After: https://screenshot.googleplex.com/7kiQVTQuCma

wnojopra commented 5 years ago

NHS and UKBB are indexing on the second cluster right now. Let me do the same for 1000_genomes, amp_pd, and both baselines

wnojopra commented 5 years ago

FYI second cluster is up for all data explorers, and indexing is complete on all.

wnojopra commented 5 years ago

All api servers are re-deployed, and I'm halfway through deleting all the old clusters.