marvin-roesch / book-search

MIT License
2 stars 1 forks source link

terms lists #4

Open fbstj opened 5 years ago

fbstj commented 5 years ago

I remember you mentioning scraping the books for non-dictionary words, I have done similar things to work out what things need wiki articles, and am very interested in making improved tooling around this for this new search

in particular I'd be interested to have a persistent manually curated term list where we can scrape a (new or existing?) book for ALL it's words, then start curating that list by 'ignoring' any dictionary ones and 'accepting' any already existing ones, then linking the rest to search results

it would also be cool to gently connect this to the wiki such that we have an up-to-date list of what is missing from the wiki

marvin-roesch commented 5 years ago

I've already implemented the dictionary building as an API endpoint, which yields quite promising results so far. However, we do have to come up with a nice interface for it.