-
-
There's a "new" project called [Textract](https://github.com/deanmalmgren/textract) which can extract text from many formats. Maybe you want to look into it, it's getting much attention lately.
-
None of them can work now
-
Reticketed from https://github.com/CivicTechTO/tor-councilmatic/issues/4#issuecomment-196557994
This probably just involves the TMMS committee listing pages, and the agencies website. There might be …
-
Gousto is in some ways an optimal source for recipes. For each recipe (https://www.gousto.co.uk/cookbook/vegetarian-recipes/3-cheese-veg-packed-pasta-bake) they have a public api which provides JSON d…
tboby updated
1 month ago
-
Two top level tasks here:
- [x] Trawl the internet and find all the available sources.
- [x] Make the scrapers.
I'll develop a list below of all scrapers we want to build.
-
http://elixir-node.cbs.dtu.dk/?page_id=441
There is also this:
http://www.binf.ku.dk/services/
-
http://bioinformaticstraining.pythonanywhere.com/
-
It would be nice to have real-world example-json files
together with directory/file-collection, which are created by
running a scraper with a certain scraperJSON-json file.
That would be helpful to i…
-
### Source information
Novel Updates
### Steps to reproduce
1. Manually refresh library, library category, or individual work.
### Expected behavior
Only new chapters should show up.
### Actua…