epiverse-trace / blueprints

Software development blueprints for epiverse-trace
https://epiverse-trace.github.io/blueprints
Other
2 stars 3 forks source link

Web Scraping & Data access/storage #19

Open juan-umana opened 1 year ago

juan-umana commented 1 year ago

Hi everyone. We'd like to start a discussion on web scraping and data access of official websites, in our case from Colombia. Epidemiological data is stored in the SIVIGILA site and it cannot be reached from some countries abroad (i.e. Canada) because a "connection time out" error ocurres. This example motivates us to think in a kind of local server/website to store data (legal issues must be addreseed), or to redirect queries and act as VPN. We initially thought on preloaded datasets within the library but they are too large. What are your thoughts on this idea? or how do you think we can ensure data access to potential users?

Bisaloo commented 1 year ago

Thanks for opening this issue!

As you mention, there is a combination of both technical and legal issues. Here are the different options I see, in order of preference: