Open sgsmob opened 4 years ago
Was curious and did an inventory of data fetching methods for reference.
requests
paramiko
paramiko
)covidcast.signal
sodapy
to access Socrata APIimap_tools
aws s3 sync
+subprocess
command@mariajahja do you have details on how claims_hosp
fetches source data? Is it a separate process that pulls from a magic email address? It might be good to normalize that with how quidel
does it.
@krivard It’s a separate python script (using paramiko) that pulls from a private Delphi ftp server. HSP deposits them there directly, and a bigchunk machine downloads it locally, runs the sensor update, then deletes the copy.
If it’s helpful, back when HSP used email @korlaxxalrok created a script to pull that data.
There are many different ways the indicators fetch remote data (email, AWS S3, cURL, sftp, etc.) and each one does so in a different spot. It would be helpful to have some regularized interface for data fetching (say, an
AbstractDataFetcher
class from whichEmailDataFetcher
, etc. inherit) as part of all indicators to consistently and clearly handle data fetching and any associated failures.