SIG: Data Science Project Name: Engage Project Description: forecast COVID19 impact overtime given new data by city primarily US cites Project Objective: Primary: Provide a data science tool to predict how the COVID-19 will continue to infect or how it may die down overtime using weather related inputs by location Secondary: Open source project to train junior data scientists in using tools such as: Spark (pyspark) for large data handling Apache Airflow for orchestration Python Visualization using Python visualization module Help define a data science pipeline for Headstorm
0
stars
0
forks
source link
Create data feed process to read a subset of data from Google public dataset for model training and forecasting #14
have a process that automatically load subsets of data needed from the Google BigQuery public database.
objective:
for anyone to run the model they do not need to download the data manually, have process to automatically read data and feed into the modeling processes
have an automated process to load new data into the model (Postgresql or MySQL) linked to another ticket
have a process that automatically load subsets of data needed from the Google BigQuery public database.
objective: