project-engage / engage

SIG: Data Science Project Name: Engage Project Description: forecast COVID19 impact overtime given new data by city primarily US cites Project Objective: Primary: Provide a data science tool to predict how the COVID-19 will continue to infect or how it may die down overtime using weather related inputs by location Secondary: Open source project to train junior data scientists in using tools such as: Spark (pyspark) for large data handling Apache Airflow for orchestration Python Visualization using Python visualization module Help define a data science pipeline for Headstorm
0 stars 0 forks source link

Train-test splitting #7

Open project-engage opened 4 years ago

project-engage commented 4 years ago

Split data in training and test set at the date level - i.e. use the last week of the data period for testing and the data since December 2019 as training set