robperch / IEOR_Capstone

1 stars 0 forks source link

Pipeline module 1: extract #26

Closed robperch closed 5 months ago

robperch commented 6 months ago

The objective of this module will only be to ingest the data from the original source and saving it for the next steps of the process.

robperch commented 6 months ago

Hey @jefwei!

I've added some scripts to the repository on issue 26. Please pull these and try to make the first module work. Don't worry if you get stuck somewhere; it might not be easy at first.

The first module is called 'Extract' and its objective is to query the database and save the results locally as a pickle. We might make adjustments in the future, but for now, let's start with this.

Here are some key ideas:

Take a look and give it a try. Let me know if you encounter any issues and we can work on it together. Or also feel free to send me any questions.

jefwei commented 6 months ago

Currently running into this bug in extract_functions.py. I have attached the pictures for reference. In the second picture, variables utc_tz and mexico_tz are not defined in general_utils.py.

Screen Shot 2024-01-09 at 1 12 46 PM Screen Shot 2024-01-09 at 1 13 23 PM