It is advisable to split the initial script into multiple notebooks to enhance clarity and customization. This approach allows for better organization and allows users to focus on specific sections or functionalities, resulting in improved clarity and enhanced customization options.
Workflow:
Data Ingestion: This step involves collecting and importing data from various sources such as databases. In this case, we will get FEC from "inputs" and store them in outputs/FEC/raw_data/SIREN/. If errors > actions
Data Extraction & Cleaning: This step extracts data from the collected sources to create a database to be processed. The outputs will be stored in "outputs/FEC/BDD/Init - Clean".
Data Enrichment: This step allows us to improve data through different business rules. The outputs will be stored in "outputs/FEC/BDD/Enrich".
Data Visualization: This step creates a dataset to be used in a data visualization tool. The outputs will be stored in "outputs/FEC/BDD/Visualisation/dataset_charges..."
It is advisable to split the initial script into multiple notebooks to enhance clarity and customization. This approach allows for better organization and allows users to focus on specific sections or functionalities, resulting in improved clarity and enhanced customization options.
Workflow: