Air pollution exposure is known to correlate with Nighttime Light (a proxy for human activity captured through satellite images (Zeng 2021, Zeng 2023). We want to incorporate night time light into our method of feature generation to predict air pollution.
A developer can generate local air pollution predictions from satellite data configurations which capture the geographic relationship between nighttime light and air pollution, which improves the model accuracy.
Environment
The feature will be required to work for the following configurations:
All features from satellites are built in openaq_engine/src/features/build_features.py here:
def _add_ee_features(self, df: pd.DataFrame) -> pd.DataFrame:
"""
Add Earth Engine (EE) features to the DataFrame.
Parameters
----------
df : pd.DataFrame
The DataFrame to which EE features will be added.
Returns
-------
pd.DataFrame
The DataFrame with EE features added.
"""
return EEFeatures.from_dataclass_config(EEConfig()).execute(
df, save_images=False
)
In openaq_engine/config/model_settings.py you will need to update EEConfig.ALL_SATELLITES. You will need to select the relevant bands.
Description
Air pollution exposure is known to correlate with Nighttime Light (a proxy for human activity captured through satellite images (Zeng 2021, Zeng 2023). We want to incorporate night time light into our method of feature generation to predict air pollution.
A developer can generate local air pollution predictions from satellite data configurations which capture the geographic relationship between nighttime light and air pollution, which improves the model accuracy.
Environment
The feature will be required to work for the following configurations:
MLFlow
instance to push the new model to a registry.Screenshots or Log Files
Currently the structure of our satellite configuration is as follows:
After completing the registration of your google account, export your credentials
All features from satellites are built in
openaq_engine/src/features/build_features.py
here:In
openaq_engine/config/model_settings.py
you will need to updateEEConfig.ALL_SATELLITES
. You will need to select the relevant bands.Adjust
TrainerConfig.All_MODEL_FEATURES
to include your selected bandsre-run the pipeline to build the new model with your additional features:
Additional Context
Earth Engine provides many datasets with Nighttime Light, it is your job to find the most suitable for the task!