This repo contains the code used to develop DGML (Data Gouv for Machine Learning), a data repository of datasets from data.gouv.fr for Machine Learning.
An extract_compressed_csv function : this function extracts all the zipped csv files that might be present in the data directory (such as csv_sample) so that the csv files can then be loaded
An EDA category is added in the task column of the main csv file: this adds a new filter in the app, so that the datasets only having a pandas profile are also shown in it
This PR introduces two changes in
dgml.py
:extract_compressed_csv
function : this function extracts all the zipped csv files that might be present in the data directory (such ascsv_sample
) so that the csv files can then be loadedtask
column of the main csv file: this adds a new filter in the app, so that the datasets only having a pandas profile are also shown in it