Added data loader and model evaluator

thefirebanks commented 3 years ago

Main changes:

Added methods in tasks/data_loader/src/ to load data from the dataset and model output json files.
Added a model evaluator class in tasks/evaluate_model/src to evaluate classification models and easily visualize the results. Documented notebook is in tasks/evaluate_model/notebooks

Bonus:

Added a script in the tasks/ folder so that whenever we need to create a new task, we can use it and it will automatically create the folder structure (including the input/output/src folders). More details in the README.md of the tasks/ folder.

jordiplanescutxi commented 3 years ago

You have done a magnificent work!! Some things that I need to clarify:

If we had the example file "input/sample_model_output.json" it would be easier to execute the code in the notebook
If I understand it right you assume that, for each document we will have two files with labelled sentences, the sample_dataset and the sample_model_output. They are going to have the same sentences in the same order and then different labels. I'm afraid we may have some mess if we do not check that the sentences are actually the same and that they are at the same position.
If we are going trhough the sample_dataset.json as we do it in the function "labeled_sentences_from_dataset" we are assuming that all sentences will fall into one of the categories 0 to 5, while most of them will fall into a -1 category which is "no_incentive". We will talk about it.

thefirebanks commented 3 years ago

You have done a magnificent work!! Some things that I need to clarify:

If we had the example file "input/sample_model_output.json" it would be easier to execute the code in the notebook

If I understand it right you assume that, for each document we will have two files with labelled sentences, the sample_dataset and the sample_model_output. They are going to have the same sentences in the same order and then different labels. I'm afraid we may have some mess if we do not check that the sentences are actually the same and that they are at the same position.

If we are going trhough the sample_dataset.json as we do it in the function "labeled_sentences_from_dataset" we are assuming that all sentences will fall into one of the categories 0 to 5, while most of them will fall into a -1 category which is "no_incentive". We will talk about it.

Hi Jordi, thank you for the feedback! Here are my responses:

You are absolutely right, I had completely forgotten that the input folders don't get versioned, so I uploaded the input folder to our google drive (left the link in Slack).
Indeed! To solve this, maybe we can create a unique ID for each sentence and that way it is easier to check for equality. This can be added in the script/process that creates the json files in the first place, and I can add a check to confirm that they are in the same order in the data loader. I will add this once we confirm the mechanism to identify the unique sentence.
Good point, I was actually thinking to make 0 be the "no incentive" label and then 1-6 be the distinct types of incentives. I will correct that now!

thefirebanks commented 3 years ago

Updated the input files in the google drive folder, will update the data loader tomorrow before midnight EST!

thefirebanks commented 3 years ago

Tried loading sentences from ElSalvador.json and they got effectively loaded!

Screen Shot 2020-12-10 at 12 01 11 AM

wri-dssg-omdena / policy-data-analyzer

Added data loader and model evaluator #24