Closed msakarvadia closed 5 months ago
Hey,
Thanks for reaching out. Looking at the code and your predictions, I noticed two things. (1). As you are using a weaker model, the "guess" field is not set for all profiles in the predictions (which is important as this field is used to compare to). Notably, predictions entries with this field also have an entry in your evaluation (e.g., line 22 in your https://github.com/msakarvadia/llmprivacy/blob/main/llama2_synthetic_reddit_eval.jsonl, which has a 1 in the eval for the guess male).
The missing "guess" field is due to the model not being able to adhere to the correct output format and the resulting parsing failing - for our experiments, we reformated answers by weaker model answers using GPT-4 to get them in the correct format.
(2) In the config you posted, you did not set any eval_mode - see, for example, our basic eval_config here: configs/reddit/eval/reddit_eval_human.yaml (which first auto_evaluates and then delegates to a human). Depending on the value you set there, we use either basic_string_matching, with only the model as judge, model and human as judge, or only human as judge. Atm you are using the default (model only). If you are able to debug locally, the corresponding function is here:
Hope I could be of help and best wishes, Robin
I have run an experiment using the synthetic data and Llama2 7b using this config:
Once we get the results in
predicted_synthethic_llama2_7b.jsonl
it is my understanding that we must run the evaluate script via this config file:My question is: How are we supposed to interpret the output of this evaluation step? This is the output of one of the synthetic profiles:
The "evaluations" field (
"evaluations": {"meta-llama/Llama-2-7b-chat-hf": {"synth": {"age": []}}}}
) does not look particularly insightful to me, and I wonder if maybe I am supposed to be looking at a different field to assess whether the model prediction matched the ground truth? Any pointers would be helpful.This was the original predictions and these are the evaluations