Open GregSzopinski opened 9 months ago
Great work, I managed to run examples and prepare my own "mock" - it works just fine. I have some difficulties though when analyzing the results - I'd like to check individual users' predictions to analyze how it changes when sequences change, go through some examples, etc. Where should I look for stuff like that in the repository?
Hi,when you run the wandb_predict.py successfully, you will get the output files such as qid_test_question_window_predictions.txt. The files contains the prediction results. Please note that we have split the original long student interactions into sub-sequences (each sub-sequence contains up to 200 interactions). Therefore, if you want to check each user's predication, you may further data preprocess for your own needs.
Thanks for the reply. Sub-sequences are fine, and I managed to find the files with predictions before, I'm just a bit confused what each row corresponds to. Hence, a few questions on how should I interpret the results:
If I get this correctly, each row in the output file is a sequence for which we're making the prediction, right?
Each "orirow" value in qid_test_question_window_predictions.txt
corresponds to individual student - that is at least my impression since number of unique values in this column corresponds to number of students in test_quelevel.csv
. How to go from "orirow" in predictions file to student and/or sequence id? In other words, I'd like to know for what exactly (e.g. which sequence) the prediction (in that particular row) is being made.
Last but not least,- If I get this right, since we're evaluating on question-level - how to check which question is predicted for given row?
Thanks for the help + once again, great work. :)
Thanks for the reply. Sub-sequences are fine, and I managed to find the files with predictions before, I'm just a bit confused what each row corresponds to. Hence, a few questions on how should I interpret the results:
- If I get this correctly, each row in the output file is a sequence for which we're making the prediction, right?
- Each "orirow" value in
qid_test_question_window_predictions.txt
corresponds to individual student - that is at least my impression since number of unique values in this column corresponds to number of students intest_quelevel.csv
. How to go from "orirow" in predictions file to student and/or sequence id? In other words, I'd like to know for what exactly (e.g. which sequence) the prediction (in that particular row) is being made.- Last but not least,- If I get this right, since we're evaluating on question-level - how to check which question is predicted for given row?
Thanks for the help + once again, great work. :)
I am very grateful for your recognition of our work. Hope the following explanation would further solve your question:
Thanks a lot. One more question regarding the columns in predictions file(s) - do I get it right?
Great work, I managed to run examples and prepare my own "mock" - it works just fine. I have some difficulties though when analyzing the results - I'd like to check individual users' predictions to analyze how it changes when sequences change, go through some examples, etc. Where should I look for stuff like that in the repository?