Questions about arg_detection and zero-shot task

Dear Author:

I ran into a few problems when trying to run this codebase. Could you help with the following questions? Thanks!

In run_arg_detection.py Line8 and Line14, you imported Write2HtmlArg and FactContainer which are not included in the codebase.
The preprocess code in save_dataset.py only provides the dataset for run_trigger_detection.py, but not for run_arg_detection.py. In run_arg_detection.py, there are 16 values to be unpacked from a batch, but only 7 is preprocessed. So I don't know how to generate the dataset for EAE task.
I think there is a bug when you're preprocessing the arguments for each event. In https://github.com/VT-NLP/Event_Query_Extract/blob/main/preprocess/save_dataset.py#L256, if there is multiple events in a sentence, the arg_list only contains the arguments of the last one. Since the evaluation code of EAE isn't included in this codebase like mentioned in 1., I don't know how you manage to align multiple event mentions with only one arg_list in the end.
The paper mentioned zero-shot EE, but zero-shot related code isn't included in the repository. Since zero-shot requires a different data format (in the paper, you said that the prototype triggers aren't included), I think adding the code for this part may help us reproduce the result.
We need to add train.doc.txt,dev.doc.txt,test.doc.txt in data/splits/ACE05-E/ before running ./setup.sh, each file representing the document split of the corresponding dataset. The format of each file is that every line contains the document name. I think you could state that in README.md to help.
The pos_tag ,time and value wouldn't appear if we stick to your preprocessing procedure in ACE_ERE_scripts repository. So if we want to run save_dataset.py successfully, we need to change _unpack_ace_with_vt like I mentioned in https://github.com/VT-NLP/Event_Query_Extract/issues/4.

VT-NLP / Event_Query_Extract

Questions about arg_detection and zero-shot task #5