Closed Hou-jing closed 2 years ago
Hi Hou-jing,
The preprocessed_KAIROS
directory is supposed to be created by the KAIROS_data_module.py
file and it should contain the tokenized results. It is not where you should put your source data.
See the prepare_data
method for details.
If you are not getting this preprocessed_KAIROS
automatically, you can first run this prepare_data
separately.
Thanks, I understand.
Sorry, I come across another question,I can not find Keyword-Based Trigger Extraction Model which mentioned in your paper. Did you put it in on this repository?
Please check the tapkey
branch.
Thank you, I find it
the data you provide is right?? when I use preprocessed_KAIROS this file ,I don't find the following items which used in dataloader: 'input_token_ids': input_token_ids, 'input_attn_mask': input_attn_mask, 'tgt_token_ids': tgt_token_ids, 'tgt_attn_mask': tgt_attn_mask, 'doc_key': doc_keys,