Asking for bi-encoder models

eeehco commented 2 months ago

When I run generated_pairs.py separately with ’python -u src/all_models/generated_pairs.py --config_path configs/retrieved_data/pair_generation.json --dataset ecb --data_split train --out_dir retrieved_data/main‘, an error message will appear as follows: "FileNotFoundError: [Errno 2] No such file or directory: './models/ecb_events_seed5_original/candidate_generator_best_model'". May I ask if you can provide this model file?

Danield21 commented 2 months ago

Hi, thanks for your attention.

This code repository is still under maintenance. We will continue to upload some codes and interpretations. Compared to before, we have made some adjustments to the file layout and some codes in the repo, and we provide links to trained bi-encoder models and retrieved/enhanced pairwise dataset in data_preparation. Please stay tuned for our latest version.

The core code of generated_pairs.py is based on Held's code. You are free to use it to generate data pairs on your own. However, we observe that it still has some indeterminacy so it may yield slightly different pairwise datasets. For more valid and deterministic experiments, we recommend using our fixed training set from the retrieved_data folder which can be downloaded from the provided link.

Best, Daniel

eeehco commented 2 months ago

Hi, thanks for your attention.

This code repository is still under maintenance. We will continue to upload some codes and interpretations. Compared to before, we have made some adjustments to the file layout and some codes in the repo, and we provide links to trained bi-encoder models and retrieved/enhanced pairwise dataset in data_preparation. Please stay tuned for our latest version.

The core code of generated_pairs.py is based on Held's code. You are free to use it to generate data pairs on your own. However, we observe that it still has some indeterminacy so it may yield slightly different pairwise datasets. For more valid and deterministic experiments, we recommend using our fixed training set from the retrieved_data folder which can be downloaded from the provided link.

Best, Daniel

Thanks for your reply.The model can train now.

eeehco commented 2 months ago

Hi, thanks for your attention.

This code repository is still under maintenance. We will continue to upload some codes and interpretations. Compared to before, we have made some adjustments to the file layout and some codes in the repo, and we provide links to trained bi-encoder models and retrieved/enhanced pairwise dataset in data_preparation. Please stay tuned for our latest version.

The core code of generated_pairs.py is based on Held's code. You are free to use it to generate data pairs on your own. However, we observe that it still has some indeterminacy so it may yield slightly different pairwise datasets. For more valid and deterministic experiments, we recommend using our fixed training set from the retrieved_data folder which can be downloaded from the provided link.

Best, Daniel

May I ask if your training code (Python - u src/all_models/crossencoder_trainer. py) is also so slow?

Danield21 commented 2 months ago

Hi, thanks for your attention. This code repository is still under maintenance. We will continue to upload some codes and interpretations. Compared to before, we have made some adjustments to the file layout and some codes in the repo, and we provide links to trained bi-encoder models and retrieved/enhanced pairwise dataset in data_preparation. Please stay tuned for our latest version. The core code of generated_pairs.py is based on Held's code. You are free to use it to generate data pairs on your own. However, we observe that it still has some indeterminacy so it may yield slightly different pairwise datasets. For more valid and deterministic experiments, we recommend using our fixed training set from the retrieved_data folder which can be downloaded from the provided link. Best, Daniel

May I ask if your training code (Python - u src/all_models/crossencoder_trainer. py) is also so slow?

Our experiments were on a single Nvidia Tesla V100. The training will take around a day for the baseline model, and around two days for the enhanced model on ECB+.

eeehco commented 2 months ago

Hi, thanks for your attention. This code repository is still under maintenance. We will continue to upload some codes and interpretations. Compared to before, we have made some adjustments to the file layout and some codes in the repo, and we provide links to trained bi-encoder models and retrieved/enhanced pairwise dataset in data_preparation. Please stay tuned for our latest version. The core code of generated_pairs.py is based on Held's code. You are free to use it to generate data pairs on your own. However, we observe that it still has some indeterminacy so it may yield slightly different pairwise datasets. For more valid and deterministic experiments, we recommend using our fixed training set from the retrieved_data folder which can be downloaded from the provided link. Best, Daniel

May I ask if your training code (Python - u src/all_models/crossencoder_trainer. py) is also so slow?

Our experiments were on a single Nvidia Tesla V100. The training will take around a day for the baseline model, and around two days for the enhanced model on ECB+.

Thanks for your reply. I use an A100, which looks similar and takes about eight hours for the baseline model on ECB+.

Danield21 / Rationale4CDECR

Asking for bi-encoder models #2