psunlpgroup / Summ-N

Code for ACL 2022 Paper "SUMM^N: A Multi-Stage Summarization Framework for Long Input Dialogues and Documents"
MIT License
58 stars 7 forks source link

Bugs preventing reproducing results #5

Open felix-schneider opened 2 years ago

felix-schneider commented 2 years ago

The code is broken in a number of places, making it impossible to reproduce results:

For others who are also trying to reproduce:

Jinhyeong-Lim commented 2 years ago
  • in stage 2, run.py does not actually use the output from stage 1 and will just create the same input/output as stage 1 again

    • the others are one-line fixes but for this one I am not sure how to fix

hi I reproduced almost everything and modified the code to get similar performance.

I think you should modify the data preprocessing part code. And the hyperparameter should be modified according to the details written in the paper.

WeixiangYAN commented 2 years ago

Hello, can you share your modified code? thanks.

xianglous commented 2 years ago
  • in stage 2, run.py does not actually use the output from stage 1 and will just create the same input/output as stage 1 again

I think the at line 100 of run.py, it is getting the output data from stage 1 with args.cur_stage still being 1 at that time:

source_path = os.path.join(args.train.output_path, f"stage_{args.cur_stage}")
cur_source = load_split_aslist(source_path, suffix='hypo')

But the other mentioned problems do exist and also at line 87 and 123 or run.py, the path to the checkpoint should be something like:

os.path.join(training_args.checkpoint_dir, f"stage_{args.cur_stage}/trainer_output")

if we want to use the checkpoints after training without any file/folder movements.

CAH9487 commented 2 years ago

I only achieved around 70% paper rouge score on AMI dataset. I used fairseq 0.12.2 or 0.10.0 + comment out codes about self.bart.cfg.xxxxx in inference.py Is there something except @felix-schneider & @xianglous said should be modified?

heatherzheng commented 1 year ago

Hi @ CAH9487, I would like to know if you produced the result. do we need to change anything about the configure file? Thanks!