Closed RiverTre closed 3 years ago
Sorry to bother but as a freshman in nlp, your help would mean a lot to me.
it is about {https://github.com/CurationCorp/curation-corpus/blob/master/examples/bart/finetuning-bart.ipynb} when i tried to reproduce your codes
Hi there. We dropped support for feather format because csv was fast enough so I don't think you'll find that anymore. If you have the csv of the dataset you can change ds = pd.read_feather(args.data_path).iloc[:args.subset]
in the next cell to ds = pd.read_csv(args.data_path).iloc[:args.subset]
This notebook is a bit out of date now though as fastai2 has been merged into fastai. If you want to finetune bart with fastai I would recommend looking at the summarisation code here https://github.com/ohmeow/blurr
Thank you so so so much for reply. I will check the code of blurr today. ( And yes, I am trying hard to finetune bart to summarize a medical paper dataset in order to finish the final paper of college.
I see the dataset for fine tuning is stored at ../data/private_dataset.file, and codes show that it at least has column "text" and "summary". Could you offer the format of this file or offer an small example of it?