fastnlp / CPT

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
481 stars 70 forks source link

can I change the max length of csl #63

Closed Dlinuigh closed 4 months ago

Dlinuigh commented 1 year ago

in finetune/run_gen.py, defines in a dict,

length_map={'lcsts':'30','csl':'50','adgen':'128'}

but I want to process some sentences with max_length 512.

choosewhatulike commented 1 year ago

length_map defines the maximum length of generated tokens. It depends on the dataset you are using. The max_length of input texts is 1024 (in the updated version)

lhz9999 commented 1 year ago

May I inquire if your latest CPT version can support the summarization of input texts with a length of 1,024 characters?