Open mces89 opened 9 months ago
Hi, can you also share the preprocessing script to convert the dataset to the standard format? also why the attention_mask in the dataset is required?
I am looking forward to the script for tokenization, too.
Hi, can you also share the preprocessing script to convert the dataset to the standard format? also why the attention_mask in the dataset is required?