Closed absolute-nil closed 1 year ago
Fixed MAX_LENGTH missing and typo in SPECIAL_SEPARATOR and added truncation to tokenizer.
close PR since it was already reflected in d768d81. Instead of truncating the long sequence, we are splitting them into multiple blocks.
Fixed MAX_LENGTH missing and typo in SPECIAL_SEPARATOR and added truncation to tokenizer.