I want to do incremental training on the pretrained wiki.bin. Could you please tell me the format of the train.txt file that has to be provided. Should only the sentences be provided or the labels also are required. Can the sentence be given as such or should it be tokenised?
I want to do incremental training on the pretrained wiki.bin. Could you please tell me the format of the train.txt file that has to be provided. Should only the sentences be provided or the labels also are required. Can the sentence be given as such or should it be tokenised?