Open twotwoiscute opened 1 year ago
I also try train dataset 505.Chinese_Zhang
only, the training process looks to go me.
However when it comes to predict, the model still output empty string as described above.
Sorry for my late reply as I am on a vacation. Saddly, the ACOS task is only tested for English, while any other language is enabled, I will release a new version. BTW, Chinese is supported by ATEPC task.
Thanks for the reply , from your reply I can tell that you’ve known all it takes to get the things done. Can you tell me the pipeline of how to achieve it? I would like to try it on my own.Thanks!
Hi,Thanks for the great work, I have a question on the task of acos I've downloaded the dataset from here and there's a task called
acos
which I am interested in.Observation
I try a quick train on all the dataset in
acos_datasets
and found out the fact that the model does not do well ondataset 505.Chinese_Zhang
, the fine-tuned model always output the empty string even the value of both training ands val loss decrease during process.So I check the paper and realize the dataset paper used is shown below:
What I have done
I am new to NLP, I believe there's always the step that deal with tokenizer like the code
AutoTokenizer.from_pretrained
before training and here's how I init the model, I downloaded it here directly.Question
Is it okay to apply the original pipeline that the model is trained on english I/O directly to the model that takes chinese input I/O? if not what should I do before get into training. Thanks!