Closed mcormick123 closed 1 year ago
Hello: The default maxlen
is 400 I believe, but running it once to observe the stats and then running it again choosing 250 (which you said was 95% percentile) sounds reasonable.
To predict on unseen data in an application, a Predictor
instance is typically used. To evaluate the test set, you can also use the Learner
instance instead of the Predictor
.
Hi, thanks for your great work! I've been using ktrain to fine tune multilingual bert and got an amazing result! i've looked on several tutorials already but some are vague, and i really want to understand more how ktrain works.
Would like to ask if i'm doing the process correctly to fine tune the mBERT from huggingface using ktrain? Here's my process:
text
andlabel
maxlen
, I thought you should tokenize it first and get themaxlen
from there? What's the default in ktrain ?Please, actually, i'm about to report this to my professor as a class output and imagining my prof questioning me stuffs about my code terrifies me already HAHA! explanation would be fully appreciated! thank you so much!