-
Does the model support Chinese input?
-
Thank you for your great work! But it seems that the latest code didn't implement your special design of attention mask during pre-train?
def generate(
self,
cell_emb: …
-
https://arxiv.org/abs/2207.03208
-
Hello,can you share the pretrain Loss Curve and fine-tune Loss Curve? I have some questions about my reproduction results.Thank you!
-
Hi,
Very simple issue, this error:
"ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group"
Is displayed when I'm trying to load a a pre-trained m…
-
Hi. I'm using my dataset instead of MNIST. My data originally is 188318 rows and 130 coloumns.
After done the pretraining and reconstruction step, the new dataset is eliminated 30 rows to 188288 rows…
-
Hi ShuangXieIrene:
Did you use MS COCO for pretraining before you trained the FSSD MobileNetV1 on VOC2007?
When I using VOC2007 and VOC2012 as training data for FSSD MobileNetV1, my perform…
-
After train the model can we use only target-encoder for down-stream task ?? like- image captioning etc.
-
Hi, since training data matters a lot when comparing different methods, could you clarify which datasets the Fast R-CNN detectors in the two provided example models (RGB and DepthJet) have been pre-tr…
tlind updated
6 years ago
-
it seems pre-train corpus using whole word mask is not support in chinese yet.
even passing --do_whole_word_mask=True using create_pretraining_data.py, nothing happens.
is there someone know ho…