pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Muennighoff/sgpt #25

chinese support?

Does the model support Chinese input?

Lukangkang123 updated 1 year ago
4
bowang-lab/scGPT #94

Where can I find the code of attention mask for generative p…

Thank you for your great work! But it seems that the latest code didn't implement your special design of attention mask during pre-train？ def generate( self, cell_emb: …

Haonan917 updated 2 months ago
4
AkihikoWatanabe/paper_notes #500

Revisiting Pretraining Objectives for Tabular Deep Learning,…

https://arxiv.org/abs/2207.03208

AkihikoWatanabe updated 1 year ago
1
TinyLLaVA/TinyLLaVA_Factory #47

Can you share the Loss Curve?

Hello,can you share the pretrain Loss Curve and fine-tune Loss Curve? I have some questions about my reproduction results.Thank you!

necrophagists updated 4 months ago
1
jerryji1993/DNABERT #65

Error when loading pretrained model for finetunning from a c…

Hi, Very simple issue, this error: "ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group" Is displayed when I'm trying to load a a pre-trained m…

danarte updated 10 months ago
1
agarant/deep_autoencoders #1

Missing rows after reconstruction?

Hi. I'm using my dataset instead of MNIST. My data originally is 188318 rows and 130 coloumns. After done the pretraining and reconstruction step, the new dataset is eliminated 30 rows to 188288 rows…

kroscek updated 7 years ago
1
ShuangXieIrene/ssds.pytorch #18

Performance of FSSD MobileNetV1 on VOC2007

Hi ShuangXieIrene: Did you use MS COCO for pretraining before you trained the FSSD MobileNetV1 on VOC2007? When I using VOC2007 and VOC2012 as training data for FSSD MobileNetV1, my perform…

insmod-he updated 6 years ago
1
facebookresearch/ijepa #51

Downstream task

After train the model can we use only target-encoder for down-stream task ?? like- image captioning etc.

ankan8145 updated 4 months ago
5
andresvasquezv/hospital_people_detector #3

Which datasets have the Fast R-CNN detectors been trained on…

Hi, since training data matters a lot when comparing different methods, could you clarify which datasets the Fast R-CNN detectors in the two provided example models (RGB and DepthJet) have been pre-tr…

tlind updated 6 years ago
1
google-research/bert #780

whole word mask is not support in chinese

it seems pre-train corpus using whole word mask is not support in chinese yet. even passing --do_whole_word_mask=True using create_pretraining_data.py, nothing happens. is there someone know ho…

brightmart updated 5 years ago
3

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for pretraining

1000+ results
for pretraining