pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

myshell-ai/JetMoE #6

Pretraining dataset and code request

Will the pretraining datasets and corresponding code be open-sourced? Thanks!

hitalex updated 5 months ago
5
pytorch/ao #803

[Quantization + FSDP] Support `quantize_()` for DTensor

While trying out INT8 mixed precision pretraining (#748) with torchtitan, I came across an issue that if the model is FSDP-sharded, `quantize_()` won't work. The fix would be adding an extra logic to …

gau-nernst updated 3 weeks ago
1
klauscc/VindLU #2

pretraining logs

Hi! I'm trying to pretrain VindLU using 5M data, can you provide the pretraining logs for reference? Thanks!

joez17 updated 1 year ago
3
abhinand5/tamil-llama #9

getting error when pretraining.

im getting this error for my training script, ` File "/usr/local/lib/python3.10/dist-packages/peft/tuners/lora.py", line 356, in forward result = F.linear(x, transpose(self.weight, self.fan_in_…

sazzad1779 updated 8 months ago
2
yformer/EfficientSAM #23

Datasets used in pretraining

In Section 4.1, it seems that only IN1K is used for pretraining. But in Table 1, both SA-1b and IN1K are used in pretraining. Which is the correct one?

Cbtor updated 9 months ago
1
MIC-DKFZ/nnUNet #2338

Is it possible to use unlabelled data for pretraining?

Hi, I have about 500 labelled images and 1500 unlabelled images. I wonder if it is possible to pretrain a model using those unlabelled images and then use the pretrained weights to initialize the …

menna1012 updated 2 months ago
1
bbaaii/DreamDiffusion #15

Pretraining from a checkpoint

Is there a way to run the pretraining of the model from a checkpoint? I'm unable to run it all at once and I don't see a way to do this.

imharris2702 updated 9 months ago
1
LTH14/rcg #37

Moco encoder

Thanks for the great paper and code! I have a query about the moco v3 encoder- in the paper it mentions the latent representations are regularized on a hyper-sphere. I am fairly new to moco v3, can yo…

A-Thorley updated 1 week ago
1
mapooon/BlendFace #15

pretraining procedure?

Hi, it seems this work, the pretraining scheme, is very important. How can I do the pre-training?

andreYoo updated 1 week ago
4
Nightmare-n/GD-MAE #21

Loss fluctuation during pretraining

After pretraining the model on KITTI and plotting the loss, I noticed lots of fluctuations between ~min of 0.1 to ~max of 0.8, it appears that the loss does not stabilize nor saturate. Is this a usual…

sinatayebati updated 8 months ago
1

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for pretraining

1000+ results
for pretraining