pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

biomap-research/scFoundation #21

Pre-training script

Hi authors, thank you for this impressive work. Is it possible to provide a pretraining script and a small sample of the processed data used for pretraining? I would like to try pretraining a model…

jonathanlimsc updated 3 weeks ago
7
huggingface/nanotron #79

Continued Pretraining on Llama 7b.

In continuation to https://github.com/huggingface/nanotron/issues/78#issue-2147747937, I converted the weights as you mentioned, but unfortunately, I cannot get the same sane outputs for the pre-tr…

wiseyy updated 3 months ago
8
openmedlab/MedLSAM #17

Pretraining checkpoint

After i trained,i put the .tar at vit_load_path.But i get the missing key error when i want to segment other data(like this:Missing key(s) in state_dict: "image_encoder.pos_embed", "image_encoder.patc…

s1054020 updated 8 months ago
1
OpenGVLab/VideoMamba #54

Masked Pretraining section

In the Masked Pretraining section, there seems to be an issue with the way the CLIP model is loaded. In the extract.ipynb notebook, the code model, _ = clip.load("ViT-B/16", device='cpu') is used, but…

Sine7812 updated 6 months ago
2
microsoft/Industrial-Foundation-Models #9

Few shot setting in pre-train

I would like to inquire about how the few-shot approach is specifically incorporated into your pretraining process. For instance, the paper mentions six different few-shot scenarios with 0, 4, 8, 16, …

wyclike updated 1 month ago
1
karchkha/MelSpec_VQVAE #1

Pretraining model？

Do you have a pre-training model? I want to save time on training. And what about your training hours with the epoch=100.

a897456 updated 8 months ago
15
shkarupa-alex/tfswin #4

Pretraining question

Hi, does the SwinTransformer v2 do SimMiM pretraining? This is shown in the paper: https://arxiv.org/pdf/2111.09883.pdf If not, any plans to add/how difficult would it be to port?

noahcjones updated 11 months ago
2
mlfoundations/datacomp #73

Pretraining dataset

Thank you for your excellent work. I'm currently training my own CLIP model and have a question. If I use LAION-2B, COYO-700M, and Datacomp datasets simultaneously for training, will it yield better r…

mactavish91 updated 11 months ago
1
prov-gigapath/prov-gigapath #53

Pretraining time of tile encoder

Dear GigaPath team, Thank you for your excellent work! Could you share how long the pre-training of the tile encoder took? In the paper you mention the time for pretraining the slide-level model b…

gabricampanella updated 4 months ago
1
modelscope/FunASR #2239

paraformer微调之后模型变大，且和basemodel推理同一段wav文件时会报错

在对paraformer长音频版模型进行微调之后，保存的pt文件大小由basemodel的800多M增加到了近2.6G，且在推理同一段wav文件时，会报错，报错信息如下： Traceback (most recent call last): File "/wind/aispace/train/source/src/FunASR/examples/industrial_data_pret…

YouTwoMeToo updated 16 hours ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for pretraining

1000+ results
for pretraining