pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

matterport/Mask_RCNN #1557

Pretraining on COCO dataset

For how many epochs & steps per epoch/ how many images was the model trained on COCO dataset in order to obtain the mask_rcnn_coco.h5 weights?

ApoorvaSuresh updated 5 years ago
1
huggingface/nanotron #161

out of memory for continuing pretraining llama3-8B

I am trying to use the framework to continue pretraining llama3-8B. I have converted the HF checkpoint into nanotron format and the generated tokens seem reasonable. I use the following setting to…

ckzbullbullet updated 3 months ago
5
SalesforceAIResearch/uni2ts #74

Unable to use load_from_disk function in pretraining

I am trying to run the pretraining scripts and encountering the following error while loading the datasets from disk. GPU available: True (cuda), used: True TPU available: False, using: 0 TPU core…

ngupta-slb updated 3 months ago
2
facebookresearch/mae #111

Fake multi gpus pretraining

I use one node with four GPU(V100, 32G) for pretrain, but parallel training is a little weird. All **four** processes run on **one(device:0)** GPU. Why it happened? Thanks for everyone's help！ I u…

Aurora-slz updated 1 year ago
2
cgmhaicenter/exBERT #7

Pretraining for sequence classification

Hi, I am implementing fine-tuning exBERT for sequence classification. I already have done the pretraining for my data. However, since the pre-training python script that you have provided is only f…

TahaAslani updated 2 years ago
1
iwangjian/textsum-gan #8

time for pretraining step

It is mentioned in the repo that the pretraining step should run for some time, please mention after how much time i should interrupt it. Also i can't use the pretrained npz file as i'm planning to…

umangkeshri updated 5 years ago
3
MCG-NJU/SparseBEV #67

pretraining SparseBEV with only 1 frame

Hello, Thank you for your very interesting model. I intend to use SparseBEV on a new dataset with only 1 frame , as a baseline (no previous temporal frame). In the paper, on ablation study, y…

jeff770 updated 5 months ago
1
huggingface/optimum-habana #1396

Pretrain with LLama Model - num_samples=0 Error

### System Info ```shell vault.habana.ai/gaudi-docker/1.17.0/ubuntu22.04/habanalabs/pytorch-installer-2.3.1:latest ``` ### Information - [X] The official example scripts - [ ] My own modified scri…

saisuryateja1436 updated 9 hours ago
1
yformer/EfficientSAM #31

will release the pretraining code?

Yut-tuat updated 9 months ago
2
UKPLab/sentence-transformers #1320

Pretraining objectives with MultiLabel?

Are there any appropriate setups or losses in sentence-transformers for pretraining sentence embeddings in cases where I have labels as targets? (I want to finetune the actual embeddings, not just a…

ddofer updated 2 years ago
5

上一页 1...17 18 19 20 21 22 23...100 下一页

1000+ results for pretraining

1000+ results
for pretraining