pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Megvii-BaseDetection/YOLOX #1738

ImageNet Pretraining

Hi, I am looking for ImageNet pretrained weights of the YOLOX backbone. I am specifically interested in the largest model YOLOX-x. In a couple of other issues I've seen that nano version can be train…

ocetintas updated 1 year ago
1
LiheYoung/Depth-Anything #21

About pretraining code

Hi, thx for your work! Do you plan to release the pretraining code? Like training dataset.

puyiwen updated 10 months ago
1
intersun/CoDIR #4

Pretraining codes

Hi Siqi, Thanks for releasing the great code. I cannot find the pretraining code in this repository about the implement of the in-batch negative examples? Could you point out it? And it seem…

YeDeming updated 2 years ago
1
intersun/LightningDOT #4

Pretraining dataset

Hi, Thank you very much for the great work, and for making your code publicly available. I am trying to run the code to reproduce the results, however, the pre-training datasets are missing from …

ghaddarAbs updated 3 years ago
5
laiguokun/Funnel-Transformer #6

Pretraining Issues

Hey, I am trying to train Funnel Transformer with the following hparams, the cpu usage for my TPUv3-8 has not gone above 4% in the 90 hours the code has been running and it seems to be very slow, to…

nemani updated 4 years ago
7
UCSC-VLAA/AdvXL #2

Regarding Pretraining and Finetuing?

In Table 3 & 4, is the same dataset used during pre-training and fine-tuning? Or does the fine-tuning only happened on ImageNet-1k dataset?

HashmatShadab updated 5 months ago
7
NVIDIA/Megatron-LM #791

[BUG] Example of pretraining BERT does not work

**Describe the bug** Runing the Pretraining *BERT* encountered two issues: 1. the "TransformerEngine only supports softmax compute in FP32". Need to add `--attention-softmax-in-fp32` to the model ar…

xju2 updated 3 weeks ago
7
openmm/spice-dataset #64

Pretraining dataset

I think there could be value in creating a separate dataset for pretraining. It would cover the same chemical space as the standard SPICE dataset, but have many more conformations and be computed at …

peastman updated 1 year ago
2
NVIDIA/NeMo #10830

When do add the code for Target Speaker Extraction, thank!

Recently, I have been conducting applied research on Target Speaker Extraction, but I have encountered many difficulties. I came across your paper titled 'Generative Speech Foundation Model Pretrainin…

haha010508 updated 3 weeks ago
2
unslothai/unsloth #1116

Compatibility issues with CUDA 12.4

Unsloth is not supported with cuda 12.4. Is there are any alternate methods to use unsloth with cuda 12.4. Also are there any other frameworks supported with cuda 12.4 for continual pretraining of llm…

seetharamarao817 updated 1 month ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for pretraining

1000+ results
for pretraining