pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

karpathy/nanoGPT #524

Pretraining Divergence

I have been trying to follow the steps listed under "reproducing GPT-2" from the README.md. Unfortunately, when I run the model, my training always diverges. I have tried switching up my learning rate…

egoetz updated 3 months ago
3
IDEA-FinAI/ChartMoE #3

about Alignment Pretrained Model

Thanks for your great work! Will the model that have undergone Alignment Pretraining be open-sourced (not the models after SFT)? When will they be open-sourced?

s123mple updated 6 days ago
1
yuqianghan/editretro #16

A question about the score result

Hello, I'm encountering an issue reproducing your experiment. I attempted to use the provided checkpoint for generation without pretraining or fine-tuning. However, the results are significantly worse…

yyh030806 updated 1 week ago
2
facebookresearch/dinov2 #204

Possible to continue pretraining?

I know the general recommendation is to leave the backbone frozen and train task-specific heads. However I'm interested in continuing pre-training to better fit the backbone features to my dataset. Is…

rohan-mehta updated 1 month ago
14
LirongWu/MAPE-PPI #7

About CATH dataset pretraining

Dear author, you have a pre-trained model on github, on which dataset was this model pre-trained? In your paper, you mentioned using the CATH dataset for pre-training. I think it is an interesting dat…

yuanhu246 updated 3 months ago
2
karpathy/llm.c #660

Pretraining (with CPUs)

I'm new to deep learning but have some experience with training boosted-decision-trees. Is this just for fine-tuning or pretraining as well? When I look inside train_gpt2.c I see the first thing it…

bitmarkcc updated 3 months ago
5
hoiliu-0801/DNTR #10

The Pretrained Model for "aitod_DNTR_mask.py"

I appreciate this awesome work and I am currently going to try finetune DNTR to my own dataset. But along the way I have a trouble with the training configuration. https://github.com/hoiliu-0801…

dango89emo updated 1 day ago
4
snap-stanford/pretrain-gnns #71

Generate the embedding based on pretraining model

If I want to generate an embedded representation based on a pre-trained model, given some SIMLES sequences, how do I modify the code and preprocess the data.

Nokeli updated 2 weeks ago
1
cvg/LightGlue #77

Homography pretraining shuffle?

Hi, a bit of noob question - should I add some shuffling, or this is expected for homography pretraining? ![IMG_6165](https://github.com/cvg/LightGlue/assets/4803565/ed07e118-1047-4979-aef3-e5471…

ducha-aiki updated 3 months ago
3
baaivision/Emu3 #7

Add information about training

Hello, thank you for the research. Please share more info about pre-training process. Data: - language data (total number of text tokens model have seen during pretraining) - images (total amoun…

Andrey36652 updated 1 month ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for pretraining

1000+ results
for pretraining