pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #1214

Error - 'OutOfMemoryError: CUDA out of memory.'

While using the continued pretraining method with the Llama 3.2 1B model, I'm encountering an 'OutOfMemoryError: CUDA out of memory.' I've already set the batch size and other parameters to their lowe…

raghavendra-k-j updated 1 month ago
3
gpt4vision/OvSGTR #14

Query about the pre-training process

Thank you for your outstanding work, but I still met many problems in the process of reproducing the pre-training results. I use the following command to pre-train the groundingdino_swint: bash …

qwerhk839 updated 4 weeks ago
8
MarSaKi/ETPNav #8

Feature Extraction Codes for Pretraining

Hai An, Thanks for your ETPNav's open-sourced codes. ETPnav is a fascinating work! I open this issue to ask where to find the feature processing codes for pertaining. Or will you plan to open this …

LYX0501 updated 4 months ago
5
OpenGVLab/unmasked_teacher #43

Some questions about pretraining

Hello! I'm very interested in your great work! I have two questions about pretraining. Does the generalization ability of UMT come from CLIP? With this in mind, regardless of what kind of pre-traini…

Chuan-shanjia updated 4 months ago
3
OpenBMB/MiniCPM-V #672

💡 [REQUEST] - <title>预训练相关

### 起始日期 | Start Date _No response_ ### 实现PR | Implementation PR _No response_ ### 相关Issues | Reference Issues _No response_ ### 摘要 | Summary 我想在我自己的domain data上做post pretraining. 可以给出你们的预训练代码吗…

zhangzhixun1999 updated 1 week ago
1
unslothai/unsloth #1333

Issue training with Qwen 2.5 7B

``` from trl import SFTTrainer from transformers import TrainingArguments, DataCollatorForSeq2Seq from unsloth import is_bfloat16_supported trainer = SFTTrainer( model = model, tokeniz…

Vital1162 updated 1 day ago
5
NVIDIA/partialconv #3

Pretraining

You mentioned in the previous issue we can load pretrained and convert conv2d to partialconv. How would you change it as the model structure is fixed in pretrained models? My model is ``` `class …

bluesky314 updated 5 years ago
2
g-simmons/persona-research-internship #144

Proj: Use Infini-gram to retrieve term frequencies for the W…

## Background *relevant information and motivation for this task* The infinigram paper claims to provide an efficient method for computing pretraining term frequencies. https://arxiv.org/pdf/2401.173…

g-simmons updated 5 days ago
5
slim1017/VaDE #5

Pretraining

Hi, My team and I are trying to duplicate the results of your paper, but cannot. Would it be possible to gain access to the code that pretrains the data? That would help us a lot. Thank you.

erlebach updated 2 years ago
13
facebookresearch/dinov2 #243

[request] Improvements to dataset code for pretraining

- Only depend on standard ImageNet files - Facilitate swapping in a different dataset Related issues: - #233 - #126 - #100 - #72

patricklabatut updated 5 days ago
5

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for pretraining

1000+ results
for pretraining