pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

denguir/student-teacher-anomaly-detection #16

The teacher.pt provided inside model folder, are they pretra…

I want to know, are you pretraining the teacher on ImageNet ? In the paper, they mentioned that Teacher is pretrained on ImageNet. Is your repo following it ?

sushovanjena updated 1 year ago
3
wandb/wandb #3405

[CLI]: wandb wrongly shows a run as crashed and stops loggin…

### Describe the bug We pretrain large models with [fairseq](https://github.com/pytorch/fairseq) and log progress with wandb. During the run wandb stops logging and the run is shown as crashed (eve…

yuvalkirstain updated 9 months ago
21
tatweer-research/privacy-mohnitor #1

Pretraining on TPUs is not possible for 3b and 11b models

It would be best to create a branche from the development branch and work in it on fixing the problem

mohammad-al-zoubi updated 1 year ago
1
Wluper/Retrograph #13

How to pretrain on GPU?

Hello! I am trying to pretrain an adapter using the `4_pretrain_adapter.sh` script. I have a GeForce RTX 2080 SUPER installed (~8GB VRAM), with NVIDIA Driver Version: 440.33.01, CUDA Version: 10.…

ghost updated 3 years ago
5
fulfulggg/Information-gathering #528

FACMIC: 医療画像分類のための連合適応CLIPモデル

## タイトル: FACMIC: 医療画像分類のための連合適応CLIPモデル ## リンク: https://arxiv.org/abs/2410.14707 ## 概要: 分散型データを用いた深層学習モデルの学習を可能にし、データプライバシーを確保する手法として、Federated learning (FL) が注目されています。しかし、FLでは、モデル性能の評価において通信コストが重要…

fulfulggg updated 1 week ago
2
acmi-lab/self-pretrain #1

Question about the paper

Hello, Thanks for the super interesting paper. I actually came across your poster in ACL and after reading the whole paper, I have a few questions regarding experimental details: 1. During pretr…

ji-xin updated 1 year ago
1
horsepurve/DeepRTplus #13

About the DeepRT+ calibration

Hi, could you please tell me that how the DeepRT+ do the calibration using a certain ratio of the test-group peptides after pretraining with the big data? Thanks.

Chaohk updated 2 years ago
4
mims-harvard/TFC-pretraining #31

Repetitive channels in the Gesture Dataset

There seems to be an issue with the Gesture Dataset, having repetitive channels. The below shows the training dataset.

jaeho3690 updated 8 months ago
1
google-research/t5x #1091

Coredump: checkpoint_###.tmp-###/parameter does not exist wh…

Hi I am repeatedly facing the error that a parameter of random checkpoint does not exist. This happens whenever I pretrained the model from scratch. Whenever I run the code, the iterations (epochs)…

boram915 updated 1 year ago
1
unslothai/unsloth #1123

Continued pretraining facing catastrophic forgetting

Hi @danielhanchen I tried fine-tuning llama 3.2-1b base model for 2 of my tasks following below example notebook https://colab.research.google.com/drive/1tEd1FrOXWMnCU9UIvdYhs61tkxdMuKZu?usp=sha…

InderjeetVishnoi updated 3 weeks ago
3

上一页 1...74 75 76 77 78 79 80...100 下一页

1000+ results for pretraining

1000+ results
for pretraining