pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/mae #111

Fake multi gpus pretraining

I use one node with four GPU(V100, 32G) for pretrain, but parallel training is a little weird. All **four** processes run on **one(device:0)** GPU. Why it happened? Thanks for everyone's help！ I u…

Aurora-slz updated 1 year ago
2
ChunyuanLI/Optimus #30

Pre-trained model download is not available.

Optimus/doc/optimus_finetune_language_models.md beta=0, latent size = 32 https://chunylcus.blob.core.windows.net/machines/msrdl/optimus/output/pretrain/philly_rr3_vc4_g8_base_vae_wikipedia_pretrain…

Yongyeon-Kim updated 2 weeks ago
6
facebookresearch/fairseq #4742

Help with replicating the results for Hubert Pretraining

## ❓ Questions and Help #### What is your question? I am trying to replicate the HuBERT base pretraining iter1 on librispeech 960hr. However, the training curve seems to be weird, as the unmask co…

a43992899 updated 3 months ago
2
cgmhaicenter/exBERT #7

Pretraining for sequence classification

Hi, I am implementing fine-tuning exBERT for sequence classification. I already have done the pretraining for my data. However, since the pre-training python script that you have provided is only f…

TahaAslani updated 2 years ago
1
QwenLM/Qwen2 #794

sft 7B model_max_length=90000 24 A00 OOM

参考这里https://qwen.readthedocs.io/zh-cn/latest/training/SFT/example.html脚本使用 24张A100，对7B sft，model_max_length超过2w时，OOM

gk-cv updated 4 days ago
2
iwangjian/textsum-gan #8

time for pretraining step

It is mentioned in the repo that the pretraining step should run for some time, please mention after how much time i should interrupt it. Also i can't use the pretrained npz file as i'm planning to…

umangkeshri updated 5 years ago
3
WenMellors/TS-TrajGen #7

files for region-level pretraining

Thank you for opening and maintaining this project. I want to reference your paper and reproduce your experimental results. However, I find that the region-level files for pretraining (eg. 201511_…

Star607 updated 9 months ago
1
UKPLab/sentence-transformers #1320

Pretraining objectives with MultiLabel?

Are there any appropriate setups or losses in sentence-transformers for pretraining sentence embeddings in cases where I have labels as targets? (I want to finetune the actual embeddings, not just a…

ddofer updated 2 years ago
5
UKPLab/sentence-transformers #911

T5 Unsupervised Learning(pretraining)

Can you please let me know how can we unsupervised training for T5 model? This [link](https://www.sbert.net/examples/unsupervised_learning/MLM/README.html) and this [link](https://github.com/huggingfa…

prince14322 updated 3 years ago
1
google-research/big_transfer #29

Loss function for pretraining

As mentioned in issue [https://github.com/google-research/big_transfer/issues/26] the loss is sigmoid binary cross entropy for each label. I have few more questions about the loss: 1) How is the obje…

kritiagg updated 3 years ago
2

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for pretraining

1000+ results
for pretraining