pre-training Search Results

1000+ results
for pre-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yoheikikuta/paper-reading #57

[2103.00020] Learning Transferable Visual Models From Natura…

## 論文リンク https://arxiv.org/abs/2103.00020 ## 公開日（yyyy/mm/dd） 2021/01/05 ## 概要 OpenAI が発表した DALL·E の中で reranking にも使われていた CLIP (Contrastive Language-Image Pre-training) の論文。 Web 上のテキストから特別な a…

yoheikikuta updated 3 months ago
13
ZJU-CTAG/PDBERT #1

For pre-train data and the steps of pre-train

Thank you very much for your work, could you please share your pre-training data set, I would like to use some longer max_length model to replace codebert to handle longer input data, in the meantime,…

bigpinwheel666 updated 5 months ago
1
DrugD/TransEDRP #1

Inquiry about Pre-processing Error in Training TransEDRP on …

Hello, I hope this message finds you well. I have successfully executed the process_GDSCv2.py script as per the provided instructions. However, when I attempted to run the main.py function to train…

Kimmm-pzl updated 1 year ago
4
Algolzw/daclip-uir #55

Pre-trained weight loading in DA-Clip

Hello, could you explain how the pre-trained model you loaded during the training of DA-Clip was obtained? Because I noticed that the model “laion2b_s34b_b79k” is used in your training command, but in…

Trevor-Philips-cbd updated 4 months ago
5
RUC-GSAI/Llama-3-SynE #1

about training code

will the training code be released?

Mrkkew updated 2 months ago
1
vllm-project/vllm #10043

[New Model]: Support Tencent-Hunyuan-Large

### The model to consider. https://huggingface.co/tencent/Tencent-Hunyuan-Large Tencent released a 389B MoE with only 52B activated parameters which beats the Llama 3.1 405B. There are three chec…

mgoin updated 1 week ago
2
Kainmueller-Lab/plankton-dinov2 #1

Product backlog

### General - [x] Prepare scaling plots until end of february. Y-axis: the speedup we get when running one epoch through the model for 2,4,6,8,10 GPUs - [x] Find out how many samples we have in the …

JLrumberger updated 2 weeks ago
1
redhat-et/datascience-wg #7

Comparison of different LLM fine tuning methods for Granite …

We would like to evaluate the model performance for various LLM fine tuning approaches and compare them with the standard benchmarks. An experiment we would like to try is: - **Compare the full car…

hemajv updated 3 months ago
2
Walter0807/MotionBERT #114

Additional joints in pose3D

Hi. I want to predict additional 3D keypoints like hands in halpe136 or small toe and big toe in halpe26. What more things do I need to do? Should I do pre-training? Or just fine tuning? I…

noah0924 updated 1 month ago
1
LAION-AI/audio-dataset #97

Were both training and test splits from original datasets (e…

I want to know the exact splits of AudioSet or VggSound used to train the CLAP. Because many audio-related datasets for downstream tasks were collected from these two large-scale datasets, if all thei…

ttgeng233 updated 2 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for pre-training

1000+ results
for pre-training