tri-training Search Results

731 results
for tri-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Jinghe-mel/UFEN-SLAM #4

Hello, could you provide the training code for UFEN's binary…

bumblebee15138 updated 4 months ago
8
AbrahamYabo/Cascade-Zero123 #3

How to train a custom dataset ?

Dear authors, Thank you for the great work. I can‘t fully download the dataset. Can you give me some advice on how to create my own dataset。what data structure should I process? Thanks

Gaohhy updated 6 months ago
4
Lightning-AI/litgpt #620

Sample packing for pretraining/fine-tuning

I was wondering if there are sample packing approaches defined somewhere for preprocessing and tokenization of datasets? I looked through different prepare_*.py, but couldn't find anything related to …

alitirmizi23 updated 4 weeks ago
16
state-spaces/mamba #6

bfloat16 overflow during training session

1. I tried vanilla pytorch training loop using bfloat16, the loss got overflow, https://github.com/mesolitica/malaya/blob/5.1/pretrained-model/mamba/causallm-130m-bf16.ipynb 2. so I tried vanilla pyt…

huseinzol05 updated 4 months ago
15
takiyu/portrait_matting #18

Invalid output and model

kkkmax updated 5 years ago
4
srcn-ivl/UPR-Net #5

Problem of the training on vimeo_triplet

Hello, there was an issue during the training. Is this a data reading issue? Thanks！！ The error is as follows： upr-base => val step: 1: 104/119; time: 0.00+0.27 upr-base => val step: 1: 105/119; ti…

Mapzzone updated 4 months ago
13
facebookresearch/xformers #918

What is the difference between the 4 implementations of FMHA…

Hi all, I'm new to xformers, I'm learning the `examples/llama_inference/generate.py` file. I traced it here: ```python def _memory_efficient_attention_forward( inp: Inputs, op: Optional[Type…

sleepwalker2017 updated 4 months ago
7
facebookresearch/fairseq #4177

[data2vec] Loss goes down and up again

I try to train data2vec on music data (the FMA dataset). I've made some modifications to the feature extractor ConvNet (I've made it a small ResNet essentially), and reduced the size of the transforme…

treasan updated 1 year ago
10
mlfoundations/open_lm #126

Figure out why AdamW + gradient accumulation leads to differ…

In https://github.com/mlfoundations/open_lm/pull/125, we had to switch our gradient accumulation tests from SGD to AdamW to make gradient accumulation tests pass. It's unclear why this is the case; an…

achalddave updated 10 months ago
6
TRI-ML/vidar #17

Saving self-calibration models.

Hi. I have several questions about saving models to a local folder. 1, When I run the self-calibration code, in which folders are trained models saved to? 2, Which part of config files should …

YasuThompson updated 1 year ago
5

上一页 1...4 5 6 7 8 9 10...74 下一页

731 results for tri-training

731 results
for tri-training