bsz Search Results - Githubissues

1000+ results
for bsz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mistyreed63849/Graph-LLM #1

Questions about some implementation details

Hi! Thanks for your wonderful work. I have some questions about the minor implementation details. For example, ``` adapter_key, adapter_value = adapter adapter_le…

CurryTang updated 10 months ago
2
huggingface/autotrain-advanced #228

DreamBooth: Using both "--train-text-encoder" and "--prior-p…

Both of the following commands work: Not training the text encoder: ``` autotrain dreambooth \ --model stabilityai/stable-diffusion-xl-base-1.0 \ --output output/ \ --image-path images…

Xargonus updated 9 months ago
10
TencentARC/T2I-Adapter #97

Training Color Adapter does not learn

Hi, I would like some help and ideas surrounding training an Adapter conditioned on spatial palette. So far I have the following code, using current diffusers code. Could anyone give some insights…

paudom updated 7 months ago
6
microsoft/DeepSpeed #3472

[BUG]Traning multiple model with deepspeed

**Describe the bug** I am currently attempting to train a txt2img model (both encoder and unet) using deepspeed. I have made some modifications to the code, but I am encountering an error. The error …

uygnef updated 8 months ago
15
unslothai/unsloth #20

Question about triton kernel for changed input shape

Excellent work for training optimization! I have a question for a long time and can an expert like you help me with it? test code is as below: ``` import torch import triton.ops import time …

AlvL1225 updated 9 months ago
12
FreedomIntelligence/HuatuoGPT #37

调用finetune.py发生错误

accelerate launch --config_file ./scripts/sft.yaml --num_processes 8 --num_machines 1 --machine_rank 0 --deepspeed_multinode_launcher standard scripts/finetune.py --experiment_name HuatuoGPT --model_p…

yky3489 updated 9 months ago
1
hiyouga/LLaMA-Factory #1274

aquila-sql微调，推理后乱码

使用aquila-sql微调2000条sql问答，然后训练损失正常下降，但是推理时bleu只有1，结果一堆无意义的字符串，下面是训练损失： ![training_loss](https://github.com/hiyouga/LLaMA-Factory/assets/68314259/101f2ecd-13d0-4fbd-a360-e9f0ace36aed) 输出结果： ![d…

jiaohuix updated 10 months ago
1
mistralai/mistral-inference #38

one_file_ref.py attention has an O(seqlen^2) matrix multipli…

Lines 129-143 in `one_file_ref.py` multiplies the complete query-key matrices with each other, if we are prefilling the key-value cache. The sliding window mask is applied only after this multiplicati…

Aniruddha-Deb updated 11 months ago
1
vgel/repeng #30

RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'

I'm running through the `emotion.ipynb` notebook, running on the CPU. At cell ``` model.reset() # make sure you always reset the model before training a new vector control_vector = ControlVect…

keviddles updated 5 months ago
2
OPUS4/application #1134

Neuen Identifier-Typ ISMN in Application integrieren

Es wurde ein neuer Identifier-Typ ISMN hinzugefügt. Dieser ist für ein neuen Dokumenttyp notwendig (siehe #1131). Für den neuen Typ müssen Übersetzungen usw. hinzugefügt werden.

j3nsch updated 10 months ago
5

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for bsz

1000+ results
for bsz