batch-scheduler Search Results

1000+ results
for batch-scheduler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

chemprop/chemprop #1068

[v2 FEATURE]: Add option for adaptive annealing coefficient …

In the CLI we set `args.v_kl` to 0 for evidential. This flag is also used for the `v_kl` in dirichlet, but in the dirichlet paper, they suggest using a `v_kl` that changes with epoch: ![image](https:…

KnathanM updated 1 month ago
1
vllm-project/vllm #6913

[RFC]: Asynchronous Output Processor

### Motivation. Each decoding step inside LLMEngine does the following: schedules the sequences to be executed in the next iteration, executes the model and process model outputs. GPU remains larg…

megha95 updated 3 weeks ago
2
aigc-apps/CogVideoX-Fun #56

OOM on H100 when finetuning the 5b inpaint model

Hello, Currently, I've been facing issues with finetuning the 5b-inpaint model on an H100. Using deepspeed with your provided config will cause the trainer to require 90gb of VRAM. Not using deepspee…

Closertodeath updated 4 days ago
12
fe1ixxu/ALMA #67

Some questions about alma.

I want to reproduce this work. Currently, I am in the first stage (monolingual training). My script is as follows: OUTPUT_DIR=${1:-"./saves/llama-2-7b-oscar-ft"} # random port between 30000 and 5…

yuanzhiyong1999 updated 1 week ago
6
pytorch/xla #8402

Kaggle Notebook: model return loss None on TPU

## ❓ Questions and Help Hi, I recieved loss None when training model. Anyone can help? Simple reproduct kaggle notebook [link](https://www.kaggle.com/code/liondude/notebook548442067d) ``` im…

manh3152924 updated 2 days ago
1
unslothai/unsloth #1101

Getting CUDA OOM on training gemma-2-2b with "lm_head" and "…

Hi @danielhanchen I am trying to fine-tune gemma2-2b for my task following the guidelines of the continued finetuning in unsloth. Howver, I am facing OOM while doing so. My intent is to train gemm…

InderjeetVishnoi updated 1 month ago
6
unslothai/unsloth #1019

No Validation Loss logged (possibly related to train_on_resp…

Evaluations are being run, _but no validation loss is logged or sent to WandB_ The console shows that eval is running, but displays a table along the lines of: | eval loss | validation loss | |…

selalipop updated 1 week ago
6
Kosinkadink/ComfyUI-AnimateDiff-Evolved #433

Error occurred when executing KSampler (Efficient): 'NoneTy…

When I run the workflow after updating ComfyUI-Advanced-ControlNet, the following error occurs. How can I solve it? Error occurred when executing KSampler (Efficient): 'NoneType' object has no a…

sundaxiong6 updated 1 month ago
4
KaiyangZhou/deep-person-reid #576

mAP using osnet_x1_0 and resnet50 is weird

Hello, I just runned Get started code as below using pretrained model 'osnet_x1_0' and even 'resnet50' too. However, the result was weird. mAP was just 3.9%.... and when I used resnet50, it was 2.x…

a2082761 updated 3 months ago
2
hiyouga/LLaMA-Factory #5512

How to train the mm_proj and the LLM part with lora of Qwen2…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info Have installed all the requirements for Qwen2-vl ### Reproduction train_mm_proj_only:True Hello, I wan…

leoozy updated 1 week ago
5

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for batch-scheduler

1000+ results
for batch-scheduler