batch-scheduler Search Results

1000+ results
for batch-scheduler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #6854

[RFC]: Multi-Step Scheduling

### Motivation. TLDR; There is high CPU overhead associated with each decode batch due to the processing and generation of input/output. Multi-step decoding will be able to amortize all these overh…

SolitaryThinker updated 2 weeks ago
14
unslothai/unsloth #1291

Extremely long context finetuning

Hi all, I am trying to fine-tune models in extremely long contexts. I've tested the training setup below, and I managed to finetune: - llama3.1-1B with a max_sequence_length of 128 * 1024 tokens …

GianlucaDeStefano updated 1 week ago
2
mindspore-lab/mindnlp #1780

动态图模式下，分布式训练报错

**Describe the bug/ 问题描述 (Mandatory / 必填)** IA3微调Qwen2-7b-instruct模型，在mindnlp.core.nn.modules.container.py处raise了一个错误： ![image](https://github.com/user-attachments/assets/5ef35812-ef13-4b51-8e95-a00…

dayunyan updated 3 weeks ago
2
muslehal/xLSTMTime #13

how to run

i don't know how to run,if you can teach me ,i'd appreciate. ![image](https://github.com/user-attachments/assets/3fde4f94-0c4d-49e0-8dd2-77b9cce22630)

fengh318 updated 6 days ago
2
hiyouga/LLaMA-Factory #5964

微调Qwen2-VL-2B报错

### Reminder - [X] I have read the README and searched the existing issues. ### System Info LLaMA Factory, version 0.9.1.dev0 torch,verison 2.4+cuda12.1 ########################################…

Liwx1014 updated 1 week ago
5
paninski-lab/lightning-pose #210

pandas incorrectly reads labels file when first row is empty

Hi, I faced the following error when trying to run multi-stream models: using dlc image augmentation pipeline Error executing job with overrides: [] Traceback (most recent call last): File "/…

AarushShintre updated 2 weeks ago
4
hashicorp/nomad #12109

System scheduler exceeds MaxParallel

### Nomad version Nomad v1.1.3 (8c0c8140997329136971e66e4c2337dfcf932692) ### Operating system and Environment details Linux and macOS, different versions. ### Issue We run 1000+ sy…

dubadub updated 2 weeks ago
7
hiyouga/LLaMA-Factory #6111

多机训练的训练速度和单机一样

### Reminder - [X] I have read the README and searched the existing issues. ### System Info 在total_batch_size相同的情况下，单机（8卡）训练速度和多机（16卡）一样。对于想使用这个仓库scale数据规模成了阻碍 ### Reproduction 使用的torchrun调用脚本为…

Wiselnn570 updated 6 hours ago
1
bmaltais/kohya_ss #2970

Google Colab Notebook: ImportError: cannot import name 'spli…

I'm trying to train the data: accelerate launch --num_cpu_threads_per_process=2 "./sdxl_train_network.py" --pretrained_model_name_or_path="/content/civitai/realismEngineSDXL…

davdotsol updated 3 days ago
1
i365dev/event_radar #1

EventRadar

# EventRadar V1 Design Specification ## 1. Overview EventRadar is a lightweight, extensible framework for building event monitoring and processing pipelines in Elixir. ### 1.1 Core Features - Dynam…

madawei2699 updated 2 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for batch-scheduler

1000+ results
for batch-scheduler