hybrid-parallelism Search Results

482 results
for hybrid-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeedExamples #760

Why not just use zero3 inference to generate sequence in Dee…

DeepSpeed Chat use tensor parallelism via hybrid engine to generate sequence in stage3 training. I wonder if just use zero3 inference for generation is ok? So that we don't need to transform model pa…

LSC527 updated 1 year ago
3
run-llama/llama_index #16770

[Question]: Query can't find specific items

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Hi there! **Background:** ```python from llama_index.core import VectorStoreInde…

martinb-ai updated 1 month ago
1
microsoft/DeepSpeed #5114

[REQUEST] ZeRO - introduce replicas to keep GBS from getting…

Currently the GBS blows up to thousands if MBS is more than 1, which is counter-productive to training. And as clusters become larger and the training needs to happen faster this is becoming more and …

stas00 updated 9 months ago
4
openxla/xla #18090

Inadequate memory consumption when using HSDP without gradie…

Hi, I'm training transformer model with Hybrid Sharded Data Parallelism. This setup is similar to FSDP/ZeRO-3 where params all-gather-ed for each layer's forward/backward pass and dropped afterwards. …

qGentry updated 1 month ago
2
SimpleSSD/SimpleSSD #4

Release SimpleSSD v2.1

Plan/progress for SimpleSSD version 2.1 in our internal repo. Version 2.1 is fully event-driven (v2.0 is functional simulator except HIL). **SimpleSSD** - [ ] Revise all source code - [ ] Host…

kukdh1 updated 2 months ago
1
jhclark/ducttape #163

Smarter traversal with -j flag

ducttape-0.3 defaults to depth-first traversal of the realization graph in order to try different kind of tasks quickly (and fail fast). But when the user elects to run multiple processes, this orderi…

nschneid updated 10 years ago
2
typelevel/cats #983

Hybrid free monad / free applicative

Hello, I am new to the cats project & open source in general - cats is a great project & I learnt a lot from it :) I've been exploring free monads recently & my understanding is that we can't expres…

bqm updated 6 years ago
22
EDmodel/ED2 #30

SMP Release

Hi All, I put the Shared Memory Parallelism commits on the master. This will allow for the splitting of radiation scattering, photosynthesis and thermodynamics of different patches to different CPU …

rgknox updated 9 years ago
57
hpcaitech/ColossalAI #5056

when I uese hybrid_parallel, and set the enable_fused_normal…

### 🐛 Describe the bug raise RuntimeError( RuntimeError: Failed to replace input_layernorm of type LlamaRMSNorm with FusedRMSNorm with the exception: Please install apex from source (https://gith…

chensimian updated 1 year ago
9
trinodb/trino #13040

Pinot aggregation not pushdown for "with clause" subqueries

When we query pinot "with clause" subqueries(as attached query1), the "with clause" subqueries is not pushed into pinot, it's bad for performance. If we removed with clause(as attached query2), it can…

ellieshen updated 2 years ago
7

上一页 1...1 2 3 4 5 6 7...49 下一页

482 results for hybrid-parallelism

482 results
for hybrid-parallelism