hybrid-parallelism Search Results

482 results
for hybrid-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #780

can 3D parallelism example of bert or gpt-2 be provided?

Hi, I noticed DeepSpeed’s training engine has already provided hybrid data and pipeline parallelism and can be further combined with model parallelism such as Megatron-LM. can you provide a 3D parall…

gongjingcs updated 3 years ago
2
microsoft/DeepSpeed #444

How Megatron work with pipeline module?

In tutorial, it says "DeepSpeed’s training engine provides hybrid data and pipeline parallelism and can be further combined with model parallelism such as Megatron-LM. ". Is there any example/tutorial…

gongwei-130 updated 4 years ago
2
changh95/WeeklySpatialAI #11

2024.10.01 - #9 - MASt3R-SfM, Hyperion, latentSplat, RL meet…

# Academic papers ## MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion - [논문 링크](https://arxiv.org/pdf/2409.19152) - 알고리즘 플로우 - 결과 - 200 장 사용시 성능…

changh95 updated 2 months ago
1
run-llama/llama_index #16483

[Question]: Node Ingestion in batches

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question When i try to ingest a singluar node using simple search everything works fine, but when…

OvaisTariq95 updated 1 month ago
3
xdit-project/xDiT #324

FLUX Hopper benchmarking

**Target:** Measure the scalability of FLUX.1 on NVIDIA Hopper architecture (both H100 & H200) using different model parallelism strategies (see [Flux.1 Performance Overview](https://github.com/xdit-p…

antferdom updated 1 month ago
2
WICG/proposals #47

A faster, parallelizable querySelectorAll

Speeding up DOM accesses would eliminate bottlenecks for a wide range of applications. I recently discovered that the spec mandate querySelectorAll() to return elements in the [document order.](https…

LifeIsStrange updated 5 months ago
7
microsoft/DeepSpeedExamples #760

Why not just use zero3 inference to generate sequence in Dee…

DeepSpeed Chat use tensor parallelism via hybrid engine to generate sequence in stage3 training. I wonder if just use zero3 inference for generation is ok? So that we don't need to transform model pa…

LSC527 updated 1 year ago
3
volcengine/verl #20

Is non-RmPad version model and RmPad verison mdoel interchan…

Hi, thanks for your great work! We are attempting to deploy this framework on Volta GPUs without support of Flash-Attn. I noticed there are Llama models without RmPad that doesn't required flash-at…

yanggthomas updated 1 week ago
5
run-llama/llama_index #16770

[Question]: Query can't find specific items

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Hi there! **Background:** ```python from llama_index.core import VectorStoreInde…

martinb-ai updated 1 month ago
1
uabrc/uabrc.github.io #496

GROMACS software update with MPI and GPU support on Cheaha

Based on ticket request #[RITM0560743](https://uabprod.service-now.com/nav_to.do?uri=%2Fsc_req_item.do%3Fsys_id%3D34fe17a91bc95d902e00eb93604bcb58), the GROMACS software is updated on Cheaha with supp…

Premas updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...49 下一页

482 results for hybrid-parallelism

482 results
for hybrid-parallelism