tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #7201

[Bug]: NCCL gives an error when I use tensor_parallel ：Runti…

### Your current environment Collecting environment information... PyTorch version: 2.3.1+cu118 Is debug build: False CUDA used to build PyTorch: 11.8 ROCM used to build PyTorch: N/A OS: Ubunt…

wlll123456 updated 1 month ago
8
tenstorrent/tt-metal #5395

Multi Device Object Spec

Currently our description of massive parallelism comes from sharding but that is within a single core. Our implementation of multi-device sharding is approximated using a slice op on host followed by…

ntarafdar updated 4 months ago
2
pytorch/pytorch #20822

[Proposal] Data reading framework for PyTorch (Hive, MySQL, …

At Facebook we are building a data reading framework for PyTorch which can efficiently read from data stores like Hive, MySQL, our internal blob store and any other tabular data sources. The framework…

pritamdamania87 updated 8 months ago
46
tweag/linear-base #474

&, T tensor

Is there a reason this isn't present in linear-base? I know you can't make & (co)datatypes directly in Haskell and have to encode them, but it seems like it's worth having. I think I read somewhere th…

Jashweii updated 4 months ago
10
pytorch/torchtitan #277

Question; parallelising convolutional layers?

Hi, I was wondering, is `torchtitan` and/or `DTensor` capable of model parallel training of convolutional neural network layers? Pretty much, we want to train a GAN on very large 2D images (eventua…

jvwilliams23 updated 5 months ago
4
sagemath/sage #33703

Parallelization of Boruvka's algorithm

As per the discussion on https://groups.google.com/g/sage-devel/c/R3r3G_Qrllo, opening this ticket to parallelize Boruvka's algorithm. CC: @kliem Component: **graph theory** Author: **Adarsh Kis…

a5cff67d-07f8-46a5-a0b6-93601583e2f7 updated 1 year ago
27
databricks/megablocks #115

Cloning input `x` in `megablocks.layers.glu.SparseGLU` leads…

I am debugging a data-parallel forward mismatch when using `megablocks` (DP and non-DP give different forward results). During debugging, I tried to reproduce such difference minimally, and found that…

cmsflash updated 3 months ago
2
huggingface/text-generation-inference #376

Improve inference speed of Santacoder and Starcoder (and oth…

I did some extensive investigation, testing and benchmarking, and determined that the following is needed to speedup inference for the Bigcode models (and most of text-gen-inference models: 1. **Use …

jlamypoirier updated 1 month ago
8
vllm-project/vllm #6732

[Bug]: VLLM 0.5.3.post1 [rank0]: RuntimeError: NCCL error: u…

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12…

jueming0312 updated 1 month ago
22
vllm-project/vllm #5793

[Bug]: Different quality responses using GPTQ / marlin kerne…

### 🐛 Describe the bug Hello, I am running llama3-70b and mixtral with VLLM on a bunch of different kinds of machines. I encountered wildly different quality performance on A10 GPUs vs A100/H…

joe-schwartz-certara updated 1 month ago
8

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism