tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

netease-youdao/QAnything #133

[BUG] AI回复中出现设置的system提示词

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing…

fifiand1 updated 4 months ago
6
tensorflow/tensorflow #30069

Support Sparse Tensors in py_function

- TensorFlow version (you are using): 2.0.0b0 - Are you willing to contribute it (Yes/No): No **Describe the feature and the current behavior/state.** Currently, sparse tensors don't seem to be s…

novog updated 3 months ago
15
microsoft/DeepSpeed #5247

Unable to load fp6 QuantizationConfig via DeepSpeedInference

**Describe the bug** Unable to use/test fp6 quantization in deepspeed 0.14 in inference mode on a GPT2 model. There is little documentation on usage right so not sure if I have the wrong init metho…

Qubitium updated 3 months ago
2
casper-hansen/AutoAWQ #234

is there a reason why gemm stores the tensors reversed (in, …

it makes things difficult when we want to handle tensor parallelism .....

vince62s updated 7 months ago
1
helmholtz-analytics/heat #1031

Support distribution of `xarray`

See https://docs.xarray.dev/en/stable/ If I understand correctly, an `xarray` object is made up of the actual `data` array (np.ndarray), and ~~1-D~~ `coordinates` arrays (dictionaries?) that map `d…

ClaudiaComito updated 1 month ago
14
NVIDIA/TensorRT-LLM #587

[Issue]PTQ INT8 calibration OOM

I refer to this [Issue](https://github.com/NVIDIA/TensorRT-LLM/issues/394) and want to use my own data set to obtain the scale value of SQ. The scenario is 70b tp=2, the length of input_ids is not …

wjj19950828 updated 7 months ago
4
pytorch/pytorch #117126

DTensor does not support broadcast, breaking sync_module_sta…

### 🐛 Describe the bug I have a setup where I am manually sharding weights for 2D parallelism and then constructing this as a DTensor using DTensor.from_local. Everything seems to work fine, except i…

mvpatel2000 updated 6 months ago
1
pytorch/pytorch #104191

torch.embedding: Trying to convert BFloat16 to the MPS backe…

### 🐛 Describe the bug Code to reproduce ``` python import torch from transformers import AutoModelForCausalLM, AutoTokenizer path = "gpt2" # any LM would result the same tokenizer = AutoTok…

Willian-Zhang updated 1 month ago
3
huggingface/accelerate #2161

[Question] Sequence Parallel

Thanks for sharing the awesome repo. I've been utilizing Accelerate for training LLMs. My current setup involves using Deepspeed Zero-3 for training a 70B parameter LLaMA-2 model, with a sequence l…

yuanenming updated 6 months ago
2
huggingface/accelerate #875

int8 quantization doesn't work with accelerate on multi-GPUs

### System Info ```Shell python 3.8 pytorch 1.12 openmpi 4.1.0 cuda 11.3 cudnn8 ubuntu 20.04 accelerate==0.14.0 transformers==4.24.0 bitsandbytes==0.35.4 1 node with 4xT4 GPUs ``` …

giulio98 updated 5 months ago
10

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism