tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/tensorflow #44711

memory leak in tf.keras.Model.predict

https://stackoverflow.com/questions/64199384/tf-keras-model-predict-results-in-memory-leak Please make sure that this is a bug. As per our [GitHub Policy](https://github.com/tensorflow/tensorflow/…

plooney updated 3 months ago
25
huggingface/optimum-neuron #334

What does run_qa.py use tensor or data parallelism?

I am using optimum neuron run_qa.py to fine-tune GPT2, by looking at the output it seems like it does data parallelism. Kindly confirm what kind of parallelism is done? If I enter 8 as batch size it…

Anand1405 updated 7 months ago
5
huggingface/transformers #28470

Running a `forward` pass before `generate` with AWQ fused mo…

### System Info - `transformers` version: 4.36.2 - Platform: Linux-5.4.0-166-generic-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.20.2 - Safetensors version: 0.4.1…

IlyasMoutawwakil updated 3 months ago
6
huggingface/safetensors #15

Question about parallelism/rayon

Questions regarding parallelism: 1. If I'm not mistaken, both tensor serialization & deserialization operations should be parallelizeable. Is this assumption correct? For example, I was thinking th…

mishig25 updated 6 months ago
2
huggingface/text-generation-inference #813

Load qlora finetuned model using TGI optimized architecure o…

### Feature request Enable TGI load qlora finetuned model with optimized architecture on Sagemaker. Right now the optimized architecture is active only for certain models on the list. If the features…

zkdtc updated 5 months ago
14
Tribler/tribler #7254

BeyondFederated - truly decentralised learning at the edge

Started full-time thesis around april/may 2023. Track DST, Q3/4 start. Still "seminar course" ToDo. Has superapp/MusicDAO experience. Discussed as diverse as digital Euro and Web3 search engine (un…

synctext updated 1 month ago
45
aws-neuron/transformers-neuronx #61

LLaMA fails when the input token length is over 1790 tokens

I am trying to use `meta-llama/Llama-2-13b-chat-hf` witch have a `max_position_embeddings` of 4096 tokens. I found that the library fails in a non-deterministic way when input length is between 1790 …

dennj updated 4 months ago
6
NVIDIA/Megatron-LM #656

[BUG] LM head weights get untied while training with overlap

LM head weights get untied during training even when they are supposed to be tied. This is happening when overlap parameters are set to true. cc: @deepakn94

mayank31398 updated 4 months ago
23
AutoGPTQ/AutoGPTQ #199

[BUG] Sample issue - division by zero

I am trying to quantize a custom fine-tuned llama2 model using the following code: ``` from transformers import AutoTokenizer, TextGenerationPipeline from auto_gptq import AutoGPTQForCausalLM, Ba…

psinger updated 3 months ago
11
ggerganov/llama.cpp #7984

Bug: Llama3 8B Instruct Model outputting nonsensical text on…

### What happened? I am running Llama3 8B Instruct, but the model output doesn't make sense. I followed the general guidelines of the [main (cli)](https://github.com/ggerganov/llama.cpp/blob/master/e…

aymane-eljerari updated 4 weeks ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism