llm-ops Search Results - Githubissues

1000+ results
for llm-ops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/accelerate #2724

Training with PEFT + Accelerate randomly gets stuck with Dee…

Hi, I'm fine tuning an LLM using Soft Prompt Tuning using DeepSpeed via Accelerate implicitly, using the `deepspeed` param in `TrainingArguments`. And all goes well until after the first epoch, whe…

vikram71198 updated 2 months ago
7
microsoft/onnxruntime #18224

[Mobile] How to run model inference with ARM GPU on android …

### Describe the issue We are currently developing a system that involves deploying Large Language Models (LLMs) on Android smartphones. To date, we've managed to execute inference tasks using ONNX R…

zjc664656505 updated 8 months ago
3
vllm-project/vllm #1846

Model Loading Stuck (in ray ?)

python = 3.11.5 torch = 2.1.0 + cu121 vllm = 0.2.2 GPU: L40 * 4 I install vllm by "pip install vllm". It will STUCK when loading vicuna-7b-v1.5 model using the vllm framework, while the fastc…

qy1026 updated 8 months ago
21
tensorflow/tensorflow #62287

Different Behavior of tf.raw_ops.Cos+tf.raw_ops.Erfc with ji…

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? No ### Source source ### TensorFlow version 2.14.0 ### Custom code Yes ### OS platform and distribution Ubuntu 22…

zoux1a updated 9 months ago
4
intel/intel-extension-for-pytorch #437

Failed running GPT-J int8 benchmark

### Describe the bug MODEL_ID="/models/models--EleutherAI--gpt-j-6b" python run_gpt-j_int8.py -m ${MODEL_ID} --quantized-model-path "./saved_results_gptj/best_model.pt" --benchmark --jit \ --token-…

RenyanDiao updated 10 months ago
5
GoogleCloudPlatform/vertex-ai-samples #2622

Mixtral-8x7B-Instruct-v0.1 deployment fails with ImportErro…

Trying to deploy Mixtral-8x7B-Instruct-v0.1 using notebook https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/model_garden/model_garden_pytorch_mistral.ipynb. Mo…

jvovk updated 7 months ago
2
tensorflow/tensorflow #62266

Different Behavior of tf.raw_ops.SquareDifference+tf.raw_ops…

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? No ### Source source ### TensorFlow version 2.14.0 ### Custom code Yes ### OS platform and distribution Ubuntu 22…

zoux1a updated 9 months ago
5
vllm-project/vllm #5458

[Bug]: Error when --tensor-parallel-size > 1

### Your current environment PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC version: (U…

javi111717 updated 2 months ago
8
tensorflow/tensorflow #62277

Different Behavior of tf.raw_ops.Asin with jit_compile=True

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? No ### Source source ### TensorFlow version 2.14.0 ### Custom code Yes ### OS platform and distribution Ubuntu 22…

zoux1a updated 9 months ago
6
microsoft/DeepSpeed #4774

mpt model fails to run on cpu.

**Describe the bug** When I ran mpt model on the CPU, I encountered the following error. ![image](https://github.com/microsoft/DeepSpeed/assets/97155466/440d34f3-7881-4cba-aa8d-22d1c3207b9b) **To…

KepingYan updated 9 months ago
3

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for llm-ops

1000+ results
for llm-ops