llm-ops Search Results - Githubissues

891 results
for llm-ops

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mit-han-lab/TinyChatEngine #106

No such file or directory during compilation

Follwoing the documentation using: ```bash cd llm make chat -j ``` I get the following error: ```bash CUDA is unavailable! src/GPTBigCodeGenerate.cc src/GPTBigCodeTokenizer.cc src/Generate.c…

saeid93 updated 1 week ago
2
pytorch/executorch #4209

[Mutable Buffer] [Core ML Delegate] Let Core ML Handle Mutab…

### 🚀 The feature, motivation and pitch Starting from iOS 18, Core ML has state, which is the counterpart of mutable buffer. As a result, ExecuTorch can now let Core ML handle buffer mutation ##…

YifanShenSZ updated 2 hours ago
5
salesforce/LAVIS #507

Is it a writting Error about bos_token in instuctblip?

Hello, thanks to your great work! In `blip2_vicuna_instruct.py`, the `bos_token` of LLM has been changed. Originally, it is '< s >' with idx:1. But after the following code: ``` self.llm_tokenize…

Coobiw updated 4 months ago
2
vllm-project/vllm #6462

[Bug]: Can't load gemma-2-9b-it with vllm 0.5.2

### Your current environment ```text PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: RED OS release MUROM (7.3.4) Stan…

vlsav updated 4 days ago
27
keras-team/keras #19519

🗺️ Keras Development Roadmap

Here's an overview of the features we intend to work on in the near future, across Core Keras, KerasNLP, and KerasCV. ## Core Keras ### Saving & export - [Open for Contributions] Add utility …

fchollet updated 1 week ago
11
triton-inference-server/tensorrtllm_backend #389

Confusion about versions and NGC images

Hi Thank you for the great work you're doing on TensorRT-LLM and the Triton backend. I have some questions on matching versions between the tensorrt-llm python package, the backend, and the NGC ima…

mbahri updated 3 months ago
4
collabora/WhisperFusion #40

libth_common.so: undefined symbol: _ZN3c1017RegisterOperator…

I tried to run the latest (as of today) docker image: `docker run --gpus all --shm-size 64G -p 8001:80 ghcr.io/collabora/whisperfusion:latest` Im getting the error `OSError: /usr/local/lib/pytho…

OliverWalter updated 5 months ago
6
vllm-project/vllm #6558

[Bug]: Cannot load fp8 model of internlm2-chat-7b offline

### Your current environment ```text PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Debian GNU/Linux 11 (bullseye) (x86…

EstellaXinyuZhang updated 12 hours ago
3
Lightning-AI/lightning-thunder #194

Mixtral 8x7B network support

## 🚀 Feature Mixtral 8x7B is a mixture-of-experts LLM that splits the parameters in 8 distinct groups an I would like to do both training and inference with Thunder. ### Work items - [x] Run `t…

riccardofelluga updated 1 week ago
4
intel/intel-extension-for-pytorch #517

Is AMX supported for LLM inference?

### Describe the issue Hi, I have a quick question regarding the LLM inference on CPUs using this extension. I've been digging into the LLM inference case, and it seems like the kernels written in …

Hyungyo1 updated 5 months ago
3

上一页 1...1 2 3 4 5 6 7...90 下一页

891 results for llm-ops

891 results
for llm-ops