llama2-docker Search Results

855 results
for llama2-docker

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tenstorrent/tt-metal #13831

Can't build Metal with profiler

**Describe the bug** I'm trying to build Metal with the profiler enabled and getting build errors and failure. the exact error depends on the script used (either `build_with_profiler_opt.sh` or using…

arikTT updated 1 week ago
17
openvinotoolkit/model_server #2218

LLaMA2 Model Serving Chat Demo Errors on Invalid number of i…

**Describe the bug** I am attempting to run the LLaMA2 demo at https://github.com/openvinotoolkit/model_server/blob/main/demos/llama_chat/python/README.md. When I run: ``` sh python client.py -…

cphoward updated 10 months ago
2
mlflow/mlflow #11091

[FR] Ability to set system prompts for prompt experimentatio…

### Willingness to contribute No. I cannot contribute this feature at this time. ### Proposal Summary During prompt experimentation you often set system prompts for OpenAI, Azure, and open source m…

dgormly updated 8 months ago
7
InternLM/lmdeploy #1750

[Bug] Official image doesn't work for 4090 on CUDA 12.3 (but…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug I used Runpod to test …

josephrocca updated 4 months ago
5
MIT-AI-Accelerator/c3po-model-server #48

Build integration with 581st for query retrieval **SAVE FOR …

**How to do in P1** Brett Ostwalt commented: [Eric Robinson (AIA | 21 AS)](https://jira.il2.dso.mil/secure/ViewProfile.jspa?name=erob) alright connectivity should be working now. in order to call th…

erob123 updated 1 year ago
1
Seeed-Studio/wiki-documents #1553

[Page Add][Enhanced Function] Building a Voice-Interactive C…

We are building a voice-interactive chatbot that leverages cutting-edge technologies such as Speech-to-Text (STT), Text-to-Speech (TTS), and local Large Language Models (LLMs), with a focus on Ollama'…

elainedanwu updated 2 months ago
2
NVIDIA/TensorRT-LLM #874

TypeError: LLaMAForCausalLM.__init__() got an unexpected key…

Hey guys, What is this jaw dropping nightmare that you put me thru? PS C:\AI\TensorRT\TensorRT-LLM\examples\llama> python build.py --meta_ckpt_dir C:/AI/LLaMA2_Docker_FileSystem/codellama/CodeL…

realhaik updated 9 months ago
3
smallcloudai/refact #376

Llama2 chat model times out

Llama2 (and Llama-based models) timeout. Other chat models (tested Mistral, Mixtral) respond fine. Below is the snippet of the docker container log capturing when the request is sent from Refact exte…

jcntrl updated 7 months ago
1
huggingface/optimum-nvidia #71

llama.py with fp8 is broken (inference produces garbage resu…

Hi! II have a finetuned Llama2 and followed the example/llama.py. When I build the model in fp16, it works just fine, and produces sane results. When we use either the `--fp8` or `--fp8-cache`, the…

urimerhav updated 8 months ago
3
flexflow/FlexFlow #1377

Performance Issue

Hi, we have tried to run the speculative inference process on OPT-13B and Llama2-70B-chat, but meet some issues. Specifically, for Llama2-70B-chat , we obtained performance worse than vLLM, which seem…

lethean287 updated 3 months ago
1

上一页 1...5 6 7 8 9 10 11...86 下一页

855 results for llama2-docker

855 results
for llama2-docker