neural-chat-7b Search Results

265 results
for neural-chat-7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/alignment-handbook #16

Memory Issue with 7b Model Fine-Tuning on 6 H100 GPUs

Hello everyone, I'm encountering a memory issue while fine-tuning a 7b model (such as Mistral) using a repository I found. Despite having 6 H100 GPUs at my disposal, I run into out-of-memory errors wh…

apt-team-018 updated 10 months ago
4
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #10

Week 5. Apr. 19: Transformers and Social Simulation - Possib…

Pose a question about one of the following articles: [“Generative agents: Interactive simulacra of human behavior.”](https://dl.acm.org/doi/abs/10.1145/3586183.3606763) Park, Joon Sung, Joseph O'B…

JunsolKim updated 6 months ago
22
opea-project/GenAIInfra #166

support multiple env-var for ENDPOINT in GMC

TEI_EMBEDDING_ENDPOINT TEI_RERANKING_ENDPOINT TGI_LLM_ENDPOINT ```yaml # Copyright (C) 2024 Intel Corporation # SPDX-License-Identifier: Apache-2.0 apiVersion: gmc.opea.io/v1alpha3 kind: GM…

irisdingbj updated 4 months ago
4
intel-analytics/ipex-llm #10983

Run neural-chat 7b inference with Deepspeed on Flex 140. #10…

Hi, After review the previous issue : https://github.com/intel-analytics/ipex-llm/issues/10507. We tested on Flex140 same suggestion, we get the performance very slow on Flex140 with both GPU ru…

weiseng-yeap updated 5 months ago
4
swuecho/chat #402

ollama modelfile

``` hwu@hwu-5950:~$ ollama show --modelfile mistral # Modelfile generated by "ollama show" # To build a new Modelfile based on this one, replace the FROM line with: # FROM mistral:latest FROM /…

swuecho updated 4 months ago
1
opea-project/GenAIExamples #704

[Bug] ChatQnA on Xeon errors using Docker compose.yaml or co…

### Priority Undecided ### OS type Ubuntu ### Hardware type Xeon-SPR ### Installation method - [X] Pull docker images from hub.docker.com - [ ] Build docker images from source ### Deploy metho…

lucasmelogithub updated 2 months ago
5
vllm-project/vllm #4917

[Performance]: Automatic Prefix Caching in multi-turn conver…

I'm interested in the automatic prefix caching feature for multi-turn conversations but I can't seem to observe a performance improvement when prefix caching is enabled. [This tweet](https://x.com/vll…

hmellor updated 8 hours ago
16
opea-project/GenAIExamples #706

[Bug] Better developer experience for bringing up TGI-Servic…

### Priority Undecided ### OS type Ubuntu ### Hardware type Xeon-SPR ### Installation method - [X] Pull docker images from hub.docker.com - [ ] Build docker images from source ### Deploy metho…

arun-gupta updated 2 months ago
8
fluid-cloudnative/fluid #4186

[BUG] multiple datasets and multiple alluxioruntimes failed …

**What is your environment(Kubernetes version, Fluid version, etc.)** kubernetes: v1.30.2 fluid: v1.0.0 alluxioruntime: 2.9.0 **Describe the bug** Dear experts. I try to use fluid to deploy…

moting9 updated 5 months ago
1
ThePansmith/Monifactory #1185

[Bug]: Large Fractionating Distillery crashing game

### Issue Summary When trying to craft Sulfuric Naphtha in a Large Fractionating Distillery out of Raw Oil, my server/world crashes. The Server now also crashes every time I try to start it again, an…

TimMayr updated 3 days ago
1

上一页 1...8 9 10 11 12 13 14...27 下一页

265 results for neural-chat-7b

265 results
for neural-chat-7b