-
Hello everyone, I'm encountering a memory issue while fine-tuning a 7b model (such as Mistral) using a repository I found. Despite having 6 H100 GPUs at my disposal, I run into out-of-memory errors wh…
-
Pose a question about one of the following articles:
[“Generative agents: Interactive simulacra of human behavior.”](https://dl.acm.org/doi/abs/10.1145/3586183.3606763) Park, Joon Sung, Joseph O'B…
-
TEI_EMBEDDING_ENDPOINT
TEI_RERANKING_ENDPOINT
TGI_LLM_ENDPOINT
```yaml
# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0
apiVersion: gmc.opea.io/v1alpha3
kind: GM…
-
Hi,
After review the previous issue : https://github.com/intel-analytics/ipex-llm/issues/10507.
We tested on Flex140 same suggestion, we get the performance very slow on Flex140 with both GPU ru…
-
```
hwu@hwu-5950:~$ ollama show --modelfile mistral
# Modelfile generated by "ollama show"
# To build a new Modelfile based on this one, replace the FROM line with:
# FROM mistral:latest
FROM /…
-
### Priority
Undecided
### OS type
Ubuntu
### Hardware type
Xeon-SPR
### Installation method
- [X] Pull docker images from hub.docker.com
- [ ] Build docker images from source
### Deploy metho…
-
I'm interested in the automatic prefix caching feature for multi-turn conversations but I can't seem to observe a performance improvement when prefix caching is enabled. [This tweet](https://x.com/vll…
-
### Priority
Undecided
### OS type
Ubuntu
### Hardware type
Xeon-SPR
### Installation method
- [X] Pull docker images from hub.docker.com
- [ ] Build docker images from source
### Deploy metho…
-
**What is your environment(Kubernetes version, Fluid version, etc.)**
kubernetes: v1.30.2
fluid: v1.0.0
alluxioruntime: 2.9.0
**Describe the bug**
Dear experts. I try to use fluid to deploy…
-
### Issue Summary
When trying to craft Sulfuric Naphtha in a Large Fractionating Distillery out of Raw Oil, my server/world crashes. The Server now also crashes every time I try to start it again, an…