-
### Describe the bug
vllm version==0.4.3
minference==0.1.4.post3
flash-attn==2.5.9.post1
triton==2.3.0
run_vllm.py from https://github.com/microsoft/MInference/blob/main/examples/run_vllm.py
###…
-
> regarding the Chat GPT-like features, thats pretty far out on my road map currently, I want to add support for smart devices, wearables and home medical devices first.
The other thing that makes t…
-
### 你是否已经阅读并同意《Datawhale开源项目指南》?
- [X] 我已阅读并同意[《Datawhale开源项目指南》](https://github.com/datawhalechina/DOPMC/blob/main/GUIDE.md)
### 你是否已经阅读并同意《Datawhale开源项目行为准则》?
- [X] 我已阅读并同意[《Datawhale开源项目行为…
-
Can the VectorStore collection_name be added to the ConfigurableField?
-
```
import appbuilder
import os
# 设置环境中的TOKEN,以下TOKEN为访问和QPS受限的试用TOKEN,正式使用请替换为您的个人TOKEN
os.environ["APPBUILDER_TOKEN"] = "bce-v3/ALTAK-n5AYUIUJMarF7F7iFXVeK/1bf65eed7c8c7efef9b11388524fa1087f90…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
AWS's Bedrock runtime does not accept the following parameters, however they are still being passed to the LM at some point:
`n` and `max_tokens`
At what point is this being done? I haven't specif…
-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
Running `generate_with_langchain_docs` gets stuck, showing:
…
-
I don't know if it's an appropriate place to submit such proposals, but I'd like to introduce the idea somewhere.
BERT is an encoder-decoder language model that can extract the meaning of words, se…
-
I am using the code from the [Getting-Started-with-RAG-in-DSPy.ipynb](https://github.com/weaviate/recipes/blob/main/integrations/llm-frameworks/dspy/1.Getting-Started-with-RAG-in-DSPy.ipynb)
```
tes…