-
### š The feature, motivation and pitch
I am workning on 4D attention mask input and LLM generateion process. Huggingface provides an interface for the 4D attention mask. Does vllm have any plan? httā¦
-
### Cortex version
Jan v0.5.7 | Cortex v-
### Describe the Bug
https://discord.com/channels/1107178041848909847/1300098068980568095
A known issue exists with the llama.cpp engineās handling of sā¦
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the [LangGraph](https://langchain-ai.github.io/langgraph/)/LangChain documentation with the integratā¦
-
Hi, i want to do my own chat AI interface for study and personnal project. I find the library interesting, the only problem is that I use Ollama as LLM provider (local server for llms). So Ithink thatā¦
-
would be good to support dify api to handle all the LLM Ops and RAG.
from: https://docs.dify.ai/
Dify is an open-source large language model (LLM) application development platform. It combines tā¦
-
I've been working on an [OpenAI-compatible REST server](https://github.com/guidance-ai/llgtrt), utilizing TensorRT-LLM but not Triton, similar to `openai_server.py` but in Rust and generally productioā¦
-
### Describe the bug
> Draw a sine function
Plan:
1 Import necessary libraries in Python.
2 Generate x values.
3 Calculate corresponding y values using the sin function.
4 Plotā¦
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find aā¦
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorchā¦
-
## š Bug
MLCengine code in quickstart guide on CPU fails with
> 'InternalError: Check failed: (it != n->end()) is false: cannot find the corresponding key in the Map'
followed by
> MLCEā¦