-
Hello, FlexFlow team!
Thank you for your outstanding work! I am attempting to reproduce the experimental results from the paper "SpecInfer: Accelerating Generative Large Language Model Serving with…
-
### Issue description
Somewhat a generalization of https://github.com/biocypher/biochatter/issues/204
Either a permutation of user query or a semantic search approach is necessary to avoid false ne…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
secbr updated
3 weeks ago
-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
The [get started section on synthetic data generation](https://docs.ragas.io/en/sta…
-
### Your current environment
```text
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 224.00 MiB. GPU
```
### How would you like to use vllm
I'm running a eval framework …
-
### Project Name
Paper Mentor AI
### Description
**Paper Mentor AI** is an intelligent research assistant designed to help students, researchers, and professionals streamline their research p…
-
**Describe the bug**
I'm trying to apply "W4A16" quantisation to the qwen2-7B model. In particular "cognitivecomputations/dolphin-2.9.2-qwen2-7b" though I've tried with other qwen2 models and had the…
-
in backend.py there is gpt4_compiled_hurricane.json,can u tell about it? when compile hurricane,there any parameters such as metric?
-
Congratulations on your recent solid survey paper and impressive paper list!
We have a related paper on LLM Agents playing Trust Games.
Can Large Language Model Agents Simulate Human Trust Beha…
-
1. [x] go over @ampudia19 's material
2. [x] slides framework
3. [x] notebook framework
4. [x] first draft LLM slides
5. [ ] tutorial for LLMs