-
Hi team,
Is it possible to configure Nemo-Guardrails to avoid sending the actual user input to the LLM? I understand that the actual user input won't be sent if the input rails are triggered. However…
-
Thank you for your excellent work. I have some questions that I hope to receive your answers to. I hope to apply TFVTG to my custom video dataset to test the video temporal grounding function. What sh…
-
**What would you like to be added/modified**:
A benchmark suite for large language models deployed at the edge using KubeEdge-Ianvs:
1. Interface Design and Usage Guidelines Document;
2. Implem…
-
I noticed we are using fastapi to build micro services, but we aren't using async for anything?
For example, llamaindex has full async support on nearly every component. Yet all the methods are def…
-
### Is there an existing issue for the same bug?
- [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting
- [X] I have checked the existing iss…
-
- [ ] [[2408.02442] Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models](https://arxiv.org/abs/2408.02442)
# [Let Me Speak Freely? A Study on the…
-
Hi, thanks for your great work! I have read your MovieChat+ paper and noticed that the Zero-shot QA Evaluation result of MovieChat on EgoSchema is 53.5, while the evaluation result in this CVPR paper(…
-
-
I will list the test results of various open-source models here. You can refer to these data to select models and configure devices. Of course, the evaluation of LLM is quite subjective. I also sugges…
-
RAG Evaluation
1. 100 questions
Types of questions:
- 60 on general trade
- 12 on growth/variation
- 28 on rankings
2. RAG evaluation results
Best combination tested so far: multi-qa-mpne…