-
Dear authors,
Firstly, I would like to express my gratitude for your exceptional work.
Recently, I attempted to utilize Factor for evaluating instruction-tuned models, such as llama2-chat. Howev…
-
[ x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
Faithfullness is not working returning always Nan, sometimes…
-
We use the `ai` package: https://sdk.vercel.ai/docs. And it's easy to switch between LLM's with it. Here's a playground for example: https://sdk.vercel.ai/
Although there may be features that don't…
-
Hello,
Beam later version is V2 and they did drastic changes to their SDK and client that makes most of the training (fine-tuning) and inference code useless. There is no "beam run" and so on...
…
-
We may want to elaborate on the default checklist items, create a new checklist, or add an example that focuses on ethical considerations that are unique to LLMs or that are particularly acute with LL…
-
### 1. Who do you think this talk is for?
(aspiring) public speakers, developers, AI enthusiasts
### 2. What do you think you'll learn from this talk?
You will learn how to build single-purpose app…
-
**Is your feature request related to a problem? Please describe.**
When setting latex=true in `config.toml` it expects mathematical formulas to be encased in `$` signs. All popular LLMs are not train…
-
Harness being one of the general evaluation frameworks for hundreds of tasks and benchmarks on different types of metrics.
- check LM EValuation Harness [here](https://github.com/EleutherAI/lm-eva…
-
### Problem Statement:
At present, Wordflow lacks the capability to utilize self-hosted Language Models (LLMs), limiting users' flexibility in choosing or swapping models according to their specific …
-
### Prize category
Best Contribution
### Overview
### __Introduction__
Our project aims to tell a compelling story of the environmental impact of LLMs (Large Language Models). We feel this…