-
### Willingness to contribute
Yes. I would be willing to contribute a document fix with guidance from the MLflow community.
### URL(s) with the issue
https://mlflow.org/docs/latest/llms/llm-evaluat…
-
Today, the [Python Evaluation building block](https://aka.ms/azai/eval) can be used against a .NET backend that uses the Chat Protocol (Azure Search supports this). However, we know from customer feed…
-
## Date
Thursday, June 20 2024 - 12pm ET; 5pm UK
## Untracked attendees
| Name | Firm | Comment |
| :--- | :--- | :------ |
## Meeting notices
- FINOS **Project leads** are responsible…
-
WOOD score paper : https://arxiv.org/pdf/2007.06898.pdf
Abstract :
>Models that surpass human performance on several popular benchmarks display significant degradation in performance on exposure…
-
Just a thought if you have any plans for a v2 breaking change release in future, from everything I’ve read on utility ai and my own (admittedly very limited) experience, should the default evaluation …
-
[http://docs.h2o.ai/h2o/latest-stable/h2o-docs/architecture.html|http://docs.h2o.ai/h2o/latest-stable/h2o-docs/architecture.html]
Some parts of this page are obsolete:
Eg.:
{quote}Python
Python scr…
-
### 🐛 Describe the bug
Hello everyone, I am attempting to replicate the trlx example found on Wandb at this link: https://wandb.ai/carperai/summarize_RLHF/reports/Implementing-RLHF-Learning-to-Summar…
-
Bonjour, d'abord merci pour votre travail c'est top,
en regardant un peu je penssais à une erreur en voyant que les evalutions était indisponible dans la liste des sensor (présent mais indispo)
…
-
- 3 success factors in ML: Velocity | Validate early | Versioning ( Source: [Youtube](https://youtu.be/HPU8ttaZc6U))
- 4 Main tasks in production ML : Data Collection | Experimentation | Evaluation & …
-
We had a huge influx of AI law addition PRs, but AI is not ready for that yet until we address some basics:
- [ ] Decide how we want laws and law uploads to work in general and which ones to use as a…