-
**Describe the bug**
I encounter 'ValueError: Evaluation LLM outputted an invalid JSON. Please use a better evaluation model.' while using most popular open source chat models in DeepEval framework. …
-
Ran into a number of times where the AI would
vs Jinteki - Personal Evolution
AI clicked for creds (40+) to the point where it was carelessly ditching high value agendas into archives (did not gr…
-
### Project description
This project aims to collect a dataset of production COBOL and associated mainframe languages (JCL, REXX, PL/I) which Large Language Models (LLMs) can be fine-tuned on. It a…
-
**What would you like to be added/modified**:
Based on existing datasets, the issue aims to build a benchmark for domain-specific large models on KubeEdge-Ianvs. Namely, it aims to help all Edge AI a…
-
# Classical Music Tempo Recognition
This task is to recognize Music Tempo for calssical songs
## Task Objective
Music Tempo is quite important when we clasify different type of music or generat…
-
With the increasing capabilities of LLMs, it is only a matter of time before they become powerful/cheap enough to use them inside Dodona. A first step might be to generate draft answers for questions …
-
Dear authors,
Great work and thanks for releasing the code for ViClip pretraining on InternVid-10M-FLT. Firstly, It would be really great if the pre-trainning instructions are more detailed, like w…
-
Je me demande s'il ne serait pas pertinent d'avoir une case "commentaire manquant ou inadapté, évaluation impossible" (ou équivalent).
Dans l'état sur une trace je sais pas trop quoi choisir si je n'…
-
### Willingness to contribute
Yes. I would be willing to contribute a document fix with guidance from the MLflow community.
### URL(s) with the issue
https://mlflow.org/docs/latest/llms/llm-evaluat…
-
The purpose of this RFC is to get early feedback on this new full-stack AI functionality in Amplify. This functionality is currently in developer preview while we get feedback and iterate on it. There…