ai-evaluation Search Results

1000+ results
for ai-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

confident-ai/deepeval #872

Error while using the most popular opensource chat models

**Describe the bug** I encounter 'ValueError: Evaluation LLM outputted an invalid JSON. Please use a better evaluation model.' while using most popular open source chat models in DeepEval framework. …

adkakne updated 4 months ago
2
bobtheuberfish/chiriboga #156

AI behaviour appears faulty

Ran into a number of times where the AI would vs Jinteki - Personal Evolution AI clicked for creds (40+) to the point where it was carelessly ditching high value agendas into archives (did not gr…

25171275 updated 2 months ago
5
openmainframeproject/tac #642

Zorse

### Project description This project aims to collect a dataset of production COBOL and associated mainframe languages (JCL, REXX, PL/I) which Large Language Models (LLMs) can be fine-tuned on. It a…

slh1109 updated 3 months ago
5
kubeedge/ianvs #95

Domain-Specific Large Model Benchmarking Based on KubeEdge-I…

**What would you like to be added/modified**: Based on existing datasets, the issue aims to build a benchmark for domain-specific large models on KubeEdge-Ianvs. Namely, it aims to help all Edge AI a…

MooreZheng updated 3 months ago
4
dynamic-superb/dynamic-superb #160

[Task] Classic music tempo recognition

# Classical Music Tempo Recognition This task is to recognize Music Tempo for calssical songs ## Task Objective Music Tempo is quite important when we clasify different type of music or generat…

ChenWils updated 4 months ago
10
dodona-edu/dodona #5331

Automatically generate draft answers for student questions

With the increasing capabilities of LLMs, it is only a matter of time before they become powerful/cheap enough to use them inside Dodona. A first step might be to generate draft answers for questions …

bmesuere updated 3 weeks ago
3
OpenGVLab/InternVideo #131

Training and Evaluation Code for ViClip

Dear authors, Great work and thanks for releasing the code for ViClip pretraining on InternVid-10M-FLT. Firstly, It would be really great if the pre-trainning instructions are more detailed, like w…

fmthoker updated 4 months ago
11
CyndelHerolt/UniFolioV2 #32

[Notation] : Idée commentaire

Je me demande s'il ne serait pas pertinent d'avoir une case "commentaire manquant ou inadapté, évaluation impossible" (ou équivalent). Dans l'état sur une trace je sais pas trop quoi choisir si je n'…

Dannebicque updated 7 months ago
1
mlflow/mlflow #10293

[DOC-FIX] LLM RAG Evaluation notebooks are missing informati…

### Willingness to contribute Yes. I would be willing to contribute a document fix with guidance from the MLflow community. ### URL(s) with the issue https://mlflow.org/docs/latest/llms/llm-evaluat…

lz-chen updated 1 year ago
1
aws-amplify/amplify-ui #5773

Amplify AI RFC

The purpose of this RFC is to get early feedback on this new full-stack AI functionality in Amplify. This functionality is currently in developer preview while we get feedback and iterate on it. There…

dbanksdesign updated 1 week ago
9

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for ai-evaluation

1000+ results
for ai-evaluation