ai-evaluation Search Results

1000+ results
for ai-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mnm-matin/ai_alignment_graph #11

Next Release of the AI Alignment Research Graph

Just one issue to track progress till the next release and collaborate on creating the right issues. Feel free to edit this issue/comment on changes. Scope of Next Release: - [ ] For each paper cr…

mnm-matin updated 2 months ago
1
rmusser01/tldw #237

Feature Tracker: Evaluation Benchmarks

Title. Benchmarks: Summarization - [x] G-Eval - [ ] SummHay - https://arxiv.org/abs/2407.01370v1 & https://github.com/salesforce/summary-of-a-haystack - https://arxiv.org/html/2403.19889v1 R…

rmusser01 updated 1 week ago
11
rbroc/echo #2

Datasets

(All these tasks will probably require prompt engineering, model-specific. Consider doing evaluation, either through external metrics or human validation) **Number of examples per dataset: cap at 5…

rbroc updated 2 weeks ago
2
Herb-AI/HerbSearch.jl #72

Differentiate `suboptimal_program` from a program that fails…

Currently, when searching using `allow_evaluation_errors` and all of the programs error out on evaluation (`interpret` errors out), the return value of `synth` is marked as a `suboptimal_program`. Thi…

ReubenJ updated 8 months ago
1
premAI-io/state-of-open-source-ai #113

state-of-open-source-ai/eval-datasets/

# Evaluation & Datasets — State of Open Source AI Book [https://book.premai.io/state-of-open-source-ai/eval-datasets/](https://book.premai.io/state-of-open-source-ai/eval-datasets/)

utterances-bot updated 11 months ago
1
OpenBMB/MiniCPM-V #555

[vllm] -

### 起始日期 | Start Date 9/3/2024 ### 实现PR | Implementation PR _No response_ ### 相关Issues | Reference Issues _No response_ ### 摘要 | Summary When using vLLM to optimally utilize GPU space for faste…

WoutDeRijck updated 2 months ago
1
UTSAVS26/PyVerse #1084

[Code Addition Request]: COVID Detection from CXR Using Expl…

### Have you completed your first issue? - [X] I have completed my first issue ### Guidelines - [X] I have read the guidelines - [X] I have the link to my latest merged PR ### Latest Merged PR Lin…

inkerton updated 1 week ago
1
alibabasglab/D2Former #4

stft requires the return_complex parameter be given for real…

**python3 evaluation.py --test_dir testdata/ --model_path ckpt/D2Former_epoch_77_0.055 --save_dir saved_tracks_best** audio_path: testdata/noisy/p287_001_noisy.wav Traceback (most recent call last):…

Snailgoo updated 4 months ago
1
BeePong/42_transcendence #65

Documentation/Module choice and readiness

# Project Evaluation Checklist ## Minimal Technical Requirements - [x] Ensure the frontend is developed using pure vanilla JavaScript (unless overridden by a module). - [x] Make the website a Single…

liocle updated 2 months ago
1
Manssur94/Manssur-Personal-Tasks #232

Document impact on HCA Language Evaluations

Manssur94 updated 4 days ago
1

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for ai-evaluation

1000+ results
for ai-evaluation