ai-evaluation Search Results

1000+ results
for ai-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/evaluate #589

Can't load exist dataset for evaluation

Hi, I used `dair-ai/emotion` to fine-tune the `bert-base-cased` model to use it for text-classification task. The dataset is loaded when I load it in this script, so there is no problem up to here:…

IsmaelMousa updated 6 months ago
1
rmusser01/tldw #297

General Research

Papers that don't fit somewhere else right now but may be relevant in the future: https://huggingface.co/papers/2409.18943 https://arxiv.org/pdf/2409.16493

rmusser01 updated 3 hours ago
21
ai-forever/MERA #10

[Feature Request] Support for OpenAI ChatCompletion models

- [Поддерживается](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/models/openai_completions.py#L368) в оригинальной lm-evaluation-harness. - Позволяет тестировать неограниченны…

kristaller486 updated 5 months ago
2
secretsauceai/nlu-nlg-engine #4

Plan: Write up results from the test model

# Description We created a model using mostly test data. We should document the results of this, including the analysis of the results. For: * Intents * Entities * Responses we will attempt t…

AmateurAcademic updated 1 week ago
4
zchen0420/nn_papers #13

Competition Platforms

zchen0420 updated 5 months ago
1
cloudflare/cloudflare-docs #17841

Hyperlint Monitor Updates

## CLI Change Report This report covers changes from October 23, 2024 to October 29, 2024 for the `wrangler` CLI monitor source. This is a result of a scan for version `wrangler --version`: `3.82.0` …

hyperlint-ai[bot] updated 2 weeks ago
2
SebLague/Chess-Coding-Adventure #11

KingEnd is in the PieceSquareTable with values but...

You have a kingmiddle and KingEnd... Also the Evaluation section choses when to call the Endgame. What I don't understand is when the AI ever 'evaluates' using the 'kingEnd' numbers. Is this part cu…

robertkjr3d updated 1 year ago
2
ethz-spylab/rlhf-poisoning #8

Evaluation Dataset

Hello, I would like to ask how to create an evaluation dataset. When I directly run `python evaluate_generation_model.py --model_path ../../LLM_Models/poison-7b-SUDO- --token SUDO --report_path ./…

chiayi-hsu updated 4 months ago
5
kounkou/Hedgehog #53

Add support for C# language

### Description Add C# language support for the questions ### Acceptance criteria - User should be able to select C# - Code in C# - Then see answers in C# - We can probably add saving the …

kounkou updated 1 week ago
3
hoon-bari/comments #1

RS/Recommender_evaluation

# 추천시스템 평가지표(Recommendation System Evaluation Metrics) | 이직러의 AI 걸작선 머신러닝 분야는 모델 구축만큼이나 성능평가도 중요합니다. 추천시스템 또한 머신러닝의 비지도학습에 속하기 때문에, 성능평가가 중요합니다. 그래서 추천시스템에는 Recall/Precision@k, MAP@k, NDCG@k 등 다양한 성능…

utterances-bot updated 1 year ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for ai-evaluation

1000+ results
for ai-evaluation