evaluation Search Results

1000+ results
for evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

GSA/Challenge_platform #181

Evaluation Form Validation

### User story As a challenge manager, in order to ensure that a valid form is available for the evaluators to use, I would like a form to be validated for errors and valid logic before it is saved. …

r-bartlett-gsa updated 1 day ago
7
zorazrw/agent-workflow-memory #4

Evaluation question

Thank you very much for your work @zorazrw !!! Honestly, very impressive! I see that you're using auto eval and it uses gpt-3.5 My questions are: 1. Why did you use auto eval instead of taking t…

Serega6678 updated 1 month ago
1
ZCMax/LLaVA-3D #9

The MMScan evaluation

I noticed that mmscan authors has not released the entire dataset but a beta-version in 2024.9. Is the result in LLAVA-3d on MMScan benchmark evaluated on a partial version like beta-version, or do yo…

SiyuanWei updated 3 weeks ago
1
mmcdermott/MEDS-DEV #17

Fairness evaluation

Here are some thoughts concerning fairness evaluation: ### Protected attributes *Extraction*: As methodologies may implement pre-, in- or post-processing to enhance algorithmic fairness, I think w…

Jeanselme updated 3 weeks ago
3
Kim-Dongjun/ctm-cifar10 #1

Reference Batch for CIFAR Evaluation

Could the author provide the reference batch to evaluate the model on CIFAR?

Schwartz-Zha updated 1 week ago
2
adap/flower #4519

Server stops waiting for client evaluation results when both…

### Describe the bug When evaluating the model both centrally in the server and federated in the clients, after server finishes evaluating the model for the current round and sends the `evaluate mess…

MikeRiz521 updated 48 minutes ago
1
ericyinyzy/VQAttack #2

refTools.evaluation.tokenizer.ptbtokenizer

Hi, thank you for your great work. In VQAttack/ALBEF_VQAttack/ALBEF_attack/refTools/evaluation, it seems that the "tokenizer" folder is missing. So I got a ERROR saying "No module named 'refTools.…

YanGGGL updated 2 weeks ago
2
OpenDriveLab/DriveLM #132

Wrong evaluation reply using ChatGPT

Hi, When I use my `output.json` and the repo's `test_eval.json`, it worked the first two times. However, now I see ChatGPT replies such as: > I would rate your answer as 10. which leads to the…

anirudh-chakravarthy updated 4 days ago
1
run-llama/llama_index #17012

[Bug]: guideline evaluation is throwing error

### Bug Description guideline evaluation is throwing error saying missing model ### Version latest ### Steps to Reproduce run the guideline evaluation example ### Relevant Logs/Tracbacks ```sh…

Rohith-Scalers updated 2 days ago
7
facebookresearch/MobileLLM #14

Evaluation pipeline

Hi Zechun, great work! Could you please share the details about the evaluation code? like which codebase was used to run inference etc. thank you, Kalyani

kalyani7195 updated 1 month ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for evaluation

1000+ results
for evaluation