evaluation Search Results

1000+ results
for evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

josejg/instruction_following_eval #2

Keyword argument error

The following test results in an "unexpected keyword" error: ``` from datasets import load_dataset from instruction_following_eval import get_examples, evaluate_instruction_following dataset =…

crichardson332 updated 4 weeks ago
1
NixOS/nixpkgs #356030

`tensoflow-bin: unsupported configuration: x86_64-darwin_312…

## Describe the bug Prefacing this with saying I'm pretty new to nix so apologies if I've missed something obvious or this isn't the correct place for this issue. I'm using nix-darwin. After…

JeffersonBledsoe updated 2 days ago
16
OpenGVLab/Diffree #10

Release of the evaluation benchmark code?

Thanks for your brilliant work diffree!!! I am interested in the process of dataset collection and the implementation process of the proposed evaluation metrics. Can you share your email so that I …

littleYaang updated 2 months ago
2
opendilab/LMDrive #46

Evaluation settings

Hi, thanks for your excellent work and code release. According to the README file, it is required to update the scenario .json file and route .xml file accordingly. However, there is no folder of lea…

wyddmw updated 6 months ago
2
cafercangundogdu/tesseract-rs #1

Hi, need help...What's the env variable: HOME? and I can't r…

I set the HOME to "" ```rust //build.rs:14 let home_dir = ""; ``` Then run cargo test: ```powershell PS C:\Users\kiwi\rust\tesseract-rs> cargo test Compiling tesseract-rs v0.1.18 (C:\Users\kiw…

qzd1989 updated 4 days ago
1
mlfoundations/evalchemy #25

Wild Bench is extremely slow due to judge

-> Currently wildbench seems to take a long time for evaluation, can we add a variable to add num_workers for openai calls? Also need some caching on results, so that I can run different evaluators…

aashay-sarvam updated 6 days ago
2
shaoyanpan/2D-Medical-Denoising-Diffusion-Probabilistic-Model- #16

Evaluation Metric

Hi, this is an interesting work! Could you show more details and the code about your evaluation metrics? especially FDS and DS. Thanks.

ZhilingYan updated 7 months ago
1
oppia/marketing-team #71

CRM Evaluation

Evaluate Mailchimp compared to other email management systems

JNunez1204 updated 6 months ago
1
haotian-liu/LLaVA #1527

errors in MME evaluation

### Describe the issue Issue: errors in MME evaluation Command: ``` CUDA_VISIBLE_DEVICES=0 bash scripts/v1_5/eval/mme.sh ``` Log: ``` RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED …

liuliAI updated 3 months ago
4
USTC-StarTeam/DR4SR #4

Training with evaluation for 2.Pretrain_regenerator.py

Hi, In 2.Pretrain_regenerator.py, there is only the process of training on the pretrain dataset for a fixed number of epochs (40 by default). However, without the evaluation process in training, the…

LeiShenVictoria updated 1 month ago
1

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for evaluation

1000+ results
for evaluation