model-evaluation Search Results

1000+ results
for model-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-deepmind/concordia #97

Pass device in launch_concordia_challenge_evaluation.py

If we are using local model, we need to pass device to utilise the gpu for inference. However, in `launch_concordia_challenge_evaluation.py ` ``` # Language Model setup model = utils.language_mode…

depshad updated 1 month ago
2
atla-ai/judge-arena #3

Add Flow-Judge-v0.1

Thanks a lot for setting up this leaderboard! Some interesting results arrise already. I was looking into specialized judge models, and I found theses ones: - [ ] [Flow-Judge-v0.1](https://huggi…

EwoutH updated 8 hours ago
1
mozilla/bugbug #4587

[code_review] Explore different models

Similar to #4582, but across different models. This depends on #4580 for the evaluation.

marco-c updated 5 days ago
1
EleutherAI/lm-evaluation-harness #2422

bbh_zeroshot fails during to a custom filter issue.

I am trying to run this setup: ``` lm_eval --model vllm \ --model_args pretrained="Qwen/Qwen2.5-0.5B-Instruct",tensor_parallel_size=2,dtype=auto,gpu_memory_utilization=0.8 \ --tasks bbh_…

shamanez updated 5 days ago
1
jerpelhan/DAVE #20

Max # of Objects in COCO Evaluation Code

Hi, thanks for the great work! In evaluating the model, I see that the maximum number of objects is 1100 (please see code snippet pasted below). I am, therefore, wondering what happens when this limit…

niki-amini-naieni updated 1 day ago
1
fath0218/channel_dqn #2

Model evaluation

- Outage probability - Number of steps: print accuracy chart - [score function](https://keras.io/getting-started/sequential-model-guide/)

fath0218 updated 6 years ago
1
Shenyi-Z/ToCa #5

The CLIP score in your paper is relatively low.

Hi, thank you for your work. I noticed that the CLIP score reported in your paper is relatively low, around 17 (e.g., Table 1), while other papers (e.g., PixArt-alpha) commonly report scores around 27…

haoweiz23 updated 3 weeks ago
9
UppuluriKalyani/ML-Nexus #860

Feature request: Face-expression recommendation system

- **Is your feature request related to a problem? Please describe:** The current face expression recommendation system uses MobileNet, and there is a need to evaluate a custom CNN model built from s…

PriyanshuLathi updated 3 weeks ago
1
US-EPA-CAMD/easey-ui #6451

Bug: Component changes not saving from the UI

Users are currently able to edit the following component-only fields: Manufacturer, Model or Version, Serial Number, and Hg Converter Indicator. The changes are not saved to camdecmpswks.component af…

esaber76 updated 1 week ago
2
descriptinc/descript-audio-codec #72

Docs on Model Evaluation pipeline

I am trying to reproduce the DAC training and evaluation. I am getting negative values for SI-SDR loss even for the pretrained weights provided in the repo. Could you please provide documentation f…

Sonal-Monteiro updated 3 months ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for model-evaluation

1000+ results
for model-evaluation