evaluate-models Search Results

1000+ results
for evaluate-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ShishirPatil/gorilla #592

[Apibench] How to evaluate a model by openai API?

I have remotely hosted vllm models. How to evaluate them?

djstrong updated 2 weeks ago
3
industrial-optimization-group/DESDEO #155

[IDEA] Support for simulators and surrogates in the problem …

* **What is the current behavior?** The current problem format does not support simulators and surrogates. * **Describe the solution you'd like** The problem format should be updated to support s…

light-weaver updated 1 week ago
3
openml-labs/ai_search #5

Decide on metric(s) with which to evaluate our models

We need to figure out what metrics we can/should use to evaluate our models, and what data is needed to evaluate them. Here we probably will make some distinction between evaluations during prototypi…

PGijsbers updated 1 month ago
2
irthomasthomas/undecidability #908

[2310.06770] SWE-bench: Can Language Models Resolve Real-Wor…

- [ ] [[2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://arxiv.org/abs/2310.06770) # [SWE-bench: Can Language Models Resolve Real-World GitHub Issues?](https://ar…

ShellLM updated 3 weeks ago
1
RAISEDAL/RAISEReadingList #78

Paper Review: A Survey on Evaluation of Large Language Model…

### Publisher ACM TIST (ACM Transactions on Intelligent Systems and Technology) ### Link to The Paper https://dl.acm.org/doi/pdf/10.1145/3641289 ### Name of The Authors Yupeng Chang, Xu Wang, Jin…

mehilshah updated 2 months ago
2
open-compass/opencompass #1239

[Feature] Difficulty in Evaluating Custom Models with OpenCo…

### Describe the feature Dear OpenCompass Team, I've encountered a challenge with OpenCompass when trying to evaluate a custom model that I developed. Currently, it seems that any action I want to…

jiangjiadi updated 3 months ago
1
clementchadebec/benchmark_VAE #96

How to evaluate models?

Hi, thanks for the great work. I would like to know how to evaluate the generation performance of models. Specifically, I am interested in how to calculate FID and other metrics such as IS, and whethe…

liang-hou updated 7 months ago
3
oss-slu/Enhancing-Bioinformatics-Research-through-LLM #1

Research and document the strengths and weaknesses of a sing…

Research and evaluate different LLM models (e.g., BERT, RoBERTa, XLNet) for their suitability in the bioinformatics domain. -> Research and document the strengths and weaknesses of each model. Crea…

AjithAkuthota23 updated 1 week ago
1
Colin-Codes/IntentClassifier-ML-Project #26

Evaluation of models

Optimal cross-validation optimisation of hyper-parameters Optimising for confusion matrix, not just accuracy

Colin-Codes updated 5 years ago
1
FSoft-AI4Code/XMainframe #3

Evaluations and Prompts

Could you please share the evaluation scripts and prompts that were used to generate the reported results in the paper? Various parameters are involved in generating outputs, and it is crucial to …

prince14322 updated 3 weeks ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for evaluate-models

1000+ results
for evaluate-models