evaluation-method Search Results

OPTML-Group/UnlearnCanvas #10

Evaluation on SEOT method

Thank you very much for providing the valuable benchmark for the MU community. I notice that you report the IRA and CRA metrics for the SEOT method in Table.2 . After I check your script in `sampli…

TtuHamg updated 4 days ago

RTIInternational/teehr #292

Expose PySpark's `persist()` method to the Evaluation class

Wondering if we could make use of the [persist ](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.persist.html)or cache methods in pyspark to load the da…

samlamont updated 4 days ago

QwenLM/Qwen2.5-Math #17

About the evaluation method of CMATH.

As for CMATH's evaluation, the prompting method is 6-shot in the Table 2 of your technical report. However, in your open-source evaluation code, it seems to use 0-shot. Additionally, the official pape…

Eternity666 updated 2 weeks ago

xhd2015/xiangqi-pro #2

Adopting NNUE as an evaluation method and suggesting next mo…

https://www.chessprogramming.org/NNUE

xhd2015 updated 3 weeks ago

AlibabaResearch/DAMO-ConvAI #159

Evaluation Method for BIRD Dataset [Enhancement]

Hello, I encountered an improvement opportunity during the evaluation process for the BIRD dataset. The prediction below is marked as incorrect by the evaluation method, but the only difference is …

lucaordronneau updated 2 months ago

langchain-ai/langsmith-docs #478

Can't use the page of evaluation locally after run the `eval…

I hope to use page of evaluation locally in my langSmith project. But I can only use page of evaluation in the way of online page, so if other developers clone and run my project, they have to sign up…

Makoq updated 3 days ago

wconnell/genplasmid #1

gLM2 Model Fine-Tuning and Embedding Evaluation

### Objectives Train fine-tuned versions of gLM2: - [x] Version 0 (v0): Initial fine-tuning and qualitative eval (completed). - [x] Version 1 (v1): Fine-tuning with augmented training data accoun…

wconnell updated 2 weeks ago

serratus-bio/open-virome #142

[LLM] Define a falsifiable, measurable hypothesis

### Task 3: Define a falsifiable, measurable hypothesis. > Our first hypothesis questions the validity of using an AI model for querying a database > at all, and whether an LLM can effectively retrie…

ababaian updated 2 weeks ago

dmlc/xgboost #10793

Performance regression in fit method with evaluation sets

I have observed a significant performance regression in XGBoost version 1.7 when using the fit method with evaluation sets in sklearn estimators. The issue appears to have been introduced by [this com…

ldesreumaux updated 2 months ago

google/android-fhir #2677

Wrap Fhirpath method evaluation calls to run from a differen…

**Describe the Issue** The initialization of `FHIRPathEngine` and methods of evaluating FhirPath expressions in `FhirPathUtil.kt` currently do not run as suspend functions and their use, especially …

LZRS updated 3 weeks ago

1000+ results for evaluation-method

1000+ results
for evaluation-method