evaluation-framework Search Results

1000+ results
for evaluation-framework

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ASSERT-KTH/repairbench-framework #178

integrate BugsInPy in elle-elle-aime

https://github.com/soarsmu/BugsInPy

monperrus updated 2 days ago
4
w3c/strategy #451

[ig/exploration] Exploration Group

# Evaluation A core part of our *Strategic* work is the evaluation of how proposed work serves the Web. In the "Evaluation" phase at the end of the funnel, we make the case whether work is ready to…

plehegar updated 1 week ago
15
datagero/pico-scholar #27

Implement Evaluation and Safeguards for RAG Features

- **Objective:** Ensure RAG components are reliable through evaluation and safeguards. - **Details:** - Define performance metrics and monitor system accuracy. - Implement safeguards to h…

datagero updated 3 weeks ago
1
idaholab/moose #21225

Design / prototype vectorized material property evaluation f…

## Reason Expensive material models provided by external codes make use of vectorized evaluation on accelerator devices (e.g. GPUs). For these models to operate with optimal efficiency we need to pro…

dschwen updated 2 years ago
7
nfdi-de/section-metadata-wg-onto #14

knowledge sharing approach: CDIF (Cross Domain Interoperabil…

This issue is associated with the charter epic #4 and #7 # Which mapping tool or framework do you want to discuss? https://worldfair-project.eu/cross-domain-interoperability-framework/ Overview…

nbetancort updated 1 day ago
23
scipy/scipy #20223

META: Streamlined Special Function Development in SciPy SDG …

This issue is intended for tracking milestones for the 2023 NumFocus small development grant for working on `scipy.special` infrastructure. The original plan for this grant was to work on developin…

steppi updated 1 month ago
12
json-schema-form/json-ui-schema #1

[Proposal] Add rules for framework independent conditional e…

**Note: this is incomplete to use as example to kick off discussion** An ability to set options based on conditional logic formatted in a consistent way that can be implemented in all potential framew…

Anthropic updated 8 years ago
4
irthomasthomas/undecidability #811

GPTScore: A Novel Evaluation Framework for Text Generation M…

- [ ] [GPTScore: A Novel Evaluation Framework for Text Generation Models](https://github.com/jinlanfu/GPTScore?tab=readme-ov-file) # GPTScore: A Novel Evaluation Framework for Text Generation Models …

ShellLM updated 7 months ago
1
ayush-that/FinVeda #3209

FEATURE: Implement Machine Learning Models for Financial Tre…

### Feature Summary I'd like to contribute to FinVeda by implementing a machine learning module that can predict financial trends, stock prices, and customer behavior. This module will leverage popul…

sanchitc05 updated 2 weeks ago
1
irthomasthomas/undecidability #895

[2308.07201] ChatEval: Towards Better LLM-based Evaluators t…

- [ ] [[2308.07201] ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate](https://arxiv.org/abs/2308.07201) # [ChatEval: Towards Better LLM-based Evaluators through Multi-Agent De…

ShellLM updated 3 months ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for evaluation-framework

1000+ results
for evaluation-framework