evaluation-framework Search Results

1000+ results
for evaluation-framework

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

moj-analytical-services/dmet-cfe #31

Formalise Innvovation/Evaluation Approach

- [x] [Evaluation Framework](https://github.com/moj-analytical-services/dmet-cfe/tree/main/investigations/evaluation_methodology) - [x] #47 - [x] #48 - [x] #39 - [x] #49

SoumayaMauthoorMOJ updated 1 month ago
1
DorisAmoakohene/Thesis #1

Thesis Topic and Questions

@tdhock as I mentioned in our meeting yesterday, I have this topics and questions for my thesis. Could you assist me, kindly let me know what you think about either and which one is best suitable T…

DorisAmoakohene updated 1 month ago
1
AkihikoWatanabe/paper_notes #1481

Beyond Utility: Evaluating LLM as Recommender, Chumeng Jiang…

# URL - https://arxiv.org/abs/2411.00331 # Authors - Chumeng Jiang - Jiayin Wang - Weizhi Ma - Charles L. A. Clarke - Shuai Wang - Chuhan Wu - Min Zhang # Abstract - With the rapid dev…

AkihikoWatanabe updated 3 weeks ago
1
UKGovernmentBEIS/inspect_ai #704

plot_results(): Are there any frameworks that allow summaris…

Many evaluation tools have frameworks to allow summarising and visualising results. An example is [zeno](https://zenoml.com/docs/integrations/eleuther) for [lm-eval-harness](https://github.com/Eleuthe…

sohaibimran7 updated 1 month ago
2
RobotLocomotion/drake #10383

Support multithreaded evaluation in the System Framework

Per discussion with @RussTedrake at the Ann Arbor offsite and today, we would like to support parallel execution in a system Diagram, likely via asynchronous evaluation of input ports with direct supp…

sherm1 updated 1 year ago
3
the-benchmarker/web-frameworks #7732

Create new types of tests

Hello, Currently, the web-framework benchmark tests an empty endpoint. However, web frameworks are about more than just request delay; they are mainly about processing complex tasks like JSON s…

mmaryo updated 1 month ago
19
collinleiber/ClustPy #96

Use with textual datasets

Hi, I want to use ClustPy to evaluate Clustering Algorithms in Combination with NLP Embeddings. But currently I am unable to get it to run the way I want it. Basically I want to replicate the fo…

randomn4me updated 1 week ago
1
dotnet/wpf #9805

Multiple inputs in quick succession cause an FatalExecutionE…

_This issue has been moved from [a ticket on Developer Community](https://developercommunity.visualstudio.com/t/Multiple-inputs-in-quick-succession-caus/10738795)._ --- [severity:It's more difficult …

vsfeedback updated 2 months ago
9
kubeedge/ianvs #98

Smart Coding benchmark suite: built on KubeEdge-lanvs

**What would you like to be added/modified:** 1. Build a collaborative code intelligent agent alignment dataset for LLMs: - The dataset should include behavioral trajectories, feedback, and i…

YangBrooksHan updated 3 months ago
1
datagero/pico-scholar #27

Implement Evaluation and Safeguards for RAG Features

- **Objective:** Ensure RAG components are reliable through evaluation and safeguards. - **Details:** - Define performance metrics and monitor system accuracy. - Implement safeguards to h…

datagero updated 3 weeks ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for evaluation-framework

1000+ results
for evaluation-framework