evaluation-framework Search Results

1000+ results
for evaluation-framework

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

albertan017/LLM4Decompile #35

Error while deserializing header: MetadataIncompleteBuffer

I am trying to evaluate llm4decompile-6.7b-v1.5 using the methods you provided. The model weights were downloaded from the Hugging Face repository of the same name. However, I keep encountering an err…

blacksunfm updated 1 week ago
1
punith300i/nlp-vlm-project #11

Evaluation Framework

- [ ] Shortlist metrics - [ ] Shortlist datasets - eval scripts available - [ ] Write Evaluation script

zvovov updated 1 year ago
2
newrelic/newrelic-ruby-agent #2856

[Spike] Catalog supported HTTP 2 libraries/frameworks for fu…

### Description Overall intent is to ensure the Ruby Agent HTTP instrumentation works properly with the HTTP 2 protocol. As part of this research spike: - [ ] Catalog all the instrumentation re…

kford-newrelic updated 1 month ago
1
isaqb-org/curriculum-t3 #37

Specify T3 participant evaluation criteria in LU01

Purpose: Define how T3 participants will be evaluated during and after the training. How to get started: 1. Review FL exam criteria as reference 2. List key competencies T3 participants must demo…

alxlo updated 1 week ago
1
si2-urssi/plan #78

evaluation framework

We need an evaluation framework in chapter 10 that goes from actions to objectives to outcomes to impacts to high-level changes, and includes how this will be measured (metrics). The high-level chang…

danielskatz updated 4 years ago
3
lappsgrid-incubator/galaxy-paper-rank #5

Evaluation Framework

Create an automated process to run an evaluation on the training data and record the results to a *"database"*. - [ ] Create N partitions from training data. - [ ] Run N evaluations - [ ] Write r…

ksuderman updated 4 years ago
1
SciPhi-AI/R2R #1632

Multi-Agent Collaboration with R2R Framework

Hello, Does R2R support multi-agent collaboration, or any recommendatuon wit integrate an external framework? Im evaluation atm to buold a system where agents collaborate while leveraging R2R’s ing…

regenrek updated 1 day ago
1
RobotLocomotion/drake #16339

proposal for batch (CPU/GPU) evaluation in the systems frame…

I took Eigen 3.4's tensor module for a quick spin to see what it might look like to support batch evaluation in the systems framework. Short story: I'm highly encouraged by it, but I think Eigen alon…

RussTedrake updated 3 weeks ago
2
elastic/kibana #197241

Defining LLM tasks and the LLM task framework

I'm opening this issue to discuss about what we think the "LLM task" framework should aim to be, and how we could incrementally get there. ## What we have today Today, what we call the "task framewo…

pgayvallet updated 2 weeks ago
2
wandb/weave #3090

Q: export dataset as json

What is the best way to convert a weave dataset to a JSON or a Python sequential type? At the moment, I use the following snippet as a template: ```python import json def export_dataset_as_json(re…

SauravMaheshkar updated 28 minutes ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for evaluation-framework

1000+ results
for evaluation-framework