bigbench Search Results

142 results
for bigbench

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google/BIG-bench #460

Running test_tasks.py produces SyntaxError

I'm following the instructions on this webpage: https://github.com/google/BIG-bench/blob/main/README.md#how-do-i-create-a-task There's a section that says: > **Testing and evaluating** > > Onc…

githubrandomuser2017 updated 2 years ago
1
google/BIG-bench #738

Results higher than max score in simple_arithmetic

See https://github.com/google/BIG-bench/blob/main/bigbench/benchmark_tasks/simple_arithmetic/results/scores_GPT_GPT-3-200B.json#L33

Randl updated 2 years ago
2
google/BIG-bench #446

Alignment Questionnaire

https://github.com/google/BIG-bench/blob/main/bigbench/benchmark_tasks/alignment_questionnaire/human_eval/210309/survey-iiu4lsyptvkry7ga6q3pmiqhya.xlsx Hi, Some of the questions in this sheet assu…

posttool updated 2 years ago
2
google/BIG-bench #463

Incorrect scores for task with a single example

BLEU and Rouge score are incorrectly reported as 0 when matching the targets with human evaluation on a task with a single example. `foo/task.json` to reproduce: ``` {"canary": "BENCHMARK DATA …

guygurari updated 2 years ago
1
fchollet/ARC-AGI #81

Abstraction and Reasoning Corpus --> bigbench?

Hi There! This is a really cool corpus, @fchollet :-) I'm wondering if a version of this task could be adapted to this ICLR workshop, centered on the construction of a big set of sequence-to-seq…

jmhessel updated 2 years ago
4
renaissance-benchmarks/renaissance #359

Transient NullPointerException and deadlock in the reactors …

At Oracle Labs we have been seeing the reactors benchmark sometimes throw a `NullPointerException` and then deadlock. Here's a stack trace from a recent example: ``` ... ====== reactors (concurrenc…

gergo- updated 2 years ago
6
google/BIG-bench #574

Missing canary string in some auto-generated files could cau…

For each task, `/results/dummy_model.transcript.md` was auto-generated and placed into the task dir. For example, [this one](https://github.com/google/BIG-bench/blob/main/bigbench/benchmark_tasks/ruin…

RomanPlusPlus updated 2 years ago
2
oracle/graal #4297

Java 11 GraalVM 22.0.0.2 EE on Windows 10 pro 21H2 64bit fai…

On Windows 10 21H2 64bit with Java 11 GraalVM 22.0.0.2 **EE** the test "reactors (concurrency)" fails with NullPointerException on the **second** repetition: (...snip...) GC before operation: comp…

Ulibaer updated 2 years ago
3
renaissance-benchmarks/renaissance #353

reactors (concurrency) fails with Java 11 GraalVM 22.0.0.2 E…

On Windows 10 21H2 64bit with Java 11 GraalVM 22.0.0.2 EE the test "reactors (concurrency)" fails with NullPointerException on the **second** repetition: GC before operation: completed in 333.103 m…

Ulibaer updated 2 years ago
1
bigscience-workshop/promptsource #661

Scripts For Evaluating T0 with Prompts

Hello, are there publicly available scripts for how the evaluation was done for the fixed choice tasks in the [T0 Paper](https://arxiv.org/pdf/2110.08207.pdf)? I tried implementing them myself but…

gabeorlanski updated 2 years ago
2

上一页 1...9 10 11 12 13 14 15...15 下一页

142 results for bigbench

142 results
for bigbench