bigbench Search Results

140 results
for bigbench

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/datasets #5096

Transfer some canonical datasets under an organization names…

As discussed during our @huggingface/datasets meeting, we are planning to move some "canonical" dataset scripts under their corresponding organization namespace (if this does not exist). On the con…

albertvillanova updated 2 months ago
11
XinyuanWangCS/PromptAgent #3

Some replication problem

Hello, I appreciate your kind words and efforts to replicate our results on my computer. Here is the process I followed: First, I installed the necessary environment as per the "Installation" instru…

LuckyAnJooo updated 6 months ago
2
protocolbuffers/protobuf #12882

TypeError: Couldn't build proto file into descriptor pool! I…

**What version of protobuf and what language are you using?** Version: v3.8.0 (NOTE: please try updating to the latest version of protoc/runtime possible beforehand to attempt to resolve your pro…

KawaiiNotHawaii updated 7 months ago
1
huggingface/dataset-viewer #259

Support big-bench

see the thread by @lhoestq on Slack: https://huggingface.slack.com/archives/C034N0A7H09/p1652370311934619?thread_ts=1651846540.985739&cid=C034N0A7H09 ``` pip install "bigbench @ https://storage.go…

severo updated 7 months ago
10
EleutherAI/lm-evaluation-harness #1513

ValueError: Input length of input_ids is 494, but `max_lengt…

Hi, I have tried to run ``` accelerate launch -m lm_eval --model hf \ --model_args pretrained=${MODEL_PATH}/llama2_7b,max_length=2048 \ --tasks bigbench_generate_until \ --batch_si…

ZhengxiangShi updated 6 months ago
2
EleutherAI/lm-evaluation-harness #1134

Running BBH CoT

Hi, I am trying to evaluate my model on the BBH with/without CoT but all task results end up being 0.0. I am quite unexperienced, so please have it in mind when helping me out. Other tasks I've tried …

alexedin updated 6 months ago
3
EleutherAI/lm-evaluation-harness #667

Add Big Bench Hard (BBH) and AGIEval

Hi, Since Microsoft released [Orca paper](https://arxiv.org/pdf/2306.02707.pdf), we are likely to see models instruct-tuned using Orca techniques that'll most likely outperform the current crop of …

abhinavkulkarni updated 7 months ago
5
embeddings-benchmark/mteb #837

Paper segment: Task selection

The goal of this segment is to create meaningful benchmark subsets with a minimal set of tasks. I believe the steps are as follows: 1) construct an experimental subset. If people agree I can con…

KennethEnevoldsen updated 6 days ago
21
huggingface/dataset-viewer #2636

e2e is broken due to KenLM install

We get: ``` Note: This error originates from the build backend, and is likely not a problem with poetry but with kenlm (0.2.0 https://github.com/kpu/kenlm/archive/master.zip) not supporting PEP 51…

severo updated 5 months ago
4
shm007g/LLaMA-Cult-and-More #1

main page

track

shm007g updated 1 year ago
7

上一页 1...3 4 5 6 7 8 9...14 下一页

140 results for bigbench

140 results
for bigbench