gpt-evaluation Search Results

1000+ results
for gpt-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #742

Pythia Support?

Hi, I've found out that Pythia (from EleutherAI) is better that Cerebras GPT in terms of evaluation results. Pythia is basically a LLM that based on GPT NeoX architecture but it's parameters ranging f…

lodorg updated 4 months ago
3
run-llama/llama_index #12192

[Question]: how to measure the new single query's latency i…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question the below is my code. ``` import torch # from transformers import BitsAndBytesConfi…

lambda7xx updated 2 months ago
11
UBC-MDS/fixml #127

Meeting Minutes for Week 6

## Sprint Planning - 2024-05-30 Week 6 ### Checklist - [x] #63 (to be done on Monday) - [x] #91 (waiting for Tiffany's replies) - [x] Prompt improvement by instructing GPT to distinguish stan…

JohnShiuMK updated 3 months ago
5
zilliztech/GPTCache #553

[Bug]: 'HuggingFaceBgeEmbeddings' object is not callable

### Current Behavior from langchain.embeddings import HuggingFaceBgeEmbeddings `model_name = "BAAI/bge-small-en" model_kwargs = {'device': 'cpu'} encode_kwargs = {'normalize_embeddings': False} …

aniketmoha9 updated 11 months ago
2
princeton-nlp/SWE-agent #21

Logs from SWE-Agent running on SWE-Bench

First off, I'd like to say thank you so much for publishing SWE-bench and SWE-agent. I was wonder is their anywhere that has the logs from running SWE-Bench/SWE-ENG evaluation are posted? I am working…

harrytormey updated 4 months ago
2
bazingagin/npc_gzip #3

Problem with accuracy calculation?

https://github.com/bazingagin/npc_gzip/blob/a46991564161023bba3b1267e0e74c69dab8f8eb/experiments.py#L116 It appears that in the `calc_acc` method it marks a sample correct if ANY of the labels with…

kts updated 1 year ago
25
Holmes-Benchmark/holmes-evaluation #4

Evaluating autoregressive models

Hi, when I try to use holmes with a GPT-2 or Llama model, I get the following error: `python investigate.py --model_name 'bbunzeck/gpt-wee-regular' --version holmes --parallel_probing --cuda_visibl…

bbunzeck updated 3 months ago
5
JuliaCon/proceedings-review #137

[REVIEW]: Distributed Parallelization of xPU Stencil Computa…

**Submitting author:** @omlins (Samuel Omlin) **Repository:** https://github.com/omlins/ImplicitGlobalGrid.jl **Branch with paper.md** (empty if default branch): **Version:** **Editor:** @fcdimitr …

whedon updated 1 month ago
33
GENZITSU/UsefulMaterials #57

weekly useful materials -06/22-

GENZITSU updated 3 years ago
22
huggingface/transformers #23763

Trainer do model generation during evaluation loop

### Feature request Current trainer only supports teacher-forcing generation for computing evaluation loss but not auto-regressive generation for other metrics. Seq2SeqTrainer supports this but seems…

szxiangjn updated 3 months ago
3

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for gpt-evaluation

1000+ results
for gpt-evaluation