gpt-evaluation Search Results

1000+ results
for gpt-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tianyi-lab/Superfiltering #2

Is there an evaluation of non API models? Such as LLama 7B, …

Is there an evaluation of non API models? Such as LLama 7B, GPT xl, etc.

sev777 updated 5 months ago
1
AkihikoWatanabe/paper_notes #893

Generating User-Engaging News Headlines, ACL'23

https://virtual2023.aclweb.org/paper_P2433.html

AkihikoWatanabe updated 1 year ago
4
baichuan-inc/Baichuan-7B #23

实现了baichuan-7B模型的LoRA微调

支持Alpaca等指令数据集的SFT和RLHF流程：https://github.com/hiyouga/LLaMA-Efficient-Tuning LoRA微调可在单块3090 GPU上运行，同时支持QLoRA方法。（最低12G显存）微调模型的 LoRA 权重：https://huggingface.co/hiyouga/baichuan-7b-sft 运行以下指令即可实现…

hiyouga updated 2 months ago
101
irthomasthomas/undecidability #652

The Bitter Lesson

- [ ] [The Bitter Lesson](http://www.incompleteideas.net/IncIdeas/BitterLesson.html) # The Bitter Lesson **DESCRIPTION:** "The Bitter Lesson Rich Sutton March 13, 2019 The biggest lesson that …

irthomasthomas updated 6 months ago
1
microsoft/DeepSpeed #2736

[BUG] RuntimeError: Tensors must be contiguous error while f…

I am just trying to fine-tune "EleutherAI/gpt-neo-1.3B" for casualLM on google colab. Without anything, it gives out of memory error. I was checking what can I do and I found deepspeed. I added deepsp…

FahriBilici updated 8 months ago
24
MaartenGr/BERTopic #1450

AttributeError: Can't get attribute 'EuclideanDistance64' on…

When I load the generated bertopic model, it give the following error traces: ``` /home/21zz42/Asset-Management-Topic-Modeling/.venv/lib/python3.10/site-packages/umap/distances.py:1063: NumbaDepreca…

zhimin-z updated 1 year ago
15
YJiangcm/FollowBench #3

some questions

Thank you for proposing this interesting benchmark. After finishing the **Model Inference** and **LLM-based Evaluation**, we tried to obtain the results as shown in **Merge Evaluation and Save Res…

AccidM updated 4 months ago
1
run-llama/llama_index #11567

[Bug]: LLM evaluated with Llamaindex don't provide scores fo…

### Bug Description Llamaindex crashes when evaluating some large language models for specific metrics (e.g., answer correctness). This happens because these models don't provide scores in their outp…

bastienpo updated 2 months ago
2
swe-bench/experiments #28

Open Source and Verification steps for AppMap Navie

The purpose of this issue is to provide instructions on how to verify the open source status and benchmark results for AppMap Navie on the Lite and Full benchmarks. ## Navie is open source You c…

kgilpin updated 2 months ago
7
braintrustdata/autoevals #84

(`autoevals` JS): Better support for evaluating based on pre…

Currently, it's less than straightforward to run evals if the answer is pre-generated, or based on case-specific data beyond the input. This is because the `Eval`'s `task()` function only accepts t…

mongodben updated 2 months ago
2

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for gpt-evaluation

1000+ results
for gpt-evaluation