-
Is there an evaluation of non API models? Such as LLama 7B, GPT xl, etc.
-
https://virtual2023.aclweb.org/paper_P2433.html
-
支持Alpaca等指令数据集的SFT和RLHF流程:https://github.com/hiyouga/LLaMA-Efficient-Tuning
LoRA微调可在单块3090 GPU上运行,同时支持QLoRA方法。(最低12G显存)
微调模型的 LoRA 权重:https://huggingface.co/hiyouga/baichuan-7b-sft
运行以下指令即可实现…
-
- [ ] [The Bitter Lesson](http://www.incompleteideas.net/IncIdeas/BitterLesson.html)
# The Bitter Lesson
**DESCRIPTION:**
"The Bitter Lesson
Rich Sutton
March 13, 2019
The biggest lesson that …
-
I am just trying to fine-tune "EleutherAI/gpt-neo-1.3B" for casualLM on google colab. Without anything, it gives out of memory error. I was checking what can I do and I found deepspeed. I added deepsp…
-
When I load the generated bertopic model, it give the following error traces:
```
/home/21zz42/Asset-Management-Topic-Modeling/.venv/lib/python3.10/site-packages/umap/distances.py:1063: NumbaDepreca…
-
Thank you for proposing this interesting benchmark.
After finishing the **Model Inference** and **LLM-based Evaluation**, we tried to obtain the results as shown in **Merge Evaluation and Save Res…
-
### Bug Description
Llamaindex crashes when evaluating some large language models for specific metrics (e.g., answer correctness). This happens because these models don't provide scores in their outp…
-
The purpose of this issue is to provide instructions on how to verify the open source status and benchmark results for AppMap Navie on the Lite and Full benchmarks.
## Navie is open source
You c…
-
Currently, it's less than straightforward to run evals if the answer is pre-generated, or based on case-specific data beyond the input.
This is because the `Eval`'s `task()` function only accepts t…