Open shm007g opened 1 year ago
PushShift.io
Reddit) using metaseq, 1/7th the carbon footprint if GPT-3, combining Meta’s open source Fully Sharded Data Parallel (FSDP) API and NVIDIA’s tensor parallel abstraction within Megatron-LM, contain predominantly English text and a small amount of non-English data via CommonCrawl, released under a noncommercial license.$\color{red}{\textsf{Refactoring, Coming soon}}$
Model | Date | Base | Size (B) | Weight | Data | Licence | Context Len | Demo |
---|---|---|---|---|---|---|---|---|
Dolly-v2 | 2023/04/12 | Pythia | 3/6.9/12 | databricks/dolly-v2-12b | databricks-dolly-15k | Apache-2.0 | 2048 | |
Dolly-v1-6b | 2023/03/24 | GPT-J | 6 | databricks/dolly-v1-6b | Stanford Alpaca | Apache-2.0 | 2048 | |
RWKV-4-Raven | 2023/04 | RWKV-4-Pile | 1.5/3/7/14 | BlinkDL/rwkv-4-raven | Alpaca, CodeAlpaca, Guanaco, GPT4All, ShareGPT and more | Apache-2.0 | Infinite | space |
BLOOMZ & mT0 | 2022/11 | BLOOM/mT5 | 0.56/1.1/1.7/3/7.1/176 | bigscience/bloomz | xP3 | bigscience-bloom-rail-1.0 | 2048 | |
OpenAssistant | 2023/04/16 | Pythia/LLaMA | 1.4/6.9/12/30 | OpenAssistant | OASST1 | Apache-2.0 | 2048 | site |
测试任务 | 详细样例 | 样例数 | 中文Alpaca-7B | 中文Alpaca-13B | 中文Alpaca-Plus-7B |
---|---|---|---|---|---|
💯总平均分 | - | 200 | 65.1 | 70.6 | 👍🏻75.3 |
知识问答 | QA.md | 20 | 66 | 74 | 👍🏻80 |
开放式问答 | OQA.md | 20 | 👍🏻79 | 74 | 👍🏻78 |
数值计算、推理 | REASONING.md | 20 | 31 | 👍🏻50 | 45 |
诗词、文学、哲学 | LITERATURE.md | 20 | 68 | 73 | 👍🏻76 |
音乐、体育、娱乐 | ENTERTAINMENT.md | 20 | 68 | 74 | 👍🏻79 |
写信、写文章 | GENERATION.md | 20 | 76 | 👍🏻81 | 👍🏻81 |
文本翻译 | TRANSLATION.md | 20 | 76 | 78 | 👍🏻82 |
多轮交互 | DIALOGUE.md | 20 | 👍🏻83 | 73 | 👍🏻84 |
代码编程 | CODE.md | 20 | 57 | 👍🏻64 | 59 |
伦理、拒答 | ETHICS.md | 20 | 47 | 65 | 👍🏻89 |
<!DOCTYPE html>
<style>
body {
background-image: url('https://boson.ai/bg.jpg');
background-attachment: fixed;
background-size: cover;
}
.header {
padding-top: 50px;
margin: auto;
width: 60%;
font-family: Arial,Helvetica,sans-serif;
}
h1 {
font-size: 110px;
color: #fff;
}
h2 {
font-size: 80px;
color: #aaa;
}
</style>
<html>
<body>
<div class="header">
<h1>Large Models for All</h1>
<h2>We're building something big...</h2>
<h2>Stay tuned! </h2>
<a href="https://github.com/shm007g/LLaMA-Cult-and-More">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/svg?repos=shm007g/LLaMA-Cult-and-More&type=Date&theme=dark" />
<source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/svg?repos=shm007g/LLaMA-Cult-and-More&type=Date" />
<img alt="Star History Chart" src="https://api.star-history.com/svg?repos=shm007g/LLaMA-Cult-and-More&type=Date" />
</picture>
</a>
<input type="hidden" id="thanks" name="to" value="https://github.com/boson-ai">
<h2></h2>
<div>
</body>
</html>
track