bert |
bert-base-chinese |
google-bert |
bert-base-chinese |
bert-base-chinese |
|
chinese_L-12_H-768_A-12 |
谷歌 |
tf权重
Tongjilibo/bert-chinese_L-12_H-768_A-12 |
|
|
chinese-bert-wwm-ext |
HFL |
hfl/chinese-bert-wwm-ext |
hfl/chinese-bert-wwm-ext |
|
bert-base-multilingual-cased |
google-bert |
bert-base-multilingual-cased |
bert-base-multilingual-cased |
|
MacBERT |
HFL |
hfl/chinese-macbert-base
hfl/chinese-macbert-large |
hfl/chinese-macbert-base
hfl/chinese-macbert-large |
|
WoBERT |
追一科技 |
junnyu/wobert_chinese_base ,junnyu/wobert_chinese_plus_base |
junnyu/wobert_chinese_base
junnyu/wobert_chinese_plus_base |
roberta |
chinese-roberta-wwm-ext |
HFL |
hfl/chinese-roberta-wwm-ext
hfl/chinese-roberta-wwm-ext-large (large的mlm权重是随机初始化) |
hfl/chinese-roberta-wwm-ext
hfl/chinese-roberta-wwm-ext-large |
|
roberta-small/tiny |
追一科技 |
Tongjilibo/chinese_roberta_L-4_H-312_A-12
Tongjilibo/chinese_roberta_L-6_H-384_A-12 |
|
|
roberta-base |
FacebookAI |
roberta-base |
roberta-base |
|
guwenbert |
ethanyt |
ethanyt/guwenbert-base |
ethanyt/guwenbert-base |
albert |
albert_zh albert_pytorch |
brightmart |
voidful/albert_chinese_tiny
voidful/albert_chinese_small
voidful/albert_chinese_base
voidful/albert_chinese_large
voidful/albert_chinese_xlarge
voidful/albert_chinese_xxlarge |
voidful/albert_chinese_tiny
voidful/albert_chinese_small
voidful/albert_chinese_base
voidful/albert_chinese_large
voidful/albert_chinese_xlarge
voidful/albert_chinese_xxlarge |
nezha |
NEZHA NeZha_Chinese_PyTorch |
huawei_noah |
sijunhe/nezha-cn-base
sijunhe/nezha-cn-large
sijunhe/nezha-base-wwm
sijunhe/nezha-large-wwm |
sijunhe/nezha-cn-base
sijunhe/nezha-cn-large
sijunhe/nezha-base-wwm
sijunhe/nezha-large-wwm |
|
nezha_gpt_dialog |
bojone |
Tongjilibo/nezha_gpt_dialog |
|
xlnet |
Chinese-XLNet |
HFL |
hfl/chinese-xlnet-base |
hfl/chinese-xlnet-base |
tranformer_xl |
huggingface |
transfo-xl/transfo-xl-wt103 |
transfo-xl/transfo-xl-wt103 |
deberta |
Erlangshen-DeBERTa-v2 |
IDEA |
IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese |
IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese |
electra |
Chinese-ELECTRA |
HFL |
hfl/chinese-electra-base-discriminator |
hfl/chinese-electra-base-discriminator |
ernie |
ernie |
百度文心 |
nghuyong/ernie-1.0-base-zh
nghuyong/ernie-3.0-base-zh |
nghuyong/ernie-1.0-base-zh
nghuyong/ernie-3.0-base-zh |
roformer |
roformer |
追一科技 |
junnyu/roformer_chinese_base |
junnyu/roformer_chinese_base |
|
roformer_v2 |
追一科技 |
junnyu/roformer_v2_chinese_char_base |
junnyu/roformer_v2_chinese_char_base |
simbert |
simbert |
追一科技 |
Tongjilibo/simbert-chinese-base
Tongjilibo/simbert-chinese-small
Tongjilibo/simbert-chinese-tiny |
|
|
simbert_v2/roformer-sim |
追一科技 |
junnyu/roformer_chinese_sim_char_base ,junnyu/roformer_chinese_sim_char_ft_base ,junnyu/roformer_chinese_sim_char_small ,junnyu/roformer_chinese_sim_char_ft_small |
junnyu/roformer_chinese_sim_char_base
junnyu/roformer_chinese_sim_char_ft_base
junnyu/roformer_chinese_sim_char_small
junnyu/roformer_chinese_sim_char_ft_small |
gau |
GAU-alpha |
追一科技 |
Tongjilibo/chinese_GAU-alpha-char_L-24_H-768 |
|
uie |
uie uie_pytorch |
百度 |
Tongjilibo/uie-base |
|
gpt |
CDial-GPT |
thu-coai |
thu-coai/CDial-GPT_LCCC-base
thu-coai/CDial-GPT_LCCC-large |
thu-coai/CDial-GPT_LCCC-base
thu-coai/CDial-GPT_LCCC-large |
|
cmp_lm(26亿) |
清华 |
TsinghuaAI/CPM-Generate |
TsinghuaAI/CPM-Generate |
|
nezha_gen |
huawei_noah |
Tongjilibo/chinese_nezha_gpt_L-12_H-768_A-12 |
|
gpt2-chinese-cluecorpussmall |
UER |
uer/gpt2-chinese-cluecorpussmall |
uer/gpt2-chinese-cluecorpussmall |
|
gpt2-ml |
imcaspar |
torch BaiduYun(84dh) |
gpt2-ml_15g_corpus
gpt2-ml_30g_corpus |
bart |
bart_base_chinese |
复旦fnlp |
fnlp/bart-base-chinese v1.0 |
fnlp/bart-base-chinese
fnlp/bart-base-chinese-v1.0 |
t5 |
t5 |
UER |
uer/t5-small-chinese-cluecorpussmall
uer/t5-base-chinese-cluecorpussmall |
uer/t5-base-chinese-cluecorpussmall
uer/t5-small-chinese-cluecorpussmall |
|
mt5 |
谷歌 |
google/mt5-base |
google/mt5-base |
|
t5_pegasus |
追一科技 |
Tongjilibo/chinese_t5_pegasus_small
Tongjilibo/chinese_t5_pegasus_base |
|
|
chatyuan |
clue-ai |
ClueAI/ChatYuan-large-v1
ClueAI/ChatYuan-large-v2 |
ClueAI/ChatYuan-large-v1
ClueAI/ChatYuan-large-v2 |
|
PromptCLUE |
clue-ai |
ClueAI/PromptCLUE-base |
ClueAI/PromptCLUE-base |
chatglm |
chatglm-6b |
THUDM |
THUDM/chatglm-6b
THUDM/chatglm-6b-int8
THUDM/chatglm-6b-int4 v0.1.0 |
THUDM/chatglm-6b
THUDM/chatglm-6b-int8
THUDM/chatglm-6b-int4
THUDM/chatglm-6b-v0.1.0 |
|
chatglm2-6b |
THUDM |
THUDM/chatglm2-6b
THUDM/chatglm2-6b-int4
THUDM/chatglm2-6b-32k |
THUDM/chatglm2-6b
THUDM/chatglm2-6b-int4
THUDM/chatglm2-6b-32k |
|
chatglm3-6b |
THUDM |
THUDM/chatglm3-6b
THUDM/chatglm3-6b-32k |
THUDM/chatglm3-6b
THUDM/chatglm3-6b-32k |
|
glm4-9b |
THUDM |
THUDM/glm-4-9b
THUDM/glm-4-9b-chat
THUDM/glm-4-9b-chat-1m |
THUDM/glm-4-9b
THUDM/glm-4-9b-chat
THUDM/glm-4-9b-chat-1m |
llama |
llama |
meta |
|
meta-llama/llama-7b
meta-llama/llama-13b |
|
llama-2 |
meta |
meta-llama/Llama-2-7b-hf meta-llama/Llama-2-7b-chat-hf meta-llama/Llama-2-13b-hf meta-llama/Llama-2-13b-chat-hf |
meta-llama/Llama-2-7b-hf
meta-llama/Llama-2-7b-chat-hf
meta-llama/Llama-2-13b-hf
meta-llama/Llama-2-13b-chat-hf |
|
llama-3 |
meta |
meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-8B-Instruct |
meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-8B-Instruct |
|
llama-3.1 |
meta |
meta-llama/Meta-Llama-3.1-8B
meta-llama/Meta-Llama-3.1-8B-Instruct |
meta-llama/Meta-Llama-3.1-8B
meta-llama/Meta-Llama-3.1-8B-Instruct |
|
llama-3.2 |
meta |
meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct |
meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct |
|
Chinese-LLaMA-Alpaca |
HFL |
|
hfl/chinese_alpaca_plus_7b
hfl/chinese_llama_plus_7b |
|
Chinese-LLaMA-Alpaca-2 |
HFL |
|
待添加 |
|
Chinese-LLaMA-Alpaca-3 |
HFL |
|
待添加 |
|
Belle_llama |
LianjiaTech |
BelleGroup/BELLE-LLaMA-7B-2M-enc |
合成说明、BelleGroup/BELLE-LLaMA-7B-2M-enc |
|
Ziya |
IDEA-CCNL |
IDEA-CCNL/Ziya-LLaMA-13B-v1 IDEA-CCNL/Ziya-LLaMA-13B-v1.1 IDEA-CCNL/Ziya-LLaMA-13B-Pretrain-v1 |
IDEA-CCNL/Ziya-LLaMA-13B-v1
IDEA-CCNL/Ziya-LLaMA-13B-v1.1 |
|
vicuna |
lmsys |
lmsys/vicuna-7b-v1.5 |
lmsys/vicuna-7b-v1.5 |
Baichuan |
Baichuan |
baichuan-inc |
baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-13B-Base
baichuan-inc/Baichuan-13B-Chat |
baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-13B-Base
baichuan-inc/Baichuan-13B-Chat |
|
Baichuan2 |
baichuan-inc |
baichuan-inc/Baichuan2-7B-Base
baichuan-inc/Baichuan2-7B-Chat
baichuan-inc/Baichuan2-13B-Base
baichuan-inc/Baichuan2-13B-Chat |
baichuan-inc/Baichuan2-7B-Base
baichuan-inc/Baichuan2-7B-Chat
baichuan-inc/Baichuan2-13B-Base
baichuan-inc/Baichuan2-13B-Chat |
Yi |
Yi |
01-ai |
01-ai/Yi-6B
01-ai/Yi-6B-200K
01-ai/Yi-9B
01-ai/Yi-9B-200K |
01-ai/Yi-6B
01-ai/Yi-6B-200K
01-ai/Yi-9B
01-ai/Yi-9B-200K |
|
Yi-1.5 |
01-ai |
01-ai/Yi-1.5-6B
01-ai/Yi-1.5-6B-Chat
01-ai/Yi-1.5-9B
01-ai/Yi-1.5-9B-32K
01-ai/Yi-1.5-9B-Chat
01-ai/Yi-1.5-9B-Chat-16K |
01-ai/Yi-1.5-6B
01-ai/Yi-1.5-6B-Chat
01-ai/Yi-1.5-9B
01-ai/Yi-1.5-9B-32K
01-ai/Yi-1.5-9B-Chat
01-ai/Yi-1.5-9B-Chat-16K |
bloom |
bloom |
bigscience |
bigscience/bloom-560m
bigscience/bloomz-560m |
bigscience/bloom-560m
bigscience/bloomz-560m |
Qwen |
Qwen |
阿里云 |
Qwen/Qwen-1_8B
Qwen/Qwen-1_8B-Chat
Qwen/Qwen-7B
Qwen/Qwen-7B-Chat
Qwen/Qwen-14B
Qwen/Qwen-14B-Chat |
Qwen/Qwen-1_8B
Qwen/Qwen-1_8B-Chat
Qwen/Qwen-7B
Qwen/Qwen-7B-Chat
Qwen/Qwen-14B
Qwen/Qwen-14B-Chat |
Qwen1.5 |
阿里云 |
Qwen/Qwen1.5-0.5B
Qwen/Qwen1.5-0.5B-Chat
Qwen/Qwen1.5-1.8B
Qwen/Qwen1.5-1.8B-Chat
Qwen/Qwen1.5-7B
Qwen/Qwen1.5-7B-Chat
Qwen/Qwen1.5-14B
Qwen/Qwen1.5-14B-Chat |
Qwen/Qwen1.5-0.5B
Qwen/Qwen1.5-0.5B-Chat
Qwen/Qwen1.5-1.8B
Qwen/Qwen1.5-1.8B-Chat
Qwen/Qwen1.5-7B
Qwen/Qwen1.5-7B-Chat
Qwen/Qwen1.5-14B
Qwen/Qwen1.5-14B-Chat |
Qwen2 |
阿里云 |
Qwen/Qwen2-0.5B
Qwen/Qwen2-0.5B-Instruct
Qwen/Qwen2-1.5B
Qwen/Qwen2-1.5B-Instruct
Qwen/Qwen2-7B
Qwen/Qwen2-7B-Instruct |
Qwen/Qwen2-0.5B
Qwen/Qwen2-0.5B-Instruct
Qwen/Qwen2-1.5B
Qwen/Qwen2-1.5B-Instruct
Qwen/Qwen2-7B
Qwen/Qwen2-7B-Instruct |
Qwen2-VL |
阿里云 |
Qwen/Qwen2-VL-2B-Instruct
Qwen/Qwen2-VL-7B-Instruct |
Qwen/Qwen2-VL-2B-Instruct
Qwen/Qwen2-VL-7B-Instruct |
Qwen2.5 |
阿里云 |
Qwen/Qwen2.5-0.5B
Qwen/Qwen2.5-0.5B-Instruct
Qwen/Qwen2.5-1.5B
Qwen/Qwen2.5-1.5B-Instruct
Qwen/Qwen2.5-3B
Qwen/Qwen2.5-3B-Instruct
Qwen/Qwen2.5-7B
Qwen/Qwen2.5-7B-Instruct
Qwen/Qwen2.5-14B
Qwen/Qwen2.5-14B-Instruct |
Qwen/Qwen2.5-0.5B
Qwen/Qwen2.5-0.5B-Instruct
Qwen/Qwen2.5-1.5B
Qwen/Qwen2.5-1.5B-Instruct
Qwen/Qwen2.5-3B
Qwen/Qwen2.5-3B-Instruct
Qwen/Qwen2.5-7B
Qwen/Qwen2.5-7B-Instruct
Qwen/Qwen2.5-14B
Qwen/Qwen2.5-14B-Instruct |
InternLM |
InternLM |
上海人工智能实验室 |
internlm/internlm-7b
internlm/internlm-chat-7b |
internlm/internlm-7b
internlm/internlm-chat-7b |
|
InternLM2 |
上海人工智能实验室 |
internlm/internlm2-1_8b
internlm/internlm2-chat-1_8b
internlm/internlm2-7b
internlm/internlm2-chat-7b
internlm/internlm2-20b
internlm/internlm2-chat-20b |
internlm/internlm2-1_8b
internlm/internlm2-chat-1_8b
internlm/internlm2-7b
internlm/internlm2-chat-7b |
|
InternLM2.5 |
上海人工智能实验室 |
internlm/internlm2_5-7b
internlm/internlm2_5-7b-chat
internlm/internlm2_5-7b-chat-1m |
internlm/internlm2_5-7b
internlm/internlm2_5-7b-chat
internlm/internlm2_5-7b-chat-1m |
Falcon |
Falcon |
tiiuae |
tiiuae/falcon-rw-1b
tiiuae/falcon-7b
tiiuae/falcon-7b-instruct |
tiiuae/falcon-rw-1b
tiiuae/falcon-7b
tiiuae/falcon-7b-instruct |
DeepSeek |
DeepSeek-MoE |
深度求索 |
deepseek-ai/deepseek-moe-16b-base
deepseek-ai/deepseek-moe-16b-chat |
deepseek-ai/deepseek-moe-16b-base
deepseek-ai/deepseek-moe-16b-chat |
|
DeepSeek-LLM |
深度求索 |
deepseek-ai/deepseek-llm-7b-base
deepseek-ai/deepseek-llm-7b-chat |
deepseek-ai/deepseek-llm-7b-base
deepseek-ai/deepseek-llm-7b-chat |
|
DeepSeek-V2 |
深度求索 |
deepseek-ai/DeepSeek-V2-Lite
deepseek-ai/DeepSeek-V2-Lite-Chat |
deepseek-ai/DeepSeek-V2-Lite
deepseek-ai/DeepSeek-V2-Lite-Chat |
|
DeepSeek-Coder |
深度求索 |
deepseek-ai/deepseek-coder-1.3b-base
deepseek-ai/deepseek-coder-1.3b-instruct
deepseek-ai/deepseek-coder-6.7b-base
deepseek-ai/deepseek-coder-6.7b-instruct
deepseek-ai/deepseek-coder-7b-base-v1.5
deepseek-ai/deepseek-coder-7b-instruct-v1.5 |
deepseek-ai/deepseek-coder-1.3b-base
deepseek-ai/deepseek-coder-1.3b-instruct
deepseek-ai/deepseek-coder-6.7b-base
deepseek-ai/deepseek-coder-6.7b-instruct
deepseek-ai/deepseek-coder-7b-base-v1.5
deepseek-ai/deepseek-coder-7b-instruct-v1.5 |
|
DeepSeek-Coder-V2 |
深度求索 |
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct |
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct |
|
DeepSeek-Math |
深度求索 |
deepseek-ai/deepseek-math-7b-base
deepseek-ai/deepseek-math-7b-instruct
deepseek-ai/deepseek-math-7b-rl |
deepseek-ai/deepseek-math-7b-base
deepseek-ai/deepseek-math-7b-instruct
deepseek-ai/deepseek-math-7b-rl |
MiniCPM |
MiniCPM |
OpenBMB |
openbmb/MiniCPM-2B-sft-bf16
openbmb/MiniCPM-2B-dpo-bf16
openbmb/MiniCPM-2B-128k
openbmb/MiniCPM-1B-sft-bf16 |
openbmb/MiniCPM-2B-sft-bf16
openbmb/MiniCPM-2B-dpo-bf16
openbmb/MiniCPM-2B-128k
openbmb/MiniCPM-1B-sft-bf16 |
|
MiniCPM-V |
OpenBMB |
openbmb/MiniCPM-V-2_6
openbmb/MiniCPM-Llama3-V-2_5 |
openbmb/MiniCPM-V-2_6
openbmb/MiniCPM-Llama3-V-2_5 |
embedding |
text2vec-base-chinese |
shibing624 |
shibing624/text2vec-base-chinese |
shibing624/text2vec-base-chinese |
|
m3e |
moka-ai |
moka-ai/m3e-base |
moka-ai/m3e-base |
|
bge |
BAAI |
BAAI/bge-large-en-v1.5
BAAI/bge-large-zh-v1.5
BAAI/bge-base-en-v1.5
BAAI/bge-base-zh-v1.5
BAAI/bge-small-en-v1.5
BAAI/bge-small-zh-v1.5 |
BAAI/bge-large-en-v1.5
BAAI/bge-large-zh-v1.5
BAAI/bge-base-en-v1.5
BAAI/bge-base-zh-v1.5
BAAI/bge-small-en-v1.5
BAAI/bge-small-zh-v1.5 |
|
gte |
thenlper |
thenlper/gte-large-zh
thenlper/gte-base-zh |
thenlper/gte-base-zh
thenlper/gte-large-zh |