issues
search
deepseek-ai
/
DeepSeek-LLM
DeepSeek LLM: Let there be answers
https://chat.deepseek.com/
MIT License
1.32k
stars
87
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Is the compute calculation wrong for Chinchilla in the paper?
#48
yzlnew
opened
2 months ago
1
贵团队是否会升级长上下文的版本?
#47
edisonzf2020
opened
3 months ago
1
Humaneval, use base model or instruct finetuned model?
#46
jasonzliang
opened
3 months ago
1
关于模型指标有一些疑问
#45
MangoFF
opened
3 months ago
1
Deepseek VL?
#44
IdiotSandwichTheThird
closed
3 months ago
1
Could you please release intermediate pretraining checkpoints at HuggingFace?
#43
Yangjinluan
opened
3 months ago
0
Scaling laws data
#42
borgr
opened
4 months ago
1
Deepseek SFT数据包含system应该如何处理?
#41
xiatingyu
closed
4 months ago
1
请问LLM和coder的base model结构是一样的吗?还是有什么区别呢?
#40
cherishtttz
opened
4 months ago
1
Update README.md
#39
stack-heap-overflow
closed
5 months ago
0
Update README.md
#38
luofuli
closed
5 months ago
0
关于vllm使用的疑问
#37
xuyifan-0731
closed
5 months ago
1
Training data distribution
#36
pluiez
closed
5 months ago
1
feat(evaluation): add AlignBench output
#35
DeepSeekPH
closed
5 months ago
0
AWS CLI 使用问题与 deepseek-ai S3 桶访问问题
#34
go-with-me000
opened
5 months ago
1
TriviaQA结果复现求助
#33
HYZ17
opened
5 months ago
4
AlignBench测评结果复现求助
#32
FoolMark
closed
5 months ago
2
Update README.md
#31
luofuli
closed
5 months ago
0
Update README.md
#30
luofuli
closed
5 months ago
0
Update README.md
#29
DeepSeekPH
closed
5 months ago
0
关于System Prompt
#28
DirtyKnightForVi
closed
5 months ago
4
Update README.md
#27
luofuli
closed
5 months ago
0
Missing files in released pretrain ckpts
#26
Wizardcoast
closed
6 months ago
1
Inquiry about Prompt Engineering and Handling Toxicity/Hallucination
#25
eric-chen-igs
opened
6 months ago
0
Programming Language in LeetCode Weekly Contest
#24
ShaneTian
opened
6 months ago
3
Will finetune scripts be provided?
#23
ftgreat
closed
5 months ago
1
docs(README): update README.md
#22
foldl
opened
6 months ago
2
67B-Instructor – will it be released shortly/ever?
#21
BuildBackBuehler
opened
6 months ago
1
lora sft deepseek 67b base版本
#20
liwenju0
closed
6 months ago
0
Update README.md
#19
luofuli
closed
7 months ago
0
question on "Revisit Multi-Choice Question Benchmarks"
#18
imhmhm
closed
7 months ago
1
Update README.md
#17
luofuli
closed
7 months ago
0
Update README.md
#16
luofuli
closed
7 months ago
0
Update README.md
#15
stack-heap-overflow
closed
7 months ago
0
Fixmath
#14
DeepSeekPH
closed
7 months ago
0
GPTQ模型量化
#13
315930399
closed
7 months ago
1
DeepSeek 7B Chat Lora 效果太棒了!
#12
KMnO4-zx
opened
7 months ago
4
Will technical reports be released in the future?
#11
XChen-Zero
closed
7 months ago
1
为什么不能复现你们的结果(why can't i reproduce your results)
#10
tanguagua
closed
6 months ago
4
German umlaut missing with deepseek-llm on llama
#9
p3d-dev
closed
7 months ago
1
LeetCode Weekly Contest Data
#8
tonysy
closed
7 months ago
1
图很好
#7
tpoisonooo
closed
7 months ago
0
update math score
#6
zdaxie
closed
7 months ago
0
Update README.md
#5
stack-heap-overflow
closed
7 months ago
0
Update README.md
#4
DOGEwbx
closed
7 months ago
0
About LR schedule
#3
futuristx
closed
7 months ago
1
can you please share sharded (<2gb / bin) model?
#2
amrrs
closed
5 months ago
2
Learning rate schedule seems very helpful.
#1
GanjinZero
closed
7 months ago
1