deepseek-ai DeepSeek-LLM issues

deepseek-ai / DeepSeek-LLM

DeepSeek LLM: Let there be answers

https://chat.deepseek.com/

MIT License

1.32k stars 87 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Is the compute calculation wrong for Chinchilla in the paper?

#48 yzlnew opened 2 months ago
1
贵团队是否会升级长上下文的版本？

#47 edisonzf2020 opened 3 months ago
1
Humaneval, use base model or instruct finetuned model?

#46 jasonzliang opened 3 months ago
1
关于模型指标有一些疑问

#45 MangoFF opened 3 months ago
1
Deepseek VL?

#44 IdiotSandwichTheThird closed 3 months ago
1
Could you please release intermediate pretraining checkpoints at HuggingFace?

#43 Yangjinluan opened 3 months ago
0
Scaling laws data

#42 borgr opened 4 months ago
1
Deepseek SFT数据包含system应该如何处理？

#41 xiatingyu closed 4 months ago
1
请问LLM和coder的base model结构是一样的吗？还是有什么区别呢？

#40 cherishtttz opened 4 months ago
1
Update README.md

#39 stack-heap-overflow closed 5 months ago
0
Update README.md

#38 luofuli closed 5 months ago
0
关于vllm使用的疑问

#37 xuyifan-0731 closed 5 months ago
1
Training data distribution

#36 pluiez closed 5 months ago
1
feat(evaluation): add AlignBench output

#35 DeepSeekPH closed 5 months ago
0
AWS CLI 使用问题与 deepseek-ai S3 桶访问问题

#34 go-with-me000 opened 5 months ago
1
TriviaQA结果复现求助

#33 HYZ17 opened 5 months ago
4
AlignBench测评结果复现求助

#32 FoolMark closed 5 months ago
2
Update README.md

#31 luofuli closed 5 months ago
0
Update README.md

#30 luofuli closed 5 months ago
0
Update README.md

#29 DeepSeekPH closed 5 months ago
0
关于System Prompt

#28 DirtyKnightForVi closed 5 months ago
4
Update README.md

#27 luofuli closed 5 months ago
0
Missing files in released pretrain ckpts

#26 Wizardcoast closed 6 months ago
1
Inquiry about Prompt Engineering and Handling Toxicity/Hallucination

#25 eric-chen-igs opened 6 months ago
0
Programming Language in LeetCode Weekly Contest

#24 ShaneTian opened 6 months ago
3
Will finetune scripts be provided?

#23 ftgreat closed 5 months ago
1
docs(README): update README.md

#22 foldl opened 6 months ago
2
67B-Instructor – will it be released shortly/ever?

#21 BuildBackBuehler opened 6 months ago
1
lora sft deepseek 67b base版本

#20 liwenju0 closed 6 months ago
0
Update README.md

#19 luofuli closed 7 months ago
0
question on "Revisit Multi-Choice Question Benchmarks"

#18 imhmhm closed 7 months ago
1
Update README.md

#17 luofuli closed 7 months ago
0
Update README.md

#16 luofuli closed 7 months ago
0
Update README.md

#15 stack-heap-overflow closed 7 months ago
0
Fixmath

#14 DeepSeekPH closed 7 months ago
0
GPTQ模型量化

#13 315930399 closed 7 months ago
1
DeepSeek 7B Chat Lora 效果太棒了！

#12 KMnO4-zx opened 7 months ago
4
Will technical reports be released in the future?

#11 XChen-Zero closed 7 months ago
1
为什么不能复现你们的结果（why can't i reproduce your results）

#10 tanguagua closed 6 months ago
4
German umlaut missing with deepseek-llm on llama

#9 p3d-dev closed 7 months ago
1
LeetCode Weekly Contest Data

#8 tonysy closed 7 months ago
1
图很好

#7 tpoisonooo closed 7 months ago
0
update math score

#6 zdaxie closed 7 months ago
0
Update README.md

#5 stack-heap-overflow closed 7 months ago
0
Update README.md

#4 DOGEwbx closed 7 months ago
0
About LR schedule

#3 futuristx closed 7 months ago
1
can you please share sharded (<2gb / bin) model?

#2 amrrs closed 5 months ago
2
Learning rate schedule seems very helpful.

#1 GanjinZero closed 7 months ago
1