deepseek-ai DeepSeek-Coder issues

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

https://coder.deepseek.com/

MIT License

6.61k stars 461 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

how to finetune in single gpu

#79 sxsxsx opened 9 months ago
1
how to finetune deepseek-coder-33b-instruct with 8*A800 80G

#78 netrookiecn closed 9 months ago
5
怎么实现deepseek-coder的lora-finetune呢

#77 YCaigogogo opened 9 months ago
1
Random Real Users Data Leak

#76 Wontfallo opened 9 months ago
1
How to use this in an IDE?

#75 aliakhtar opened 9 months ago
3
Potential bug: EOS token mismatch

#74 lixinye-nju closed 9 months ago
3
Update README.md

#73 foldl opened 9 months ago
0
Support for gpt fast.

#72 briandw closed 9 months ago
1
FIM doesn't work

#71 SinanAkkoyun closed 9 months ago
15
sort all languages in README

#70 foldl closed 8 months ago
3
Deepseek coder 33B 模型测试输出重复问题

#69 txy6666yr opened 10 months ago
3
What is the dataset format for fine-tuning the base model for code infilling purposes only (not instructions) ?

#68 arkaprovob opened 10 months ago
2
请问model_length设置为2048进行微调，微调后模型输入最大只能是2048了吗？

#67 Stucrehap closed 10 months ago
0
使用deepspeed mii加速推理的时候，输出全是特殊字符，比如“(”、"{"等，该如何解决呢

#66 Stucrehap opened 10 months ago
1
微调后的模型使用问题

#65 Horizon2022 closed 9 months ago
2
Syntax checking

#64 Bytes-Explorer opened 10 months ago
0
有关文件依赖关系解析

#63 Grey4sh opened 10 months ago
3
close

#62 tanklandry closed 10 months ago
1
DeepSeek Chat

#61 SinanAkkoyun closed 10 months ago
3
我看模型支持了amis，请问下amis的训练数据应该如何构造？

#60 MarsMeng1994 closed 5 months ago
1
Visual Studio Code Extension

#59 geosem42 closed 10 months ago
2
请问在微调模型后，如何加载微调后的模型并在测试集上评估性能？

#58 Horizon2022 closed 10 months ago
6
Update README.md

#57 BingxuanWang closed 10 months ago
0
请问demo中的system prompt怎么使用？

#56 LiuYang328 opened 10 months ago
1
执行finetune脚本之后，未看到模型保存

#55 nstl-zyb closed 10 months ago
5
Running finetune_deepseekcoder.py results in return code = -9 and running script directly results in RuntimeError: 'weight' must be 2-D

#54 hobpond closed 10 months ago
5
How can i create a 13B version?

#53 erfanium closed 10 months ago
1
Update eval_instruct.py

#52 itstalmeez closed 10 months ago
1
显存占用过高，发现示例是fp32

#51 GeeeekExplorer closed 10 months ago
1
tokenizer.model

#50 SinanAkkoyun closed 10 months ago
12
生成sql

#49 xiaokai01 closed 10 months ago
1
请问可以提供多机多卡微调的脚本吗？

#48 txy6666yr closed 10 months ago
4
代码输出和官网输出不

#47 xiaokai01 closed 10 months ago
1
Add MBPP evaluation script for deepseek-coder instruct models

#46 DejianYang closed 10 months ago
0
Update Q&A in README.md

#45 BingxuanWang closed 10 months ago
0
TensorRT-LLM Support

#44 anxietymonger closed 10 months ago
5
Repo level concatenation of data

#43 Bytes-Explorer opened 10 months ago
38
Dedup of code during data prep

#42 Bytes-Explorer closed 5 months ago
7
For evaluating on the MBPP dataset, any code for the instruction-based model?

#41 wujwyi closed 10 months ago
1
有api调用吗

#40 mimadiule opened 10 months ago
1
better docs

#39 uplight-dev opened 10 months ago
0
Sagemaker hugging face deployment issue:

#38 Al-aminI opened 10 months ago
0
Question on the license

#37 UniverseFly closed 10 months ago
0
训练基于多少种语言啊

#36 lionday closed 10 months ago
0
当Prompt中带有比较多的数字时，33b-instruct模型会重复输出。

#35 wushixong closed 10 months ago
1
6.7B的模型需要多少显存？

#34 askxiaozhang closed 10 months ago
6
Instruction dataset?

#33 cdj0311 closed 10 months ago
4
33b需要多少显存，怎么量化加载

#32 xiaokai01 closed 10 months ago
2
finetune效果不能复现

#31 kylesong307 closed 10 months ago
1
Prompt format of chat model

#30 anxietymonger closed 10 months ago
2

Previous Next