issues
search
deepseek-ai
/
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
https://coder.deepseek.com/
MIT License
6.61k
stars
461
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
how to finetune in single gpu
#79
sxsxsx
opened
9 months ago
1
how to finetune deepseek-coder-33b-instruct with 8*A800 80G
#78
netrookiecn
closed
9 months ago
5
怎么实现deepseek-coder的lora-finetune呢
#77
YCaigogogo
opened
9 months ago
1
Random Real Users Data Leak
#76
Wontfallo
opened
9 months ago
1
How to use this in an IDE?
#75
aliakhtar
opened
9 months ago
3
Potential bug: EOS token mismatch
#74
lixinye-nju
closed
9 months ago
3
Update README.md
#73
foldl
opened
9 months ago
0
Support for gpt fast.
#72
briandw
closed
9 months ago
1
FIM doesn't work
#71
SinanAkkoyun
closed
9 months ago
15
sort all languages in README
#70
foldl
closed
8 months ago
3
Deepseek coder 33B 模型测试输出重复问题
#69
txy6666yr
opened
10 months ago
3
What is the dataset format for fine-tuning the base model for code infilling purposes only (not instructions) ?
#68
arkaprovob
opened
10 months ago
2
请问model_length设置为2048进行微调,微调后模型输入最大只能是2048了吗?
#67
Stucrehap
closed
10 months ago
0
使用deepspeed mii加速推理的时候,输出全是特殊字符,比如“(”、"{"等,该如何解决呢
#66
Stucrehap
opened
10 months ago
1
微调后的模型使用问题
#65
Horizon2022
closed
9 months ago
2
Syntax checking
#64
Bytes-Explorer
opened
10 months ago
0
有关文件依赖关系解析
#63
Grey4sh
opened
10 months ago
3
close
#62
tanklandry
closed
10 months ago
1
DeepSeek Chat
#61
SinanAkkoyun
closed
10 months ago
3
我看模型支持了amis,请问下amis的训练数据应该如何构造?
#60
MarsMeng1994
closed
5 months ago
1
Visual Studio Code Extension
#59
geosem42
closed
10 months ago
2
请问在微调模型后,如何加载微调后的模型并在测试集上评估性能?
#58
Horizon2022
closed
10 months ago
6
Update README.md
#57
BingxuanWang
closed
10 months ago
0
请问demo中的system prompt怎么使用?
#56
LiuYang328
opened
10 months ago
1
执行finetune脚本之后,未看到模型保存
#55
nstl-zyb
closed
10 months ago
5
Running finetune_deepseekcoder.py results in return code = -9 and running script directly results in RuntimeError: 'weight' must be 2-D
#54
hobpond
closed
10 months ago
5
How can i create a 13B version?
#53
erfanium
closed
10 months ago
1
Update eval_instruct.py
#52
itstalmeez
closed
10 months ago
1
显存占用过高,发现示例是fp32
#51
GeeeekExplorer
closed
10 months ago
1
tokenizer.model
#50
SinanAkkoyun
closed
10 months ago
12
生成sql
#49
xiaokai01
closed
10 months ago
1
请问可以提供多机多卡微调的脚本吗?
#48
txy6666yr
closed
10 months ago
4
代码输出和官网输出不
#47
xiaokai01
closed
10 months ago
1
Add MBPP evaluation script for deepseek-coder instruct models
#46
DejianYang
closed
10 months ago
0
Update Q&A in README.md
#45
BingxuanWang
closed
10 months ago
0
TensorRT-LLM Support
#44
anxietymonger
closed
10 months ago
5
Repo level concatenation of data
#43
Bytes-Explorer
opened
10 months ago
38
Dedup of code during data prep
#42
Bytes-Explorer
closed
5 months ago
7
For evaluating on the MBPP dataset, any code for the instruction-based model?
#41
wujwyi
closed
10 months ago
1
有api调用吗
#40
mimadiule
opened
10 months ago
1
better docs
#39
uplight-dev
opened
10 months ago
0
Sagemaker hugging face deployment issue:
#38
Al-aminI
opened
10 months ago
0
Question on the license
#37
UniverseFly
closed
10 months ago
0
训练基于多少种语言啊
#36
lionday
closed
10 months ago
0
当Prompt中带有比较多的数字时,33b-instruct模型会重复输出。
#35
wushixong
closed
10 months ago
1
6.7B的模型需要多少显存?
#34
askxiaozhang
closed
10 months ago
6
Instruction dataset?
#33
cdj0311
closed
10 months ago
4
33b需要多少显存,怎么量化加载
#32
xiaokai01
closed
10 months ago
2
finetune效果不能复现
#31
kylesong307
closed
10 months ago
1
Prompt format of chat model
#30
anxietymonger
closed
10 months ago
2
Previous
Next