issues
search
deepseek-ai
/
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
https://coder.deepseek.com/
MIT License
5.99k
stars
431
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Clarification Request on Discrepancies Between Appendix B and Section 4.1 Results
#119
s-JoL
closed
4 months ago
4
eos_token_id for v1.5 model
#118
G07cha
closed
4 months ago
4
TensorRT Quantization Breaks for `LlamaLinearScalingRotaryEmbedding`
#117
Sanger2000
opened
4 months ago
0
Repository Level Code Completion format question
#116
zch-cc
closed
4 months ago
2
Regex of HASDEPENDENCY in Dependency Parsing
#115
alex8937
opened
5 months ago
1
ERROR: ImportError: cannot import name 'SyncManager' from partially initialized module 'multiprocessing.managers' (most likely due to a circular import)
#114
kokolerk
opened
5 months ago
3
预训练细节(fim)
#113
lightdf
opened
5 months ago
3
Please pass your input's `attention_mask` to obtain reliable results.
#112
metero20000
closed
2 months ago
1
微调后用代码中的evaluation做humaneval评测时报错Failed to extract code block with error `list index out of range`:
#111
mst272
closed
4 months ago
13
请问一下最新发布的7b-v1.5模型不支持中间补全吗
#110
Reve1ations
closed
4 months ago
9
Update README.md
#109
eltociear
closed
5 months ago
0
Possible generation bug?
#108
kyesniper
opened
5 months ago
2
Construction of the FIM training data
#107
shatealaboxiaowang
opened
5 months ago
4
Training loss extremely noisy during fine-tuning and randomly goes to 0
#106
zpx01
opened
5 months ago
1
Update leetcode contest evaluation
#105
DejianYang
closed
5 months ago
0
HF chat-ui Prompt Template (DeepSeek Coder 6.7B)
#104
GANJAC
opened
5 months ago
0
请问finetune脚本是全参微调么,最少需要多少显存和内存。
#103
juhengzhe
opened
5 months ago
5
inference with tensorrt_llm
#102
thanhtung901
opened
5 months ago
9
Update README.md
#101
timxx
opened
5 months ago
0
加载模型出现json错误
#100
mst272
closed
5 months ago
1
How to extended window size during train step2?
#99
jiejie1993
opened
5 months ago
1
不能把模型转化为gguf格式
#98
dotyuu
opened
5 months ago
1
The installed version of bitsandbytes was compiled without GPU support
#97
hs117
opened
5 months ago
1
Why is the size of the fine tuned model only a few hundred kb
#96
vvvvk1
closed
5 months ago
0
How to do code completion in Visual studio code?
#95
vikasd22
closed
5 months ago
1
deepseek coder能够在base模型基础上继续与训练吗?
#94
EnderWu
opened
5 months ago
2
Fix the position of add_generation_prompt
#93
pcystc
closed
5 months ago
0
Infilling怎么微调
#92
timxx
opened
5 months ago
1
Use Dilated Attention as Core mechanism instead of vanilla Attention with Llama model
#91
younesselbrag
opened
5 months ago
0
What's the pad token for deepseek-coder
#90
tonyaw
opened
6 months ago
2
Cutoff dates
#89
Naman-ntc
closed
5 months ago
4
When using the deepseek or generating content, the output contains the character � instead of expected characters.
#88
BinhMinhs10
closed
6 months ago
2
Evaluate on APPS
#87
Cheungki
opened
6 months ago
0
Evaluation on PAL-Math just read the completion files in rank 0.
#86
MingfengXue
closed
6 months ago
1
Can you augment the model with whole repo?
#85
vladimirpekez
opened
6 months ago
1
使用fim后human eval分数很低?
#84
nullxjx
opened
6 months ago
4
`apply_chat_template` not works as expected
#83
timxx
closed
6 months ago
3
评估instruct模型的代码 humaneval 除了python 其他都有问题,跑出来分都为0
#82
Nightbringers
closed
6 months ago
7
System prompt and User prompt support
#81
sandwu
opened
6 months ago
1
Instruct - Code Completion
#80
RussellCanfield
closed
4 months ago
10
how to finetune in single gpu
#79
sxsxsx
opened
6 months ago
1
how to finetune deepseek-coder-33b-instruct with 8*A800 80G
#78
netrookiecn
closed
6 months ago
5
怎么实现deepseek-coder的lora-finetune呢
#77
YCaigogogo
opened
6 months ago
1
Random Real Users Data Leak
#76
Wontfallo
opened
6 months ago
1
How to use this in an IDE?
#75
aliakhtar
opened
6 months ago
3
Potential bug: EOS token mismatch
#74
rookielxy
closed
6 months ago
3
Update README.md
#73
foldl
opened
6 months ago
0
Support for gpt fast.
#72
briandw
closed
6 months ago
1
FIM doesn't work
#71
SinanAkkoyun
closed
6 months ago
15
sort all languages in README
#70
foldl
closed
5 months ago
3
Previous
Next