issues
search
deepseek-ai
/
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MIT License
3.68k
stars
156
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
不能在cursor编辑器上用自定义api是吗?
#97
PeyFon
opened
1 week ago
1
一个账户创建了五十个api,但一个api没问题,但多了会报错 处理时发生未知错误: Connection error
#96
wanglu2014
opened
4 weeks ago
0
Access to Deepseek chat is blocked for no reason.
#95
Farooq87
opened
4 weeks ago
1
default temperature of this model
#94
ssk705
opened
1 month ago
0
NAN issue using FP16 to load the model
#93
zitgit
opened
1 month ago
0
Inquiry about Key/Value Storage and Matrix Merging in DeepSeekerV2 Inference Code
#92
xlim1996
opened
2 months ago
0
doc: followup #89 add client demo for using SGLang
#91
Ying1123
closed
2 months ago
1
doc: followup #89 add client demo
#90
zhyncs
closed
2 months ago
1
doc: recommend SGLang for DeepSeek V2 inference
#89
zhyncs
closed
2 months ago
1
Function Calling比以前难触发了
#88
whoisfucker
opened
2 months ago
5
Exploring the Combined Effects of YaRN and Adjusted rope_base Values in deepseek v2
#87
hannlp
opened
2 months ago
0
docs: fix incorrect link in README.md
#86
itaowei
opened
2 months ago
0
Question about the design of bos and eos token
#85
jojo23333
opened
3 months ago
0
线上api如何稳定的触发 tool_calls
#84
wssnail
opened
3 months ago
4
ValueError: The model's max seq len (163840) is larger than the maximum number of tokens that can be stored in KV cache (13360). Try increasing gpu_memory_utilization or decreasing max_model_len when initializing the engine.
#83
ArtificialZeng
opened
3 months ago
3
empty response from server
#82
879611427
opened
3 months ago
2
fix
#81
ArtificialZeng
opened
3 months ago
0
HuggingFace中开源的代码似乎没有实现矩阵合并
#80
meteorlin
opened
3 months ago
1
多轮在训练中是否需要特殊间隔符,用什么间隔符号?
#79
AceCHQ
opened
3 months ago
0
DeepSeek-V2-Lite-Chat模型启动依赖问题
#78
Malowking
opened
3 months ago
1
自配大模型服务器如何选择GPU,CPU和内存
#77
zhanghanting
opened
4 months ago
0
Error executing method determine_num_available_blocks: vLLM multi node fails for both DeepSeek-Coder-V2-Instruct and DeepSeek-Coder-V2-Lite-Instruct
#76
liangfang
opened
4 months ago
1
0628版本加载报错
#75
bestpredicts
opened
4 months ago
0
为何我在A800上运行DeepSeek-V2-Lite-Chat (SFT),竟然消耗60G的显存?!
#74
juhengzhe
opened
4 months ago
3
about the active param counts of DeepSeek-V2-Lite
#73
imhmhm
opened
4 months ago
0
Why max_model_len only 8192 when inferencing with vLLM for DeepSeek-V2-Chat?
#72
ybdesire
opened
4 months ago
0
怎么用dspy里的方法来调用deepseek?
#71
buchikeke
opened
4 months ago
2
What's the Prompt and Response length in the Paper?
#70
JadeRay
opened
4 months ago
0
Add support llama.cpp
#69
techn0man1ac
opened
4 months ago
0
希望做vs2022扩展
#68
woaidianqian
closed
4 months ago
1
怎么在langchain里面使用deepseek计算embedding?
#67
ShuoAndy
opened
4 months ago
3
网页端的默认参数
#66
JaheimLee
closed
4 months ago
0
您好,可以查看源码吗?
#65
Darleen71
opened
5 months ago
0
同一个请求连续多次尝试都是相同错误
#64
gauss-clb
opened
5 months ago
0
docs: update README for LMDeploy support
#63
zhyncs
closed
4 months ago
1
Main
#62
bang78945
opened
5 months ago
0
如何优化deepseek用来做文本审查时的prompt定义
#61
xfghvgnfyjssjgte
opened
5 months ago
0
如何让模型能够回答完问题自动停止
#60
hensiesp32
opened
5 months ago
0
关于DeepSeek-Coder-V2-Lite-Base的128k捞针测试结果
#59
chaochen99
opened
5 months ago
1
it swapped to chinese and i cant get it to change back to english
#58
james28909
opened
5 months ago
1
It won't answer questions about the events that transpired in Tiananmen Square from April 15, 1989, to June 4, 1989.
#57
richpav
opened
5 months ago
4
128k的推理有例子吗?
#56
520jefferson
closed
1 month ago
3
Will the Deepseek platform's API call be updated to support generating multiple texts (n>1)?
#55
zchuz
opened
5 months ago
1
Chat API响应的role字段不要设为null
#54
jichulu
opened
5 months ago
1
hi, could you provide a code like llama3?
#53
lambda7xx
opened
5 months ago
2
Compatibility issues with the OpenAI Python client.
#52
dennymao
opened
6 months ago
2
敏感词封禁问题
#51
gauss-clb
opened
6 months ago
2
Knowledge cutoff date
#50
Shadow-Alex
opened
6 months ago
0
模型部署困惑
#49
ylhou
opened
6 months ago
2
Drop Token
#48
Richie-yan
closed
6 months ago
2
Next