issues
search
ProjectD-AI
/
llama_inference
llama inference for tencentpretrain
GNU General Public License v3.0
96
stars
11
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
int8量化输出不完整
#19
zhenglinpan
closed
1 year ago
3
模型分片了怎么指定load_model_path
#18
caowenhero
opened
1 year ago
1
你好 请问支持llama 65B了嘛
#17
zcuuu
opened
1 year ago
1
多卡推理
#16
yingzhao27
closed
1 year ago
4
lora模型推理啥时候能出来啊
#15
isaacxie41
opened
1 year ago
1
为什么LLaMa模型只有encoder没有decoder
#14
yyqi17
opened
1 year ago
1
生成乱码
#13
McCarrtney
opened
1 year ago
4
运行多轮对话时代码报错
#12
LJL00000
opened
1 year ago
1
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#11
yingzhao27
opened
1 year ago
2
fp32 精度-inference
#10
biubiobiu
opened
1 year ago
3
Dev
#9
fengyh3
closed
1 year ago
0
Dev
#8
fengyh3
closed
1 year ago
0
llama_server.py支持多卡推理吗
#7
yuxuan2015
opened
1 year ago
3
求老哥搞个lora的
#6
ze00ro
opened
1 year ago
1
Dev
#5
fengyh3
closed
1 year ago
0
Dev
#4
fengyh3
closed
1 year ago
0
update generate scripts.
#3
fengyh3
closed
1 year ago
0
assert batch <= args.batch_size AssertionError
#2
baketbek
opened
1 year ago
1
Create LICENSE
#1
fengyh3
closed
1 year ago
0