issues
search
PanZezhong1725
/
operators
算子库
7
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
SoftMax:CPU,MLU,CUDA三个平台重构
#105
xgqdut2016
opened
1 hour ago
0
fix: CpuRearrangeDescriptor
#104
bitzyz
closed
1 hour ago
0
Matmul 支持 FP32
#103
Ziminli
opened
2 hours ago
0
Rearrange CPU描述符存有用户所有指针问题
#102
PanZezhong1725
opened
2 hours ago
1
修复融合算子 bug
#101
kilinchange
opened
1 day ago
0
fix: softmax remove tensor
#100
PanZezhong1725
closed
1 day ago
0
Ascend softmax
#99
zhangyue207
closed
1 day ago
0
clean_up: delete depricated codes
#98
PanZezhong1725
closed
1 day ago
0
Ascend rearrange
#97
zhangyue207
closed
1 day ago
0
Ascend matmul
#96
zhangyue207
closed
1 day ago
0
Add Pooling
#95
Ziminli
opened
2 days ago
1
softmax:重构softmax的CPU,BANG,CUDA代码
#94
xgqdut2016
closed
2 hours ago
0
Add GEMM & Expand
#93
Ziminli
opened
5 days ago
0
random_sample_workspace
#92
xgqdut2016
opened
5 days ago
0
CI: time each script
#91
PanZezhong1725
closed
6 days ago
0
Ascend Rope & Swiglu
#90
zhangyue207
opened
6 days ago
1
Add Global Average Pool
#89
Ziminli
opened
1 week ago
0
cuda/cpu 编译加 Werror 和 Wall 选项
#88
kilinchange
opened
1 week ago
0
Ascend rope
#87
zhangyue207
closed
1 week ago
0
randomSample
#86
xgqdut2016
closed
2 weeks ago
0
Ascend rms norm
#85
zhangyue207
closed
1 day ago
0
Ascend rearrange
#84
zhangyue207
closed
4 days ago
2
Ascend matmul
#83
zhangyue207
closed
6 days ago
2
randomSample:增加topp和topk为0的特殊处理
#82
xgqdut2016
closed
2 weeks ago
0
Dev ascend softmax
#81
zhangyue207
closed
2 weeks ago
2
bangRoPE
#80
xgqdut2016
closed
2 hours ago
0
bang_add
#79
xgqdut2016
opened
2 weeks ago
1
bangRMS
#78
xgqdut2016
closed
4 days ago
0
bang_rmsnorm
#77
xgqdut2016
closed
2 weeks ago
0
Support fp32 for add operator
#76
Ziminli
closed
1 week ago
1
fix: 创建tensor descriptor时使用const形状和步长
#75
PanZezhong1725
closed
2 weeks ago
0
fix(attn): add include
#74
kilinchange
closed
2 weeks ago
0
cnnl_matmul:修复cnnl计算matmul
#73
xgqdut2016
closed
2 weeks ago
0
change matmul interface
#72
zhangyue207
closed
3 weeks ago
0
bang-add:add算子的寒武纪重构
#71
xgqdut2016
closed
2 weeks ago
0
添加 attention 融合算子
#70
kilinchange
closed
2 weeks ago
1
修复rope测试旧版pytorch不支持uint64的问题
#69
PanZezhong1725
closed
1 month ago
0
Add ReLU CPU and CUDA implementation
#68
Ziminli
closed
2 hours ago
2
assert 问题
#67
kilinchange
opened
1 month ago
0
bang_rmsnorm:rmsnorm的寒武纪平台重构
#66
xgqdut2016
closed
2 weeks ago
0
Fixed Add to correctly handle any data size
#65
Ziminli
closed
1 month ago
0
将dtype_eq替换为==
#64
PanZezhong1725
opened
1 month ago
0
device id问题
#63
xgqdut2016
opened
1 month ago
0
bangRearrange:重构寒武纪平台rearrange算子
#62
xgqdut2016
closed
3 weeks ago
0
Add conv
#61
Ziminli
closed
21 hours ago
0
添加 mlp 融合算子
#60
kilinchange
closed
2 weeks ago
1
重构rms_norm算子的cpu、cuda接口
#59
JYMiracle305
closed
1 month ago
0
bangRoPE:重构RoPE的寒武纪接口
#58
xgqdut2016
closed
2 weeks ago
0
__restrict__ 问题
#57
kilinchange
opened
1 month ago
0
handle 析构问题
#56
kilinchange
opened
1 month ago
0
Next