issues
search
fengggli
/
gpu-computing-materials
A simple deep learning framework that optimizes task scheduling and memory usage on different CPU/GPU architectures.
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add topology-aware apis and benchmarks
#62
fengggli
closed
4 years ago
0
network architectures
#61
fengggli
opened
4 years ago
0
hybrid parallelism and its stragegy
#60
fengggli
opened
4 years ago
5
setup and experiment and analysis in KNL
#59
fengggli
closed
4 years ago
0
server information
#58
fengggli
opened
4 years ago
2
perf tuning
#57
fengggli
closed
4 years ago
5
Refactorized
#56
fengggli
closed
5 years ago
0
14-layer resnet, worker threads + utils improve
#55
fengggli
closed
5 years ago
0
Adding numa-aware work threads
#54
fengggli
closed
4 years ago
8
unmerged code from last semester.
#53
fengggli
closed
5 years ago
0
Test speed 1 32
#52
zkSNARK
closed
5 years ago
0
downsampling dimension mismatch
#51
fengggli
closed
4 years ago
5
cudnn implementation
#50
qoofyk
closed
5 years ago
0
[conv layer selector and caffe' per-image convolution]:
#49
fengggli
closed
5 years ago
4
device convolution experiments
#48
fengggli
closed
4 years ago
0
Forward gpu
#47
zkSNARK
opened
5 years ago
6
Backward gpu
#46
zkSNARK
closed
5 years ago
0
Final Presentation
#45
fengggli
closed
5 years ago
0
Forward gpu integration
#44
zkSNARK
closed
5 years ago
1
Tmp broken cublas test
#43
zkSNARK
closed
5 years ago
0
Tmp broken cublas test
#42
zkSNARK
closed
5 years ago
0
[nnpack and float32 type]
#41
fengggli
closed
5 years ago
3
Col2im inner dev2
#40
zkSNARK
closed
5 years ago
0
figure out mapping from 6D to 1D to 2D in col2im_inner
#39
zkSNARK
closed
5 years ago
0
[resnet]: improve resnet
#38
fengggli
closed
5 years ago
3
[simple resnet]: add prototype implementation
#37
fengggli
closed
5 years ago
3
adjustments to correct the primary index in 1D loop
#36
zkSNARK
closed
5 years ago
0
Im2col inner dev
#35
zkSNARK
closed
5 years ago
1
Tpose1230 dev
#34
zkSNARK
closed
5 years ago
0
[fixed compiler warning]
#33
fengggli
closed
5 years ago
0
Remove pad dev
#32
zkSNARK
closed
5 years ago
0
Padding dev
#31
zkSNARK
closed
5 years ago
0
Tpose 3012 dev
#30
zkSNARK
closed
5 years ago
0
meeting april 10 2019
#29
zkSNARK
closed
5 years ago
0
[ResNet]: status tracker
#28
fengggli
closed
4 years ago
1
[result reporting and corrected mlp]
#27
fengggli
closed
5 years ago
1
[bug]: fc.weight.diff perioically set to 0
#26
fengggli
closed
5 years ago
3
Backward
#25
zkSNARK
closed
5 years ago
0
[Data]: Data utils for cifar10.
#24
fengggli
closed
5 years ago
0
Col2im
#23
zkSNARK
closed
5 years ago
0
Tpose 1230 2
#22
zkSNARK
closed
5 years ago
0
[cuda-pool]: an demo of device pooling layer
#21
fengggli
closed
5 years ago
0
Backward
#20
zkSNARK
closed
5 years ago
0
[net-mlp]: finish net-mlp. forward/backward both checked, add .clangformat
#19
fengggli
closed
5 years ago
1
[net-mlp]: add mlp net with forward/backward
#18
fengggli
closed
5 years ago
0
adds python examples and 2 tensor ops
#17
zkSNARK
closed
5 years ago
2
[pooling]: fix the bug: dx is not intialized, now renable the test
#16
fengggli
closed
5 years ago
0
[conv layer] add more test
#15
fengggli
closed
5 years ago
0
[pooling layer]: add forward and backward
#14
fengggli
closed
5 years ago
0
Convolution kernel implementation
#13
fengggli
closed
5 years ago
2
Next