issues
search
usersan
/
papers
読んだ論文のメモ置き場:主にエッジAI、高速化、FPGA実装関連など
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
SPEED: Speculative Pipelined Execution for Efficient Decoding
#50
tera1k
opened
1 year ago
2
A Survey of Techniques for Optimizing Transformer Inference
#49
tera1k
opened
1 year ago
0
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
#48
tera1k
opened
1 year ago
0
Training data-efficient image transformers & distillation through attention
#47
tera1k
opened
1 year ago
0
NPE: An FPGA-based Overlay Processor for Natural Language Processing
#46
tera1k
opened
1 year ago
0
Robust Speech Recognition via Large-Scale Weak Supervision
#45
tera1k
opened
1 year ago
0
Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization
#44
usersan
opened
1 year ago
1
RISC-VTF: RISC-V Based Extended Instruction Set for Transformer
#43
tera1k
opened
1 year ago
0
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
#42
tera1k
opened
1 year ago
0
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
#41
tera1k
opened
1 year ago
0
Rethinking Attention with Performers
#40
tera1k
opened
1 year ago
0
SqueezeLLM: Dense-and-Sparse Quantization
#39
tera1k
opened
1 year ago
0
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
#38
tera1k
opened
1 year ago
2
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
#37
tera1k
opened
1 year ago
0
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
#36
tera1k
opened
1 year ago
0
Attention Is All You Need
#35
tera1k
opened
1 year ago
1
LogicNets: Co-Designed Neural Networks and Circuits for Extreme-Throughput Applications
#34
usersan
opened
3 years ago
0
AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation
#33
usersan
opened
3 years ago
0
DiCENet: Dimension-wise Convolutions for Efficient Networks
#32
usersan
opened
4 years ago
0
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
#31
usersan
opened
4 years ago
0
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
#30
usersan
opened
4 years ago
1
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
#29
usersan
opened
4 years ago
0
Panoptic Feature Pyramid Networks
#28
usersan
opened
4 years ago
0
Deep Learning Method for Automated Classification of Anteroposterior and Posteroanterior Chest Radiographs
#27
usersan
opened
4 years ago
0
Deep Anomaly Detection with Deviation Networks
#26
usersan
opened
4 years ago
0
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
#25
usersan
opened
4 years ago
0
YOLOv3: An Incremental Improvement
#24
usersan
opened
4 years ago
0
YOLO9000: Better, Faster, Stronger
#23
usersan
opened
4 years ago
0
You Only Look Once: Unified, Real-Time Object Detection
#22
usersan
opened
4 years ago
0
Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices
#21
usersan
opened
4 years ago
5
Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments
#20
usersan
opened
4 years ago
0
Enriching Variety of Layer-wise Learning Information by Gradient Combination
#19
usersan
opened
4 years ago
0
ThunderNet: Towards Real-time Generic Object Detection
#18
usersan
opened
4 years ago
0
CSPNet: A New Backbone that can Enhance Learning Capability of CNN
#17
usersan
opened
4 years ago
0
RTSeg: Real-time Semantic Segmentation Comparative Study
#16
usersan
opened
4 years ago
3
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
#15
usersan
opened
4 years ago
0
SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications
#14
usersan
opened
4 years ago
0
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
#13
usersan
opened
4 years ago
0
Scaling Binarized Neural Networks on Reconfigurable Logic
#12
usersan
opened
4 years ago
0
FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks
#11
usersan
opened
4 years ago
0
A Lightweight YOLOv2: A Binarized CNN with A Parallel Support Vector Regression for an FPGA
#10
usersan
opened
4 years ago
0
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
#9
usersan
opened
4 years ago
0
Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA
#8
usersan
opened
4 years ago
0
LUTNet: Rethinking Inference in FPGA Soft Logic
#7
usersan
opened
4 years ago
0
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
#6
usersan
opened
4 years ago
2
Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1
#5
usersan
opened
4 years ago
1
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
#4
usersan
opened
4 years ago
0
Distilling the Knowledge in a Neural Network
#3
usersan
opened
4 years ago
0
Learning Efficient Object Detection Models with Knowledge Distillation
#2
usersan
opened
4 years ago
0
Knowledge Distillation for Optimization of Quantized Deep Neural Networks
#1
usersan
opened
4 years ago
0
Next