Wechat ID: NeuralTalk

Embedded-AI-Report

关注模型压缩、低比特量化、移动端推理加速优化、部署

2	0	2	2
			2022-02-23
2	0	2	1
2021-11-01	2021-10-13	2021-08-05	2021-08-24	2021-08-05
2021-07-18	2021-07-01	2021-05-21	2021-01-19
2	0	2	0
2021-01-05	2020-11-30	2020-10-21	2020-09-17
2020-08-26	2020-08-06	2020-07-18	2020-07-02
2020-06-17	2020-06-03	2020-05-15	2020-04-26
2020-04-04	2020-03-19	2020-03-02	2020-02-16
2020-01-27	2020-01-06	2019-12-17	2019-12-02
2	0	1	9
2019-11-30	2019-11-18	2019-10-31	2019-10-17
2019-10-03	2019-09-16	2019-08-30	2019-08-15
2019-07-30	2019-07-15	2019-06-29	2019-06-17
2019-05-30	2019-05-15	2019-04-27	2019-04-13
2019-03-31

embedded-ai.bi-weekly

A curated list of awesome A.I. & Embedded/Mobile-devices resources, tools and more.

Looking for contributors. Submit a pull request if you have something to add :)
Please check the contribution guidelines for info on formatting and writing pull requests.

Awesome-Emebedded-AI

Device Benchmark
Papers
App-Experience
Demo-Codes
- Android
- iOS
- Vulkan
Frameworks
Course/Guide/Tutorial
Hardware
- GPU
Company
News

Device Benchmark

高通处理器规格表 | mydriver

CPU Devive

GPU Device

Adreno – Wikipedia
GPU GFLOPS | surge.sh
Qualcomm Adreno GPU Performance as below:
手机处理器天梯图_最新CPU天梯图_手机CPU性能天梯图 | ZOL中关村在线

Papers

Classic

[1512.03385] Deep Residual Learning for Image Recognition
[1610.02357] Xception: Deep Learning with Depthwise Separable Convolutions
[1611.05431] ResNeXt: Aggregated Residual Transformations for Deep Neural Networks

Overview

Representation

Structure

[1704.06904] Residual Attention Network for Image Classification [code]
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
[CVPR2017] Squeeze-and-Excitation networks (ILSVRC 2017 winner) at CVPR2017
[1707.06342] ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
[1707.01083] ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
[1704.04861] MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
[1707.06990] Memory-Efficient Implementation of DenseNets
[1706.03912] SEP-Nets: Small and Effective Pattern Networks

Binarization

Pruning

Quantization

[ICML'17] The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning
[1412.6115] Compressing Deep Convolutional Networks using Vector Quantization
[CVPR '16] Quantized Convolutional Neural Networks for Mobile Devices
[ICASSP'16] Fixed-Point Performance Analysis of Recurrent Neural Networks
[arXiv'16] Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
[ICLR'17] Loss-aware Binarization of Deep Networks
[ICLR'17] Towards the Limit of Network Quantization
[CVPR'17] Deep Learning with Low Precision by Half-wave Gaussian Quantization
[1706.02393] ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

LowRankApproximation

Distillation

Joint Compression

[1707.09102] Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization

Kernel Selection

[1703.09746] Coordinating Filters for Faster Deep Neural Networks
[1606.05316] Learning Infinite-Layer Networks: Without the Kernel Trick

Computation Precison/Resolution

Model Split

[ASPLOS’17] Neurosurgeon: Collaborative intelligence between the cloud and mobile edge
[1705.04630] Forecasting using incomplete models

Others

[1606.05316] Learning Infinite-Layer Networks: Without the Kernel Trick
[1608.02893] Syntactically Informed Text Compression with Recurrent Neural Networks
[1608.05148] Full Resolution Image Compression with Recurrent Neural Networks
[1707.09422] Hyperprofile-based Computation Offloading for Mobile Edge Networks
[1707.09855] Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
[1707.09597] ScanNet: A Fast and Dense Scanning Framework for Metastatic Breast Cancer Detection from Whole-Slide Images
[1604.08772] Towards Conceptual Compression

FrameworkPaper

Experience

Codes

Model Compression

Model Encryption

OpenMined/Syft: Homomorphically Encrypted Deep Learning Library

Model Application

AR

Android

iOS

Vulkan

Frameworks & Acceleration Library

Benchmark

Convertor

Model convertor. More convertors please refer deep-learning-model-convertor

NervanaSystems/caffe2neon: Tools to convert Caffe models to neon's serialization format

Mobile Video Process Library/Player

Other Toolkit

Data Set

HandNet - A dataset of depth images of hands

Course

This part contains related course, guides and tutorials.

Hardware

GPU

Company

News

2017-08-07

2017-07-24

ysh329 / embedded-ai.bi-weekly

readme