facebookarchive / caffe2

Caffe2 is a lightweight, modular, and scalable deep learning framework.
https://caffe2.ai
Apache License 2.0
8.42k stars 1.95k forks source link

Caffe2 OpenCLv1.x/OpenCLv2.0 etc.. support for Mobile GPUs #283

Closed c00lrain closed 7 years ago

c00lrain commented 7 years ago

Congratulation open Caffe2 github. But surprisingly still no solution for Mobile GPUs which from Qualcomm, Imagination and ARM. Many Android and iOS mobile devices have GPU and ready to accelerate Tensor processing. Like a DL framework history Caffe2 also follow CUDA only? Even Intel-Altera and Xilinx FPGA offer OpenCL based HDL compiler for their FPGA embedded solution. OpenCL on the Roadmap or out of sight?

Yangqing commented 7 years ago

Thanks - cc @ajtulloch

For the Qualcomm SNPE integration that we announced, we will be rolling out the code and Qualcomm will set their official SNPE release, probably in June or July.

For iOS, we will also push out the metal gpu code soon - there are some build script / sync issues to be sorted out.

For OpenCL / HDL / FPGA - these are definitely interesting directions to pursue. Right now we don't have a definitive roadmap yet, though. Will update when new info pops up.

jquave commented 7 years ago

@Yangqing Is there a way the community can contribute to the Metal integration for iOS?

Yangqing commented 7 years ago

@jquave - thanks, this is definitely going to be valuable. Currently, as mentioned in #298 we are in the process of pushing our metal code proper as promised. After that we would definitely love to build the metal work together!

Yangqing commented 7 years ago

298 now contains the pointers for metal and snpe - cmake scripts would be the first thing calling for contribution.

ThatAIGeek commented 7 years ago

Just came to say that OpenCL support is much needed, and I'd love to see and use it as part of Caffe2!

futurely commented 7 years ago

https://community.arm.com/graphics/b/blog/posts/arm-compute-library-for-computer-vision-and-machine-learning-now-publicly-available