digitalbrain79 / darknet-nnpack

Darknet with NNPACK
Other
306 stars 76 forks source link

raspberry pi gpu #6

Closed Grabber closed 7 years ago

Grabber commented 7 years ago

https://github.com/nineties/py-videocore

digitalbrain79 commented 7 years ago

@Grabber Do you know how can get mAP of YOLO? I can get recall but I cannot find how to get mAP.

shizukachan commented 6 years ago

I have implemented idein's qmkl (https://github.com/Idein/qmkl) to accelerate the GEMM in convolutional_layer.c / gemm.c.

On a Pi Zero this may pay off, but I wasn't able to beat NNPACK-darknet's performance on the Raspberry Pi 3. Interestingly, copying the matrix buffers to/from GPU/system memory is faster than allocating them in GPU memory space. I presume this is probably because of MTRRs (or the ARM equivalent) leading to poor cache performance.