For ARM platform with NEON, enable the NEON based Powf()

jetpacapp / DeepBeliefSDK

The SDK for Jetpac's iOS Deep Belief image recognition framework

Other

2.86k stars 437 forks source link

For ARM platform with NEON, enable the NEON based Powf() #27

Closed CFAndy closed 9 years ago

CFAndy commented 10 years ago

The libm based powf cost a lot on TegraK1(when cublas is used), replace it with a neon based implementation from math_neon. Now the processing time decrease to 600ms on Tegra 1.7Ghz with GPU@400Mhz and EMC at DDR3-1600.

CFAndy commented 9 years ago

Fix a bug in Buffer.transpose. The free(_data) is fixed.