nvidia-holoscan / holohub

Central repository for applications and operators for Holoscan
Apache License 2.0
104 stars 65 forks source link

Update multiai ultrasound application for optimal performance #522

Open AndreasHeumann opened 2 days ago

AndreasHeumann commented 2 days ago

Use CUDA stream pool to enable operators to run in parallel on the GPU. Use BlockMemoryPool to avoid calls to cudaFree() which synchronize CPU and GPU execution. Execute post processing on GPU (needs Holoscan SDK 2.6).

Improves performance from 12 ms to 7.4 ms per frame.