Open Mengmengbai opened 1 month ago
Hi @Mengmengbai, I've added your request of a sample app to our backlog.
You'd need to implement the lightweight "coordination code" in mobile native code (e.g., Java for Android, or javascript for electron app).
This demo submits optimized on-device binaries (text encoder, unet, vae decoder) and run inference on on device via AI Hub's inference job. The coordination code for these model components (e.g., DPM scheduler) are in pytorch / python. This demo gives you a sense of the numerics and performance for each model component, excluding the coordination code.
These binaries optimized for on-device can be used directly with good performance and numerics.
Great works! Qualcomm has already supported ControlNet, but I have a confusion about how to run it on a mobile phone. As we know, there are some calculations between ControlNet and UNet, which are defined as an API by Qualcomm or do we need to implement this part ourselves? Or, is it necessary to convert both ControlNet and UNet into one model?
We hope to provide us with an Android sample for our development. We can't find relevant documents on the official website and other places.