mobiusml / aana_sdk

Aana SDK is a powerful framework for building AI enabled multimodal applications.
https://www.mobiuslabs.com/
Apache License 2.0
26 stars 3 forks source link

[ENHANCEMENT] Enable CPU-only Deployment Option #117

Open movchan74 opened 3 months ago

movchan74 commented 3 months ago

Enhancement Description

The current implementation of example applications in the repository runs exclusively on GPU. While this is necessary for certain applications like chat_with_video or llama2, other applications such as whisper can run effectively on a CPU, albeit at a slower speed. The existing deployment configuration, found in deployments.py, mandates specifying the number of GPUs (num_gpus) for all deployments. This limitation necessitates code modification to run applications on a CPU, which is not ideal. This enhancement proposes adding a feature to override the num_gpus setting without altering the code.

Advantages

Allows users to run applications on either CPU or GPU without modifying the existing codebase.

Possible Implementation

By implementing this enhancement, the deployment process will become more adaptable, catering to a wider range of hardware configurations without necessitating direct code changes.