[ENHANCEMENT] Update vLLM

Enhancement Description

Update the vLLM version from 0.3.2 to the latest available version. This update is necessary to support the phi-3 mini model, which is only compatible with vLLM 0.4.3 and later. The current deployment does not support this model due to the outdated vLLM version. A quick upgrade attempt was made but failed, possibly due to issues with numpy 2.0, which is not backward compatible with numpy 1.x.

Advantages

Model Compatibility: Ensure compatibility with the phi-3 mini model, allowing for the latest features and improvements offered by this model.
Performance Improvements: Potential performance enhancements and bug fixes included in the newer vLLM versions.
Feature Set: Access to new features and functionalities introduced in vLLM 0.4.3 and later.

Possible Implementation

Modify the project requirements to request the latest vLLM version. Possibly add numpy<2 to avoid compatibility issues with the latest numpy version.

mobiusml / aana_sdk