Avoid monkey patching vLLM

Currently, vLLM's vllm.worker.worker.Worker is replaced with openrlhf.trainer.ray.vllm_worker_wrap.WorkerWrap on fly as a monkey patch.

The monkey patch is avoidable by making init_process_group and update_weight global functions and invoke them via __ray_call__.

__ray_call__ has not been documented yet but it is expected to be documented soon since it is marked as P1 in https://github.com/ray-project/ray/issues/45068.

__ray_call__ is already used in Ray to initialize NCCL group, which is similar to the OpenRLHF use case:

OpenLLMAI / OpenRLHF