Ray TPU Webhook Autoscaling Support and Reliability Improvements

This PR improves the reliability of the webhook by making it stateless in between calls, fixing issues related to the sliceToWorkers mapping being cleared upon webhook restart. These changes rely on adding a k8s client to the webhook that lists the current Pods in the same namespace as the intercepted Pod. These changes remove the need to intercept Pod deletion requests. Additionally, this PR generates TPU_WORKER_HOSTNAMES when intercepting each Pod, rather than the RayCluster, supporting autoscaling RayClusters.

This PR has been tested as follows:

[x] Unit Tests
[x] Manual Tests using single-host, multi-host, and an autoscaling RayCluster with a TPU worker group added

GoogleCloudPlatform / ai-on-gke

Ray TPU Webhook Autoscaling Support and Reliability Improvements #723