vmware-archive / pcf-pipelines

PCF Pipelines
Apache License 2.0
158 stars 282 forks source link

Potential issue with GCP Global HTTPS load balancers #334

Open mjj209 opened 6 years ago

mjj209 commented 6 years ago

Issue

On GCP, if you use a global HTTPS load balancer, the reliability isn't as high as most customers require. This can potentially be mitigated by using a Regional TCP Load Balancer. The Toolsmiths team is currently testing this out, and they should have a good data set in another 2-3 weeks.

This applies to all versions of PCF

Context

Global HTTPS load balancers are really only a thing on GCP; AWS doesn't have them.

Expected result

99.999% reliability when making requests to the cf API

Current result

Less than 99.999% reliability, with the majority of issues being of failed_to_connect_to_backend

Steps to Reproduce

Install PCF on GCP, run smoke tests continuously.

Possible Fix

Copy the work we did here: https://github.com/pivotal-cf/toolsmiths-shared-scripts/commit/5282d70ddd919e4a8bb57df0da230b0399a4dec9

Logs and Screenshots

cf-gitbot commented 6 years ago

We have created an issue in Pivotal Tracker to manage this. Unfortunately, the Pivotal Tracker project is private so you may be unable to view the contents of the story.

The labels on this github issue will be updated when the story is started.

mjj209 commented 6 years ago

After several weeks of testing, the overall R&D recommendation is to switch to Regional TCP load balancers for the time being. I'm not sure if you want to go ahead up update the PCF-Pipelines, given that R&D may change the recommendation in the future. I'll leave this decision up to you. Would be happy to help you either way.

R&D may update the recommendation pending a test with modified version of the global load balancers, the test is going to happen in the next month, but rollout to the rest of the GCP infrastructure may be Q3 2018 or later.