Closed justinvyu closed 1 year ago
Another failure (480s > 450s): https://console.anyscale-staging.com/o/anyscale-internal/jobs/prodjob_pw8rwtedwq9xwnisjqgzpbxx2x?pjd-section=last-log
@bveeramani I think this is still failing. Can you take a look?
The last 4 runs have been successful
This seems to be a performance regression related to spilling to disk?
Timing out job (2/21/23): https://console.anyscale-staging.com/o/anyscale-internal/jobs/prodjob_nltqmbvtuhjbiuu3senydsbedp?pjd-section=last-log Succeeding job (2/22/23): https://console.anyscale-staging.com/o/anyscale-internal/jobs/prodjob_8ur7wqygilmzhdfw5idsec888u
See error logs from (2/21/23):
The failing run:
read_parquet
: 00:08 + 00:02 ~= 00:10 = 10 secondsbatch_predictor.predict(...)
: 02:10 = 130 secondsPast failures
This test has failed in the past for similar reasons (batch prediction taking a long time). Was that ever resolved?
Buildkite for one of these past failures: https://buildkite.com/ray-project/release-tests-branch/builds/1314#0186042a-ebf4-4f7c-b5c0-ddc950c23766