kruize / autotune

Autonomous Performance Tuning for Kubernetes!
Apache License 2.0
164 stars 54 forks source link

Stress test failed intermittently with converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException #1012

Open chandrams opened 1 year ago

chandrams commented 1 year ago

Stress test fails intermittently with the below error messages in the kruize pod log on both minikube and openshift

Detail: Partition key of the failing row contains (interval_end_time) = (2023-10-28 23:59:59.558).
2023-10-2718:45:00.244 ERROR [qtp88397182-33][ExperimentDAOImpl.java(194)]-Converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException` : could not execute statement
2023-10-2718:45:00.431 ERROR [qtp88397182-36][KruizeErrorHandler.java(66)]-{"message":"Out of a total of 100 records, 1 failed to save","httpcode":400,"documentationLink":"","status":"ERROR","data":[{"interval_start_time":"2023-10-28T23:44:59.631Z","interval_end_time":"2023-10-28T23:59:59.631Z","errors":[{"message":"Converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException` : could not execute statement","httpcode":400,"documentationLink":"","status":"ERROR"}],"version":"1.0","experiment_name":"quarkus-resteasy-kruize-min-http-response-time-db-13"}]}
2023-10-2718:45:00.442 ERROR [qtp88397182-33][KruizeErrorHandler.java(66)]-{"message":"Out of a total of 100 records, 1 failed to save","httpcode":400,"documentationLink":"","status":"ERROR","data":[{"interval_start_time":"2023-10-28T23:44:59.558Z","interval_end_time":"2023-10-28T23:59:59.558Z","errors":[{"message":"Converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException` : could not execute statement","httpcode":400,"documentationLink":"","status":"ERROR"}],"version":"1.0","experiment_name":"quarkus-resteasy-kruize-min-http-response-time-db-36"}]}
2023-10-2718:45:06.425 ERROR [qtp88397182-36][SqlExceptionHelper.java(138)]-ERROR: no partition of relation "kruize_results" found for row
  Detail: Partition key of the failing row contains (interval_end_time) = (2023-10-28 23:59:59.354).
2023-10-2718:45:06.426 INFO [qtp88397182-36][ExperimentDAOImpl.java(186)]-Create partition and retry !
2023-10-2718:45:06.642 ERROR [qtp88397182-36][SqlExceptionHelper.java(138)]-ERROR: no partition of relation "kruize_results" found for row
  Detail: Partition key of the failing row contains (interval_end_time) = (2023-10-28 23:59:59.354).
2023-10-2718:45:06.642 ERROR [qtp88397182-36][ExperimentDAOImpl.java(194)]-Converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException` : could not execute statement
2023-10-2718:45:07.029 ERROR [qtp88397182-36][KruizeErrorHandler.java(66)]-{"message":"Out of a total of 100 records, 1 failed to save","httpcode":400,"documentationLink":"","status":"ERROR","data":[{"interval_start_time":"2023-10-28T23:44:59.354Z","interval_end_time":"2023-10-28T23:59:59.354Z","errors":[{"message":"Converting `org.hibernate.exception.ConstraintViolationException` to JPA `PersistenceException` : could not execute statement","httpcode":400,"documentationLink":"","status":"ERROR"}],"version":"1.0","experiment_name":"quarkus-resteasy-kruize-min-http-response-time-db-42"}]}

Steps to reproduce: