We encounter vtgate panic after running an online migration (adding a column). Even after vtgate reboot or downgrade (on 14 RC) the issue persisted.
The migration revert throwed and error (not sure but I think it was something similar: ... unexpected plan type (CallerID: user)). The only way to recover those vtgate was to manually rollback the migration (DROP the column.)
Reproduction Steps
I tried to reproduce the error, but could not. Even on the impacted infrastructure, after some times the migration was a success. (Even if it failed in multiple attempts).
What I did was:
running the migration on one vtgate:
SET @@ddl_strategy='vitess';
ALTER TABLE files ADD COLUMN `extension` char(64) COLLATE utf8mb4_unicode_ci DEFAULT NULL AFTER file_path;
After this, multiple (all) vtgate were crashing with the following. Live application was querying those vtgate, so the query that make vtgate to crash was not clear but we saw as a first error:
SQLSTATE[HY000] [2002] Connection refused (SQL: select * from files where name = chrisb limit 1)
Schema tracking was disabled, and running with or without updated column vschema was not resolving the issue.
Binary Version
Version: 14.0.0 (Git revision 4e6d95c08abaa5e2e47fc9243a52884e6ae829a6 branch 'heads/v14.0.0') built on Tue Jun 28 12:55:43 UTC 2022 by vitess@buildkitsandbox using go1.18.3 linux/amd64
Overview of the Issue
We encounter vtgate panic after running an online migration (adding a column). Even after vtgate reboot or downgrade (on 14 RC) the issue persisted. The migration revert throwed and error (not sure but I think it was something similar:
... unexpected plan type (CallerID: user)
). The only way to recover those vtgate was to manually rollback the migration (DROP the column.)Reproduction Steps
I tried to reproduce the error, but could not. Even on the impacted infrastructure, after some times the migration was a success. (Even if it failed in multiple attempts).
What I did was:
running the migration on one vtgate:
After this, multiple (all) vtgate were crashing with the following. Live application was querying those vtgate, so the query that make vtgate to crash was not clear but we saw as a first error:
Schema tracking was disabled, and running with or without updated column vschema was not resolving the issue.
Binary Version
Operating System and Environment details
Log Fragments
No response