eclipse-che / che

Kubernetes based Cloud Development Environments for Enterprise Teams
http://eclipse.org/che
Eclipse Public License 2.0
6.99k stars 1.19k forks source link

Workspace does not start in CHE on OCP 4.6.8 #18704

Closed artaleks9 closed 3 years ago

artaleks9 commented 3 years ago

Describe the bug

Che version

Steps to reproduce

  1. Install Che nightly on OCP 4.6.8
  2. Try run workspace from 'Get Started' on dashboard

Expected behavior

Runtime

Installation method

Environment

Eclipse Che Logs


GMS: address=che-76bd5f4b7d-xgwq4-48102, cluster=RemoteSubscriptionChannel, physical address=10.128.3.70:7800

2020-12-28 04:14:22,778[ost-startStop-1] [INFO ] [o.jgroups.protocols.pbcast.GMS 125] - che-76bd5f4b7d-xgwq4-48102: no members discovered after 3206 ms: creating cluster as coordinator 2020-12-28 04:14:22,820[ost-startStop-1] [INFO ] [o.j.p.kubernetes.KUBE_PING 131] - namespace nightly-che set; clustering enabled


GMS: address=che-76bd5f4b7d-xgwq4-56896, cluster=WorkspaceLocks, physical address=10.128.3.70:7801

2020-12-28 04:14:25,879[ost-startStop-1] [INFO ] [o.jgroups.protocols.pbcast.GMS 125] - che-76bd5f4b7d-xgwq4-56896: no members discovered after 3048 ms: creating cluster as coordinator 2020-12-28 04:14:25,908[ost-startStop-1] [INFO ] [o.j.p.kubernetes.KUBE_PING 131] - namespace nightly-che set; clustering enabled


GMS: address=che-76bd5f4b7d-xgwq4-24227, cluster=WorkspaceStateCache, physical address=10.128.3.70:7802

2020-12-28 04:14:28,952[ost-startStop-1] [INFO ] [o.jgroups.protocols.pbcast.GMS 125] - che-76bd5f4b7d-xgwq4-24227: no members discovered after 3031 ms: creating cluster as coordinator 2020-12-28 04:14:29,192[ost-startStop-1] [INFO ] [o.f.c.i.d.DbSupportFactory 44] - Database: jdbc:postgresql://postgres:5432/dbche (PostgreSQL 9.6) 2020-12-28 04:14:29,242[ost-startStop-1] [INFO ] [o.f.c.i.util.VersionPrinter 44] - Flyway 4.2.0 by Boxfuse 2020-12-28 04:14:29,248[ost-startStop-1] [INFO ] [o.f.c.i.d.DbSupportFactory 44] - Database: jdbc:postgresql://postgres:5432/dbche (PostgreSQL 9.6) 2020-12-28 04:14:29,309[ost-startStop-1] [INFO ] [i.f.CustomSqlMigrationResolver 158] - Searching for SQL scripts in locations [classpath:che-schema] 2020-12-28 04:14:29,367[ost-startStop-1] [INFO ] [o.f.c.i.command.DbValidate 44] - Successfully validated 62 migrations (execution time 00:00.061s) 2020-12-28 04:14:29,387[ost-startStop-1] [INFO ] [o.f.c.i.m.MetaDataTableImpl 44] - Creating Metadata table: "public"."schema_version" 2020-12-28 04:14:29,443[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Current version of schema "public": << Empty Schema >> 2020-12-28 04:14:29,506[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.0.0.8.1 - 1init.sql 2020-12-28 04:14:29,870[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.0.0.9.1 - 1add_index_on_workspace_temporary.sql 2020-12-28 04:14:29,893[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.0.0.9.2 - 2update_local_links_in_environments.sql 2020-12-28 04:14:29,914[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.2.0.1 - 1increase_project_attributes_values_length.sql 2020-12-28 04:14:29,937[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.4.0.1 - 1drop_user_to_account_relation.sql 2020-12-28 04:14:29,959[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.4.0.2 - 2create_missed_account_indexes.sql 2020-12-28 04:14:29,982[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.6.0.1 - 1add_exec_agent_where_terminal_agent_is_present.sql 2020-12-28 04:14:30,000[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.7.0.1 - 1__add_factory.sql 2020-12-28 04:14:30,164[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.7.0.2 - 2remove_match_policy.sql 2020-12-28 04:14:30,181[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.8.0.1 - 1add_foreigh_key_indexes.sql 2020-12-28 04:14:30,369[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.11.0.1 - 1optimize_user_search.sql 2020-12-28 04:14:30,399[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.19.0.0.1 - 0.1add_permissions.sql 2020-12-28 04:14:30,604[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.19.0.0.2 - 0.2add_resources.sql 2020-12-28 04:14:30,643[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 5.19.0.0.3 - 0.3add_organization.sql 2020-12-28 04:14:30,728[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.1 - 1__add_path_to_serverconf.sql 2020-12-28 04:14:30,738[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.2 - 2rename_agents_to_installers.sql 2020-12-28 04:14:30,762[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.3 - 3add_installer.sql 2020-12-28 04:14:30,839[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.3.1 - 3.1__remove_old_recipe_permissions.sql 2020-12-28 04:14:30,854[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.4 - 4remove_old_recipe.sql 2020-12-28 04:14:30,871[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.5 - 5add_machine_env.sql 2020-12-28 04:14:30,896[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.6 - 6remove_snapshots.sql 2020-12-28 04:14:30,914[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.7 - 7add_machine_volumes.sql 2020-12-28 04:14:30,942[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.8 - 8add_serverconf_attributes.sql 2020-12-28 04:14:30,985[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.9 - 9increase_externalmachine_env_value_length.sql 2020-12-28 04:14:31,001[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.10 - 10move_dockerimage_recipe_location_to_content.sql 2020-12-28 04:14:31,012[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.11 - 11increase_workspace_attributes_values_length.sql 2020-12-28 04:14:31,022[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.0.0.12 - 12remove_stack_sources.sql 2020-12-28 04:14:31,033[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.3.0.1 - 1add_fk_indexes.sql 2020-12-28 04:14:31,073[ost-startStop-1] [WARN ] [o.f.c.i.dbsupport.JdbcTemplate 48] - DB: identifier "che_index_factory_on_projects_loaded_action_value_action_entity_id" will be truncated to "che_index_factory_on_projects_loaded_action_value_action_entity" (SQL State: 42622 - Error Code: 0) 2020-12-28 04:14:31,086[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.3.0.1.1 - 1.1add_fk_indexes.sql 2020-12-28 04:14:31,143[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.4.0.1 - 1add_workspace_expirations.sql 2020-12-28 04:14:31,168[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.4.0.2 - 2add_signature_key.sql 2020-12-28 04:14:31,221[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.4.0.3 - 3add_k8s_runtimes.sql 2020-12-28 04:14:31,323[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.10.0.1 - 1__add_workspace_cfg_attributes.sql 2020-12-28 04:14:31,345[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.10.0.2 - 2change_signature_key_pair_id.sql 2020-12-28 04:14:31,365[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.11.0.1 - 1add_signature_key_constraints.sql 2020-12-28 04:14:31,398[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.12.0.1 - 1rename_project_attributes_values_field.sql 2020-12-28 04:14:31,407[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.15.0.1 - 1remove_not_null_constraint_from_env_name_fields.sql 2020-12-28 04:14:31,427[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.15.0.2 - 2add_commands_to_k8s_runtime.sql 2020-12-28 04:14:31,474[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.16.0.1 - 1increase_workspace_config_attributes_values_length.sql 2020-12-28 04:14:31,485[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.16.0.2 - 2__create_workspace_activity_table.sql 2020-12-28 04:14:31,550[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.16.0.3 - 3bootstrap_ws_activity_data.sql 2020-12-28 04:14:31,560[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 6.17.0.1 - 1convert_enums_to_strings.sql 2020-12-28 04:14:31,576[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.4.0.1 - 1__add_devfile.sql 2020-12-28 04:14:31,894[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.5.0.1 - 1devfile_command_reference.sql 2020-12-28 04:14:31,903[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.6.0.1 - 1add_devfile_component_prefs.sql 2020-12-28 04:14:31,927[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.7.0.1 - 1__add_registry_url_to_devfile_component.sql 2020-12-28 04:14:31,943[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.8.0.2.0.1 - 1devfile_metadata.sql 2020-12-28 04:14:31,955[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.0.0.8.0.2.0.2 - 2devfile_make_some_fields_optional.sql 2020-12-28 04:14:31,965[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.1.0.1 - 1change_devfile_component_preferences_type.sql 2020-12-28 04:14:31,974[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.1.0.1.1 - 1.1remove_stack_permissions.sql 2020-12-28 04:14:31,985[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.1.0.2 - 2remove_stacks.sql 2020-12-28 04:14:31,998[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.2.0.1 - 1remove_installers.sql 2020-12-28 04:14:32,014[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.4.0.1 - 1__add_devfile_source_sparse_checkout_dir.sql 2020-12-28 04:14:32,024[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.4.0.2 - 2add_preview_url_to_devfile_command.sql 2020-12-28 04:14:32,045[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.6.0.1 - 1drop_che_workspace_expiration.sql 2020-12-28 04:14:32,060[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.10.0.1 - 1__add_devfile_plugin_editor_component_cpu_limit_request.sql 2020-12-28 04:14:32,083[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.10.0.2 - 2add_devfile_plugin_editor_component_ram_request.sql 2020-12-28 04:14:32,101[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.11.0.1 - 1update_inconsistent_stopped_workspace_activities.sql 2020-12-28 04:14:32,112[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.16.0.1 - 1add_devfile_component_automount_workspace_secrets.sql 2020-12-28 04:14:32,131[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.20.0.1 - 1userdevfile.sql 2020-12-28 04:14:32,180[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.20.0.1.1 - 1.1add_userdevfile_permissions.sql 2020-12-28 04:14:32,242[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Migrating schema "public" to version 7.21.0.1 - 1__remove_installers.sql 2020-12-28 04:14:32,251[ost-startStop-1] [INFO ] [o.f.c.i.command.DbMigrate 44] - Successfully applied 62 migrations to schema "public" (execution time 00:02.865s). 2020-12-28 04:14:33,850[ost-startStop-1] [INFO ] [o.j.p.kubernetes.KUBE_PING 131] - namespace nightly-che set; clustering enabled


GMS: address=che-76bd5f4b7d-xgwq4-39115, cluster=EclipseLinkCommandChannel, physical address=10.128.3.70:7803

2020-12-28 04:14:36,918[ost-startStop-1] [INFO ] [o.jgroups.protocols.pbcast.GMS 125] - che-76bd5f4b7d-xgwq4-39115: no members discovered after 3062 ms: creating cluster as coordinator 2020-12-28 04:14:37,015[ost-startStop-1] [INFO ] [o.e.c.a.w.s.WorkspaceRuntimes 175] - Configured factories for environments: '[openshift, kubernetes, no-environment]' 2020-12-28 04:14:37,015[ost-startStop-1] [INFO ] [o.e.c.a.w.s.WorkspaceRuntimes 176] - Registered infrastructure 'openshift' 2020-12-28 04:14:37,083[ost-startStop-1] [INFO ] [o.e.c.a.w.s.WorkspaceRuntimes 702] - Infrastructure is tracking 0 active runtimes 2020-12-28 04:14:37,134[ost-startStop-1] [INFO ] [o.e.c.a.c.u.ApiInfoLogInformer 36] - Eclipse Che Api Core: Build info '7.24.0-SNAPSHOT' scmRevision '6791ed488a7bce4044f7318eacfa584a1ecf8474' implementationVersion '7.24.0-SNAPSHOT' 2020-12-28 04:14:37,162[ost-startStop-1] [WARN ] [p.s.AdminPermissionInitializer 69] - Admin admin not found yet. 2020-12-28 04:14:37,212[ost-startStop-1] [INFO ] [o.e.c.c.metrics.MetricsServer 46] - Metrics server started at port 8087 successfully 28-Dec-2020 04:14:37.515 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployWAR Deployment of web application archive [/home/user/eclipse-che/tomcat/webapps/api.war] has finished in [27,462] ms 28-Dec-2020 04:14:37.516 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployWAR Deploying web application archive [/home/user/eclipse-che/tomcat/webapps/swagger.war] 28-Dec-2020 04:14:37.563 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployWAR Deployment of web application archive [/home/user/eclipse-che/tomcat/webapps/swagger.war] has finished in [47] ms 28-Dec-2020 04:14:37.564 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployDirectory Deploying web application directory [/home/user/eclipse-che/tomcat/webapps/dashboard] 28-Dec-2020 04:14:37.584 INFO [localhost-startStop-1] org.apache.jasper.servlet.TldScanner.scanJars At least one JAR was scanned for TLDs yet contained no TLDs. Enable debug logging for this logger for a complete list of JARs that were scanned but no TLDs were found in them. Skipping unneeded JARs during scanning can improve startup time and JSP compilation time. 28-Dec-2020 04:14:37.587 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployDirectory Deployment of web application directory [/home/user/eclipse-che/tomcat/webapps/dashboard] has finished in [23] ms 28-Dec-2020 04:14:37.587 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployDirectory Deploying web application directory [/home/user/eclipse-che/tomcat/webapps/workspace-loader] 28-Dec-2020 04:14:37.615 INFO [localhost-startStop-1] org.apache.catalina.startup.HostConfig.deployDirectory Deployment of web application directory [/home/user/eclipse-che/tomcat/webapps/workspace-loader] has finished in [28] ms 28-Dec-2020 04:14:37.636 INFO [main] org.apache.coyote.AbstractProtocol.start Starting ProtocolHandler ["http-nio-8080"] 28-Dec-2020 04:14:37.647 INFO [main] org.apache.catalina.startup.Catalina.start Server startup in 29977 ms 2020-12-28 07:46:23,281[io-8080-exec-10] [INFO ] [o.e.c.a.w.s.WorkspaceManager 694] - Workspace 'admin/java-web-spring-fz2i7' with id 'workspacert2mxi4q15r7ezm4' created by user 'admin' 2020-12-28 07:46:28,745[nio-8080-exec-1] [INFO ] [o.e.c.a.w.s.WorkspaceRuntimes 480] - Starting workspace 'admin/java-web-spring-fz2i7' with id 'workspacert2mxi4q15r7ezm4' by user 'admin' 2020-12-28 07:46:44,959[aceSharedPool-0] [ERROR] [e.c.w.i.k.n.p.PVCSubPathHelper 217] - Job command '[mkdir, -m, 777, -p, /tmp/job_mount/workspacert2mxi4q15r7ezm4/che-logs-che-plugin-broker/]' execution is failed. Logs: mkdir: cannot create directory '/tmp/job_mount/workspacert2mxi4q15r7ezm4': No space left on device \n 2020-12-28 07:46:51,026[/172.30.0.1/...] [ERROR] [o.e.c.w.i.k.w.b.DeployBroker 176] - Unrecoverable event occurred during plugin brokering for workspace 'workspacert2mxi4q15r7ezm4' startup: Failed, Error: failed to create subPath directory for volumeMount "claim-che-workspace" of container "che-plugin-metadata-broker-v3-4-0", workspacert2mxi4q15r7ezm4.che-plugin-broker 2020-12-28 07:46:51,365[aceSharedPool-0] [WARN ] [.i.k.KubernetesInternalRuntime 257] - Failed to start Kubernetes runtime of workspace workspacert2mxi4q15r7ezm4. org.eclipse.che.api.workspace.server.spi.InfrastructureException: Plugins installation process failed. Error: Unrecoverable event occurred: 'Failed', 'Error: failed to create subPath directory for volumeMount "claim-che-workspace" of container "che-plugin-metadata-broker-v3-4-0"', 'workspacert2mxi4q15r7ezm4.che-plugin-broker' at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.brokerphases.WaitBrokerResult.execute(WaitBrokerResult.java:72) at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.brokerphases.DeployBroker.execute(DeployBroker.java:129) at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.brokerphases.PrepareStorage.execute(PrepareStorage.java:76) at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.brokerphases.ListenBrokerEvents.execute(ListenBrokerEvents.java:63) at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.PluginBrokerManager.getTooling(PluginBrokerManager.java:129) at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.SidecarToolingProvisioner.provision(SidecarToolingProvisioner.java:90) at org.eclipse.che.workspace.infrastructure.kubernetes.KubernetesInternalRuntime.provisionWorkspace(KubernetesInternalRuntime.java:288) at org.eclipse.che.workspace.infrastructure.kubernetes.KubernetesInternalRuntime.internalStart(KubernetesInternalRuntime.java:204) at org.eclipse.che.api.workspace.server.spi.InternalRuntime.start(InternalRuntime.java:141) at org.eclipse.che.api.workspace.server.WorkspaceRuntimes$StartRuntimeTask.run(WorkspaceRuntimes.java:969) at org.eclipse.che.commons.lang.concurrent.CopyThreadLocalRunnable.run(CopyThreadLocalRunnable.java:38) at java.base/java.util.concurrent.CompletableFuture$AsyncRun.run(Unknown Source) at io.micrometer.core.instrument.internal.TimedRunnable.run(TimedRunnable.java:44) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at org.eclipse.che.commons.observability.CountedThreadFactory.lambda$newThread$0(CountedThreadFactory.java:75) at java.base/java.lang.Thread.run(Unknown Source) Caused by: org.eclipse.che.api.workspace.server.spi.InfrastructureException: Unrecoverable event occurred: 'Failed', 'Error: failed to create subPath directory for volumeMount "claim-che-workspace" of container "che-plugin-metadata-broker-v3-4-0"', 'workspacert2mxi4q15r7ezm4.che-plugin-broker' at org.eclipse.che.workspace.infrastructure.kubernetes.wsplugins.brokerphases.DeployBroker.handleUnrecoverableEvent(DeployBroker.java:184) at org.eclipse.che.workspace.infrastructure.kubernetes.util.UnrecoverablePodEventListener.handle(UnrecoverablePodEventListener.java:44) at org.eclipse.che.workspace.infrastructure.kubernetes.namespace.KubernetesDeployments$4.lambda$eventReceived$0(KubernetesDeployments.java:585) at java.base/java.util.concurrent.ConcurrentLinkedQueue.forEachFrom(Unknown Source) at java.base/java.util.concurrent.ConcurrentLinkedQueue.forEach(Unknown Source) at org.eclipse.che.workspace.infrastructure.kubernetes.namespace.KubernetesDeployments$4.eventReceived(KubernetesDeployments.java:585) at org.eclipse.che.workspace.infrastructure.kubernetes.namespace.KubernetesDeployments$4.eventReceived(KubernetesDeployments.java:550) at io.fabric8.kubernetes.client.utils.WatcherToggle.eventReceived(WatcherToggle.java:49) at io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager$1.onMessage(WatchConnectionManager.java:237) at okhttp3.internal.ws.RealWebSocket.onReadMessage(RealWebSocket.java:323) at okhttp3.internal.ws.WebSocketReader.readMessageFrame(WebSocketReader.java:219) at okhttp3.internal.ws.WebSocketReader.processNextFrame(WebSocketReader.java:105) at okhttp3.internal.ws.RealWebSocket.loopReader(RealWebSocket.java:274) at okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:214) at okhttp3.RealCall$AsyncCall.execute(RealCall.java:203) at okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) ... 1 common frames omitted 2020-12-28 07:46:51,475[aceSharedPool-0] [INFO ] [o.e.c.a.w.s.WorkspaceRuntimes 995] - Workspace 'admin:java-web-spring-fz2i7' with id 'workspacert2mxi4q15r7ezm4' start failed

skabashnyuk commented 3 years ago

tested on RHPDS. Not reproduced.

Client Version: v4.2.0-alpha.0-657-g51011e4
Server Version: 4.6.3
Kubernetes Version: v1.19.0+9f84db3
skabashnyuk commented 3 years ago

@artaleks9 do you have any suspicious events or failing pods in the cluster?

artaleks9 commented 3 years ago

@skabashnyuk No, I don't see any suspicious events or failing pods related to this problem.

tolusha commented 3 years ago

It might be underlying infrastructure issue.

artaleks9 commented 3 years ago

Do you have an explanation why in the same time workspace of downstream project runs well? And will we have guarantee when we reinstall cluster the problem is not repeat after some time, especially later for customers?

ocp46-ws-runs

skabashnyuk commented 3 years ago

Do you have an explanation why in the same time workspace of downstream project runs well?

No

And will we have guarantee when we reinstall cluster the problem is not repeat after some time, especially later for customers?

No

@artaleks9 does this error reproduced on other clusters?

artaleks9 commented 3 years ago

I can say this isn't reproduced on OCP 4.5.19 and minikube.

tolusha commented 3 years ago

I've just deployed Che 7.24.0-SNAPSHOT on a fresh OCP 4.6 cluster and was able to start a workspace.

Screenshot from 2020-12-30 09-33-12 Screenshot from 2020-12-30 09-33-18

skabashnyuk commented 3 years ago

I can say this isn't reproduced on OCP 4.5.19 and minikube.

@artaleks9 is this reproduced on minikube? Does this reproduce somewhere except your concrete cluster?

artaleks9 commented 3 years ago

No, it isn't reproduced on minikube. It's reproduced on concrete cluster OCP 4.6.8 on OSP.

rhopp commented 3 years ago

I took a look and found the reason. The claim-che-workspace PVC uses PV, which is full. It's full of other workspaces, which are either belonging to either different che instances on the cluster, or to some orphaned workspaces (maybe force stopped, thus rm pod couldn't finish it's duty? Just guessing).

image

# du -hs * |sort -hr
378M    workspaceec8pdfbi04lsrmh1
122M    workspaces111wtwspztop2wp
122M    workspacegm9itltcbcim9d9j
96M     workspaceakpbnsw3bwn4jj6p
61M     workspaceu3cv03g3vnuofz52
61M     workspacesb0w0gv3xmj2c9ph
61M     workspacedv332ym248e8mp6e
61M     workspace8tu6ku1l8tvbh5ej
16K     lost+found
8.0K    workspacercp7c6a197ir1r27

@artaleks9 You can either investigate further (look for owners of those workspaces), or if you feel comfortable with that, either delete the folders from the PV, or delete the PVC.

artaleks9 commented 3 years ago

@rhopp, thanks for your investigation. It helped, ws can start.