Netflix / conductor

Conductor is a microservices orchestration engine.
Apache License 2.0
12.83k stars 2.34k forks source link

docker-compose failure #1725

Closed ghost closed 4 years ago

ghost commented 4 years ago

Any docker-compose fails with an exception.

Reproduction steps

node v10.21.0
npm 6.14.4
Docker version 19.03.8
docker-compose version 1.25.5
(Docker Restart and pruning of containers/images before the build)
kishorebanala commented 4 years ago

@IanSietro I see this in logs:

Could not GET 'https://jcenter.bintray.com/com/mycila/parent-pom/5/parent-pom-5.pom'. Received status code 504 from server: Gateway Time-out

Can you please verify if you're in an unrestricted network, and that you can access these files, say from your browser, without any issues.

ghost commented 4 years ago

@kishorebanala thank you for the response, I can access the files, but I cannot verify the cause - my network changed since.

Update:

Node v14.4.0
npm 6.14.5
javac 1.8.0_251 (Java 8 Update 251)
Docker version 19.03.8
docker-compose version 1.25.5

cd docker; docker-compose up now does not break, yet it still doesn't seem to work properly:


http://localhost:5000/ → Workflow Defs:

Screenshot_2020-06-09_at_09_55_52

After changing URL to http://localhost:5000/#/workflow and refreshing the page:

Screenshot 2020-06-09 at 10 02 58

Nothing runs on http://localhost:8080/ (expected Swagger).


Suspicious cd docker; docker-compose up logs:

conductor-server_1  | 0    [main] INFO  com.netflix.conductor.bootstrap.ModulesProvider  - Starting conductor server using dynomite/redis cluster.
conductor-server_1  | 8    [main] WARN  com.netflix.conductor.bootstrap.ModulesProvider  - Starting locking module using Noop Lock.
conductor-server_1  | 282  [main] INFO  com.netflix.conductor.core.execution.tasks.SystemTaskWorkerCoordinator  - Adding the queue for system task: HTTP
conductor-server_1  | 283  [main] INFO  com.netflix.conductor.contribs.http.HttpTask  - HttpTask initialized...
conductor-server_1  | 286  [main] INFO  com.netflix.conductor.core.execution.tasks.SystemTaskWorkerCoordinator  - Adding the queue for system task: KAFKA_PUBLISH
conductor-server_1  | 286  [main] INFO  com.netflix.conductor.contribs.kafka.KafkaPublishTask  - KafkaTask initialized...
conductor-server_1  | 288  [main] INFO  com.netflix.conductor.core.execution.tasks.SystemTaskWorkerCoordinator  - Adding the queue for system task: JSON_JQ_TRANSFORM
conductor-server_1  | Exception in thread "main" java.lang.NoClassDefFoundError: org/elasticsearch/client/Client
conductor-server_1  |   at com.netflix.conductor.elasticsearch.ElasticSearchModule.configure(ElasticSearchModule.java:28)
conductor-server_1  |   at com.google.inject.AbstractModule.configure(AbstractModule.java:62)
conductor-server_1  |   at com.google.inject.spi.Elements$RecordingBinder.install(Elements.java:340)
conductor-server_1  |   at com.google.inject.spi.Elements.getElements(Elements.java:110)
conductor-server_1  |   at com.google.inject.util.Modules$OverrideModule.configure(Modules.java:177)
conductor-server_1  |   at com.google.inject.AbstractModule.configure(AbstractModule.java:62)
conductor-server_1  |   at com.google.inject.spi.Elements$RecordingBinder.install(Elements.java:340)
conductor-server_1  |   at com.google.inject.spi.Elements.getElements(Elements.java:110)
conductor-server_1  |   at com.google.inject.internal.InjectorShell$Builder.build(InjectorShell.java:138)
conductor-server_1  |   at com.google.inject.internal.InternalInjectorCreator.build(InternalInjectorCreator.java:104)
conductor-server_1  |   at com.google.inject.Guice.createInjector(Guice.java:99)
conductor-server_1  |   at com.google.inject.Guice.createInjector(Guice.java:73)
conductor-server_1  |   at com.netflix.conductor.bootstrap.Main.main(Main.java:45)
conductor-server_1  | Caused by: java.lang.ClassNotFoundException: org.elasticsearch.client.Client
conductor-server_1  |   at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
conductor-server_1  |   at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
conductor-server_1  |   at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
conductor-server_1  |   at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
conductor-server_1  |   ... 13 more
conductor-ui_1      | {"name":"Conductor UI","hostname":"4438a0fae212","pid":7,"level":30,"msg":"Serving static /app/ui/dist","time":"2020-06-09T03:14:51.250Z","src":{"file":"/app/ui/dist/webpack:/src/server.js","line":29,"func":"Main.routesConfig"},"v":0}
conductor-ui_1      | {"name":"Conductor UI","hostname":"4438a0fae212","pid":7,"level":30,"msg":"Workflow UI listening at http://0.0.0.0:5000","time":"2020-06-09T03:14:51.347Z","src":{"file":"/app/ui/dist/webpack:/src/server.js","line":44},"v":0}
docker_conductor-server_1 exited with code 1

After opening http://localhost:5000/ → Workflow Defs:

conductor-ui_1      | Error: getaddrinfo ENOTFOUND conductor-server conductor-server:8080
conductor-ui_1      |     at GetAddrInfoReqWrap.onlookup [as oncomplete] (dns.js:67:26)
conductor-ui_1      | Error: getaddrinfo ENOTFOUND conductor-server conductor-server:8080
conductor-ui_1      |     at GetAddrInfoReqWrap.onlookup [as oncomplete] (dns.js:67:26)
Screenshot 2020-06-09 at 09 47 23
apanicker-nflx commented 4 years ago

Should be fixed in #1733

ghost commented 4 years ago

@apanicker-nflx please note that the issue was resolved only partially. docker-compose still "fails", please see the previous comment: https://github.com/Netflix/conductor/issues/1725#issuecomment-641004069


Swagger not running on http://localhost:8080/


http://localhost:5000/ -> Workflow Defs throws an error:

Screenshot_2020-06-30_at_20_50_28
apanicker-nflx commented 4 years ago

The suspicious logs mentioned above seem to be from an older version which has since been fixed. Please try to run against the latest version.

imcyee commented 3 years ago

I had the exact same error, but changing CONFIG_PROP=config-local.properties temporary fix the issue. Not sure what causing it.

 environment:
      # - CONFIG_PROP=config.properties
      - CONFIG_PROP=config-local.properties # using memory

if using dynomite, you have to wait for all the service to finish running. You can check with docker ps

ruchitadamodar1922 commented 3 years ago

for me swagger is running but I still encounter the same issue.

rukter commented 3 years ago

@apanicker-nflx please note that the issue was resolved only partially. docker-compose still "fails", please see the previous comment: #1725 (comment)

Swagger not running on http://localhost:8080/

http://localhost:5000/ -> Workflow Defs throws an error:

Screenshot_2020-06-30_at_20_50_28

Hi @IanSietro Can you share step for fix this issue ?