outerbounds / terraform-aws-metaflow

Deploy production-grade Metaflow cloud infrastructure on AWS
https://registry.terraform.io/modules/outerbounds/metaflow/aws/latest
Apache License 2.0
58 stars 50 forks source link

minimal terraform example leads to internal server error #45

Open MarselScheer opened 1 year ago

MarselScheer commented 1 year ago

When I use the https://github.com/outerbounds/terraform-aws-metaflow/blob/master/examples/minimal/minimal_example.tf (from v.0.8.0 which is the current master) to deploy to aws (eu-central-1) and afterwards update METAFLOW_SERVICE_AUTH_KEY in the generated config.json a simple flow leads to this error

Validating your flow...
    The graph looks good!
Running pylint...
    Pylint is happy!
    Metaflow service error:
    Metadata request (/flows/TestGPUFlow) failed (code 500): {"message": "Internal server error"}

Since with_public_ip in the minimal example is not defined, I tried both true and false but i get in both cases the same error message.

Currently, I am using version 0.5.1 and there everythings seems to works (also with GPUs)

PS: the minimal example in the README still uses version 0.3.0