enso-org / enso

Enso Analytics is a self-service data prep and analysis platform designed for data teams.
https://ensoanalytics.com
Apache License 2.0
7.38k stars 323 forks source link

Multiple `executionContext/create` messages prevent app startup #9993

Open hubertp opened 6 months ago

hubertp commented 6 months ago

Discord username

No response

What type of issue is this?

Permanent – Occurring repeatably

Is this issue blocking you from using Enso?

Is this a regression?

What issue are you facing?

Somewhere between 08/05 and 17/05 a regression was introduced that leads to multiple executionContext/create requests being sent by GUI when creating a new project (likely also when opening an existing project).

See screenshot.

Expected behaviour

No failures on startup on a hello world example.

How we can reproduce it?

Try to open a new project with latest nightly.

Screenshots or screencasts

Screenshot from 2024-05-17 18-08-05

Logs

error: 
{code: 2004, message: "Invalid stack item"}

Enso Version

17/05 nightly

Browser or standalone distribution

Standalone distribution (local project)

Browser Version or standalone distribution

standalone

Operating System

Linux

Operating System Version

No response

Hardware you are using

No response

hubertp commented 6 months ago

I don't see any issues in nightly from 14/05 (no nightiles between 14th and 17th, so can't triage that one easily).

hubertp commented 6 months ago

It appears to be only(?) reproducible for less powerful machines. Still, I think we should discuss retries that GUI is making as some of the sent messages are not idempotent.

farmaazon commented 6 months ago

It appears to be only(?) reproducible for less powerful machines. Still, I think we should discuss retries that GUI is making as some of the sent messages are not idempotent.

I'm aware of the problem with retries, but honestly I haven't expected we hit it so early :)

Technically, on problems like "timeout" we have no idea what is the engine's state. One solution would be marking the execution context as "broken" and try to destroy it and recreate on every single failure in its creation or frame change. Perhaps visualization as well? To avoid unnecessary visualizations nobody listens to. Here the problem is, that even slight connectivity issue would destroy all caches.

So better solution would be to have an ability to read the current engine's state. Then, after a timeout, we could read the state first to check if last call was actually applied or not. But that requires adding new methods to engine.

hubertp commented 6 months ago

Are you sure something wasn't broken recently? I built my own nightlies and narrowed it down to somewhere between 14th and 15th, i.e. 52b8ed4d3a01711a70ef65c1f53d9a33adbe0509...c437721ba5623e6d167ea040832be08b2d84dbe7.

hubertp commented 6 months ago

I'm afraid I narrowed it down to your commit, @vitvakatu, 52b8ed4d3a01711a70ef65c1f53d9a33adbe0509. It's weird though, it looks innocent. To verify you can build IDE with 43c80da8a313f48bc7b492f37cc9c651cf7dea71 and 52b8ed4d3a01711a70ef65c1f53d9a33adbe0509 with a fixed nightly backend. The former will be fine with a simple Hello World project, the latter not.

farmaazon commented 5 months ago

Let's also see if it gets fixed by #9951 as it fixes the same project being initialized twice.

hubertp commented 5 months ago

Still seeing the problem. It's actually a combination of multiple executionContext/create and executionContext/push.

hubertp commented 5 months ago

This issue should be visible once the other PR gets merged.

hubertp commented 4 months ago

@farmaazon noticed a similar issue when trying it out with electron app and connecting to cloud: Screenshot from 2024-07-09 14-04-27

EDIT: changed my mind. Those are similar issues but not exactly the same. Filed https://github.com/enso-org/enso/issues/10495. This is reproducible when connecting to cloud project via electron app.