DARPA-ASKEM / terarium

https://app.terarium.ai
Apache License 2.0
15 stars 2 forks source link

[BUG]: `Transform Dataset` operator behaves anomalously - cannot generated code or access input dataframes #5056

Open liunelson opened 1 month ago

liunelson commented 1 month ago

Describe the issue

The Transform Dataset operator has a few bugs:

  1. After just two code cells (printing what d2 is and asking the LLM for the join code), the session seems to have lost access to the input dataset d1. Image

  2. Starting from a new Transform Dataset operator, now the generated LLM code won't run fully - the status is either stuck at Busy or Offline Image

  3. Minor issue: unknown code output shown in the operator every time - not informative, nor should it be shown to users?

    {'modules': {}, 'variables': {}, 'functions': {}}

    Image

blanchco commented 1 month ago

Asked Jataware about this, beaker seems to just hang with this command which may cause downstream issues https://unchartedsoftware.slack.com/archives/C049E6PA16V/p1728054249803609

blanchco commented 1 month ago

It seems this might be an issue with Jupyter itself, we run out of memory. It might be worth looking into a solution where if beaker starts to hang like this we can add a timeout and start a new session informing the user of the issue

liunelson commented 1 month ago

What's the plan for this issue?