RumbleDB / rumble

⛈️ RumbleDB 1.22.0 "Pyrenean oak" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
http://rumbledb.org/
Other
213 stars 82 forks source link

Adding scripting functionality to RumbleDB #1252

Closed DavidBuzatu-Marian closed 4 months ago

DavidBuzatu-Marian commented 5 months ago

The PR introduces scripting capabilities within the RumbleDB engine. Scripting enables support for side-effects, both in query bodies and functions. Changes are backwards compatible with previous RumbleDB versions. Scripting introduces the construct of statements, which are execution units that do not return anything and execute locally. Local execution is enforced as side-effects may affect parallel computation where cross-dependencies exist between parallel jobs. With scripting, we also enabled while loops, as well as interrupts such as break loop, continue loop, and exit returning.