-
I've seen a couple cases recently where it would be helpful to have "runtime" asset partitions. I.e. a partition is added to an asset when a job runs, rather than at definition time.
### What we've…
-
**Minimal Code To Reproduce**
```python
from fugue import DataFrame, FugueWorkflow
from fugue.column import lit, col
import pandas as pd
def aggregate_prices(
df: DataFrame,
rollup:…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Willi…
-
### Is your feature request related to a problem?
Since early on the project xarray has supported wrapping `dask.array` objects in a first-class manner. However recent work on flexible array wrapping…
-
I would like to execute SQL queries over a dataframe or lazyframe
We expect to launch queries like: df.query("select * from df")
-
**Is your feature request related to a problem? Please describe.**
Extending pandera to non-pandas dataframe-like structures is a challenge today because the schema and schema component class defin…
-
Followed everything here, and unable to create h2o context.
https://docs.h2o.ai/sparkling-water/3.3/latest-stable/doc/rsparkling.html#install-sparklyr
### Clear libraries
```{r}
# The followin…
-
## Expected behavior
I am doing a simple ST_Contains query to check whether points lie within the polygon. I have created two seperate dataframes for points and polygon using shapefile as an input f…
-
Hi friends!
Kevin from the [Fugue Project](https://github.com/fugue-project/fugue/) here.
Ploomber has a nice abstraction for [SQL tasks inside pipelines](https://docs.ploomber.io/en/latest/get…
-
There is schema evolution that's highly permissive via `df.write.option("mergeSchema", "true")` and `spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", "true")`. This lets you append …