spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dagster-io/dagster #7943

Runtime / dynamic asset partitions

I've seen a couple cases recently where it would be helpful to have "runtime" asset partitions. I.e. a partition is added to an asset when a job runs, rather than at definition time. ### What we've…

sryza updated 1 year ago
11
fugue-project/fugue #392

[BUG] Aggregations on Spark dataframes fail intermittently

**Minimal Code To Reproduce** ```python from fugue import DataFrame, FugueWorkflow from fugue.column import lit, col import pandas as pd def aggregate_prices( df: DataFrame, rollup:…

jstammers updated 1 year ago
5
mlflow/mlflow #8275

mlflow.tensorflow: Failed to infer model signature on Google…

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### Willi…

nikshingadiya updated 1 year ago
14
pydata/xarray #6807

Alternative parallel execution frameworks in xarray

### Is your feature request related to a problem? Since early on the project xarray has supported wrapping `dask.array` objects in a first-class manner. However recent work on flexible array wrapping…

TomNicholas updated 1 year ago
12
pola-rs/polars #2871

Executing an SQL query over a dataset

I would like to execute SQL queries over a dataframe or lazyframe We expect to launch queries like: df.query("select * from df")

calavia88 updated 1 year ago
12
unionai-oss/pandera #381

Abstract out validation logic to support non-pandas datafram…

**Is your feature request related to a problem? Please describe.** Extending pandera to non-pandas dataframe-like structures is a challenge today because the schema and schema component class defin…

cosmicBboy updated 1 year ago
16
h2oai/sparkling-water #2838

Failed to Create H20Context

Followed everything here, and unable to create h2o context. https://docs.h2o.ai/sparkling-water/3.3/latest-stable/doc/rsparkling.html#install-sparklyr ### Clear libraries ```{r} # The followin…

tsengj updated 1 year ago
3
apache/sedona #218

Tune SQL join query performance and handle Exception: Number…

## Expected behavior I am doing a simple ST_Contains query to check whether points lie within the polygon. I have created two seperate dataframes for points and polygon using shapefile as an input f…

SrinivasRIL updated 1 year ago
15
ploomber/ploomber #944

Proposal - Support for SQL on Flat Files (CSV and Parquet)

Hi friends! Kevin from the [Fugue Project](https://github.com/fugue-project/fugue/) here. Ploomber has a nice abstraction for [SQL tasks inside pipelines](https://docs.ploomber.io/en/latest/get…

kvnkho updated 1 year ago
2
MrPowers/mack #43

Brainstorm middle ground type of schema evolution

There is schema evolution that's highly permissive via `df.write.option("mergeSchema", "true")` and `spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", "true")`. This lets you append …

MrPowers updated 1 year ago
6

上一页 1...61 62 63 64 65 66 67...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes