spark-columns Search Results

1000+ results
for spark-columns

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/iceberg #10930

Specify in lower/upper bounds in data_file struct are exact

### Proposed Change There is a need to perform exploratory aggregation queries on tables with `min/max` aggregations on data columns. Currently spec for `data_file` struct specifies the following mea…

sopel39 updated 1 month ago
6
Qbeast-io/qbeast-spark #224

Unable to supersede IdentityToZeroTransformation and NullToZ…

## What went wrong? Both `IdentityToZeroTransformation` and `NullToZeroTransformation` are to handle special instances where `LinearTransformer` is used to map `Numeric` columns, but the values are e…

Jiaweihu08 updated 1 week ago
1
apache/iceberg #10274

Spark: Schema evolution is not reflected on branches

### Apache Iceberg version 1.4.3 ### Query engine Spark ### Please describe the bug 🐞 We have added to columns in a nested struct field by using Iceberg Java API. I can query and see th…

javrasya updated 4 months ago
1
microsoft/semantic-link-labs #153

Vertipaq-Analyzer version 0.7.4

First of All, thanks for your continuous support. i tested the new release and it works as expected. But i am missing the time stamp in which the vertipaq data has been exported to the lh tables so…

muhssamy updated 4 days ago
1
delta-io/delta #2580

[Feature Request][Spark] reveal generation expression for ge…

## Feature request #### Which Delta project/connector is this regarding? - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ### Overview Delta allows spec…

keen85 updated 7 months ago
8
astral-sh/ruff #7272

Pyspark Linting Rules

Apache Spark is widely used in the python ecosystem for distributed computing. As user of spark I would like for ruff to lint problematic behaviours. The automation that ruff offers is especially usef…

sbrugman updated 1 week ago
12
apache/hudi #11803

[SUPPORT] Schema evolution using DataSource and HiveSyncTool…

**Describe the problem you faced** hello i try to test several schema evolution usecases using hudi 0.15 and spark3.5 using hms 4 first test: Adding column in PG --> debezium / schema registry ok --…

Armelabdelkbir updated 4 days ago
10
sparklyr/sparklyr #3465

src_databases(sc) has argument error when connecting to data…

I am trying to connect to a databricks cluster and trying to run the exploratory command to list databases with `src_databases(sc)`. Not sure but wanted to reach out for thoughts on what could be goin…

leesahanders updated 2 days ago
1
kubeflow/spark-operator #2026

how works cache or persist with Spark Operator

#### Please describe your question here I'm using spark operator in minikube + minio to send some SQL distributed queries over CSV 2.4GB files with 8883 lines with 20000 columns each one and recove…

masalinas updated 1 month ago
1
G-Research/spark-extension #242

Error: 'JavaPackage' object is not callable

**Description** I have two PySpark dataframes, source_df and target_df. I ran `pip install pyspark-extension` to install diff. Spark Version - 3.4.1 Scala Version - 2.12 When I run `source_…

rish-shar updated 3 months ago
4

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for spark-columns

1000+ results
for spark-columns