spark-columns Search Results

1000+ results
for spark-columns

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opensearch-project/opensearch-spark #644

[FEATURE] Add PPL support to unnest arrays

**Is your feature request related to a problem?** Many log sources include arrays in a log line, and to efficiently extract and analyze these data, it would be very helpful to have a function in plac…

A-Gray-Cat updated 2 months ago
2
sparklyr/sparklyr #3196

spark_read_jdbc returns columns with quotes instead of backt…

`spark_read_jdbc` returns columns with quotes instead of backticks for query. This causes the results to return as literals instead of the data. ```r sc = 1;``` This results in the error that: …

crogers923 updated 2 years ago
2
ydataai/ydata-profiling #1484

Bug Report: Some bugs? encountered in the PySpark version

I have recently been extensively using the Spark version of ydata-profiling to generate analysis reports, and here are some issues I've encountered: There have already been some related issues before…

frelion updated 11 months ago
2
apache/hudi #11803

[SUPPORT] Schema evolution using DataSource and HiveSyncTool…

**Describe the problem you faced** hello i try to test several schema evolution usecases using hudi 0.15 and spark3.5 using hms 4 first test: Adding column in PG --> debezium / schema registry ok --…

Armelabdelkbir updated 3 days ago
11
dosisod/refurb #286

[Enhancement]: call chaining readability rule

### Overview Many popular python libraries support an API for call chaining, also known as [fluent interface](https://en.wikipedia.org/wiki/Fluent_interface). Examples are [pandas](https://towardsd…

sbrugman updated 1 year ago
1
unionai-oss/pandera #1317

Add option to return dataframe with columns in order specifi…

## Problem When handling writing Spark dataframes to datalake storage, the order of the columns in the dataframe is important. For example if a pipeline is appending parquet files in the lake, if t…

Smartitect updated 1 month ago
17
delta-io/delta #1874

[Feature Request] Liquid Clustering

## Overview We propose to introduce Liquid Clustering, a new effort to revamp how clustering works in Delta, which addresses the shortcomings of Hive-style partitioning and current ZORDER clusterin…

tdas updated 1 week ago
27
uber/petastorm #769

make_spark_converter RuntimeError: Vector columns are only s…

I convert pyspark dataframe to two columns: one for feature column, it's a dense vector, and another is a label column. When I transform to tensorflow dataset using `make_spark_converter`, it raised a…

Alxe1 updated 2 years ago
4
Qbeast-io/qbeast-spark #414

Error when indexing a table with BIGINT

## What went wrong? When creating a table using BIGINT on a date column and inserting a set of 10 rows, a `ScalaMatch` error appears. We would need to investigate the flow for the BIGINT type. Is…

osopardo1 updated 3 days ago
6
apache/iceberg #11297

Table Not Found While reading IcebergTable from Spark SQL

### Apache Iceberg version 1.6.1 (latest release) ### Query engine Spark ### Please describe the bug 🐞 Hi Team, I have done setup for hive4 docker images but while reading table from spark sql…

AwasthiSomesh updated 1 month ago
4

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for spark-columns

1000+ results
for spark-columns