spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AbsaOSS/atum #28

Atum redesign

## Background Currently, Atum relies on the global state of a Spark Application. This complicates the usage of Atum for jobs that are slightly more complicated than just a pipeline of a single dataf…

yruslan updated 4 years ago
6
spotify/spark-bigquery #33

Performance tune df.saveAsBigQueryTable

Hi I am trying to load a csv zip file from google cloud into BQ, file size is 100 GB but the load is taking lot of time, is there a way to tune the df.saveAsBigQueryTable command to spe…

abhineet13 updated 7 years ago
1
unionai-oss/pandera #1540

Enhanced Validation Reporting for PySpark DataFrames in Pand…

**Is your feature request related to a problem? Please describe.** Hello, I am new to Pyspark and data engineering in general. I am looking to validate a Pyspark Dataframe given a schema. Came across…

zaheerabbas-prodigal updated 3 weeks ago
13
bpn1/ingestion #68

Spark 1.6 vs 2.0 Benchmark

test with our code & data

janehmueller updated 7 years ago
1
zouzias/spark-lucenerdd #220

Why is indexing entering a loop?

**Describe the bug** I am not sure what could be the reason , but the indexing process seems to be entering a loop for the following code until it completes successfully ``` val blockingFields…

yeikel updated 4 years ago
4
dotnet/spark #1171

[FEATURE REQUEST]: Deprecate and/or evict Microsoft.Data.Ana…

**Is your feature request related to a problem? Please describe.** I'd like to deprecate **Microsoft.Data.Analysis** from this project, or at least move it out of **Microsoft.Spark** to a distinct …

dbeavon updated 6 months ago
1
SANSA-Stack/SANSA-Stack #130

Add support for enriching R2RML TermMaps with prefixes, rang…

These extension have already been added to [r2rml model extensions](https://github.com/SmartDataAnalytics/r2rml-api-jena/blob/develop/r2rmlx-jena-api/src/main/java/org/aksw/r2rmlx/domain/api/Constrain…

Aklakan updated 3 years ago
1
pola-rs/polars #14610

Write support for Apache Iceberg

### Description I love the lazy reading for Iceberg. It's great. If Polars support writes back to an Iceberg catalog, that would make it a really powerful too working alongside the sql engines and…

randypitcherii updated 8 months ago
2
typelevel/frameless #321

The right way to convert a column ?

Hello, I am starting with Frameless and I am having a hard time converting my code based on spark-Dataframes to the Frameless framework. The blocking point I reach now is how to override a column. …

leobenkel updated 5 years ago
3
elastic/eland #332

'Requested column [0] is not in the DataFrame.'

When trying to create a Spark DataFrame from an Eland Dataframe, I get the following error : `KeyError: 'Requested column [0] is not in the DataFrame.'` I tried renaming/filtering out columns wit…

helo-ch updated 3 years ago
1

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes