spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

G-Research/spark-extension #242

Error: 'JavaPackage' object is not callable

**Description** I have two PySpark dataframes, source_df and target_df. I ran `pip install pyspark-extension` to install diff. Spark Version - 3.4.1 Scala Version - 2.12 When I run `source_…

rish-shar updated 4 months ago
4
cryeo/sparksql-magic #4

SparkSQL simple way of reading CSV as local Spark Dataframes

Dear @cryeo, I really like your library as it makes possible to integrate SQL syntax directly into cells, that's a nice piece of work! However I would like to hear from you what's the best way …

CarloNicolini updated 1 year ago
1
Kotlin/dataframe #765

Add a User Guide "How to handle large CSV?"

Users often asks about limitations of KDF to handle large dataframes The User Guide should contain some recommendations and snippets of code to improve User Path here - some benchmarks on real-w…

zaleslaw updated 2 weeks ago
1
nats-io/nats-spark-connector #10

Add the option to have batch pulls

### What motivated this proposal? Is there a way to micro batching the actual NATS streams? Let's say I just want to try and pull 20,000 messages, if there are so many or whatever is in the queue. …

AlejandroUPC updated 1 day ago
1
ondra-m/ruby-spark #6

Support for dataframes

I'm really interested in using spark and would love to be able to interact with it using Ruby. This gem looks like a great option. It doesn't look like it would natively support spark dataframes, righ…

gnilrets updated 9 years ago
5
pitchmuc/aepp #13

Add a driver for processing and working with spark dataframe…

The goal is to work with really large datasets and extract the results of large queries into a spark dataframe, this will allow us to work with pqs and spark to do large scale feature transformation a…

skanjila updated 1 year ago
1
samelamin/spark-bigquery #84

Conflict occurred creating export directory already exists

when i write two bigquery dataframes in for loop using function saveAsBigQueryTable(projectid+schemaname+tablename) it gives the error when writing the table in bigquery where it gives error Conflic…

ashay10001 updated 4 years ago
1
almond-sh/almond #180

Is there a way to print Spark Dataframes as HTML tables ?

That would be neat. I searched around but didn't find what I was looking for. Any help appreciated !

skattoor updated 3 years ago
6
uber/petastorm #597

can we use s3 path here instead of hdfs?

from petastorm.spark import SparkDatasetConverter, make_spark_converter # specify a cache dir first. # the dir is used to save materialized spark dataframe files **_spark.conf.set(SparkDatasetCon…

p9anand updated 4 years ago
9
openaire/iis #1130

Align all caching modules implemented in spark to rely on da…

Some of the currently implemented caching solutions in spark, namely `CachedWebCrawlerJob` and `PatentMetadataRetrieverJob`, are relying on RDDs while we could take advantage of the full potential of …

marekhorst updated 4 years ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes