spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

crflynn/pbspark #9

RecursionError when inferring recursive protobuf definitions…

Hello, Thanks for developing the `pbspark` library. This seems quite useful for converting protobuf on the wire to dataframes. I have some recursive proto definitions of the form (a simplified exa…

abhishekrb19 updated 2 years ago
5
amperity/sparkplug #11

Support for Datasets/DataFrames

Datasets and DataFrames are more efficient and should be preferred over direct RDD programming as of Spark 2.0. Build out the sparkplug-sql project to support them.

brandonvin updated 2 years ago
1
alteryx/featuretools #203

Spark Example for Featuretools

### Bug/Feature Request Description In notebooks such as here: https://github.com/Featuretools/predict-next-purchase/blob/master/Tutorial.ipynb and documentation: https://docs.featuretools.com/usag…

8bit-pixies updated 2 years ago
26
google/fhir-data-pipes #180

Implement a BigQuery runner for the query library

With the [new approach](https://github.com/GoogleCloudPlatform/openmrs-fhir-analytics/pull/178) to our query library API, the underlying runner for distributed data processing is completely separated …

bashir2 updated 2 years ago
1
atoti/atoti #46

Side-by-side comparison with pandas

## Description Suggested by a first time user: > I imagine people using this would come from pandas or R : what would have been useful is to see a side by side "this is how you do it in pandas" vs…

jbe456 updated 2 years ago
4
Roche/pyreadstat #184

Feature: Update binary read to Polars

It would be interesting to be able to read spss files with Polars. Pyreadstat provides support to convert to Pandas. Could this be done with Polars? Polars is a great package and would help a lot spee…

jccguma updated 1 year ago
33
malexer/pytest-spark #9

Suggestions to speed up pytest-spark tests

The spark session created by pytest-spark is not so optimized for small unit tests that only work with small dataframes. pytest-spark seems to rely on whatever are Spark's default settings: http…

juhoautio updated 2 years ago
3
rjurney/Agile_Data_Code_2 #113

Chapter 2: Exercises

Please find below all the out of the Exercise. The step 10 is successful, the step 11 fails. Also fails the "Loading and Inspecting Parquet Files" find the output below. ================= Exerci…

geopolitis updated 2 years ago
9
alteryx/woodwork #1423

Spike: Investigate PySpark bug with pandas 1.4.X

- There is a bug with PySpark and pandas 1.4.X - We need to investigate what the bug is, as this prevent us from upgrading to the pyarrow dtypes - We should check if this bug occurs with latest pand…

gsheni updated 2 years ago
1
awslabs/deequ #420

Unable to run the demo code on data science workbench enviro…

Spark version _: ![image](https://user-images.githubusercontent.com/22921775/171391938-cc85b808-7bdd-4bf6-b858-cab5873bd130.png) `case class RawData( productName: String, totalNumber: String…

ankit-khare-2015 updated 2 years ago
4

上一页 1...69 70 71 72 73 74 75...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes