spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

section-engineering-education/engineering-education #5003

[Languages]Creating a PySpark DataFrame: A Beginner's Guide

### Topic Suggestion Creating a PySpark DataFrame: A Beginner's Guide #### Proposed article introduction We can distribute data and conduct calculations on several nodes of a cluster using Spark, a…

FranciscaNg updated 2 years ago
2
JohnSnowLabs/spark-nlp #1023

Requesting a feature to find the similarity between text col…

**Is your feature request related to a problem? Please describe.** Problem: Need to calculate the similarity between texts stored in 2 columns of the same or different dataframes For example, the…

kaniska updated 2 years ago
6
mrpowers-io/spark-fast-tests #96

bug: ignoreNullable doesn't work for nested StructTypes

For test case: ``` test("test dataFrameComparer") { val df1 = spark.createDataFrame( spark.sparkContext.emptyRDD[Row], StructType( List( StructField("neste…

mlavengood-sayari updated 2 years ago
2
sparklyr/sparklyr #2667

Failure when using dplyr's across

Sparklyr fails to parse `dplyr` syntax that uses [`across`](https://dplyr.tidyverse.org/reference/across.html) function. # Example ```r # Settings library("sparklyr", quietly = FALSE) library("…

konradzdeb updated 2 years ago
11
mlflow/mlflow #2263

[FR] Saving torchscript models

## Describe the proposal Option to save torchscript model using `torch.jit.save` instead of `torch.save` which enables the deployment toolkits to pickup the optimized torchscript model for production…

hhsecond updated 2 years ago
13
catboost/catboost #1622

[catboost4j-spark] SparkException error when spark.executor.…

Problem: With real-world Spark dataframes (e.g. 50 vector-assembled columns with real values, 130000 rows), I get this "An active CatBoost worker is already present in the current process" error when…

candalfigomoro updated 2 years ago
3
opentargets/issues #1893

Completing PySpark courses on datacamp

Completing these courses will provide the sufficient technical knowledge for the internship: - [x] [Introduction to PySpark](https://app.datacamp.com/learn/courses/introduction-to-pyspark) - [x] [Dat…

DSuveges updated 2 years ago
1
GoogleCloudDataproc/spark-bigquery-connector #165

INVALID_ARGUMENT: request failed: Row filter for table

I did a simple select using spark.read.bigquery. it works fine, the moment I do join with other table it breaks with error saying invalid filter. Below is the code snippet val objkdf = spark.read.big…

kumgaurav updated 2 years ago
6
JuliaDataCubes/EarthDataLab.jl #218

Is it possible to "generalize" this so that ESDL.jl can beco…

Sorry if this is not the right channel to ask questions.

xiaodaigh updated 2 years ago
2
AbsaOSS/spline-spark-agent #272

Duplicated attribute IDs

**UPDATE**: A temporary workaround - https://github.com/absaoss/spline-spark-agent/issues/272#issuecomment-895947366 The issue was found in and causing AbsaOSS/spline#925 See JSON sample in http…

wajda updated 2 years ago
21

上一页 1...74 75 76 77 78 79 80...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes