spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

moj-analytical-services/splink #664

INTERNAL Error: Invalid unicode detected in segment statisti…

Hi, this is the error I get when I run `clusters = linker.cluster_pairwise_predictions_at_threshold(df_predict, threshold_match_probability=0.95)`: ``` `----------------------------------------…

MarianaBazely updated 1 year ago
15
rapidsai/cudf #6755

[FEA] Can't run full broadcast join a big cudf with a small …

I wish I could join a large cuDF with a small series/list/sequence in terms of full join in sql, or even better with the small series/list being broadcast for the full join like in spark sql, while th…

roe246 updated 2 years ago
6
finos/morphir-elm #879

Check whether the Spark transpiler preserves ordering in Rec…

When working on aggregation filters, I had an example, ``` testAggregateFilterOneCount : List Antique -> List { product : Product, vintage : Float, all : Float } testAggregateFilterOneCount antique…

jonathanmaw updated 2 years ago
1
delta-io/delta-rs #600

Not able to access Azure Delta Lake

### Discussed in https://github.com/delta-io/delta-rs/discussions/599 Originally posted by **ganesh-gawande** May 9, 2022 Hi, I am using the documentation - https://github.com/delta-io/de…

ganesh-gawande updated 1 year ago
58
apache/hudi #6055

Hudi Partial Update not working by using MERGE statement on …

**Describe the problem you faced** **Scenario #1:** 1)created a dataframe(**targetDf**) and using the below statement to write it in GCS Bucket location (for ex - **locA**) targetDF.write.forma…

rishabhbandi updated 2 years ago
16
dbt-labs/dbt-redshift #188

[CT-1185] [Feature] [Spike] Support dbt Python models on Red…

UPDATE: closed in favor of https://github.com/dbt-labs/dbt-redshift/issues/204 ### Is this your first time submitting a feature request? - [X] I have read the [expectations for open source contr…

lostmygithubaccount updated 1 year ago
1
jupyterhub/jupyterhub #4184

Print spark dataframe more friendly for reading

### Proposed change I come across this problem with pyspark. When I call foo.show(),if the foo dataframe contains too many columns, the result won't be printed in a single row in jupyter noteb…

FukoH updated 2 years ago
2
jupyter-incubator/sparkmagic #286

Support connect to existing livy session

It would be nice if there will be a command to connect to an existing livy session. For example connecting to livy session with `id=4` and `kind=pyspark` and naming to `pyspark-test` `%spark connect …

hegand updated 1 year ago
11
aws/aws-sdk-java #2510

NoSuchMethodError: SemaphoredDelegatingExecutor while writin…

Issue writing to AWS S3 via the aws-java-sdk in spark context ## Describe the bug For a given DataFrame df in a PySpark env, the operation `df.write.parquet("s3a://some-bucket/test.parquet")` star…

ottobricks updated 2 years ago
11
G-Research/spark-extension #105

Diff Failure on AWS Glue

Hi Thanks for this awesome lib! Hey, looking for some guidance on an issue I'm having I'm trying to compare two dataframes for equality. It's not a requirement to know what's different jus…

bobhaffner updated 2 years ago
7

上一页 1...67 68 69 70 71 72 73...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes