pyspark Search Results - Githubissues

1000+ results
for pyspark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nightscape/spark-excel #856

Extract sheet names using pyspark

### Am I using the newest version of the library? - [X] I have made sure that I'm using the latest version of the library. ### Is there an existing issue for this? - [X] I have searched the existin…

Krukosz updated 2 months ago
5
apache/iceberg #11477

Error while connecting to REST catalog using Spark

### Apache Iceberg version 1.4.3 ### Query engine Spark ### Please describe the bug 🐞 Spark config and code - ``` iceberg_rest = { "spark.sql.extensions": "org.apache.iceberg.spa…

Gowthami03B updated 2 weeks ago
1
apache/sedona #1685

Issue on the documentation steps on how to configure Sedona …

## Expected behavior I want to use Apache Sedona in pyspark in an AWS glue environment. ## Actual behavior The sedona librarie does not work when following the steps described in the doc : https:…

MDiakhate12 updated 16 minutes ago
8
ClickHouse/clickhouse-java #976

Pyspark java.io.IOException: Reached end of input stream

Hi! When using pyspark to read and save query with limit 1 million rows everything working fine. However when I try to set the limit up to 10 million rows, for example, got this error on the same quer…

1pyxa1 updated 2 weeks ago
3
abhishekrk/test #1

pyspark

[seniorityMapping&seniorityLevelDefinitions_V8.xlsx](https://github.com/abhishekrk/test/files/1237287/seniorityMapping.seniorityLevelDefinitions_V8.xlsx) [seniorityMapping&seniorityLevelDefinitions…

abhishekrk updated 7 years ago
1
catboost/catboost #1585

Can't import catboost_spark by specifiying "spark.jars" conf…

I am trying to run catboost on pyspark but the box I am running the code in does not have internet so I cannot use ```spark.jars.packages``` config so I downloaded the jar file (catboost-spark_2.12-…

tgamal updated 2 weeks ago
6
vmware-archive/gpdb-sandbox-tutorials #18

Pyspark tutorial?

Hi, I have gone through the tutorial and would like to try pyspark on hdfs. I notice pyspark is pre-installed (2.0.x). But it doesn't support the pre-installed python (version 2.6.6). To make it work,…

zhang2jg updated 7 years ago
4
sdv-dev/SDV #1641

Support for pyspark

### Problem Description ### Expected behavior ### Additional context

sm823zw updated 8 months ago
2
kubeflow/pipelines #11099

[bug] Cannot run sql commands from pyspark

My team wants to use py-spark in Kubeflow pipeline nodes. This py-spark pipeline node is communicating with a completely independent MinIO instance and runs ANSI SQL commands to it. When we create…

gioargyr updated 1 month ago
1
sdv-dev/SDV #573

Pyspark backend option?

### Problem Description SDV is AWESOME! And one of the very few players in this space to be able to handle mutli-tables. However, it is quite limited with sklearn as a backend. What would it tak…

tomrod updated 8 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for pyspark

1000+ results
for pyspark