pyspark Search Results - Githubissues

1000+ results
for pyspark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

projectglow/glow #724

Compatibility Issue: java.lang.NoSuchMethodError with glow.p…

When using glow.py in a Conda environment with the following setup: - Python version: 3.10 - PySpark version: 3.5.1 - Glow.py version: 2.0.0 (installed via pip install glow.py) Attempting to loa…

nickzren updated 2 weeks ago
1
ibis-project/ibis #10273

bug: UDFs not present on pyspark workers

### What happened? Discovered in https://github.com/NickCrews/mismo/issues/64. CC @jstammers. Here is a more minimal reproducer. Run with `uv run script.py` to get uv to install the deps automa…

NickCrews updated 2 weeks ago
1
RTIInternational/teehr #292

Expose PySpark's `persist()` method to the Evaluation class

Wondering if we could make use of the [persist ](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.persist.html)or cache methods in pyspark to load the da…

samlamont updated 3 weeks ago
1
biocypher/open-targets #21

Incorrect batching implementation

The current implementation divides the dataset into partitions by assigning a partition ID computed using pyspark function `spark_partition_id` to each row, followed by querying each partition. I thin…

kpto updated 2 weeks ago
5
jvolkman/rules_pycross #120

Error trying to use pyspark

When I try to use the `pyspark` package, I get this error: > Backend 'setuptools.build_meta:__legacy__' is not available. Here is the `pyproject.toml`: ```yaml [tool.poetry] name = "cowapp"…

njlr updated 2 months ago
3
delta-io/delta #3864

How to connect spark on delta mode and read/write data remot…

this is my docker-compose file: ``` version:` '3.8' services: spark-master: image: bitnami/spark container_name: spark-master environment: - SPARK_MODE=master - S…

majidebraa updated 1 week ago
3
astral-sh/ruff #7272

Pyspark Linting Rules

Apache Spark is widely used in the python ecosystem for distributed computing. As user of spark I would like for ruff to lint problematic behaviours. The automation that ruff offers is especially usef…

sbrugman updated 2 months ago
12
narwhals-dev/narwhals #1419

[Enh]: Add Support for Snowpark

### We would like to learn about your use case. For example, if this feature is needed to adopt Narwhals in an open source project, could you please enter the link to it below? I'm building a pl…

rwhitten577 updated 9 hours ago
1
ydataai/ydata-profiling #1634

Supporting Spark Connect Dataframe

### Missing functionality After databricks runtime 14, the dataframe type is changed in notebook. It was `pyspark.sql.dataframe.DataFrame`, but now it is `pyspark.sql.connect.dataframe.DataFrame` …

chenbojian updated 2 weeks ago
2
biocypher/open-targets #13

No documentation on how to run this project on Windows

Since this project uses pyspark to read parquet files, pyspark and it's dependencies spark and hadoop are required but the documentation is currently lacking a guideline of how to run the script on Wi…

kpto updated 3 weeks ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for pyspark

1000+ results
for pyspark