spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dbt-labs/dbt-core #1860

Support Dask as an Adapter

### Describe the feature Support Dask just as Spark is supported. ### Who will this benefit? This will benefit realtime / web-request use cases where milliseconds matter. The same isomorphic Mac…

talebzeghmi updated 1 year ago
10
MobileTeleSystems/Ambrosia #10

Fractional split bug on duplicated dataframes indices

Fractional split feature of `Splitter` returns an undesired result when one tries to split a `pandas` dataframe with duplicated indices without passing any argument for `id_column`. The following …

xandaau updated 1 year ago
1
GoogleCloudDataproc/spark-bigquery-connector #443

com.google.cloud.spark.bigquery.ArrowSchemaConverter Unsuppo…

I'm using spark-bigquery-with-dependencies_2.11-0.21.1.jar and having trouble with reading BigQuery data from Spark on Yarn cluster. Pipeline: BigQuery -> Spark 2.3.2 with HDP 3.1.5 , Python 3.6 …

appden1 updated 1 year ago
7
kedro-org/kedro-plugins #108

Snowflake Data Connectors (SnowPark)

## Description I think there's scope to create a series of data connectors that would allow Kedro users to connect to Snowflake in different ways. This usage pattern was identified in the kedro-org/k…

yetudada updated 1 year ago
13
apache/arrow #19099

[Python] write_to_dataset poor performance when splitting

Hello, Posting this from github (master @wesm asked for it :) ) ```java import pandas as pd import numpy as np import pyarrow.parquet as pq import pyarrow as pa idx = pd.date_…

asfimport updated 1 year ago
4
kedro-org/kedro-viz #907

Kedro-Viz to show preview of data

## Description Kedro-viz supports Plotly. Plotly has cool tables -https://plotly.com/python/table/ the idea is simply show the first 5/10 rows of the dataset on Kedro-viz ### Implementa…

rashidakanchwala updated 1 year ago
18
unionai-oss/pandera #996

Config option `strict = "filter"` does not work on spark dat…

**Describe the bug** When using a SchemaModel on a pyspark dataframe with the config option `strict = "filter"` set, a `TypeError: drop() got an unexpected keyword argument 'inplace'` is raised. -…

nwoodbury updated 1 year ago
6
dask/dask #5506

Data distribution guarantees for CUDA-based multi-node multi…

This is a feature request / discussion issue to outline some problems we are having on RAPIDS cuML and, hopefully, converge on a good solution. **Problem:** Dask Arrays & Dataframes are assumed to …

cjnolet updated 1 year ago
10
apache/sedona #464

Distance Join Query result using SpatialRDDs is sparse (does…

## Expected behavior Want to do Distance Join Query between two dataframes. So followed the [documentation](https://datasystemslab.github.io/GeoSpark/tutorial/geospark-core-python/#write-a-distance-j…

firasomrane updated 1 year ago
4
pandas-dev/pandas #28142

Typing Stubs and PEP 561 compatibility

xref https://github.com/pandas-dev/pandas/pull/28135#issuecomment-524659775 do we want to make pandas PEP 561 compatible? https://mypy.readthedocs.io/en/latest/installed_packages.html#making-pep…

simonjayhawkins updated 10 months ago
63

上一页 1...62 63 64 65 66 67 68...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes