spark-dataframes Search Results

1000+ results
for spark-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AbsaOSS/cobrix #556

Fixed length Cobol EBCDIC file with header, body and trailer…

## Background [Optional] We have fixed length ebcdic file with file structure starts with header, body records and trailer. We user cobrix library (**za.co.absa.cobrix:spark-cobol_2.12:2.6.0**) in **…

ManojKolisetty-git updated 1 year ago
8
MrPowers/mack #64

Feature Request: Flatten Nested Schema

Having to flatten nested schemas can be cumbersome. A function to flatten the structure with various options would be handy. Some configurations to include could be: - Include Parent Struct Nam…

gardnmi updated 1 year ago
1
snowflakedb/snowpark-python #596

SNOW-686233: Ability to Convert 'snowflake.snowpark.datafram…

## What is the current behavior? Creating DataFrame / Executing sql statements on snowflake returns a `snowflake.snowpark.dataframe.DataFrame` by default without an option to convert it to a `spark…

amithadiraju1694 updated 1 year ago
10
MobileTeleSystems/Ambrosia #10

Fractional split bug on duplicated dataframes indices

Fractional split feature of `Splitter` returns an undesired result when one tries to split a `pandas` dataframe with duplicated indices without passing any argument for `id_column`. The following …

xandaau updated 1 year ago
1
snowflakedb/snowflake-connector-python #397

SNOW-194062: feature request: `fetch_dask_dataframe()`

I've been working with this library for a few days. Thanks so much for maintaining it! It has made it really easy to work with data from Snowflake using PyData libraries. I'd like to propose a feat…

jameslamb updated 1 year ago
8
qt4cg/qtspecs #45

Second parameter of fn:sum must be neutral element for +

Currently fn:sum specifies the intent of the second parameter in a note: > The second argument allows an appropriate value to be defined to represent the sum of an empty sequence. For example, when…

ghislainfourny updated 1 year ago
6
GoogleCloudDataproc/spark-bigquery-connector #821

Unable to write spark dataframe to new bigquery table | BigQ…

In trying to write from a spark dataframe to bigquery with dataproc (image version image 2.0.45-debian10) in [direct mode](https://github.com/GoogleCloudDataproc/spark-bigquery-connector#writing-data-…

hkarimi-bc updated 1 year ago
6
stitchfix/hamilton #17

Add caching for hamilton

**The problem** We want to enable caching of functions and their downstream results. Say we want to alter a function and rerun the entire DAG. The function that we want to alter runs late enough…

elijahbenizzy updated 1 year ago
4
dbt-labs/dbt-core #1860

Support Dask as an Adapter

### Describe the feature Support Dask just as Spark is supported. ### Who will this benefit? This will benefit realtime / web-request use cases where milliseconds matter. The same isomorphic Mac…

talebzeghmi updated 1 year ago
10
apache/arrow #19099

[Python] write_to_dataset poor performance when splitting

Hello, Posting this from github (master @wesm asked for it :) ) ```java import pandas as pd import numpy as np import pyarrow.parquet as pq import pyarrow as pa idx = pd.date_…

asfimport updated 1 year ago
4

上一页 1...63 64 65 66 67 68 69...100 下一页

1000+ results for spark-dataframes

1000+ results
for spark-dataframes