dask-dataframes Search Results

1000+ results
for dask-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pandas-dev/pandas #42600

ENH: Serialize view of ArrowStringArray

Currently Pandas serializes views of ArrowStringArrays by serailizing the whole thing, rather than a subset. Here is an example: ```python In [1]: import pandas as pd In [2]: s = pd.Series([c …

mrocklin updated 2 years ago
4
toddfarmer/arrow-migration #2366

Python: Convert non-range Pandas indices (optionally) to Arr…

***Note**: This issue was originally created as [ARROW-376](https://issues.apache.org/jira/browse/ARROW-376). Please see the [migration documentation](https://gist.github.com/toddfarmer/12aa88361532d2…

toddfarmer updated 1 year ago
11
toddfarmer/arrow-migration #328

Python: Convert non-range Pandas indices (optionally) to Arr…

***Note**: This issue was originally created as [ARROW-376](https://issues.apache.org/jira/browse/ARROW-376). Please see the [migration documentation](https://gist.github.com/toddfarmer/12aa88361532d2…

toddfarmer updated 1 year ago
11
catalyst-cooperative/pudl #1457

Reduce memory usage and manage RMI/PPL data dependencies

In the course of [setting up continuous integration](https://github.com/catalyst-cooperative/rmi-ferc1-eia/issues/151) in the `rmi-ferc1-eia` repository, we discovered that the current plant part list…

zaneselvans updated 1 year ago
5
rapidsai/cudf #10169

[FEA] Standardize applymap support with pandas to enable Das…

Today, we support the `applymap` interface on Series but not DataFrames. Pandas supports `applymap` on DataFrames but not Series. In pandas, the interface provides applies a scalar function/UDF to eve…

beckernick updated 2 years ago
5
rapidsai/cudf #6755

[FEA] Can't run full broadcast join a big cudf with a small …

I wish I could join a large cuDF with a small series/list/sequence in terms of full join in sql, or even better with the small series/list being broadcast for the full join like in spark sql, while th…

roe246 updated 2 years ago
6
dask/distributed #7289

Numba serialization is slow sometimes

I'm using Dask + Datashader over here: https://github.com/mrocklin/dask-tutorial/blob/main/2-dataframes-at-scale.ipynb I'm finding that I'm spending around 20s serializing things, this is mostly in…

mrocklin updated 1 year ago
2
dask-contrib/dask-sql #684

[DF] Some grouped aggregations fail

Repro: ``` import pandas as pd from dask_sql import Context c = Context() df = pd.DataFrame({"id": [0, 1, 1, 2], "val": [1, 1, 2, 1]}) c.create_table("df", df) c.sql(""" SELECT val, …

randerzander updated 2 years ago
2
pangeo-data/foss4g-2022 #50

Update content of dask_introduction.ipynb

Here are some propositions as discussed in https://github.com/pangeo-data/foss4g-2022/pull/45. Please indicate whether it's OK for you (especially @tinaok): - [x] Add a little part on Dask Clust…

guillaumeeb updated 2 years ago
3
rapidsai/cudf #3682

[BUG] Dask DataFrames groupby...apply

**Describe the bug** AttributeError occurs when I use groupby...apply to dask dataframe. > AttributeError: 'SeriesGroupBy' object has no attribute 'apply' **Steps/Code to reproduce bug** `from…

xiaonans updated 2 years ago
11

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for dask-dataframes

1000+ results
for dask-dataframes