dask-dataframes Search Results

1000+ results
for dask-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pandas-dev/pandas #26925

Index dtype is changed to object after left join

#### Code Sample ```python import pandas as pd import numpy as np left = pd.DataFrame( columns=['A'], index=pd.Index([], name='id', dtype=np.int64) ) right = pd.DataFrame( [[…

kayibal updated 1 year ago
12
apache/arrow #17606

[PYTHON] serialize_pandas should pass through the preserve_i…

I'm doing some benchmarking of Arrow serialization for dask.distributed to serialize dataframes. Overall things look good compared to the current implementation (using pickle). The biggest difference…

asfimport updated 1 year ago
2
fugue-project/fugue #404

[FEATURE] Ray/Dask engines guess optimal default partitions

When converting local dataframes to a Ray Dataset and Dask DataFrame or when there is a group-map operation, Ray requires users to be explicit about the number of partitions and reducers. However, mos…

goodwanghan updated 1 year ago
1
dask/dask #6460

Bug? Dask randomly hangs on 1-2 open tasks if the memory tar…

Pretty random I get the following situation: The dask `cluster = LocalCluster(n_workers=28, host='192.168.56.11')` works good until some lonely task is hanging. Checking the logs, I see a trac…

cgi1 updated 1 year ago
21
apache/arrow #16145

[Python] Add adapter to write pandas.DataFrame in user-selec…

While we can convert a `pandas.DataFrame` to a single (arbitrarily large) `arrow::RecordBatch`, it is not easy to create multiple small record batches – we could do so in a streaming fashion and immed…

asfimport updated 1 year ago
5
ioos/erddapy #228

GSoC 2022 ideas

Some users are looking for tools to help them assemble ERDDAP urls for use in their own workflows, while others would prefer to work at a higher, more opinionated level. I believe we can more cleanly …

ocefpaf updated 1 year ago
18
pyOpenSci/software-submission #31

OpenOmics: Library for integration of multi-omics, annotatio…

Submitting Author: Jonny Tran (@JonnyTran) All current maintainers: @JonnyTran Package Name: openomics One-Line Description of Package: Library for integration of multi-omics, annotation, and int…

JonnyTran updated 1 year ago
58
pyOpenSci/software-peer-review #82

Discuss possible collaborations / integrations with Pangeo

Thanks for all of your amazing work on pyOpenSci. It's great to see the progress this project has made. I'm opening this issue to discuss how we (in the Pangeo project) can leverage and collaborate…

rabernat updated 1 year ago
4
flyteorg/flyte #427

[Backend][Plugin]Support for Dask clustered tasks in Flyte

**Why would this plugin be helpful to the Flyte community** Users could write very short running distributed array jobs using DASK. This makes it possible to have very small runtime jobs multi-plexed…

kumare3 updated 1 year ago
13
dask/dask #9567

Non-working implicit dask dataframe promotion in map_overlap

_map_overlap()_ is unable to handle raw pandas DataFrame, unlike _map_partition()_. ``` import dask.dataframe import pandas as pd df = pd.util.testing.makeMixedDataFrame() # Works fine da…

epizut updated 1 year ago
2

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for dask-dataframes

1000+ results
for dask-dataframes