-
Modin(Ray, Dask): For performance of larger dataframes, it would be cool if the summaries do not depend on pandas, but could also work with scalable implementations, such as dask. We can easily test t…
-
It would be great to have a function like dask's to_delayed in order to take xarray datasets and convert them to pandas dataframes chunkily.
http://stackoverflow.com/questions/40475884/how-to-conve…
-
(Real-world use case) Say you have 10,000 single-partition dask DataFrames, each with known divisions. You want to combine them into one. Some don't overlap at all, but say 40% of them do overlap with…
-
Splitting out from #62 this other approach to managing memory, to address the running-out-of-memory issue described by https://github.com/EcohydrologyTeam/ClearWater-modules/issues/57#issuecomment-183…
-
This is essentially the same issue as [this one](https://github.com/dask/community/issues/151) on dask/community, but I thought it would be worth a try to see if anyone can help here. Please let m…
-
I think that groupby with Dask can be improved using some Dask related statements to declare the aggragations, etc. see https://docs.dask.org/en/latest/dataframe-groupby.html and https://examples.dask…
-
Implement the plan we've discussed to abstract out model I/O, so that:
- model runs by interacting with a dictionary of data frames in memory
- presently the model reads/writes to HDF5 during the …
-
Hi, would it be possible to have zero-copy conversion to/from Dask dataframes in modin?
- `ddf = mdf.to_dask_dataframe()`
- `mdf = mpd.from_dask_dataframe(ddf)`
This is useful in many cases when …
-
This issue is an attempt to clearly highlight an existing bug in Dask that has been (less-clearly) reported in a number of places (e.g. #7449 and #7777): The current serialization machinery used in Da…
-
It would be nice if we could seamlessly hold `dask` arrays and dataframes inside of an AnnData object.
I would consider this issue closed once most operations on an AnnData would work when it conta…