-
It may be helpful to indicate size of datasets that can be used with Red Amber and what operations will be supported.
For a comparison with other dataframes, see Table 3 in [Towards Scalable Datafram…
-
As stated in the docs, "Blaze includes nascent support for out-of-core processing with Pandas DataFrames and NumPy NDArrays". http://blaze.readthedocs.org/en/latest/ooc.html#parallel-processing.
Sho…
-
#### [FEA] API to write dask dataframes to local storage of each node in multi-node cluster
### Example requested API:
```python
df.to_parquet(xxx, write_locally_per_node=True)
```
**Pleas…
-
-
I recently discovered modin and loved the clean approach to working with large dataframes in a simple manner. One of the things that struck me was that the [Modin architecture](https://modin.readthedo…
-
Potential candidates include (but would be in no way limited to):
* xarray
* vaex
* Dask
The idea here is that data cleaning and higher-level functions to manipulate data structures are pretty…
-
Hi, I am trying to get the points from lidar data and am having issues with v2.perception.utils.lidar_utils.convert_range_image_to_point_cloud.
I am using python3.10 and tensorflow 2.12 in a docker…
-
# Repro
Run:
```
import dask.dataframe as dd
import pandas as pd
ddf1 = dd.from_pandas(pd.DataFrame([{'foo': float('nan')}]), npartitions=1)
ddf2 = dd.from_pandas(pd.DataFrame([{'foo': ['s…
-
Hello
I start to use dask- sql but I cant make any simple query, I can just make a total selection with `select * from df;`. Beside this query I cant do anything else, in every query I get the sa…
-
Sometimes it's not clear when running a number of SQL scripts in a session whether all scripts "clean up" after themselves (dropping temp tables, unpersisting tables, de-registering UDFs, etc).
It …