-
There is content from my thesis in chapter 6.2 that compares and contrasts Modin and Dask/Koalas. It would be good to add a summary this content (and the API plot) to the README: http://www2.eecs.berk…
-
@quasiben has been playing with Dask and R with [reticulate](https://github.com/rstudio/reticulate) and [rpy2](https://rpy2.readthedocs.io/) with the objective of providing Dask's concurrent.futures A…
-
@charlesbluca shared a Dask-SQL test/snippet that seems to "hang" when the query-planning is enabled in `dask.dataframe`. It turns out the operation does *eventually* finish, but that graph materializ…
-
I have a pandas dataframe that looks like this
index type value1 value2
0 a .5 .6
1 b .25 .2
2 c .25 .2
Then a dask …
-
Apologies if this has already been requested, or is clearly impossible for some reason. My Dask knowledge isn't super deep.
I know that OSErrors, which can occur due to a disk being full, are handl…
zmbc updated
8 months ago
-
Implement functionality to submit jobs to Dask and/or condor for the preselection looper.
Implementation would go in this function in `prep_helper.py`:
https://github.com/cmstas/HggAnalysisDev/blo…
-
I'm curious about mechanisms to address data skew in sorting/shuffling/merge operations, where some values are far more common than others. This comes up a lot when talking to Spark people as a commo…
-
Occurs [here](https://github.com/iiasa/message_data/actions/runs/8247073169/job/22554473807#step:14:68) when importing pyam, via ([among others](https://github.com/iiasa/message_data/actions/runs/8247…
-
**Minimal Code To Reproduce**
```python
import fugue_sql
dag = fugue_sql.FugueSQLWorkflow()
df = dag.df([[0, "hello"], [1, "world"]], "a:int64,b:str")
dag("SELECT * FROM df WHERE a > 0 YIELD …
-
Per the example:
```
dataset = [['Milk', 'Onion', 'Nutmeg', 'Kidney Beans', 'Eggs', 'Yogurt'],
['Dill', 'Onion', 'Nutmeg', 'Kidney Beans', 'Eggs', 'Yogurt'],
['Milk', 'Appl…