-
`gdpperworking_hour_us_inflation_adjusted` is mapped to ILOSTAT `gdp_205u_noc_nb`.
https://github.com/open-numbers/ddf--gapminder--systema_globalis/blob/master/etl/translation_dictionaries/indicato…
-
Specifically, I found that `index.dt.floor(...)` can result in invalid divisions if the divisions already align with the period passed to floor. For example:
```python
import dask.dataframe as dd
…
-
**What happened**:
When using `npartitions="auto"` in `DataFrame.set_index()` on a local distributed cluster, a "Could not deserialize task" error occurs (see code and output below).
This happen…
-
The DDF team has identified some generic checking requirements which might be best implemented through the checking of USDM JSON data against a defined USDM schema (e.g., JSON-Schema). These checks i…
-
**What happened**:
I'm working on pandas 1.5 compatibility (cf #8776) and came across this apparent bug in groupby + transform/apply/shift.
If we are grouping on the dataframe index, we should be …
-
#### Issue summary & value proposition
Visually represent the relationship between multiple selections so users know they are getting the data they expect.
#### Problem
Visualization is challeng…
-
The tests enabled by PR #3480 contain some that fail. These are
```
pdf = pd.DataFrame({'x': [0, 1, 2, 3, 4, 6, 7, 8, 9, 10],
'y': list('abcbabbcda')})
ddf = dd.f…
-
**What happened**: When using ```shuffle = 'disk'``` merging took 50 minutes compared to 2 minutes when using ```shuffle = 'tasks'```. Also, Dask dashboard was showing very low CPU utilization when us…
-
When I load a csv first into dask, and then into dask dataframe using .from_dask_dataframe, ._meta_nonempty does not exist, causing downstream problems in analysis (e.g. `with spatial_shuffle`). My h…
-
### Functionality
- [ ] about `type` what happens if we have a link as a piece of information to list?. The type only covers (text, number, and date)
- [ ] How to store the `.json` file locally inst…