-
This post summarizes some of the inconsistencies in the reader / writer APIs.
Inconsistencies naturally arise as open source projects develop. I think the project is now in a place where we can th…
-
When I try
```python
import dask.dataframe as dd
ddf = dd.read_csv('random_people.csv')
```
It reports to me:
`
AttributeError Traceback (most recent call last)
…
-
Similar to https://github.com/dask/dask/issues/7095 , I'd also like to be able to use `rank` on groupby Series and DataFrame objects to create per-group rank columns. Today, I can do this in pandas bu…
-
We are using both Python (https://github.com/semio/ddf_utils) and Javascript tooling to generate the datapackage.json with its ddfSchema property.
When running on the very same dataset, the Python-…
-
According to the `ddf.repartition` docs `partition_size`: Max number of bytes of memory for each partition. Use numbers or strings like 5MB. If specified npartitions and divisions will be ignored.
…
-
**Feature request**
Implement `Catalog.to_dask_dataframe()` which would return `self._ddf.copy()`.
**Before submitting**
Please check the following:
- [x] I have described the purpose of the…
-
Lots of checks have been specified for things like:
- If there are multiple codes for xxx, they must be unique.
- The same code must not be used in xxx and yyy
- The xxx attribute must be coded accord…
-
chef changes the `day` concept while it shouldn't, probably some parsing thing:
https://github.com/open-numbers/ddf--open_numbers--covid_government_response/blob/master/ddf--datapoints--stringency_…
semio updated
4 years ago
-
**What happened**: When loading a Parquet file, I specified a column twice in the "columns=" argument and the column was loaded twice, that is, there were two columns in the resulting DataFrame with t…
-
Hello,I met this question,can you help me?
python grad_check.py
Traceback (most recent call last):
File "grad_check.py", line 8, in
from ddf import ddf
File "/media/omnisky/34B22D6336…