ddf Search Results - Githubissues

1000+ results
for ddf

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dask/distributed #7380

[QST]: p2p shuffle on large datasets

I'm attempting to use to p2p shuffle implementation (using the branch proposed for merge in #7326) to shuffle an ~1TB dataset. The data exists on disk as ~300 parquet files (that each expand to aroun…

wence- updated 1 year ago
21
dask/distributed #6775

utils_tests `cleanup` fixture does not play well with `threa…

**What happened**: I use the `distributed.utils_test` fixture `cleanup` in my pytest tests. I have some code that uses dask `threading` scheduler. However, the `cleanup` fixture has a `check_th…

DamianBarabonkovQC updated 2 years ago
2
dask/dask #8474

Cross-scheduler inconsistent result structure

**What happened**: Due to a mistake in our code, we were persisting a dask dataframe in one scheduler, but then ran the compute while specifying threads scheduler. What was weird was that the compu…

davidhao3300 updated 1 year ago
11
dask/dask #8800

`groupby.transform` failing on dataframe with duplicates in …

I'm trying to process parquet files stored in AWS S3. The files are read simply with ```python with Client(n_workers=6) as client: df = dd.read_parquet('s3://lightnings_*.gzip.parquet') …

guidocioni updated 3 months ago
3
geopandas/dask-geopandas #286

ddf._meta_nonempty doesnt instantiate correctly when calling…

When I load a csv first into dask, and then into dask dataframe using .from_dask_dataframe, ._meta_nonempty does not exist, causing downstream problems in analysis (e.g. `with spatial_shuffle`). My h…

taneugene updated 8 months ago
1
ddf-project/ddf-flink #44

SQLHandler does not support drop table command

Is it possible to delete the specified DDF when a drop table command is executed on it?

Shiti updated 9 years ago
4
dask/dask #3135

Unexpected dataframe shuffle for sorted index

Following up on https://stackoverflow.com/questions/48592049/dask-dataframe-groupby-apply-efficiency/48592529 with an example. Read data w/ a sorted index column and perform a groupby; shouldn't re…

bnaul updated 3 years ago
5
DistanceDevelopment/Distance #186

ds produces unexpected check.mono warnings

Using `Distance` 2.0.0 and `mrds` 3.0.0. The following code uses these duiker data: [DaytimeDistances.txt](https://github.com/user-attachments/files/17680350/DaytimeDistances.txt) ``` DuikerCam…

lenthomas updated 1 day ago
4
geopandas/dask-geopandas #189

ENH: add crs keyword to from_dask_dataframe

When creating a GeoDataFrame from a dask dataframe, we could pass through the `crs` keyword to the underlying geopandas.GeoDataFrame constructor: https://github.com/geopandas/dask-geopandas/blob/5b…

jorisvandenbossche updated 2 years ago
2
dask/dask-expr #89

Selecting from Index with duplicates raises or returns incor…

``` from dask_expr import from_pandas df = pd.DataFrame({"a": [1, 2, 3], "bb": 1}, index=["a", "a", "b"]) ddf = from_pandas(df) ddf.a["b"].compute() ``` This raises ``` Traceback (most r…

phofl updated 1 year ago
3

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for ddf

1000+ results
for ddf