-
Trying to pass in dask dataframes/series into dask-ml GridSearchCV using XGBClassifier as an estimator results in data validation errors pre-train because:
1. xgboost will convert dask dataframe …
-
**Minimal Code To Reproduce**
```python
import fugue_sql
dag = fugue_sql.FugueSQLWorkflow()
df = dag.df([[0, "hello"], [1, "world"]], "a:int64,b:str")
dag("SELECT * FROM df WHERE a > 0 YIELD …
-
### pycaret version checks
- [X] I have checked that this issue has not already been reported [here](https://github.com/pycaret/pycaret/issues).
- [X] I have confirmed this bug exists on the [latest…
-
If this repository is going to go semi-public then it should probably have a more user-focused name. `expr` corresponds to an internal detail. It's useful for us but doesn't mean anything to a user.…
-
**Is your feature request related to a problem? Please describe.**
It would be great if filter operations in a sql clause could be pushed down to the io layer for formats like `parquet` that support …
-
**Describe the issue**:
As part of https://github.com/geopandas/dask-geopandas/pull/285, we found that dask-expr will lose the type of a pandas DataFrame subclass in `groupby.agg` if (and only if?)…
-
## Feature Request
For any checks in `sklearn.utils.estimator_checks` that generate `pandas` DataFrames, `scipy` sparse arrays, or `numpy` arrays, implement equivalent checks in `dask-ml`, but whic…
-
```
import dask.dataframe as dd
from dask.diagnostics import ProgressBar
def split_url(url):
parts = url.split()
return parts[1] if len(parts) > 1 else url
def deduplicate_csv(input_fi…
-
Datashader has traditionally focused on 2D aggregation and we've given some thought if we could maybe support 3D aggregations, but one low hanging fruit we haven't much considered is 1D aggregations. …
-
### Problem description
The usage of an index build pipeline `build_dataset_indices__bag` may build indices of incompatible types when building an index for a date type column, leaving the dataset …