-
## About
When looking at Apache Spark from the perspective of a dataframe library, I guess [Use CrateDB with DataFrame libraries](https://cratedb.com/docs/crate/clients-tools/en/latest/connect/df.h…
amotl updated
6 months ago
-
It would be convenient to only have to `using DataFrameMacros` instead of `using DataFrames, DataFrameMacros`. DataFramesMeta.jl uses Reexport.jl to achieve this.
-
### Bug report checklis
- [X] Searched the [issues page](https://github.com/e2nIEE/pandapower/issues) for similar reports
- [X] Read the relevant sections of the [documentation](https://pandapower.r…
-
Most of the data ingestion utilities (`pd.read_csv`, `pd.read_parquet`, `pd.read_sql`) since Pandas 2.2 include a `dtype_backend='pyarrow'` option to use PyArrow types instead of NumPy backend.
It …
-
As [Polars](https://github.com/pola-rs/polars) is becoming more and more popular I am wondering if there are any consideration to support its dataframes. Given the performance oriented focus of this p…
-
We added a line in [disrordered_tracing()](https://github.com/AFM-SPM/TopoStats/blob/5d01d437144455d179254e7d8eaa98edcc0ff58d/topostats/tracing/disordered_tracing.py#L425) to substitute the `-` in col…
-
Currently, data frames can be split by random splits.
I would like the following types of splits:
1. Row based data frame split/reweighting
a. by column (i.e. Split data frame by where column m…
-
### Overview
We force our users to use pandas dataframes to reduce a dependency on polars, which is less widely supported than polars. https://github.com/narwhals-dev/narwhals is a really cool proj…
-
There are some minor bugs in the code generated from the incidence-to-prevalence prompt.
I had to edit the prompt and code a bit to get the right behaviour:
```
I have three dataframes d1, d2, d3. Do…
-
Sometimes, a dataset is broken into multiple tables. For instance, data is often split into multiple CSV files so that each download is smaller.
When we need to reconstruct a single DataFrame from mu…