-
thank you for the wonderful library!
**Is your feature request related to a problem? Please describe.**
I'm wondering if RecBole's [data flow](https://recbole.io/docs/user_guide/data/data_flow.htm…
-
I am currently partitioning a docx file harnessing unstructured with the next input params:
```json
{
"filename": "document.docx",
"response_type": "application/json",
"coordinates": fals…
-
Right now, we're copying entire dataframes to avoid mutating the original in a pyjanitor function. This of course can come with big computational costs depending on the size of the dataframe involved.…
-
atm the functions herein operate primarily on `DataFrame`s, because they inherently bind timestamps/frequencies to the data samples, which are used in ~every function in this package. However, I perso…
-
The Dask documentation suggests [using Dask Dataframes.](https://docs.dask.org/en/latest/best-practices.html#load-data-with-dask). Is this even possible using Woodwork? Is it worth it?
-
Within our analysis notebooks, we can easily generate strings of text with data interpolated into them. To facilitate **automatic** analysis of our data, we should engineer a set of prompts that can b…
-
## Feature Description
Spark Connect Support
## Is your feature request related to a problem?
In Spark Connect, RDD is not supported, so PipelineDP does not work. See https://github.com/apache/sp…
wchau updated
8 months ago
-
Having a look to the documentation API for vaex.open_many I dont see a way to really know what kind of files vaex expects.
https://vaex.readthedocs.io/en/latest/_modules/vaex.html#open_many
Nor …
-
Hello,
Currently, we cannot sample from a GroupedDataFrame directly.
```julia
julia> df = DataFrame(rand(100000, 100), :auto);
gdf = groupby(df, :x1);
# Code above from #31…
-
This extension will allow users to return correlation data for the numerical columns in their synthesis by a certain variable (i.e., gender, age, race, etc.). Therefore, users can assess the performan…