-
Hi gota looks really great! But it doesn't seem to be maintained or used any more. What are people using instead when they want to work with dataframes in golang?
-
Since version 1.2, scikit-learn has a set_output API that can make transformers output pandas dataframes.
- [ ] Investigate how that affects us.
-
This template is a collection of ETL pipelines, from extracting data from various sources to Loading the REST API to the specific Application.
-
We want to be able to perform split-apply-combine operations on (geo)pandas dataframes in a distributed / scalable way both on a single multi-core machine and in the cloud. It looks like lithops may t…
-
# :beetle:
- [X] I have checked that this issue has not already been reported.
### Bug summary
Our current version of `DataFrames.jl` is not compatible with all of the code that is implemente…
-
Are the plans to support `to_spark` for remote dataframes? Currently the `RemoteDataFrame` class's `to_spark` method resolves to the base class where it raises a not implemented error despite the `int…
-
Hello,
Good job on Spark.jl.
I have a issue, I tried to learn Spark and I followed the documentation:
> This is a quick introduction into the Spark.jl core functions. It closely follows the …
-
Consider the following MWE where I created a sparse 10,000 × 1,500 matrix and, I made the first column the label. The entire dataset takes less only a few MB in memory. The `task.fit` step encounters …
-
Seems like this is not compatible with the latest DataFrames 22.0 version if you do add ShapML. I was able to download it from the github repository but it resulted in downgrading to DataFrames 21.8. …
-
Current (as a 2-step process):
%storage read --object "gs://bucket/path/to/csv" --variable temp_str
df = pd.read_csv(StringIO(temp_str))
Proposed (one step, plus avoid creating a copy o…