-
We have seen users who write using the default from R, and then are frustrated when their Python colleagues can't read. We have considered changing to arrow for a long time:
https://github.com/rstu…
-
If I have a weight variable that's part of the DataFrame, I might want to do this:
```
glm(@formula(y ~ x), df, ..., wts = df[:weight]
```
However this doens't work if `x` or `y` contain missi…
-
**Describe the problem you faced**
When trying to use the [observe](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.observe.html) function on datafra…
-
### Description
Essentially the suggestion is just to provide a function:
`df.lookup(row_condition, column)`
as syntactic sugar for the more verbose:
`df.filter(row_condition)[column].item…
-
The functionality of this packages is awesome, but for the use case my team and I have, it's rendered essentially useless due to the fact that patito.polars.DataFrames can't be reverted back to polars…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
It is a common use-case to dynamically download datasets (usually CSV or JSON) from third-p…
-
When running a comparison on dataframes with a single column, the following exception is thrown:
```
/opt/venv/lib/python3.8/site-packages/datacompy/spark.py:356: in rows_both_mismatch
self._…
-
### Description
When joining data, it is useful to have the possibility of adding an "indicator" column displaying information on the source of each row, similar to the `indicator` parameter in `pa…
-
If you have a styled ```df = pandas.DataFrame().style```, you can pass it as ```sdf = solara.DataFrame(df.data)``` but you loose the styling that was applied to df. So if columns were hidden by the st…
-
I tried the to_dataframe example at the bottom of the README file and got the error KeyError: 'VARCHAR(32)'.
On line 366 there is a map of all the possible datatypes, but I was thinking it's missin…