-
I'm using spark-bigquery-with-dependencies_2.11-0.21.1.jar and having trouble with reading BigQuery data from Spark on Yarn cluster.
Pipeline:
BigQuery -> Spark 2.3.2 with HDP 3.1.5 , Python 3.6 …
-
## Description
I think there's scope to create a series of data connectors that would allow Kedro users to connect to Snowflake in different ways. This usage pattern was identified in the kedro-org/k…
-
**Describe the bug**
When using a SchemaModel on a pyspark dataframe with the config option `strict = "filter"` set, a `TypeError: drop() got an unexpected keyword argument 'inplace'` is raised.
-…
-
## Description
Kedro-viz supports Plotly.
Plotly has cool tables -https://plotly.com/python/table/
the idea is simply show the first 5/10 rows of the dataset on Kedro-viz
### Implementa…
-
This is a feature request / discussion issue to outline some problems we are having on RAPIDS cuML and, hopefully, converge on a good solution.
**Problem:** Dask Arrays & Dataframes are assumed to …
-
## Expected behavior
Want to do Distance Join Query between two dataframes. So followed the [documentation](https://datasystemslab.github.io/GeoSpark/tutorial/geospark-core-python/#write-a-distance-j…
-
I have different excel files with multiple sheets in a directory. I am not sure about the sheet names.
How do I read/ Iterate over all the sheet names without providing the names explicitly.
-
**Describe the bug**
See the example below. Reading an Excel from a WebserviceFileDataObject creates a file called result.xlsx in the specified path.
When reading this same ExcelFileDataObject with…
-
xref https://github.com/pandas-dev/pandas/pull/28135#issuecomment-524659775
do we want to make pandas PEP 561 compatible?
https://mypy.readthedocs.io/en/latest/installed_packages.html#making-pep…
-
### What's the use case?
When opening a connection to Snowflake, it is possible to use `cursor.fetch_pandas.all()` to create a DataFrame without using type hints. However, when that DataFrame is pass…