-
# The problem
Hello,
I'm working on a Change Data Capture and my goal is to replicate data from a parquet into a Delta table by making the required inserts, updates and deletes. I followed the t…
-
I want to filter inside for loop like
```
sc
-
When using dplyr's `summarise()` with `.group = "drop"` parameter a new column called `.groups` is created as shown below.
```r
spark_version
-
- Are there any major differences other than `processor` being able to take >1 input?
---
My attempt at finding the differences from the docs:
`processor` is on the driver side and `transform…
-
Is it possible to have 2 input layers and concatenate them both in SparkNLP, just as tf.keras functional API? There doesn't seem to be documentation on this. The use case is to pass 2 different text c…
-
-
Because of some stability issue in SnappyData 1.1.0 we thought of upgrading to recently released version i.e., SnappyData 1.2.0.
We have been using Snappydata for more than a year in production.
P…
-
As far as I can tell, functionality to create an Optimus DataFrame from an existing Spark DataFrame is not supported. This would be useful when working with data from a source that Optimus doesn't cur…
-
**Is your feature request related to a problem? Please describe.**
Most datasets I deal with are XML or JSON based. In contrast to tabular data, "cells" can be nested, possibly with a key.
Preproce…
-
Things don't work out as per the official [overview.ipynb
](https://github.com/microsoft/SynapseML/blob/master/notebooks/features/lightgbm/LightGBM%20-%20Overview.ipynb)
I am trying to build a mod…