-
I have a parquet dataset with a column consisting of serialized tf.Example protobufs. When I write this dataset and read without any compression I have no problems deserializing the protos. When I wri…
-
Hi,
It would be great to implement [jay](https://github.com/h2oai/datatable/issues/1109), binary format to write dataframes for R data.tables as well.
-
I'm running a .NET Spark app for batch processing with a selection of GitHub projects data. My program runs as expected up through making a Spark Sql call:
```csharp
// ...Code creating spark sess…
-
If you have a schema that contains a list-of-struct, selecting a subset of the inner columns doesn't work. Example
`list`
If the schema for this column was
```
A (list)
B …
-
Hi,
We must have a feature selection that is not manual. Gisele recommended this one:
https://spark.apache.org/docs/2.2.0/ml-features.html#chisqselector
The issue (mentioned by @waltersf ) is…
-
https://cloud.onehouse.ai/c3eb3868-6979-41cd-9018-952d29a43337/data/lakes/iceberg/databases/taxis
reran extractor service
```
root@spark:/opt/lakeview# java -jar LakeView-1.0-SNAPSHOT-all.jar -p …
-
```
prepareDF()
longStringDF
.write
.format("carbon")
.save(writerPath)
sqlContext.read.format("carbon").load(writerPath).show(false)
```
```
private def prepa…
-
So far, aggregation can only work without windowing (without the `OVER` keyword).
Unfortunately, I do (still) not know how to implement the very broad windowing techniques from SQL in dask.
-
I created an MOR table t_mor with 3 columns a,b,c and a as recordKey, c as precombine field. I do 4 operations with the spark-sql.
1. insert into t_mor 1, 1,1
2. insert into t_mor 2,2,2
3. update …
-
Real life customer issues:
1. The `is set` and `is not set`) filters in the events explorer [don't work](https://app.posthog.com/events#q=%7B%22kind%22%3A%22DataTableNode%22%2C%22full%22%3Atrue%2C%22…