-
when trying to read data from Kafka which is stored as Json from schema registry. using below configuration. pls tell me where do I'm wrong with the configuration
- --checkpoint
- "OCR.VEHIC…
-
Got below error after running 'select * from hive.redact order by time_stamp desc'
Data is hudi parquet (hive - STORED AS INPUTFORMAT 'com.uber.hoodie.hadoop.HoodieInputFormat' ) on s3.
But enti…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues.
### What happened
In my requir…
-
### Bug description
I built gluten+velox using branch-1.1, submitted a tpch query using spark-shell, and the data was stored in s3. However, the following error occurred during execution:
`Reason:…
-
Vectorized Query Execution could have big performance improvement for SQL engines like Hive, Drill, and Presto. Instead of processing one row at a time, Vectorized Query Execution could streamline ope…
-
Is it possible - similarly to Turbodbc - to get the resultset in Arrow format efficiently?
-
Using sparklyr version 0.2.29, and sparkapi 0.3.18
On an AWS EM cluster, running the following to connect fails:
```
Sys.setenv(SPARK_HOME="/usr/lib/spark")
sc /etc/spark/conf
drwxr-xr-x 3 root roo…
-
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
### Search before asking
- [X] I have searched in the [issues](http…
-
It would be better if we eliminate the need for avro.schema (infer it?/obtain it from PCollection?) and use org.apache.beam.sdk.schemas.Schema instead.
Link to discussion on user@: [https://lists.…
-
An "empty" parquet file, created via pyarrow, seems to be leading to Barrage writing issues.
```python
from deephaven import parquet
my_table = parquet.read("Empty1.parquet")
```
This manif…