-
### Describe the feature
I am currently facing an issue using DBT with Spark on AWS/Glue/EMR environment as discussed already in https://github.com/dbt-labs/dbt-spark/issues/215 (but already raised h…
-
# Context
Spark / Spark SQL supportes _nested_ datatypes. So called [`StructType`](https://docs.databricks.com/sql/language-manual/data-types/struct-type.html) are basically a _container_ for other a…
-
### Version
main branch
### Describe what's wrong
I want to use federation query using hive metastore stored in 2 hadoop clusters.
So we added two hive catalogues to metalake.
There is a diff…
-
I want to generate spark-sql data lineage for tables, atlas version 2.0.0, spark-atlas-connector version master branch code,
but I met a bug:
the sql cmd:
`./bin/spark-sql -e 'drop table if…
-
Hey there,
As the title says, i am trying to query an existing cassandra DB from nodejs using your library. I am using a spark cluster on a LAN
Here's what i have done so far :
using :
- CentOS 7
- …
-
1. when write data fail in hudi some partitions , then data file is not exist
2. use sparksql to query data,return success
3. use trino(v391) to query data, return fail
trino error log:
```
io…
-
**Is your feature request related to a problem?**
Apache Iceberg is designed for managing large analytic tables in a scalable and performant way, using features like schema evolution, partitioning,…
-
Creating an open source data lake with SQL access is key to the reference architecture of this book
Please adjust the TOC for the data lake part so it fit's your needs. Please propose a reference a…
-
**Describe the problem you faced**
I am unable to create a hudi table using the data that I have with POPULATE_META_FIELDS being enabled. I can create the table with POPULATE_META_FIELDS set to fal…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscri…