-
-
(you don't have to strictly follow this form)
## Question
查询数据量
```
SELECT count(t0.`rate_sum`) FROM `db`.`xxx_table` AS t0 WHERE t0.`sdate` >= '2024-10-01' AND t0.`sdate` = '2024-10-01' AND t…
-
According to the current status in the readme, it seems only cloud storages are supported for the catalog.
For companies running on premise data lake, HDFS is widely used.
It would be nice if HDFS…
-
Recently, #767 has been merged into main. With that we should add some tests that check the code from the PR. As HDFS is an optional feature it would be fine if these tests only run if one wants to op…
-
I have been using pyarrow's pa.hdfs.connect() and pq.ParquetDataset to read files before using daft to read from pyarrow. The alternative is to simply use pandas' read_parquet and then daft's from_pan…
-
### Backend
VL (Velox)
### Bug description
After executing the SQL, if I exit the spark-sql command line using Ctrl+C or quit command, a core dump occurs. https://github.com/apache/incubator-…
-
Hi,
I have my K8S setup where we have deployed HDFS as helm chart with 3 Journalnodes, 3 Datanodes, 2 Namenodes and other HDFS3-HA components.
But when we do helm install, the Namenode pod goes fo…
-
# Problem
Some data products are very fragile when some of their services are restarted. These restarts may induce long response times or even data loss causing major disruptions in production enviro…
-
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
### Search before asking
- [X] I have searched in the [issues](http…
-
### OpenLineage integration/client
Spark integration
### OpenLineage version
main branch
### Technology and package versions
_No response_
### Environment configuration
_No response_
### Deplo…