-
cc @Fokko .
This is a super simple implementation of an iceberg client for dask. It works for the limited couple of datasets I have available including
- version metadata choice
- snapshot time …
-
We have to use a private maven repository proxy to download given packages under deps.
The SparkApplication however stays in status SUBMITION_FAILED because it is unable to verify the certificate of…
-
### What happened?
I am trying to deploy a Beam job (Python Beam) that runs on a PortableRunner (Flink Runner) in my Kubernetes cluster.
I have not experienced issues prior with Beam using the Flin…
-
## Description
My pyspark application needs to access GCP storage bucket, and i've mounted the secret (containing the service account key) as volume mount .. however getting an error.
```
Trace…
-
Configure monitoring for spark applications is slightly confusing.
With this we setup monitoring:
```yaml
monitoring:
metricsProperties: |
*.sink......
*.source....
expo…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Willi…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscri…
-
I am not too sure if this is an issue with Spark/K8S or the entry-point of the image, but the result is that the driver can not access the files passed through `--py-files` and `--files` when these ar…
-
### Description
We are replacing presto in the system with velox, but we have done a lot of self-research and optimization on performance based on presto. We found that there is a performance gap b…
-
Example: https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/examples/spark-pi.yaml
Operator values
```
operatorVersion: v1beta2-1.2.1-3.0.0
imagePullSecrets:
- name: ha…