-
## Bug
#### Which Delta project/connector is this regarding?
- [ x] Spark
- [ ] Standalone
- [ ] Flink
- [ ] Kernel
- [ ] Other (fill in here)
### Describe the problem
I downloaded a c…
-
I have a hadoop cluster with spark enabled on it. Can I run spark -perf on this cluster? Or should it be standalone cluster with spark on it?
-
### Cosmetic Item Name
wraithbinder,Orbuculum Equinox
### Description
when you put a spark for the power of light, then it has the effect of an immortal object, and if for the powers of darkness, t…
-
Use cases for parsing massive amounts of data:
- Return summed taxonomic distribution of reads at multiple levels (domain, family, etc.)
- Return summed functional distribution (KOs, Pfam, etc.)
…
-
Schema measurement seems to result in duplicate columns when the hive table is partitioned by same column, e.g. if a hive table contains column_a and partitioned by column_a, schema measurement is tre…
-
# Error when using spark_apply method
I am using Spark Connect to perform operations with tables hosted in Unity Catalog (Databricks). When I want to use the `spark_apply` method to process them I…
-
This [tutorial](https://tutorials.rc.nectar.org.au/deploying-a-hadoopspark-cluster/01-overview) seems to be well out of date. I managed to get it running by folking the elasticluster [repo](https://gi…
-
**Observed Behavior**
When deploying multiple clusters of the same product (Airflow, NiFi, ...) into one namespace and then deleting one of them, it can happen that the `roleBinding` and `serviceAccou…
-
## Bug Report
### Affected tool(s) or class(es)
_Tool/class name(s), special parameters?_
`SortSamSpark --sort-order coordinate`
### Affected version(s)
- [ ] Latest public release version …
-
Here is the config
Spark- 3.1.1
Hadoop - 3.2
Deequ - 2.0.0-spark-3.1
There is an error with the ColumnProfileRunner (other methods are working well)
Traceback (most recent call last):
…