-
This is a request about using CellPy on a cloud platform, and specifically for using Unity Catalog for data Governance, which is useful for example if you want to use databricks. Unity Catalog uses Ap…
-
After finding an efficient way to upload big files to a flask server, I now have to send this data to a database so that the user can query the database and retrieve desired records. Querying a databa…
-
**Describe the bug**
Accorrding to [the code here ](https://github.com/apache/spark/blob/branch-3.3/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala#L220), Spark …
-
**Describe the bug**
Vowpal Wabbit's [One Against All](https://github.com/VowpalWabbit/vowpal_wabbit/wiki/One-Against-All-(oaa)-multi-class-example) classifier does not work via the MMLSpark interfac…
-
spark-monitor branch [l4jv2](https://github.com/mspnp/spark-monitoring/tree/l4jv2) to support custom logging after DBR version upgrade to 12.2 are having column mapping issues. Some of the SparkLoggin…
-
A related issue: https://github.com/rapidsai/cudf/issues/11462
We would like to transfer a cuDF dataframe between a JVM process and a Python process without data copy . This is primarily used in Py…
-
Hi All,
I am new to Spark and Scala. I have the source code for Spark SQL Performance Tests and dsdgen .
Can anyone tell me how to proceed next ? I am done with building by giving command bin/run…
-
We had a use case at Argenta, where we worked with table of about 300 cols and ~2 mil. of rows.
There, the preprocessing took a lot of time and memory especially.
What we’d need is to find any dat…
-
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
### Search before asking
- [X] I have searched in the [issues](http…
-
I want to upgrade Hudi from 0.11.1 to 0.13.1, but I encountered the problem of duplicate data. I have never encountered it before with the same configuration.
I have synchronized many tables, and t…