-
### Validation
- [X] I've read the [FAQ](https://github.com/xenia-project/xenia/wiki/FAQ).
- [X] The Xenia build used is from the master branch. (not MLBS/AlexVS/Canary/pull requests, etc)
- [X] This…
-
## Design
Kudo serialization format is optimized for columnar batch serialization used during spark shuffle, which significantly improved serialization/deserialization time compared to jcudf serial…
-
Recently introduced committees and majorities (see https://github.com/space-meridian/roadmap/issues/59) created an opportunity for building reputation scores at the level of individual checker nodes (…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscri…
-
### rdd
#### from *.gz to rdd
```
* *scala
// from *.gz file to rdd
rdd = sc.textFile("hdfs://path/to/*.gz")
* *pyspark
from pyspark.sql import SparkSession
spark = SparkSession.builder…
-
please join us @cchunharas
-
**Name of the component**: Spark
**What would you like to be added**: Create a new component
**Why is this needed?** : A couple of our hot prospects use Spark. We would like to demonstrate t…
-
hive.exec.dynamicparition
hadoop committer algorithms
hadoop commit path
-
1. Leave solder there, to bernard opinion. Do nothing.
2. Have the project compiling in intellij 12.
3. Attach SparkDS file (DataSource file).
4. Make it work with our common server:
http://107.21.…
-
This is first on trial basis.