-
## Enhancement
The current approach to reading Hive external tables involves three steps
1. Retrieving all partitions from the HMS
2. Fetching all data files from the partition directory
3. Send…
-
Writing mapReduce_like.js , I found several points to consider about.
https://github.com/7k8m/json.filed/blob/master/examples/mapReduce_Like.js
Distributed processing like MapReduce have "master node…
-
**Describe the bug**
Query-frontend shows in logs query performance stats and bloom-filters related metrics ("index_bloom_filter_ratio" and "index_post_bloom_filter_chunks") are always equal to zero,…
-
Symptom: moco fails with exception in DistributedDataParallel:
```
Traceback (most recent call last):
File "/home/jovyan/work/triton-no-conda/pytorch/benchmarks/dynamo/torchbench.py", line 481, i…
-
When doing a distributed query the result of rows_before_limit_at_least is greater than the total number of rows returned by the GROUP BY statement when the total amount of rows is less than LIMIT
…
-
### Checklist
- [ ] The issue exists after disabling all extensions
- [X] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused b…
-
Example:
https://www.gbif.org/dataset/2cd6ba56-b0ee-4565-94d8-4016e25c39ae
The source archive has eml.xml, meta.xml, image.tsv and occurrence.tsv but the image and occurrence TSVs have no headers, al…
-
Refer to #2 for a quick summary on Spark and RDDs. Note that RDDs are parameterized on a single type. You can have an `RDD` or an `RDD` but not an `RDD` or `RDD`. Consequently, the functions used to j…
-
Hi,
I've been really liking how Stanza just "works" out of the box since the last month or so. However, I have recently hit a wall and the documentation is a little sparse on the Stanza CoreNLP cl…
-
The following jmd file fails to weave https://gist.github.com/aaowens/30528f61528104988a53e15c5402a86f .
If I try to run it with
```
weave("Parallel-VFI-examples.jmd")
```
I get errors like
…