Open dhutchis opened 8 years ago
Removing the hash partitioning works because STORE
used with LOAD
partitions a relation on a single worker by default (this is a bug: https://github.com/uwescience/raco/issues/494).
Interestingly, running this query on the demo server seems to hang forever (in the attached screenshot it's been running over 2 hours):
@shrjain and I found the following query on the example TwitterK dataset that fails. (Replace the IP address with that of the demo myria service if it changes. Or pick another dataset you can load from either the local filesystem or a url.) It is simplified from a real query we are running on genomic data.
It fails because it throws an exception on this line of QueryConstruct.
The line checks that all FileScan operators are inside fragments that are assigned to no more than one worker. However, the FileScan for this query is placed inside a fragment that is assigned to more than one worker, because it includes logic for the UnionAll.
Likely there is a problem with the code that assigns fragments to workers. The problem goes away if you remove the "partition on
[b]
" from the two store statements. The hash partitioning is messing up the assignment somehow.