Closed mimaric closed 4 years ago
rerun tests
Addressing #526 in this PR as well
rerun tests
rerun tests
@ahehn-nv This is not the final commit, no need to re-review yet
rerun tests
rerun tests
rerun tests
@ahehn-nv While re-reviewing can you also check those few unresolved comments from previous reviews?
When creating indices generation of one
Index
is overlapped with transferring of previously generatedIndex
to host memory.When calculating overlaps while one batch of device indices in being processed the other batch is being loaded into device memory simultaneously.
TODO:
IndexCache
synchronize generation and communication streams using events, notcudaStreamSynchronize()
: #524Closes #318 Closes #526