The particle rung distribution from a GPU build is very different with and without "gpu-local-tree-walk" enabled. With that option disabled, the rung distribution matches that of a cpu build, as expected.
Example rung distributions of step 1 of a ~4 billion particle run:
The particle rung distribution from a GPU build is very different with and without "gpu-local-tree-walk" enabled. With that option disabled, the rung distribution matches that of a cpu build, as expected. Example rung distributions of step 1 of a ~4 billion particle run: