These are not the prettiest changes as I'm optimizing for minimum amount of branching as tends to run faster on GPUs. In my testing, the erratic five flush filter runs 2-3x times as fast with the improved seed functions.
Also added a very basic timer so that seed/second benchmarks can be done.
These are not the prettiest changes as I'm optimizing for minimum amount of branching as tends to run faster on GPUs. In my testing, the erratic five flush filter runs 2-3x times as fast with the improved seed functions.
Also added a very basic timer so that seed/second benchmarks can be done.