Closed GuDuJian-J-Zhang closed 1 year ago
Thanks for the PR!
What you've implemented is essentially compaction using an atomic add operation, which is totally valid here. What we have instead is a "stable" compaction (i.e., preserves the ordering of the input elements) using a regular sum scan (also known as prefix sum).
I expect both approaches are valid and performance should be similar (seems to be confirmed with the performance numbers visible on your screenshots) so feel free to use this approach although we probably won't merge the changes as it is functionally equivalent to what's already in place.
ok, got it. thanks for your reply
added
probe_spawn_tile_count_buffer_
to hold spawned tile countand mark as
overrideable
at the end of theSpawnScreenProbes
passso that we can remove
CompactScreenProbes
, which is , from my understand, not necessary for the gi pipelinepipeline screenshot before and after this change: