Describe the New Feature:
1) Devise a work queue approach where the executive class spawns one thread
per CPU core and assigns CTAs to threads as they complete.
2) For atomic ops, rather than locking, asynchronously push data into a
local queue, when it overflows then lock and do a bulk update. Also do a
bulk update when the CTA completes to eliminate stragglers and when a fence
instruction is called.
Which milestone does the feature belong to?
1.0.0
Which branch does the new feature go in?
Branch
Original issue reported on code.google.com by gregory....@gatech.edu on 30 Jul 2009 at 3:27
Original issue reported on code.google.com by
gregory....@gatech.edu
on 30 Jul 2009 at 3:27