raxtechbits / gpuocelot

Automatically exported from code.google.com/p/gpuocelot
0 stars 0 forks source link

Multithread the emulator! #16

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
Describe the New Feature:
1) Devise a work queue approach where the executive class spawns one thread
per CPU core and assigns CTAs to threads as they complete.
2) For atomic ops, rather than locking, asynchronously push data into a
local queue, when it overflows then lock and do a bulk update.  Also do a
bulk update when the CTA completes to eliminate stragglers and when a fence
instruction is called.

Which milestone does the feature belong to?
1.0.0

Which branch does the new feature go in?
Branch

Original issue reported on code.google.com by gregory....@gatech.edu on 30 Jul 2009 at 3:27

GoogleCodeExporter commented 8 years ago

Original comment by gregory....@gatech.edu on 2 Feb 2012 at 6:01