jiffyrune / gpuocelot

Automatically exported from code.google.com/p/gpuocelot
0 stars 0 forks source link

Lazy Evaluation of PTX Kernels in the CUDA Runtime #19

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Describe the New Feature:
The current implementation loads and parses all PTX kernels declared within
a program upon kernel registration.  Make this lazily evaluated instead.

Ideally, registering a kernel should add an entry with a flag saying that
it has not yet been parsed.  Upon the first execution, it should be
translated and then executed and the flag should be updated.

Which milestone does the feature belong to?
1.0.0

Which branch does the new feature go in?
Trunk

Original issue reported on code.google.com by gregory....@gatech.edu on 13 Aug 2009 at 8:15

GoogleCodeExporter commented 9 years ago
This feature is now in.  Modules will only be loaded by the runtime upon the 
first use rather than when the program starts.

Original comment by gregory....@gatech.edu on 1 Jul 2010 at 9:00