Lazy Evaluation of PTX Kernels in the CUDA Runtime

jiffyrune / gpuocelot

Automatically exported from code.google.com/p/gpuocelot

0 stars 0 forks source link

Describe the New Feature:
The current implementation loads and parses all PTX kernels declared within
a program upon kernel registration.  Make this lazily evaluated instead.

Ideally, registering a kernel should add an entry with a flag saying that
it has not yet been parsed.  Upon the first execution, it should be
translated and then executed and the flag should be updated.

Which milestone does the feature belong to?
1.0.0

Which branch does the new feature go in?
Trunk

Original issue reported on code.google.com by gregory....@gatech.edu on 13 Aug 2009 at 8:15

jiffyrune / gpuocelot

Lazy Evaluation of PTX Kernels in the CUDA Runtime #19