fangq / mcxcl

Monte Carlo eXtreme for OpenCL (MCXCL)
http://mcx.space/wiki/?Learn#mcxcl
Other
41 stars 29 forks source link

optimize photon workload at work-group level #16

Closed leimingyu closed 7 years ago

leimingyu commented 7 years ago

Use local memory to dynamically distribute workloads to each thread in the work-group. The OpenCL lib directory for cuda is added to the Makefile.

fangq commented 7 years ago

your tab setting seems still different from mine (8 space per tab), but I accepted it anyways.

with the way you calculated photons_per_blk, will mcxcl still run exactly the specified total photon numbers if it is not multiple of thread numbers? (previously it was take cared by oddphotons)