Better placing of the threadpool count limiter for when it runs in parallel.
Added the simple "map" example back into the synthetic example notebook, which helps test the code automatically runs in parallel when present with more than 256 "pixels" of data.