Closed khawandc closed 6 years ago
We are fairly observation-based rather than column-based, but you should be able to accomplish what you want by doing:
parallel
call a outputopts
option so that each child has a name of where to store its output.parallel
is done load the original dataset, and then bring in (likely via merge
) all the generated variables in the ancillary files.
Hope that helps.
I am looking to run a few distinct commands in parallel on all observations in a single data set but do not know how to disable parallel's splitting of the data saet.
Expected behavior and actual behavior
What I want is to run each command reg y1 x1, reg y2 x2, etc. on the full data set of 1,000,000 in parallel and then have them create a fitted value variable. However, parallel instead runs three tasks and runs each regression with 333,333 observations.
Any easy workarounds?
Thanks for your time. Love the code/idea.
System information
Some relevant information
Output from
creturn list
: