Open vkhodygo opened 1 year ago
Add all elements required to conduct multiruns of SimPaths in parallel.
As a brief list to work on:
simpaths.jar
with dependencesinput
folder in the same directory as script - this should also have been pre-processed with SimPathsStart
output/logs
folder - though would suggest a test and create line (which I can add after getting it to run first off)Add scripts for data conversion
There were two R scripts deleted in 01df37665552708b01fdfd6eedfba4c39bb38fe7 which convert to arrow/parquet if these are any use? One converts Person.csv
, BenefitUnit.csv
and Household.csv
output files to three parquet files. The other joins them to one combined database.
If possible solving #6 might be priority - want to test things in running order so would add tests for the above files/directories after parsing arguments.
There were two R scripts deleted in 01df376 which convert to arrow/parquet if these are any use? One converts
Person.csv
,BenefitUnit.csv
andHousehold.csv
output files to three parquet files. The other joins them to one combined database.
I do have a slightly better version which I'll add once you're done with #4
If possible solving #6 might be priority - want to test things in running order so would add tests for the above files/directories after parsing arguments.
I'll get to it, thanks for letting me know.