Data intensive research, such as Next Generation Sequencing, can require extensive parallelization of analysis routines to efficiently process data in a timely fashion. While parallelization provides marked benefits in the speed of analysis, it creates extra challenges when attempting to share research analysis processes with other researchers.
There are many different parallelization architectures, and few standards
Some researchers lack access to any parallelization infrastructure
We could develop suggestions and practices for researchers to use in sharing research analyses involving parallelization. This might involve using parameterization to allow other researchers a choice of whether to use parallelization or not, or provide them the ability to scale its parallelization to the infrastructure available to them.
Data intensive research, such as Next Generation Sequencing, can require extensive parallelization of analysis routines to efficiently process data in a timely fashion. While parallelization provides marked benefits in the speed of analysis, it creates extra challenges when attempting to share research analysis processes with other researchers.
We could develop suggestions and practices for researchers to use in sharing research analyses involving parallelization. This might involve using parameterization to allow other researchers a choice of whether to use parallelization or not, or provide them the ability to scale its parallelization to the infrastructure available to them.