demiangomez / Parallel.GAMIT

Python wrapper to parallelize GAMIT executions
BSD 3-Clause "New" or "Revised" License
36 stars 17 forks source link

Add a "Check node" object to verify a node before sending jobs #6

Closed demiangomez closed 6 years ago

demiangomez commented 6 years ago

When invoking a job server for parallel python, a "check node" object should handle the job creation and verification that each node in the cluster has all the necessary dependencies to run. If the node doesn't have all the necessary programs/dependencies, remove the node from the cluster and continue execution without it.

demiangomez commented 6 years ago

Behavior changed so that the execution is cancelled when a node with missing dependencies is found. This makes sure that the error cannot be ignored.

demiangomez commented 6 years ago

As of today, only pyArchiveService has incorporated this feature. Will be applied to ScanArchive and others soon.

demiangomez commented 6 years ago

New class called pyJobServer added. Done implementing this class in all PG programs. Still pending to verify a condition where a timeout is sometimes observed.

demiangomez commented 6 years ago

All programs now support this object.