RSE-Cambridge / data-acc

Data Accelerator: Creates a burst buffer from generic hardware and integrates it with Slurm https://www.hpc.cam.ac.uk/research/data-acc http://www.stackhpc.com
https://rse-cambridge.github.io/data-acc
Apache License 2.0
17 stars 11 forks source link

Consider picking devices based on expected lifespace #135

Open JohnGarbutt opened 4 years ago

JohnGarbutt commented 4 years ago

Balance load between the NVMe drives, rather than just picking at random.

Likely should update drive health on restart of dacd and after delete of buffers. Need to look into secure erase / reset / discard at delete of buffer, to make sure Lustre does the correct thing.