Hi HPC Community,
Talon2 is back online and compute services have been resumed. All jobs that were suspended appear to have resumed correctly. As always, please notify us immediately if anything looks out of place.
Please note that we are still solving the Lustre Metadata high-availability issue. Lustre is extremely robust but this issue could cause loss of redundancy. However, we feel the risk of this is low enough that we can safely resume service.
A few more updates: