Now watches for nodes that have been reloading for too long. "too long" is
currently defined as 30 minutes, to keep false positives to a minimum. Sends mail to testbed-ops if/when it finds any. The timing is not precise, as it only polls in between loading machines, but this is fine for our purposes.
Showing
Please register or sign in to comment