1. 10 Aug, 2016 1 commit
    • Mike Hibler's avatar
      Rejiggered reload_daemon to enforce a max time. · b6d272a2
      Mike Hibler authored
      There are now some sitevars to control its behavior, the one of interest here
      is reload/failtime:
      
      The way the reload daemon is supposed to work now is that nodes will be
      started on their reloading adventure with an os_load. If they are still there
      after reload/retrytime minutes, then they will either be rebooted (if the
      os_load was successful) or os_load'ed again (if the first os_load failed
      outright). The logic for either of these is that there might have been some
      transient condition that caused the failure. If we do have to perform this
      "retry" then we will send email to testbed-ops if reload/warnonretry is set.
      If, after another reload/retrytime minutes, a node is still there, then the
      node will be sent to hwdown, possibly powering it off or booting it into the
      admin MFS depending on the setting of reload/hwdownaction.
      
      So really, reload/failtime should not be needed. All node should exit
      reloading in 2 * reload/retrytime minutes. But it is there as a backstop
      (and because I didn't understand the logic of the reload daemon at first!)
      Well, it also comes into play if the reload daemon is restarted after being
      down for a long period of time. In this case, all nodes in reloading will
      get moved to hwdown. May need to reconsider this...
      b6d272a2
  2. 29 Jul, 2016 4 commits
  3. 21 Jul, 2016 1 commit
  4. 19 Jul, 2016 1 commit
  5. 17 Jun, 2016 1 commit
  6. 10 Jun, 2016 2 commits
  7. 06 Jun, 2016 1 commit
  8. 26 May, 2016 1 commit
  9. 16 May, 2016 1 commit
  10. 11 May, 2016 1 commit
  11. 10 May, 2016 1 commit
  12. 09 May, 2016 1 commit
  13. 06 May, 2016 2 commits
  14. 03 May, 2016 1 commit
  15. 28 Apr, 2016 1 commit
  16. 25 Apr, 2016 1 commit
  17. 18 Apr, 2016 1 commit
  18. 13 Apr, 2016 1 commit
  19. 12 Apr, 2016 1 commit
  20. 08 Apr, 2016 1 commit
  21. 07 Apr, 2016 3 commits
  22. 06 Apr, 2016 1 commit
  23. 04 Apr, 2016 3 commits
  24. 01 Apr, 2016 1 commit
  25. 28 Mar, 2016 2 commits
  26. 24 Mar, 2016 3 commits
  27. 21 Mar, 2016 2 commits