1. 02 Apr, 2018 3 commits
    • Leigh B Stoller's avatar
    • Gary Wong's avatar
      Fix reservation approval admission control bug. · 493ce602
      Gary Wong authored
      (If the reservation being approved isn't in the list returned by LookupAll(),
      presumably because it was previously pending, then append it during
      feasibility checking.)
    • David Johnson's avatar
      Fix a race in kill/restart of pubsubd in rc.bootsetup . · a3b1a555
      David Johnson authored
      pubsubd wasn't restarting, surely because the existing pubsubd was still
      running and/or socket state was still live in the kernel even after
      putative death.  This took a long time to manifest, and it's not clear
      exactly what the problem was, but making sure pubsubd is dead (and is no
      longer holding its specific port) is appropriate even if we assume
      REUSEADDR is working, and fixes the current problem.  This was only
      observable on the pc3000s and c220g2s, as far as I saw.
  2. 30 Mar, 2018 6 commits
    • Kirk Webb's avatar
      Add s5048 model definitions. · c5ee65fa
      Kirk Webb authored
    • Mike Hibler's avatar
      Install the right file Mike.. · 396431a0
      Mike Hibler authored
    • Mike Hibler's avatar
    • Mike Hibler's avatar
      Initialize port range from defs- vars. · e593d62b
      Mike Hibler authored
    • Mike Hibler's avatar
      Support for frisbee direct image upload to fs node. · 99943a19
      Mike Hibler authored
      We have had issues with uploading images to boss where they are then written
      across NFS to ops. That seems to be a network hop too far on CloudLab Utah
      where we have a 10Gb control network. We get occasional transcient timeouts
      from somewhere in the TCP code. With the convoluted path through real and
      virtual NICs, some with offloading, some without, packets wind up getting
      out of order and someone gets far enough behind to cause problems.
      So we work around it.
      If IMAGEUPLOADTOFS is defined in the defs-* file, we will run a frisbee
      master server on the fs (ops) node and the image creation path directs the
      nodes to use that server. There is a new hack configuration for the master
      server "upload-only" which is extremely specific to ops: it validates the
      upload with the boss master server and, if allowed, fires up an upload
      server for the client to talk to. The image will thus be directly uploaded
      to the local (ZFS) /proj or /groups filesystems on ops. This seems to be
      enough to get around the problem.
      Note that we could allow this master server to serve downloads as well to
      avoid the analogous problem in that direction, but this to date has not
      been a problem.
      NOTE: the ops node must be in the nodes table in the DB or else boss will
      not validate proxied requests from it. The standard install procedure is
      supposed to add ops, but we have a couple of clusters where it is not in
      the table!
    • Leigh B Stoller's avatar
      Minor fixes. · e4485562
      Leigh B Stoller authored
  3. 29 Mar, 2018 5 commits
  4. 28 Mar, 2018 6 commits
  5. 27 Mar, 2018 1 commit
  6. 26 Mar, 2018 14 commits
  7. 22 Mar, 2018 2 commits
  8. 19 Mar, 2018 1 commit
  9. 18 Mar, 2018 1 commit
  10. 14 Mar, 2018 1 commit