1. 31 Jan, 2017 2 commits
  2. 20 Jan, 2017 1 commit
    • Mike Hibler's avatar
      New 'subbossinfo' command. · d75093f8
      Mike Hibler authored
      When invoked by a subboss, returns key=value pairs from subboss_attributes
      for all services for that subboss. Will be used to configure subbosses,
      eliminating the need to customize startup scripts per-subboss.
      d75093f8
  3. 17 Jan, 2017 1 commit
    • Mike Hibler's avatar
      Implement heartbeat/status reports in Frisbee. · 2be46ba4
      Mike Hibler authored
      There are three pieces here, a change to the frisbee protocol itself, an
      Emulab event component to get status back to the portal, and the surrounding
      infrastructure to make it all work.
      
      Frisbee heartbeat messages:
      
      Added a new message type to the frisbee protocol, "Progress". In theory it
      operates by having the server send a multicast progress request to its clients
      which includes an interval at which to report (or "just once") and an
      indication of what to report (nothing, progress summary, or full stats). The
      client then sends unicast "fire and forget" UDP replies according to that
      schedule. However, I took a shortcut for the moment and just added a command
      line option to the client to tell it to report a summary at the indicated
      interval (-H <interval>).  So the server never sends requests.
      
      This is implemented in the client by a fourth thread since I wanted it to
      operate independent of packet reception (which would cause clients to report
      in a highly synchronized fashion due to multicast). The server instance just
      logs progress reports into its log.
      
      This protocol addition should be fully backward compatible as both client and
      server ignore (but log) unknown messages.
      
      Emulab progress report events:
      
      When this is compiled in (-DEMULAB_EVENTS) and turned on (-E <server>), the
      frisbee server instances will send a FRISBEEPROGRESS event to the indicated
      event server for every progress report it receives (in addition to logging the
      events to its own log). Right now it will create an event with key/value pairs
      for the information in a client summary reply:
      
      TSTAMP is the client's time at which it sends the event. Could be used by the
      received to determine latency of the report if it cared (and if it assumed
      that the clocks are in sync). We don't care about this.
      
      SEQUENCE is the report number. Again, could be used by the receiver, in this
      case to detect loss, if it cared. We don't.
      
      CHUNKS_RECV is complete chunks that the client has received from the network.
      CHUNKS_DECOMP is chunks decompressed by the client.  BYTES_WRITTEN is bytes
      written to disk by the client.
      
      Any of the three can be used by the event receiver as an indication of life
      and/or progress. However, only the last would be a reasonable indicator of
      time remaining since it is the last (and slowest) phase of imaging. To
      estimate time remaining we could compare that value to the amount of
      uncompressed data that is in the image. This makes the sketchy assumptions
      that time for writes to the disk are uniform and that the number and distance
      of seeks is uniform, but it is better than a sharp stick in the eye.
      
      Emulab infrastructure:
      
      There is a new sitevar "images/frisbee/heartbeat" which can be set to a
      non-zero value to tell the frisbee MFS to fire off frisbee with -H <value>
      and thus make reports. The default value of zero means to not make reports.
      The tmcd "loadinfo" command sends this through via the HEARTBEAT=<value>
      param.
      
      REQUIRED A TMCD VERSION BUMP TO 41.
      2be46ba4
  4. 17 Nov, 2016 1 commit
  5. 21 Oct, 2016 1 commit
    • Mike Hibler's avatar
      Fix assorted lint. · 4d94c464
      Mike Hibler authored
      Primarily I was after what was causing the occasional segfault.
      That problem was caused by calling tmcc on a node that was free.
      Seems we were derefing some NULL columns returned by mysql because
      we assumed that there would always be a row in experiments for the
      node in question.
      
      Since I do need to call tmcd from the "pxewait" initramfs on Moonshot
      ARM nodes, I cleaned up this assumption.
      4d94c464
  6. 18 Oct, 2016 1 commit
  7. 04 Oct, 2016 1 commit
  8. 19 Sep, 2016 1 commit
  9. 12 Sep, 2016 1 commit
    • Mike Hibler's avatar
      Modify NOVIRTNFSMOUNTS to allow mounts on vnodes with routable IPs. · 470a81e5
      Mike Hibler authored
      This is different than the traditional behavior of this defs- variable.
      Previously it caused tmcd to not expose any NFS mounts to shared-host vnodes.
      We relax that now to allow exposing such mounts to vnodes with routable IP
      addresses.
      
      The rationale for this change is simply that the original option was only
      intended to prevent exporting mounts to hosts that could not reach the FS
      node anyway due to their unroutable cnet IPs.
      470a81e5
  10. 04 Sep, 2016 1 commit
  11. 29 Aug, 2016 1 commit
    • Leigh B Stoller's avatar
      Fix for bug Kirk reported; we were returning two sets of accounts to · 9f49cc7e
      Leigh B Stoller authored
      geni slice nodes when the project was a local project. In this case, we
      want to return the project accounts and ignore the ssh keys sent in the
      geni API call (a future change might involve a merge of accounts, but
      not unless someone actually needs it). And for a nonlocal project we of
      course still want to return the geni API ssh keys, but not return the
      project member accounts, since they are just stub accounts and don't
      actually have any ssh keys associated with them. They just cause
      confusion.
      9f49cc7e
  12. 10 Jun, 2016 2 commits
    • Mike Hibler's avatar
      Allow doloadinfo() to return more than the stock 2K of info. · fa686a25
      Mike Hibler authored
      At least for TCP based calls. We will need this for long-ish delta chains.
      I didn't think this warranted a version number bump even though it is
      possible that an old MFS that makes a UDP-based call will only wind up
      getting the first line (image). The reasoning here is that MFSes that old
      could only handle one line anyway in rc.frisbee!
      fa686a25
    • Leigh B Stoller's avatar
      NFS mount changes, still a work in progress, bound to change: · e369c1a8
      Leigh B Stoller authored
      * The Emulab portal now adds a toplevel element (Emulab namespace)
        directing the CM to use standard emulab mounts (read: /users).
        We clear that element from the other portals.
      
      * The CM looks for that tag, and allows it only if the caller is the local
        SA. The default for nfsmounts setting for geni experiment containers is
        "genidefault", but that is set to "emulabdefault" when allowed.
      
      * tmcd changes; no using nfsmounts slot instead of nonfsmounts. "none"
        means no mounts (duh), "emulabdefault" means standard mounts we all know
        and love, "genidefault" means no /users mounts.
      
        In addition, when we are doing emulabdefault mounts on a geni experiment
        node, we do not return accounts that are specified in the rspec, but
        rather we return the local project accounts only.
      e369c1a8
  13. 25 Apr, 2016 1 commit
  14. 19 Apr, 2016 1 commit
  15. 14 Apr, 2016 1 commit
  16. 07 Apr, 2016 2 commits
  17. 01 Apr, 2016 2 commits
  18. 28 Mar, 2016 1 commit
  19. 05 Feb, 2016 1 commit
  20. 31 Jan, 2016 1 commit
    • Mike Hibler's avatar
      Tweaks to TRIM reporting code. · f927aba6
      Mike Hibler authored
      Make the interval between TRIM operations a per nodetype (or per node)
      attribute instead of a global site variable. The sitevar will still be
      used to turn TRIM on or off globally.
      f927aba6
  21. 21 Jan, 2016 1 commit
  22. 23 Dec, 2015 1 commit
  23. 10 Nov, 2015 1 commit
  24. 07 Oct, 2015 2 commits
  25. 02 Sep, 2015 1 commit
  26. 01 Sep, 2015 2 commits
  27. 11 Jul, 2015 1 commit
  28. 25 Jun, 2015 1 commit
    • Leigh B Stoller's avatar
      Add new options to CreateSliver/Provision; supply an x509 certificate and · 8be26639
      Leigh B Stoller authored
      private key.
      
      The goal is to distribute an experiment wide certificate and private
      key. At the moment this is just a self signed x509 certificate and the
      accompanying rsa key. In PEM format. The same cert/key will be distributed
      across multiple aggregates.
      
      An openssh key pair can be trivially derived from the private key. Or the
      public part can be derived from the certificate. A quick google will show
      show.
      
      Initially, you will need to run tmcc directly to get them, using the
      geni_certificate and geni_key commands.
      8be26639
  29. 06 Apr, 2015 2 commits
  30. 31 Mar, 2015 1 commit
    • Mike Hibler's avatar
      Add sitevar to determine whether clients should use UDP or TCP for NFS. · f1ae820e
      Mike Hibler authored
      Yes, out of the blue and off the wall. But I got tired of trying to
      guess what we had Linux and FreeBSD use. I was surprised to discover
      that we were using UDP on Linux (which caused Clemson CloudLab to fail
      because they have jumbo frames enabled on their control net switches
      but ops had the MTU set to 1500).
      
      Anyway, here it is. The default setting is UDP for backward compat.
      We should probably set it to TCP nowadays. There is also an 'osdefault'
      setting which says use the default setting on the client OS.
      f1ae820e
  31. 06 Mar, 2015 1 commit
  32. 05 Mar, 2015 1 commit
  33. 03 Feb, 2015 1 commit