1. 06 Dec, 2005 18 commits
    • Mike Hibler's avatar
      Phase II in disk state saving for swapout. · ed0d25b4
      Mike Hibler authored
      Exec summary: after this checkin, the infrastructure exists (once enabled)
      to create swapout-time "delta" images for all machines in experiments.
      There is only a single, cumulative swap image per node (i.e., all diffs
      are from the base image, not from the previous swap).
      
      What doesn't yet exist, is the mechanism for reloading the delta at
      swapin time.  That is Phase III.
      
      The nitty-gritty:
      
      1. Keep disk image signature files for all nodes in an experiment.
      
         New fields in the DB to track, for each disk partition, what image the
         partition was loaded from.  This enables us at swapin or os_load time to
         create signature files in /proj/<pid>/exp/<eid>/swapinfo for the current
         contents of a node disk/partition.  All nodes with the same image loaded
         will share (via symlink) the same signature file.  TODO: no longer
         referenced signature files should be removed.
      
         Signature info is only collected in the swapinfo directory if the
         experiment is set to have disk state saving enabled (see #5 below).
         Info consists of the <vname>.sig file, which is the file created
         by imagehash, and <vname>.part which says what the root disk is
         for the node and whether to look at the whole disk or just a single
         partition when crafting the delta image.
      
      2. Swapout-time hook for creating swapout image.
      
         If the experiment is marked as allowing disk state saving, tbswap
         will arrange to run and then monitor the create-swapimage command
         on each node.  This script will run the modified version of imagezip
         which uses the signature file to create a delta image.
      
         The command to run and maximum timeout are specified via sitevars
         (previously checked in).  Note that the tbswap script currently has
         special knowledge of /usr/local/bin/create-swapimage as a swapout
         time script.  If the swap/swapout_command sitevar is set to that,
         Magic Stuff shall occur (i.e. it will monitor the command and make
         periodic reports of progress).  The sitevars are a total hack and
         will disappear at some point.
      
      3. Client-side script for creating swapout image.
      
         os/create-swapimage, very similar to create-image.  Uses the info
         stashed in /proj/..blahblah../swapinfo to create a delta image.
      
         XXX fer now hack: the script first looks in /proj/<pid>/bin for an
         imagezip binary to use.  Failing that, it uses the one in the MFS.
         This allows for easier development of the imagezip changes (i.e.,
         don't have to update the MFS every time.
      
      4. Auto creation of signature files for new images.
      
         The create_image script (the one that runs on boss when creating images
         for users) has been modified to automatically create a signature via
         imagehash.  The .sig file winds up in /usr/testbed/images/sigs or
         in /proj/<pid>/images/sigs.  From there it will be copied at swapin/os_load
         time to the per-expt swapinfo directory for any node that uses the images.
      
         The process for creating standard system images (aka, "Mike") has not
         yet been modified.  When the image creation/installation procedure
         is formalized into a script, this will be done.
      
      5. Web changes to set/clear saving of disk state at swapout time.
      
         Add a checkbox to the experiment create page to allow setting "save
         swap state".  Also added to the experiment modify page, but currently
         "if (0)"ed out as it will need some additional support.  The showstuff
         page will show it.
      
         Taking a page from Leigh's hack book, if EXPOSESTATESAVE in defs.php3
         is set to zero (as it is now), then the checkbox doesn't appear in the
         create experiment page except for STUDLY users.
      ed0d25b4
    • Timothy Stack's avatar
      Pass the control net IP address to the proxy on ops since names with · 8c89fa8f
      Timothy Stack authored
      underscores don't resolve.  Also, fix a couple minor bugs in the
      filehandle resolver and add some stats for lookups.
      8c89fa8f
    • Timothy Stack's avatar
      Decrease the minimum time between updates. · a62cce01
      Timothy Stack authored
      a62cce01
    • Kevin Atkinson's avatar
      · e3d00a07
      Kevin Atkinson authored
      Improved sync test:  Will now deadlead if the asynchronous (-a) doesn't work.
      e3d00a07
    • Mike Hibler's avatar
      88d3cd60
    • Mike Hibler's avatar
      Wee little hack: · abfbb5d7
      Mike Hibler authored
       Common mistake: forget the -i before the imagename, e.g.,
       "os_load FBSD54-STD pcNN", which results in pcNN getting loaded
       with the default image.  So if the first arg fails as a node, but
       is an image ID, assume they have made this mistake and stop.
      abfbb5d7
    • Leigh Stoller's avatar
      bbd1834a
    • Timothy Stack's avatar
      c91ded41
    • Leigh Stoller's avatar
      Temporary change to linktest while we continue to debug; Always run · 428c5121
      Leigh Stoller authored
      linktest at level 3 if a mere user. Studly users still have control
      though. Note that errors are no longer mailed to user by linktest_control.
      
      Also moved duplicated code to get dbuid (and email address) to top of
      file.
      428c5121
    • Mike Hibler's avatar
      Accidentally left in some future changes... · 7c6a150e
      Mike Hibler authored
      7c6a150e
    • Mike Hibler's avatar
      bbc76645
    • Mike Hibler's avatar
      efaf93de
    • Mike Hibler's avatar
      Fix a cut/paste error in a print statement. · 5ba66394
      Mike Hibler authored
      5ba66394
    • Mike Hibler's avatar
      60e46ec4
    • Mike Hibler's avatar
      Minor tweak to free block handling: wait til we have accumulated and merged · dc7b2389
      Mike Hibler authored
      all free blocks before throwing out ones that are too small (-F).  There
      were a fair number of cases where a small free chunk was adjacent to a larger
      one and we were tossing out the smaller.  This does increase the risk that
      we will run out of memory building the free list.  If that happens, we can
      make an incremental cleanup pass.
      dc7b2389
    • Leigh Stoller's avatar
      Temporary change while debugging linktest; only notify testbed ops · 9cdd0345
      Leigh Stoller authored
      when linktest fails on swapin path. Also, add a file link to the
      swaplog file, which still contains the linktest output.
      9cdd0345
    • Leigh Stoller's avatar
      Oops, remove experimental archiving code. · d923a8b2
      Leigh Stoller authored
      d923a8b2
    • Leigh Stoller's avatar
  2. 05 Dec, 2005 7 commits
  3. 04 Dec, 2005 1 commit
  4. 03 Dec, 2005 1 commit
  5. 02 Dec, 2005 7 commits
  6. 01 Dec, 2005 6 commits