1. 30 Apr, 2003 2 commits
  2. 29 Apr, 2003 5 commits
    • Chad Barb's avatar
      · 40bc8389
      Chad Barb authored
      (hopefully) fixed bug in startexp
      affecting batch system by switching two cleanup checks.
      (I wasn't giving it an opportunity to bail on a batchexp failing
       soon enough.)
      Also, tidied up return code for tbswap.
    • Robert Ricci's avatar
    • Chad Barb's avatar
      · 59d5e48f
      Chad Barb authored
      Don't reboot nodes when recovering from a failed assign during update.
    • Chad Barb's avatar
      · a3a818af
      Chad Barb authored
      Mask off '64' bit from return from assign_wrapper, so
      batch_exp doesn't choke on it.
    • Chad Barb's avatar
      Robust Experiment Modify -and- · 7308f458
      Chad Barb authored
      Various Other changes to get Expt Modify ready for prime time.
       - If assign fails on a modify, experiment will
         be restored to old state, *not* swapped out.
       - Reboot option has been improved to reboot all
         nodes as part of os_setup, not in separate
       - Different assign error codes result in different
         retry behavior for assign_wrapper
         (Follow's Rob's change to assign to make it
          pass back special code for non-retriable faults)
       - '64' bit in assign_wrapper exit code indicates to tbswap
         that db/phys state hadn't been mucked with before
         the exit occurred
         (ergo, '65' and '1' are the common return codes,
          though the old 4,8,16,32 are still there for assign failing.)
       - (tbswap still returns codes from assign wrapper)
       - Added 5 sec pause between assign attempts.
       - Cleaned up tbswap code.
       - Physical state backup/restore removed from tbprerun,
         put into swapexp.
       - Interfaces table now getting cleaned up correctly
         (Mike noticed problem)
       - Changed menu display in showexp to show
         the "modify" menu option for swapped out experiments
         (like it used to.)
       - A couple other changes.
       Still admin-only, but I plan to change that soon.
      To do:
       - Erase expt backups in /tmp after using them.
       - Re-viz failed experiments.
  3. 28 Apr, 2003 3 commits
    • Leigh B. Stoller's avatar
      Change 2002 to 2003 on any file that was modified in 2003, but di not · 9e26c63a
      Leigh B. Stoller authored
      get an updated copyrights message.
    • Leigh B. Stoller's avatar
      Add several minor experiment_stats fields; swap_errors (a count), · 27558935
      Leigh B. Stoller authored
      swap_exitcode (last error), idle_swaps (a count), batch (a flag to
      indicate a batch experiment).
      Add a operational log. Okay, its not actually a log, but a table that
      will grow forever until it consumes the earth. Its a small table
      though, so it will take a few years. Its cross indexed with the
      experiment_stats table, so by massaging this table along with the
      stats table, we can get a good picture of what was running on the
      testbed when, and how many resources it was using. Sorry, not a log
      file, but we can easily generate a log file from tbe table if the Boss
      really wants one. The table entry averages 28 bytes.
      Move stats to their own main menu item (admin mode only). Remove from
      the showexp_list page since that was bogus.
    • Leigh B. Stoller's avatar
      Add support for new {user,group,project,experiment}_stats tables. · 5e5508bf
      Leigh B. Stoller authored
      The first three are aggregate tables, while the experiment stats table
      gets a record for each new experiment, and is updated when an
      experiment is swapped in/out/modify or terminated. Look at the table
      to see what is tracked. Once the experiment_stats record is updated,
      the aggregate tables are updated as necessary. There are a bunch of
      ugly changes to assign_wrapper to get the stats. Note that pnodes is
      not incremented until an experiment sucessfully swaps in. This is in
      leu of getting status codes; I'm not tracking failed operations yet,
      nor creating the log file that Jay wants. I'll do that in the next
      round of changes when we see how useful these numbers are.
      Most of the changes are to create/delete table entries where
      appropriate, and to display the records. Display is only under admin
      mode, and the display is raw; just a dump of the assoc tables in php.
      The last 100 experiment stats records are available via the Experiment
      List page, using the "Stats" show option at the top. Bad place, but
      will do for now.
  4. 23 Apr, 2003 2 commits
  5. 21 Apr, 2003 2 commits
  6. 18 Apr, 2003 2 commits
  7. 17 Apr, 2003 2 commits
  8. 16 Apr, 2003 4 commits
  9. 14 Apr, 2003 4 commits
  10. 11 Apr, 2003 1 commit
  11. 08 Apr, 2003 2 commits
  12. 07 Apr, 2003 2 commits
    • Leigh B. Stoller's avatar
    • Chad Barb's avatar
      · 99a867cb
      Chad Barb authored
      Modify os_setup return codes to enable "intelligent" retry;
      Now os_setup returns:
        0 on success
        1 on one or more retry-friendly errors
       -1 on no-retry errors
      tbswap.in checks os_setup's return code,
      and will only retry on 1.
  13. 04 Apr, 2003 2 commits
  14. 03 Apr, 2003 4 commits
    • Robert Ricci's avatar
      Handle counters that wrap around. · d23235d0
      Robert Ricci authored
    • Chad Barb's avatar
      · 765de560
      Chad Barb authored
      Added new feature 'Experiment Modify'.
      Now available (to admins only for now) from the showexp page.
      Warning! doing a modify which alters the topology will probably
      require a "reboot all nodes" afterwards.
      (There will be a checkbox soon in the modify experiment page.)
      Adding/removing delay nodes seems to work fine without reboots, though.
      Warning! If the new version of the experiment cannot be mapped
       (not enough nodes available, for instance) the experiment will be
       swapped out! This will get fixed later.
      Prerun backs up the experiment topology, so using a bad NS
      file doesn't result in experiment termination.
      As part of this, added library functions to libdb to
      delete, backup, and restore both virtual and physical experiment state.
    • Mac Newbold's avatar
    • Mac Newbold's avatar
      Remove webidlecheck, since it no longer gets used. Removal of idlecheck · 4b26a132
      Mac Newbold authored
      forthcoming, along with addition of idletimes, idlemail, and webidlemail.
  15. 02 Apr, 2003 1 commit
  16. 27 Mar, 2003 2 commits