1. 24 Jul, 2001 1 commit
    • Leigh B. Stoller's avatar
      Add new console_reset script to bulk reset all of the tip lines on · 4303838f
      Leigh B. Stoller authored
      plastic. Useful for when plastic reboots, although it needs to be
      run from paper of course. We should probably set up some kind of
      reboot utility for when plastic reboots, since when capture fires up,
      the "acl" files are all in the wrong group/mode. On the other hand,
      this shared file based permission scheme is going to end up changing
      anyway soon, since it won't work when the serial lines are on other
      machines. Will need a tip permserver at that point.
      4303838f
  2. 10 Jul, 2001 1 commit
    • Mac Newbold's avatar
      Added sched_reserve. Works kind of like sched_reload, but for a reservation. · e1f4263c
      Mac Newbold authored
      Rob and I noticed that when we need to do maintainance to a node, its hard
      to grab it when it gets freed before it gets taken again. So this will allow
      admins to set up nodes so that when the current owner frees them, they
      immediately are moved into another experiment. For instance, if I need to do
      an upgrade on all the machines in the testbed, I can do:
      
      sched_reserve testbed down pc1 pc2 pc3 [...] pc40
      
      and it will immediately reserve any free nodes, skip any that are already
      in testbed/down, and add an entry to the next_reserve table for any nodes
      that are already reserved. Then nfree checks for entries in the next_reserve
      table before it lets any nodes go (much like it does for the reloads table),
      and moves them into the expt before it can get reserved again. So we can
      guarantee that the next time a node is freed, it will be saved for us.
      
      Note that this can also be useful to accumulate nodes for a large experiment,
      since you can make sure you get them as soon as the current owner is done.
      e1f4263c
  3. 20 Jun, 2001 1 commit
  4. 24 May, 2001 1 commit
  5. 17 May, 2001 1 commit
  6. 10 May, 2001 2 commits
  7. 12 Apr, 2001 1 commit
  8. 11 Apr, 2001 2 commits
  9. 30 Mar, 2001 1 commit
    • Leigh B. Stoller's avatar
      Reloading daemon. Looks for free nodes that have not been reloaded · 74d21844
      Leigh B. Stoller authored
      since the last reservation (as determined by last_reservation table).
      Picks one (randomly) from that set of nodes, and calls sched_reload on
      it. Then waits until the node has finished reloading, as determined by
      the reserved table, which gets cleared by the tmcd when the node first
      reboots after a scheduled reload. Sleeps 30 seconds, and then goes
      around again. So at most one node is tied up in a reload at a time,
      which seems like a good balance between trying to keep the machines in
      a pristine state, and having nodes available for use.
      
      The advantage of this approach is that instead of calling sched_reload
      on 40 nodes (after generating a new image) and watching the network
      meltdown, we can let the nodes reload at a slower pace. We could call
      sched_reload on allocated nodes so that they will load when freed, but
      we run into the problem of big experiments ending and causing meltdown.
      
      The downside is that this approach is a little too aggressive. Nodes
      will end up reloading after just a single experiment. Need finer grain
      control over when to reload, but I will leave that as an exercise for
      later.
      74d21844
  10. 29 Mar, 2001 1 commit
    • Leigh B. Stoller's avatar
      A set of change for exporting project and user directories only to · 407cd1ac
      Leigh B. Stoller authored
      machines in the experiment. exports_setup runs on paper and queries
      the DB, generating a set of export lines. That is sent over to
      plastic, and piped into exports_setup.proxy, which generates a new
      /etc/exports file and restarts mountd. All of the client setup scripts
      have been changed to mount the project and user directories.
      407cd1ac
  11. 23 Mar, 2001 1 commit
  12. 19 Mar, 2001 1 commit
  13. 15 Mar, 2001 1 commit
  14. 21 Feb, 2001 1 commit
  15. 27 Jan, 2001 1 commit
  16. 08 Jan, 2001 2 commits
  17. 07 Jan, 2001 1 commit
  18. 04 Jan, 2001 5 commits
  19. 03 Jan, 2001 1 commit
  20. 02 Jan, 2001 1 commit
  21. 28 Dec, 2000 2 commits
  22. 27 Dec, 2000 1 commit
  23. 21 Dec, 2000 3 commits
  24. 14 Dec, 2000 2 commits
  25. 11 Dec, 2000 2 commits
  26. 07 Dec, 2000 1 commit
  27. 06 Dec, 2000 2 commits