Commit 2a142d24 authored by Kirk Webb's avatar Kirk Webb
Browse files

Added my last big plab commit to the ChangeLog.

parent 98d2488c
kwebb 2003/10/23 15:37:17 MDT
Modified files:
Well, here it is: The checkin implementing robust recovery/retry and
asynchronous safe termination in plab allocation/deallocation/setup.
Here are some of the more prominent changes/additions:
* Bounded plab agent communication
Scripts should never hang waiting for plab xmlrpc commands to complete;
they have their own internal timeouts. Node.create() in libplab is an
exception, but is always run under a timeout constraint in vnode_setup
and can be changed easily if the need arises.
* Wrote functions in libplab to do the retry/recovery/timeout of remote
command exection.
* Wrapped critical sections with a signal watcher.
* Added code to handle various error conditions properly
* Added a libtestbed function, TBForkCmd, which runs a given program in
a child process, and can optionally catch incoming SIGTERMs and terminate
the child (then exit itself).
* Fixed up vnode_setup to batch the 'plabnode free' operation along with
a few other cleanups. This should alleviate Jay's concern about how
long it used to take to teardown a plab expt.
* Whacked plabmonitord into better shape; fixed a couple bugs, taught it how
to daemonize, and implemented a priority list for testing broken plab nodes.
This list causes new (as yet unseen) nodes to be tried first over ones that
have been tested already.
Revision Changes Path
2.15 +50 -1 testbed/tbsetup/
1.34 +17 -25 testbed/tbsetup/
1.24 +611 -198 testbed/tbsetup/plab/
1.4 +34 -2 testbed/tbsetup/plab/
1.3 +85 -17 testbed/tbsetup/plab/
1.8 +13 -28 testbed/tbsetup/plab/
stoller 2003/10/16 12:06:44 MDT
Modified files:
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment