- 22 Dec, 2004 3 commits
-
-
Leigh B. Stoller authored
-
Leigh B. Stoller authored
Mac's statewait library to wait for nodes to through a MFSETUP,ISUP transition. This ensures that the nodes actually came up in the MFS, and not something else. No retry; just exit with failure if any node fails.
-
Leigh B. Stoller authored
Usage: node_admin [-h] [-n | -w] <on | off> [node ....] node_admin [-h] [-n | -w] -p pid,eid <on | off> -h This message -n Do not reboot node -w Wait for node to come back up if rebooted -e Operate on all nodes in an experiment
-
- 03 Aug, 2004 1 commit
-
-
Leigh B. Stoller authored
Add explicit -w option to wait for node to reboot.
-
- 12 Jan, 2004 1 commit
-
-
Leigh B. Stoller authored
filename to boot, and all local nodes will boot the same pxeboot kernel, which has been extended to allow for jumping directly into a specific MFS (in addition to the usual testbed boot into a partition or multiboot kernel). Bootinfo and the bootwhat protocol extended to tell the client node what MFS to jump into directly, without a reboot. pxe_boot_path and next_pxe_boot_path are now deprecated, with bootinfo used to control which MFS to boot. Nodes now boot a single pxeboot kernel, and bootinfo tells them what to do next. Bootinfo greatly simplifed. temp_boot_osid has been added to allow for temporary booting of different kernels (such as with ndoe_admin or create_image). Unlike next_boot_osid which is a one-shot boot, temp_boot_osid causes the node to boot that OS until told not too. next_boot_path and def_boot_path in the nodes table are now ignored. Bootinfo gets path info strictly from the os_info table entry for the osid given in one of def_boot_osid, temp_boot_osid, or next_boot_osid. This makes the selection of what to do in bootinfo a lot simpler (and for TBBootWhat in libdb). The os_info table also modified to include an MFS flag so that bootinfo knows to tell the client that the path refers to an MFS and not a multiboot kernel. Change to boot sequence; free nodes no longer boot into the default OSID. Instead, they are told to wait in pxeboot until told what to do, which will typically be when the node is allocated and a specific OSID picked. If the node needs to be reloaded, then the node is told to jump directly into the Frisbee MFS, which saves one complete reboot cycle whether the node has the requested OS installed, or not. New program added called "bootinfosend" that is used by node_reboot to "wake up" up nodes sitting in pxewait mode, so that they query bootinfo again and boot. node_reboot changed to look at the event state of a node, and use bootinfosend to wake up nodes, rather then power cycle, since pxeboot does not repsond to pings. Retry (if the UDP packet is lost) is handled by stated. Event support added to bootinfo, to replace the event generation that was in proxydhcp. I have not included the caching that Mac had in proxydhcp since it does not appear that bootinfo packets are lost very often. Cleaned up all of the event and DB queury code to use lib/libtb for DB access, and moved all of the event code into a separate file. The event sequence when a node boots now looks like this: 'SHUTDOWN' --> 'PXEBOOTING' (BootInfo) 'PXEBOOTING', --> 'PXEBOOTING' (BootInfo Retry) 'PXEBOOTING', --> 'BOOTING' (Node Not Free) 'PXEBOOTING', --> 'PXEWAIT' (Node is Free) 'PXEWAIT', --> 'PXEWAKEUP' (Node Allocated) 'PXEWAKEUP', --> 'PXEWAKEUP' (Bootinfo Retry) 'PXEWAKEUP', --> 'PXEBOOTING' (Node Woke Up) Change stated to support resending PXEWAKEUP events when node times out. After 3 tries, node is power cycled. Other minor cleanup in stated. Clean up and simplify os_select, while adding support for temp_next_boot and removing all trace of def_boot_path and next_boot_path processing. Remove all pxe_boot_path and next_pxe_boot_path processing. Changed command line interface to support "clearing" fields. For example, node_admin changed to call os_select like this to have the node temporarily boot the FreeBSD MFS: os_select -t FREEBSD-MFS pcXXX which sets temp_boot_osid. To turn admin mode off: os_select -c -t pcXXX which says to clear temp_boot_osid. sql/database-fill-supplemental.sql modifed to add os_info table entries for the FreeBSD, Frisbee, and newnode MFS's. Be sure to change dhcpd config, restart dhcp, kill proxydhcp, restart bootinfo,
-
- 18 Oct, 2002 1 commit
-
-
Mac Newbold authored
Changes to watch out for: - db calls that change boot info in nodes table are now calls to os_select - whenever you want to change a node's pxe boot info, or def or next boot osids or paths, use os_select. - when you need to wait for a node to reach some point in the boot process (like ISUP), check the state in the database using the lib calls - Proxydhcp now sends a BOOTING state for each node that it talks to. - OSs that don't send ISUP will have one generated for them by stated either when they ping (if they support ping) or immediately after they get to BOOTING. - States now have timeouts. Actions aren't currently carried out, but they will be soon. If you notice problems here, let me know... we're still tuning it. (Before all timeouts were set to "none" in the db) One temporary change: - While I make our new free node manager daemon (freed), all nodes are forced into reloading when they're nfreed and the calls to reset the os are disabled (that will move into freed).
-
- 07 Jul, 2002 1 commit
-
-
Leigh B. Stoller authored
-
- 23 May, 2002 2 commits
-
-
Robert Ricci authored
request.
-
Leigh B. Stoller authored
-
- 15 Oct, 2001 1 commit
-
-
Leigh B. Stoller authored
testbed boot process and the freebsd pxeboot/MFS kernel. I'm hoping this will be useful enough for users to load their own disk images, but we shall see. Usage is simple: /usr/testbed/bin/node_admin on|off pcXXX The "on" state boots the MFS, and the "off" states sets it back to normal.
-