lbto software startup/shutdown and troubleshooting july 18, 2006 chris biddick 1 cjb
TRANSCRIPT
Start TCS networks
netconfig start (on all servers, but not jet)• networkserver subsystem manager• rpcserver RPC manager• gshmserver reflective memory manager
netconfig start passive (TO station only)
prevents subsystems from running here
3 cjb
Start MCSPU on jet
User telescope rpcconfig start 10.144.0.20 [passive] ./gotemp (used)
Will start mcstemp (most recent control program) and the engineering interface program mcsDisplay.
./go (not used)Will start mcsworking (stable control program) and the engineering interface program
4 cjb
Start subsystems
netconfig start [left | right] SUBStarts subsystem SUB on lightest load server
netconfig start [left | right] SUB on addressStarts subsystem SUB on server ‘address’ – name or IP address
netconfig start [left | right] SUB on localStarts subsystem SUB attached to local terminal
Alternate side notation:• left SUB SUBL
• right SUB SUBR
5 cjb
Start subsystems
When subsystems are started normally• stdout is directed to /dev/null
• stderr is available (but not much use)
When started ‘on local’• both stdout and stderr are available and
attached to controlling terminal
6 cjb
Current subsystems
LSS Logging control ECS Enclosure control CSQ IIF receiver PMC left Primary mirror control PSF left Active optics control MCS Mount control PCS Pointing control SDB Star database
7 cjb
Start GUIs
GUIs are normal executables installed in a system path
ECSGUI [READ_WRITE] ECSGUI [READ_WRITE] & For sided subsystems add ‘left’ or ‘right’
as argument: PSFGUI left
8 cjb
Current GUIs ECSGUI [READ_WRITE] PMCGUI {left | right} PSFGUI {left | right} MCSGUI (old) mcsgui (OpenGL) PCSGUI (old) pcsgui (OpenGL) presetgui (OpenGL) WXGUI
9 cjb
TO station start buttons ECSGUI TEL, ECS, ECSGUI
READ_WRITE MATGUI CSQ, MATGUI MCSGUI MCS, mcsgui (OpenGL) PCSGUI PCS, pcsgui (OpenGL) PMCGUI TEL, LSS, left PMC, PMCGUI left PRESETGUI PCS, MCS, SDB, presetgui PSFGUI TEL, LSS, left PSF, PSFGUI left TELGUI TEL, TELGUI WXGUI ECS, WXGUI
10 cjb
Stop subsystems
netconfig stop [left | right] SUBStops subsystem SUB – times out in seven seconds
netconfig stop allStops all subsystems
netconfig kill [left | right] SUBSends SIGKILL (9) instead of SIGINT (2)
If GUI is left running when SUB stopped GUI should turn red
12 cjb
Stop MCSPU on jet
Command ‘stop’ in engineering interface will stop just the interface
Command ‘stopall’ will stop both the engineering interface and mcstemp
14 cjb
Stop TCS networks
netconfig stop (on each server)• Stop all TCS network processes on local
server
• You may stop TCS networks without stopping subsystems – network state is kept in files on disk
15 cjb
TCS state files
LBT.confOverall configuration parameters in
/usr/commonsw/NetConfig
rpc_IPAddr.confRPC functions and aliases on local server in /var/state/LBT
nameprocs.confSubsystem running on local server in /var/state/LBT
LSSEventClientList.datList of event callbacks on local server in /var/state/LBT
16 cjb
Troubleshooting - netconfig
netconfig ps• Shows all TCS processes on local server
netconfig –l• Shows all known TCS subsystems on all
servers
netconfig –s• Shows all servers with subsystem count and
load
17 cjb
Troubleshooting - rpcconfig rpcconfig stop [address]
• Stops rpcserver on ‘address’ rpcconfig start address [cold] [passive]
• Starts rpcserver on ‘address’• ‘cold’ skips reading state file• ‘passive’ disables rpc health check
rpcconfig –l• Shows number of registered functions and aliases on local server
rpcconfig –f [address]• Lists all functions on server address
rpcconfig –a [address]• Lists all aliases on server address
18 cjb
Troubleshooting - gshmconfig gshmconfig stop
• Stop gshmserver on local server gshmconfig start
• Start gshmserver on local server gshmconfig –b
• Shows reflective memory IP broadcast address on local server
gshmconfig –l• Lists segment key, size, and all servers currently on
the reflective memory network• Repeat and check that all servers stay at same ID
19 cjb
Troubleshooting - monitor
monitor –o• Shows computers as they join or drop off the
shared memory network
• Shows restart requests
20 cjb
Troubleshooting - syslog
/var/log/messages on local server ‘tcslog’ will show messages with LBT
designation as they occur
21 cjb
Troubleshooting - DDViewerGUI
DDViewerGUI shows hierarchical view of reflective memory on local server
Available on every server and TO station Updates about every two seconds
22 cjb
Troubleshooting - Events
No LSS - XML files on local server• /tmp/SUB.log
LSS - MySql database• mysql –u lss –p –h mysql LBT_log
• password lss
• select date,description from Event where (date like ‘%Jul 18%’ and description like ‘%left PSF%’);
23 cjb
Mountain subsystem debugging
Use your account on a server Get current CVS tree Modify subsystem code Build and install (may require removal of
existing package) netconfig stop SUB netconfig start SUB on server
24 cjb