best pratices how to configure a loghost on sunfire

4
'Best Practices' and configuring loghost on Sun Fire[TM] 3800,4800,4900,6800, and E6900 servers [Video] [ID 1008676.1] Modified 01-AUG-2011 Type HOWTO Migrated ID 211896 Status PUBLISHED Applies to: Sun Fire 4800 Server Sun Fire 4810 Server Sun Fire 6800 Server Sun Fire E4900 Server Sun Fire E6900 Server All Platforms Goal This document gives 'Best Practice's and step by step instructions on how to configure a loghost on the Sun Fire[TM] 3800-6800 and E4900/E6900 servers. Available for this topic, a Video Tutorial; Brief how-to video tutorials that provide step-by-step instructions answering Sun's most frequently asked questions. View the video ans detailed instructions below. Video - How to Archive SC Logs (05:30) Solution It is considered a 'Best Practice' to configure a 'loghost's for each server and production domain. A 'Loghost' on a Solaris platform can permanently save messages that are logged in the System Controller's NVRAM buffer. This will insure that they are not lost due to either a off of the small first in first out buffer in the system controller. Properly stored, they can be quickly accessed if a domain outage occurs even if the server controlling the domain files when sent to SUN Engineers can speed up troubleshooting and resolve problems quickly and accurately. Configuration Best practices. 1) Log files to an independent Solaris platform not the domain('s) associated with the server. This is to remove the issue which is created when the domain which is down is the collecting the loghost data. This issue presents two problems. First, data, that is needed for troubleshooting becomes inaccessible. Secondly, as soon as Solaris goes down on logging, all further messages from the system controller are lost forever. Failure to follow these best practices will lead to an increase in the time it takes to troubleshoot severe and could drastically increase downtime if the situation arises. 2) The loghost could be any type of Solaris machine running on the same subnet as the system controllers. If the machine can ping the system consoles it could generally be us 3) Capture Platform and also domain logs in the loghost. Platform logs give a great snapshot of what is happening globally in a server and domain logs give a much more detaile issues may be occurring on a failing domain. Getting Started: The first step is to create 'loghost's is to configure the 'loghost' on the System Controller(SC). Setup the IP address of the admin workstation, and then, configure the syslog.con workstation itself. 1. On the System Controller(SC) of the Sun Fire platform, do the following: First, log on to the SC, either by telnet, ssh or through the serial port, then issue the following command: r12-1a:SC> setupplatform -p loghost Loghosts -------- Loghost []: 172.16.40.10 Log Facility [local0]: local0 Note: You must not use a domain on the platform itself as the loghost. The format is slightly different below 5.12.5, an OBP upgrade is long overdue. Document: 1003034.1 has links to the latest OBP patches. This setting will send platform messages with facility local0, to the admin workstation with IP address 172.16.40.10. The local0 facility is used to differentiate messages coming from the platform. Log Facility will default to local0, this setting allows syslogd on the admin workstation to determine which file to log the message into. For domains, setup the loghost the same way, but use the 'setupdomain' command. For instance, if you want to configure domains A and B loghost facility: r12-1a:A> setupdomain -p loghost Loghosts -------- Loghost []: 172.16.40.10 Rate this document https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id... 1 of 4 10/25/2011 10:11 AM

Upload: danielmsilva

Post on 18-Apr-2015

20 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Best Pratices How to Configure a Loghost on SunFire

'Best Practices' and configuring loghost on Sun Fire[TM] 3800,4800,4900,6800, and E6900 servers [Video] [ID 1008676.1]

Modified 01-AUG-2011 Type HOWTO Migrated ID 211896 Status PUBLISHED

Applies to:

Sun Fire 4800 Server

Sun Fire 4810 Server

Sun Fire 6800 Server

Sun Fire E4900 Server

Sun Fire E6900 Server

All Platforms

Goal

This document gives 'Best Practice's and step by step instructions on how to configure a loghost on the Sun Fire[TM] 3800-6800 and E4900/E6900 servers.

Available for this topic, a Video Tutorial; Brief how-to video tutorials that provide step-by-step instructions answering Sun's most frequently asked questions. View the video answer and/or follow the

detailed instructions below.

Video - How to Archive SC Logs (05:30)

Solution

It is considered a 'Best Practice' to configure a 'loghost's for each server and production domain.

A 'Loghost' on a Solaris platform can permanently save messages that are logged in the System Controller's NVRAM buffer. This will insure that they are not lost due to either a power event, or by ro

off of the small first in first out buffer in the system controller. Properly stored, they can be quickly accessed if a domain outage occurs even if the server controlling the domain is unresponsive. These

files when sent to SUN Engineers can speed up troubleshooting and resolve problems quickly and accurately.

Configuration Best practices.

1) Log files to an independent Solaris platform not the domain('s) associated with the server. This is to remove the issue which is created when the domain which is down is the same domain which wa

collecting the loghost data. This issue presents two problems. First, data, that is needed for troubleshooting becomes inaccessible. Secondly, as soon as Solaris goes down on the domain which is

logging, all further messages from the system controller are lost forever. Failure to follow these best practices will lead to an increase in the time it takes to troubleshoot severe problems on the serve

and could drastically increase downtime if the situation arises.

2) The loghost could be any type of Solaris machine running on the same subnet as the system controllers. If the machine can ping the system consoles it could generally be used.

3) Capture Platform and also domain logs in the loghost. Platform logs give a great snapshot of what is happening globally in a server and domain logs give a much more detailed error report of what

issues may be occurring on a failing domain.

Getting Started:

The first step is to create 'loghost's is to configure the 'loghost' on the System Controller(SC). Setup the IP address of the admin workstation, and then, configure the syslog.conf file on the admin

workstation itself.

1. On the System Controller(SC) of the Sun Fire platform, do the following:

First, log on to the SC, either by telnet, ssh or through the serial port, then

issue the following command:

r12-1a:SC> setupplatform -p loghost

Loghosts

--------

Loghost []: 172.16.40.10

Log Facility [local0]: local0

Note: You must not use a domain on the platform itself as the loghost.

The format is slightly different below 5.12.5, an OBP upgrade is long overdue.

Document: 1003034.1 has links to the latest OBP patches.

This setting will send platform messages with facility local0, to the admin

workstation with IP address 172.16.40.10. The local0 facility is used to

differentiate messages coming from the platform.

Log Facility will default to local0, this setting allows syslogd on the admin

workstation to determine which file to log the message into.

For domains, setup the loghost the same way, but use the 'setupdomain' command.

For instance, if you want to configure domains A and B loghost facility:

r12-1a:A> setupdomain -p loghost

Loghosts

--------

Loghost []: 172.16.40.10

Rate this document

https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id...

1 of 4 10/25/2011 10:11 AM

Page 2: Best Pratices How to Configure a Loghost on SunFire

Log Facility [local0]: local1

r12-1a:B> setupdomain -p loghost

Loghosts

--------

Loghost []: 172.16.40.10

Log Facility [local0]: local2

NOTE: local8 and higher are not allowed and won't work - there are 8 local

facilities, local0-local7. See the syslog.conf(4) man page for more information.

It's also possible to leave the Log Facility set to it's default of "local0"

but if this is done it will be more difficult to separate messages coming from the platform and domains, into different messages files on the admin workstation.

2. On the admin workstation, do the following:

Now, configure the syslog.conf file on the admin workstation, to place the

messages into specific messages files. To do this, create the following message

files, and simply add the following lines in the file /etc/syslog.conf:

% ssh -l root 172.16.40.10

# touch /var/adm/messages.platform

# touch /var/adm/messages.domainA

# touch /var/adm/messages.domainB

# vi /etc/syslog.conf

[...]

#

# non-loghost machines will use the following lines to cause "user"

# log messages to be logged locally.

#

ifdef(`LOGHOST', ,

user.err /dev/sysmsg

user.err /var/adm/messages

user.alert `root, operator'

user.emerg *

)

local0.notice /var/adm/messages.platform <= ADD THIS LINE

local1.notice /var/adm/messages.domainA <= ADD THIS LINE

local2.notice /var/adm/messages.domainB <= ADD THIS LINE

NOTE: You must use TABs between the two entries spaces will not work.

Then restart the syslog daemon:

# /etc/init.d/syslog stop

# /etc/init.d/syslog start

or just force it to re-read the configuration file:

# pkill -HUP syslogd

If the system is running Solaris 10 first of all you'll need to allow remote logging in /etc/default/syslogd by enabling LOG_FROM_REMOTE=YES

then use the following commands:

# svcadm restart svc:/system/system-log:default

or just force it to re-read the configuration file:

# pkill -HUP syslogd

3. To test that your loghost configurations are working correctly, do the

following:

To test the platform shell loghost file, on the main SC run setfailover off

followed by setfailover on You should see the messages appear in the

/var/adm/messages.platform file.

To test the domain shell loghost files, there are three methods. The first method

requires an outage of the domain whereas the second does not. The third one

is very simple and non-intrusive.

Method 1

From the domain shell, perform a "setkeyswitch on". Note, the keyswitch must be

initially in the off position. If the keyswitch is initially in the on position,

you will need to perform a "setkeyswitch off" followed by a "setkeyswitch on".

If the domain loghost setup is correct, you should see the output from LPOST

appear in the domain shell's loghost file.

https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id...

2 of 4 10/25/2011 10:11 AM

Page 3: Best Pratices How to Configure a Loghost on SunFire

Method 2

If an outage is not acceptable, the following procedure may be performed. This

procedure relies on a spare System Board being available, or a Dynamic

Reconfiguration(DR) operation.

Spare System Board available

If there is a spare System Board available perform the following:

1) First, add the spare System Board to the domain for which you wish to test

the loghost setup, at the platform shell.

2) Power off the System Board if it is powered on. If it is powered off already,

ignore this step.

3) Power on the System Board

4) Perform a "testboard SB#" in the domain shell you are testing, where # is the

System Board number.

The testboard command will cause an LPOST to be run on the specified System

Board only, and will cause output to be displayed to the domain loghost(if

configured correctly), as well as to the console.

5) You can then re-assign the System Board to another domain for testing, and

follow the steps above, for each domain that is to be tested.

No spare System Boards, DR can be performed

If there are no spare System Boards, and a DR operation can be performed on a

System Board in one of the domains, perform the following:

1) DR a System Board out from a domain on the system, via the cfgadm command

from Solaris.

2) Ensure the System Board is assigned to the domain you wish to test. You may

need to perform a deleteboard/addboard operation at the platform level.

3) Power off the System Board if it is powered on. If it is powered off already,

ignore this step.

4) Power on the System Board

5) Perform a "testboard SB#" in the domain shell you are testing, where # is the

System Board number.

The testboard command will cause an LPOST to be run on the specified System

Board only, and will cause output to be displayed to the domain loghost(if

configured correctly), as well as to the console.

6) You can then re-assign the System Board to another domain for testing, and

follow the steps above for each domain that is to be tested.

7) Once finished, assign the System Board to the original domain, and DR the

board back into the domain.

Method 3

If an outage is not acceptable and you do not have spare board or do not want

to use DR then do the following:

1) Look at the current keyswitch setting in the domain shell

2) If it is 'on' then perform "setkeyswitch secure" and then "setkeyswitch on".

If it is 'secure' then perform "setkeyswitch on" and then "setkeyswitch secure".

The setkeyswitch command will cause a transition of keyswitch between 'on' and

'secure' positions and will cause output to be displayed to the domain loghost

(if configured correctly), as well as to the domain buffer (you may check it

with "showlogs" command).

Product

Sun Fire 6800 Server

Sun Fire 4810 Server

Sun Fire 4800 Server

Sun Fire 3800 Server

Sun Fire E6900 Server

Sun Fire E4900 Server

https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id...

3 of 4 10/25/2011 10:11 AM

Page 4: Best Pratices How to Configure a Loghost on SunFire

Attachments

How to Archive SC Logs (8,163.86 KB)

Related

Products

Sun Microsystems > Servers > Midrange Servers > Sun Fire 4800 Server

Sun Microsystems > Servers > Midrange Servers > Sun Fire 4810 Server

Sun Microsystems > Servers > Midrange Servers > Sun Fire 6800 Server

Sun Microsystems > Servers > Midrange Servers > Sun Fire E4900 Server

Sun Microsystems > Servers > Midrange Servers > Sun Fire E6900 Server

Sun Microsystems > Servers > Midrange Servers > Sun Fire 3800 Server

Keywords

CONTROLLER; LOGHOST; SUNFIRE; SYSTEM CONTROLLER

Back to top

Copyright (c) 2007, 2010, Oracle. All rights reserved. Legal Notices and Terms of Use | Privacy Statement

https://support.oracle.com/CSP/main/article?cmd=show&type=NOT&id...

4 of 4 10/25/2011 10:11 AM