Navigation
Records in this category
- How to install OpenSSH in AIX
- AIX HACMP Cluster tips
- Frequently asked questions on HACMP Cluster
- Configuring LVM Filesystems for Optimum Performance in AIX
- Very useful AIX commands
- Setting Up the Disk Quota System in AIX
- How to reduce the var or tmp filesystem
- AIX Command Crib Sheet
- Here is there error I get when I execute a basic topas command: # topas exec(): 0509-036 Cannot load program topas because of the following errors: 0509-130 Symbol resolution failed for /usr/lib/libperfstat.a(shr.o) because: 0509-136 Symbol ptx_get_rsetsyswait (number 81) is not exported from dependent module /unix. 0509-136 Symbol ptx_get_rsetsysinfo (number 83) is not exported from dependent module /unix. 0509-136 Symbol ptx_get_rsetsysstats (number 84) is not exported from dependent module /unix. 0509-026 System error: Error 0 0509-192 Examine .loader section symbols with the 'dump -Tv' command. Any reason what this would cause this error?
- how to find Get full path of a running process in AIX
- How to exclude group of files in TSM backup in AIX ?
- How to get history of command in AIX5.3
- Aix 5.2 increasing the paging space - How to?
- Unable to create spot from mksysb for one nim client ?
- How to change Parameters like SHMMAX in AIX
- How do i configure aix 5.3 to use a smtp relay server?
- How to check the status of services in AIX like Chargen , echo , finger , who
- How to create more space on a filesystem in AIX
- Anonymous FTP System Settings AIX (with POSIX compliant ftp account home directory)
- How to remove an LPAR through HMC via command line ? HMC is not accessible through GUI
- Tomcat 6.0.20 installation on AIX 5.3
- what will be impact of changing hard file size/soft file size unlimited in AIX for particular user? How to change hard file size/soft file size to unlimited at the shell prompt?
- Installing Apache Directory on AIX 5.3
- How do I create a user with admin rights in AIX 5.3
- From HMC How to get cpu and memory information of lpar ?
- Please explain the real life implication of shared processors on AIX
- How torepair the log on a jfs2 filesystem in AIX
- How to check Call Home facility from HMC for p570 and p595 lpars?
- What is the appropriate syntax to edit the 'DH' entry in sendmail.cf?
- How to customize login banner for telnet in AIX.
- How to download cfg2html script for aix and configure it ?
- Unix command to count number of files in a directory
- Physical Memory usage - AIX server
- commands to get system information for AIX
- create filesystem on lv and mount it at boot up
- How to change Default Gateway in AIX
- Script to copy files from one AIX server to another
- Script to compare two directories and copy files missing from A to B
- How to create Lpar on IBM pSeries Servers
- How to send multiple email attachments from AIX?
- How to enable SFTP logging in AIX?
- How to send Oracle audit logs to syslog
- How do you find out the wwpn of the hba currently plugged into an ibm system
- Laserjet network printer setup in AIX
- How to check if AIX 5.1. has a 64-bit kernel or 32-bit
- Installing of SSH on AIX 5.2
- How to STOP TELNET Service in AIX 5.2
- How to mount tape drive in AIX system
- How do I rename an existing file system in AIX 5.2
- Aix monitoring tools
- Search AIX for core file, omitting /proc
- How to copy select files based on date
- How to I check dsmcad port on AIX
- Automatic startup of a service on AIX
- How to do a OS backup in AIX
- How to break the rootvg mirror
Tags
Sticky FAQs
AIX
AIX HACMP Cluster tips
Quorum Issues
Quorum presents some tricky issues within an HACMP cluster. The cluster administrator has two choices:Fortunately, there is a solution to this apparent quandry. The varyonvg command will online a quorum-disabled volume group with missing physical volumes if the MISSINGPV_VARYON environment variable is set to TRUE when the varyonvg command is invoked.
- Enable quorum checking in shared volume groups. This results in the entire volume group going offline if 50% or more of the VGDAs are lost. A typical shared volume group will have each physical volume's data mirrored on a second physical volume. Since this results in the volume group having an even number of physical volumes, loss of half of the physical volumes will cause quorum to be lost and the rest of the physical volumes will go offline.
Obviously, this isn't acceptable behaviour in a high-availability environment.
- Disable quorum checking in shared volume groups. This avoids the problem described above since the volume group remains online as long as at least one physical volume is still available. Unfortunately, a volume group with quorum disabled can't be varied online if any of the physical volumes are unavailable.
Obviously, this is also unacceptable behaviour in a high-availability environment.
So . . . configure your shared volume groups with quorum disabled and put the line
MISSINGPV_VARYON=TRUEin each node's /etc/environment file (changes to this file don't take effect until after the next reboot).Note that the MISSINGPV_VARYON environment variable is a relatively new feature of AIX. Verify that it works as described above on your version of AIX before relying on it.
Also note that setting MISSINGPV_VARYON in /etc/environment doesn't work in HACMP/ES clusters because HACMP/ES ignores the contents of /etc/environment. In an HACMP/ES cluster, find the commented out line in /usr/es/sbin/cluster/utlitities/clvaryonvg and uncomment it (it's near line 400).
One last piece of advice and warning:
If half of your disks go offline with the result that you lose exactly half of your disk mirrors then your volume group will remain online if you've disabled quorum. If you then experience a failover and the takeover node is able to online the half of your disks which were previously offline then everything will (probably) appear to work. The problem is that you are using OLD DATA!!!!!There are no perfect solutions to this problem although here's a bit of advice which will probably help a lot:This scenario is disconcertingly easy to create (cut the paths to half of the disks for the first node and the paths to the other half of the disks from the second node and then trigger a failover). Trust me! You do NOT want this to happen!
- Always, ALWAYS, ALWAYS respond to broken disk mirrors promptly since that avoids the old data being very old if the above scenario occurs. This is probably an excellent place to use AIX's Error Notification mechanism (smit screens to do this are provided with HACMP 'classic' and HACMP/ES).
- One way to avoid the above scenario is to triple-mirror all logical logical volumes and turn quorum back on for the volume group. If done correctly, this will allow the disk subsystem to survive the loss of one disk copy and should prevent the disks from staying online or being brought online in any combination which can result in a failure along the lines of the above scenario.
Obviously, triple-mirroring logical volumes is a rather expensive proposition. Then again, the consequences of running with old data are likely to be even more expensive.
Finding mounted file systems associated with a volume group
The following shell script will list the currently mounted file systems associated with a specified volume group (the -l option specifies the default behaviour of listing the logical volumes and the -f option specifies that file system mount points are to be listed instead).This script could be used in a crontab job which periodically verifies that nobody has mounted a shared file system without adding it to the appropriate resource group.
#!/bin/ksh
#
# list the mounted file systems associated with a specified volume group
#
# This script was developed and tested on AIX 4.3.2.
#
# Author: Daniel Boulet (danny@matildasystems.com)
showlv=1
if [ "$1" = "-l" ] ; then
showlv=1
shift
elif [ "$1" = "-f" ] ; then
showlv=0
shift
fi
if [ $# -ne 1 ] ; then
echo "Usage: $0 [ -l | -f ] __vgname__"
exit 1
fi
if getlvodm -d "$1" > /dev/null 2> /dev/null ; then
getlvodm -L "$1" | while read lvname junk ; do
if df | grep -q "^/dev/$lvname " ; then
if [ $showlv -eq 1 ] ; then
echo $lvname
else
df | grep ^/dev/$lvname | awk '{print $7}'
fi
fi
done
else
echo $0: "$1 isn't the name of a volume group"
exit 1
fi
exit 0
Centralized syslog messages
This is a really simple tip which lets you see cluster-related syslog messages in proper time-ordering.Once you've done this, syslog messages from the cluster (including everything you see in /var/adm/cluster.log) will appear in the specified file on the non-cluster node in proper time-ordering.
- Configure syslogd on each cluster node to send all syslog messages to a non-cluster system. Assuming that the non-cluster system is called critter, add the following line to /etc/syslog.conf on each cluster node:
*.debug @critterand refresh syslogd on each cluster node using the command:refresh -s syslogd- Configure syslogd on the non-cluster system to send log messages to a file. For example, adding the following line to /etc/syslog.conf and refreshing syslogd on the non-cluster system sends all syslog messages to the file /var/log/debug:
*.debug /var/log/debugNote that the protocol which syslogd uses to send messages to other systems is somewhat unreliable so some messages might be lost in transit (the frequency of lost messages is typically quite low and depends on the quality of your network infrastructure, your network load and the relative system loads of the machines in question).
Related entries:
- Veritas Cluster - Commands
- The best DNS FAQ for solaris
- IBM High Availability Geographic Clustering software
- Frequently asked questions on HACMP Cluster
Last update: 2009-06-19 18:01
AuthorLuke Francis
Revision: 1.0
You cannot comment on this entry