Table of Contents

Concepts and Facilities Guide

Contents

About This Guide

Chapter 1: HACMP for AIX

High Availability Cluster Multi-Processing for AIX

HACMP and HACMP/ES

High Availability and Hardware Availability

High Availability vs. Fault Tolerance

Role of HACMP

Cluster Multi-Processing

The Availability Costs and Benefits Continuum

Enhancing Availability with the AIX 5L Software

Journaled File System and Enhanced Journaled File System

Disk Mirroring

Process Control

Error Notification

Physical Components of an HACMP Cluster

Nodes

Shared External Disk Devices

Networks

Clients

Goal of HACMP: Eliminating Scheduled Downtime

Chapter 2: HACMP Cluster Nodes, Sites, Networks, and Heartbeating

Cluster Nodes and Cluster Sites

Nodes

Sites

Cluster Networks

Physical and Logical Networks

HACMP Communication Interfaces

HACMP Communication Devices

Subnet Routing Requirements in HACMP

Service IP Label/Address

IP Alias

IP Address Takeover

IPAT and Service IP Labels

IP Address Takeover via IP Aliases

IP Address Takeover via IP Replacement

Heartbeating over Networks and Disks

Heartbeating in HACMP: Overview

Heartbeating over TCP/IP Networks

Heartbeating over Point-to-Point Networks

Heartbeating over Disks

Chapter 3: HACMP Resources and Resource Groups

Cluster Resources: Identifying and Keeping Available

Identifying Cluster Resources

Types of Cluster Resources

Volume Groups

Logical Volumes

Filesystems

Applications

Service IP Labels/Addresses

Tape Resources

Communication Links

Cluster Resource Groups

Participating Nodelist

Default Node Priority

Home Node

Startup, Fallover and Fallback

Resource Group Policies and Attributes

Overview

Resource Group Startup, Fallover and Fallback

Settling Time, Dynamic Node Priority and Fallback Timer

Distribution Policy

Cluster Networks and Resource Groups

Resource Group Dependencies

Child and Parent Resource Groups Dependencies

Resource Group Location Dependencies

Sites and Resource Groups

Chapter 4: HACMP Cluster Hardware and Software

Enhancing Availability with IBM Hardware

IBM pSeries

RS/6000 SP System

Disk Subsystems

HACMP Required and Supported Hardware

HACMP Cluster Software

Software Components of an HACMP Node

HACMP Software Components

Complementary Cluster Software

Chapter 5: Ensuring Application Availability

Overview

Eliminating Single Points of Failure in an HACMP Cluster

Potential Single Points of Failure in an HACMP Cluster

Eliminating Nodes as a Single Point of Failure

Eliminating Applications as a Single Point of Failure

Eliminating Communication Interfaces as a Single Point of Failure

Eliminating Networks as a Single Point of Failure

Eliminating Disks and Disk Adapters as a Single Point of Failure

Minimizing Scheduled Downtime with HACMP

Starting Cluster Services without Stopping Applications

Dynamic Automatic Reconfiguration (DARE)

Resource Group Management

Cluster Single Point of Control (C-SPOC)

Dynamic Adapter Swap

Automatic Verification and Synchronization

Minimizing Unscheduled Downtime

Recovering Resource Groups on Node Startup

Fast Recovery

Delayed Fallback Timer for Resource Groups

Minimizing Takeover Time: Fast Disk Takeover

Maximizing Disaster Recovery

Cross-Site LVM Mirroring

Cluster Events

Processing Cluster Events

Emulating Cluster Events

Customizing Event Processing

Customizing Event Duration

Chapter 6: HACMP Cluster Configurations

Sample Cluster Configurations

Standby Configurations

Standby Configurations: Example 1

Standby Configurations: Example 2

Takeover Configurations

One-Sided Takeover

Mutual Takeover

Two-Node Mutual Takeover Configuration for Concurrent Access

Eight-Node Mutual Takeover Configuration for Concurrent Access

Cluster Configurations with Multi-Tiered Applications

Cluster Configurations with Resource Group Location Dependencies

Publishing Model with Same Node and Different Nodes Dependencies

Cross-Site LVM Mirror Configurations for Disaster Recovery

Cluster Configurations with Dynamic LPARs

DLPARs and Capacity Upgrade on Demand

Chapter 7: HACMP Configuration Process and Facilities

Information You Provide to HACMP

Information on Physical Configuration of a Cluster

AIX 5L Configuration Information

Establishing the Initial Communication Path

Information Discovered by HACMP

Cluster Configuration Options: Standard and Extended

Configuring an HACMP Cluster Using the Standard Configuration Path

Configuring an HACMP Cluster Using the Extended Configuration Path

Overview: HACMP Administrative Facilities

Cluster Security

Installation, Configuration and Management Tools

Two-Node Cluster Configuration Assistant

Smart Assists for Integrating Specific Applications with HACMP

General Application Smart Assist

Smart Assist API

Planning Worksheets

Starting, Stopping and Restarting Cluster Services

SMIT Interface

Web-Based SMIT Interface

HACMP System Management with C-SPOC

Cluster Snapshot Utility

Customized Event Processing

Resource Group Management Utility

HACMP File Collection Management

Monitoring Tools

Cluster Manager

Cluster Information Program

Application Monitoring

Show Cluster Applications SMIT Option

Cluster Status Utility (clstat)

HAView Cluster Monitoring Utility

Cluster Monitoring and Administration with Tivoli Framework

Application Availability Analysis Tool

Persistent Node IP Labels

HACMP Verification and Synchronization

Troubleshooting Tools

Log Files

Resetting HACMP Tunable Values

Cluster Status Information File

Automatic Error Notification

Custom Remote Notification

User-Defined Events

Event Preambles and Summaries

Trace Facility

Cluster Test Tool

Emulation Tools

HACMP Event Emulator

Emulation of Error Log Driven Events

Chapter 8: HACMP 5.4: Summary of Changes

List of New Features

New Features That Enhance Ease of Use

New Features That Enhance Geographic Distance Capability

Other Features

Discontinued Features

Features Discontinued in HACMP 5.2 and Up

Features Discontinued in HACMP 5.3

Where You Go from Here

Notices for HACMP Concepts and Facilities Guide

Index