Table of Contents

Planning Guide

Contents

About This Guide

Chapter 1: Overview of the Planning Process

Overview of the Planning Process Goals

Planning Guidelines

Eliminating Single Points of Failure: Configuring Redundant Components Supported by HACMP

Overview of the Planning Tools

Paper Worksheets

Online Planning Worksheets Application

Overview of the Planning Process

Step 1: Planning for Highly Available Applications

Step 2: Planning Cluster Topology

Step 3: Planning Cluster Network Connectivity

Step 4: Planning Shared Disk Devices

Step 5: Planning Shared LVM Components

Step 6: Planning Resource Groups

Step 7: Planning Cluster Event Processing

Step 8: Planning HACMP Clients

Chapter 2: Initial Cluster Planning

Prerequisites

Overview

Planning Cluster Security

Managing User Account Security

Managing Cluster Security

Planning Cluster Nodes

Planning Cluster Sites

Planning Resources and Site Policy

HACMP/XD for GLVM Mirroring Overview

HAGEO for AIX 5L Overview

HACMP/XD for Metro Mirror Overview

Cross-Site LVM Overview

Completing the Cluster Site Worksheet

Application Planning

Planning for Capacity Upgrade on Demand

Application Servers

Applications Integrated with HACMP

Application Monitoring

Planning Considerations for Multi-Tiered Applications

Planning Applications and Application Servers

Completing the Application Worksheet

Completing the Application Server Worksheet

Completing the Application Monitoring Worksheet

Planning for AIX 5L Fast Connect

Planning Considerations for Fast Connect

Fast Connect as a Highly Available Resource

Completing the Fast Connect Worksheet

Planning for Highly Available Communication Links

SNA and X.25 Links Required Software and Hardware

Completing the Communication Links Worksheets

Drawing a Cluster Diagram

Where You Go from Here

Chapter 3: Planning Cluster Network Connectivity

Prerequisites

Overview

General Network Considerations for HACMP

Supported Network Types

IP Aliases

Network Connections

ARP Cache Updating

IP Labels

Cluster Partitioning

General Network Connection Example

HACMP Configuration in Switched Networks

HACMP and Virtual Ethernet (VLAN)

Heartbeating in HACMP

Topology Services and Heartbeat Rings

Heartbeating over IP Aliases

Overview

Setting up Heartbeating over IP Aliases

How HACMP Assigns Heartbeat Rings

Examples of Heartbeat Rings

Viewing IP Addresses Assigned by HACMP for Heartbeating over IP Aliases

Heartbeating over Disk

Heartbeating over Disk and Fast Method for Node Failure Detection

Designing the Network Topology

Eliminating the IP Subsystem as Single Points of Failure

Eliminating Networks as Single Points of Failure

Planning Point-to-Point Networks

Eliminating Network Interface Cards as a Single Point of Failure

Planning for IP Address Takeover via IP Aliases

Assigning IP Labels for IPAT via IP Aliases

Planning for Service IP Label Alias Placement

Types of Distribution for Service IP Label Aliases

Planning for Site-Specific Service IP Labels

Planning for IP Address Takeover via IP Replacement

Hardware Address Takeover with IPAT via IP Replacement

Planning for Other Network Conditions

Using HACMP with NIS and DNS

Monitoring Clusters

Planning for VPN Firewall Network Configurations in HACMP

Setting Failure Detection Parameters

Setting Values for the Network Grace Period

Identifying Service Adapter Failure for Two-Node Clusters

Choosing IP Addresses for the netmon.cf File

Setting RS232 TTY Baud Rates

Planning Networks for Inter-Node Communication with Oracle

Planning for SP Networks

SP Planning Considerations

Handling SP Network Failure

Special SP Switch Failure Considerations

Planning for the SP Switch Network

Completing the Network Worksheets

Completing the TCP/IP Networks Worksheet

Completing the TCP/IP Network Interface Worksheet

Completing the Point-to-Point Networks Worksheet

Completing the Serial Network Interface Worksheet

Defining Hardware Addresses

Selecting an Alternate Hardware Address

Avoiding Network Conflicts

Adding the Network Topology to the Cluster Diagram

Where You Go from Here

Chapter 4: Planning Shared Disk and Tape Devices

Prerequisites

Overview

Choosing a Shared Disk Technology

Obtaining HACMP APARS

Disk Planning Considerations

IIBM pSeries

IBM Serial Storage Architecture Disk Subsystem

Disk Power Supply Considerations

SCSI Device Power Considerations

IBM SSA Disk Subsystem Power Considerations

Planning for Non-Shared Disk Storage

Licensing

Starting Applications

Planning a Shared SCSI Disk Installation

HACMP and Virtual SCSI

Disk Adapters

Cables

Sample IBM 2104 Expandable Storage Plus Configuration

Sample IBM DS4000 Storage Server Configuration

Sample IBM 2105 Enterprise Storage Server Configuration

Planning a Shared IBM SSA Disk Subsystem Installation

AIX 5L and HACMP Levels

Disk Adapters

Using SSA Facilities for High Availability

Configuring to Minimize Single Points of Failure

Configuring for Optimal Performance

Testing Loops

Planning for RAID and SSA Concurrent Volume Groups

SSA Disk Fencing in Concurrent Access Clusters

Completing the Disk Worksheets

Completing the Shared SCSI Disk Worksheet

Completing the Shared SCSI Disk Array Worksheet

Completing the IBM SSA Disk Subsystems Worksheet

Adding the Disk Configuration to the Cluster Diagram

Planning for Tape Drives as Cluster Resources

Reserving and Releasing Shared Tape Drives

Recovery Procedures

Where You Go from Here

Chapter 5: Planning Shared LVM Components

Prerequisites

Overview

Planning for LVM Components

Physical Volumes

Volume Groups

Logical Volumes

Filesystems

Planning LVM Mirroring

Mirroring Physical Partitions

Mirroring Journal Logs

Mirroring across Sites

Planning for Disk Access

Enhanced Concurrent Access

Non-Concurrent Access

Using Fast Disk Takeover

Using Quorum and Varyon to Increase Data Availability

Using Quorum

Using Forced Varyon

Forced Varyon and Cluster Partitioning

Other Ways to Force a Varyon

Using NFS with HACMP

Relinquishing Control over NFS filesystems in an HACMP Cluster

Reliable NFS Server Capability

Shared Volume Groups

NFS Exporting Filesystems and Directories

NFS and Fallover

NFS Cross-Mounting in HACMP

Completing the Shared LVM Components Worksheets

Planning for LVM Components

Completing the Non-Shared Volume Group Worksheet

Completing the Shared Volume Group and Filesystem Worksheet

Completing the NFS-Exported Filesystem Worksheet

Completing Concurrent Access Worksheets

Adding LVM Information to the Cluster Diagram

Where You Go from Here

Chapter 6: Planning Resource Groups

Prerequisites

Overview

General Rules for Resources and Resource Groups

Two Types of Resource Groups: Concurrent and Non-Concurrent

Concurrent Resource Groups

Non-Concurrent Resource Groups

Resource Group Policies for Startup, Fallover and Fallback

Resource Group Attributes

How Resource Group Attributes Relate to Startup, Fallover, and Fallback

Settling Time for Startup

Node Distribution Policy

Dynamic Node Priority Policy

Delayed Fallback Timer

Resource Group Dependencies

Moving Resource Groups to Another Node

Using clRGmove to Move Resource Groups

Planning Cluster Networks and Resource Groups

Aliased Networks and Resource Groups

IPAT via IP Replacement Networks and Resource Groups

Planning Service IP Labels in Resource Groups

Planning Parallel or Serial Order for Processing Resource Groups

Dependent Resource Groups and Parallel or Serial Order

Planning Resource Groups in Clusters with Sites

Concurrent Resource Groups and Sites

Non-Concurrent Resource Groups and Sites

General Resource Group Behavior in Clusters with Sites

Special Considerations for Using Sites

Resource Group Behavior Examples in Clusters with Sites

Customizing Inter-Site Resource Group Recovery

Planning for Replicated Resources

Configuration of Replicated Resources

Processing of Replicated Resources

Moving Resource Groups with Replicated Resources

Recovering Resource Groups on Node Startup

Planning for Workload Manager

About Workload Manager Classes

Workload Manager Reconfiguration, Startup, and Shutdown

Limitations and Considerations

Assigning WLM Classes to HACMP Resource Groups

Completing the Resource Group Worksheet

Where You Go from Here

Chapter 7: Planning for Cluster Events

Prerequisites

Overview

Planning Site and Node Events

Planning node_up and node_down Events

Establishing Initial Cluster Membership

Rejoining the Cluster

node_up Events with Dependent Resource Groups or Sites

node_down Events

Network Events

Sequence of Network Events

Network Interface Events

Failure of a Single Network Interface Does Not Generate Events

Cluster-Wide Status Events

Resource Group Event Handling and Recovery

Resource Group Events

Customizing Cluster Event Processing

Event Notification

Pre- and Post-Event Scripts

Pre- and Post-Event Scripts May No Longer Be Needed

Dependent Resource Groups and Pre- and Post-Event Scripts

Event Recovery and Retry

Custom Remote Notification of Events

Planning for Custom Remote Notification

Customizing Event Duration Time Until Warning

User-Defined Events

Writing Recovery Programs

Event Summaries and Preamble

Completing the Cluster Event Worksheet

Where You Go from Here

Chapter 8: Planning for HACMP Clients

Prerequisites

Overview

Client Application Systems

NFS Servers

Terminal Servers

Clients Running Clinfo

Reconnecting to the Cluster

Customizing the clinfo.rc Script

Clients Not Running Clinfo

Network Components

Where You Go from Here

Chapter 9: Using Online Planning Worksheets

Prerequisites

Overview of the Online Planning Worksheets Application

Limitations

Installing the Online Planning Worksheets Application

Installation Prerequisites

User Permission Requirements

Downloading the Application on an AIX 5L System

Installing the Application on a Windows System

Starting and Stopping the Application

Running the Application from an AIX 5L Installation

Running the Application from a Windows Installation

Running the Application from the IBM Website

Stopping the Application

Where You Go from Here

Using the Online Planning Worksheets Application

Understanding the Main Window

Navigating Your Cluster Configuration

Viewing Standard and Extended Configuration Panels

Entering Data in the Configuration Panels

Using Online Help

Creating a New Cluster Definition File

Opening an Existing Cluster Definition File

Adding, Modifying, and Deleting Cluster Information

Adding Notes about the Cluster Configuration

Validating a Cluster Definition

Saving a Cluster Definition File

Creating an HTML Configuration Report

Planning a Cluster

Overview of the Planning Process

Defining the Cluster

Defining Cluster Security

Adding Nodes to the Cluster

Planning Site Configuration

Creating Resource Groups

Defining Disk Resources

Disks

Adding Tape Resources

Adding Resource Associations

Defining Networks

Adding Applications

Specifying TTY Node/Port Pairs

Specifying Cluster Events

Defining File Collections

File Collections Global Settings

Specifying Verification Options

Understanding the Cluster Definition File

Cluster Definition File Format

Sample Cluster Definition File

Cluster Definition Schema Files

Converting an HACMP Cluster Configuration into OLPW

Exporting a Cluster Definition File from SMIT

Converting a Snapshot to a Cluster Definition File

Applying Worksheet Data to Your HACMP Cluster

Prerequisites

Applying Your Cluster Configuration File

Troubleshooting Cluster Definition File Problems

Where You Go from Here

Appendix A: Planning Worksheets

Appendix B: Applications and HACMP

Notices for HACMP Planning Guide

Index