6.1 Introduction to availability

Also known as resiliency, availability is the description of the system's ability to respond to requests no matter the circumstances. Availability requires that the topology provide some degree of process redundancy in order to eliminate single points of failure. Whereas vertical scalability (multiple application servers on one system) can provide this by creating multiple processes, the physical machine then becomes a single point of failure. For this reason, a High Availability topology typically involves horizontal scaling across multiple machines or LPARs.

For more information see IBM Redbook, IBM WebSphere Application Server Network DeploymentWebSphere Application Server Network Deployment V6: High Availability Solutions, SG24-6688, for running DMgr and node agents as OS services (Windows®, UNIX).
xxxx