Unplanned downtime



Tru64 UNIX

Unplanned downtime is caused by hardware, software or administrator failure that was not "protected" by the system. These failures cause an unanticipated denial of service to users and applications.

Customer problem

Today's NonStop e-business environment does not leave room for unplanned downtime. Systems, applications and data must be available to users 100% of the time to meet daunting customer service levels.

HP's answer

Tru64 UNIX 's evolutionary reliability and availability architecture and design are a strong statement to you that hp is committed to solving your availability and performance needs by providing high availability leadership solutions. HP's goal is to minimize unplanned downtime so that system availability stays at the highest possible level. Here's how:

TruCluster Server: No matter how reliable your single system is, only a clustered configuration can keep your business running absolutely uninterrupted. TruCluster Server software guarantees your service levels with maximum availability while reducing your management costs. How? By managing all systems in the cluster as one single system. No other vendor can offer you this level of availability and ease of management at the same time.

Alternate pathing: Tru64 UNIX supports alternate pathing with automatic failover (transparent to applications) for storage, network peripherals, and cluster interconnects. In a cluster environment, DRD provides failover to storage through other systems in the cluster; essentially providing alternate pathing at the system level. The LSM software (Logical Storage Manager) also provides transparent failover to mirror sets.

Automatically remove failing components: Tru64 UNIX serviceability features allow the system to automatically detect failing components such as CPU, memory, or environmental devices, and shut them down to avoid system crashes. You can therefore experience maximum system availability until you are ready to service these components at your most convenient time.

Memory error resiliency: Tru64 UNIX guards memory with a "just-in time" scrubbing model, where correctable errors are scrubbed when encountered by the operating system or application code, ensuring maximum availability. To enhance this capability, Tru64 UNIX has implemented a trigger mechanism, called the memory troller, which pro-actively locates and scrubs correctable memory errors prior to the operating system or application reading the memory. The troller arranges for the memory page containing the error to be marked bad and if the page is free (or when it becomes free) maps the page out, so it will not be reused.

Protect your systems from disaster: Tru64 UNIX solutions protect IT systems from disaster by replicating systems at multiple sites to ensure continuous operation in the event of a disaster (ie fire, flood, earthquake, etc.) There are several ways to achieve degrees of disaster protection, from backups to extended clusters to more complex configurations. To learn more about how to enhance high availability and include disaster protection, click here. This white paper includes product information on Campus-Wide Disaster Tolerant Clusters and Data Replication Manager, key components of a robust disaster protected environment. Additionally, information is included on Enterprise Volume Manager and Reliable Transaction Router products that enhance the IT environment that requires high availability and disaster protection.