Job Management Partner 1 Version 10, Job Management Partner 1/Automatic Job Management System 3 Administration Guide

[Contents][Glossary][Index][Back][Next]

12.1.1 How disaster recovery works

This subsection describes how disaster recovery works in the context of JP1/AJS3.

Organization of this subsection
(1) Begin operation on main site and copying between shared disks
(2) A major event causes the main site to go down
(3) Switch operation to the remote site
(4) Rebuild the main site
(5) Return operation to the main site

(1) Begin operation on main site and copying between shared disks

After building the disaster recovery environment, begin operation on the main site and initiate the process of copying data between the main and remote sites.

The JP1/AJS3 services on the logical host (for disaster recovery operation) at the remote site do not start running while the JP1/AJS3 system is still operating at the main site.

Figure 12-2 Initiating operation at main site and copying between shared disks

[Figure]

Supplementary note
Because the logical host prepared for disaster recovery at the remote site does not run any JP1/AJS3 services while the JP1/AJS3 system is operating at the main site, you cannot connect to the logical host at the remote site using JP1/AJS3 - View, JP1/AJS3 - Web Operation Assistant, or other products.

(2) A major event causes the main site to go down

When a large-scale disaster or other event causes the system at the main site to stop running, the copy process between the shared disks is terminated by a hardware operation at the remote site, and the write protection applied to the remote volume is lifted.

Figure 12-3 The main site goes down in the event of a large-scale disaster

[Figure]

(3) Switch operation to the remote site

Operation switches to the main site, from which point the logical host on the remote site becomes the active site. Suppression of the JP1/AJS3 services on the remote site is then released, a disaster recovery restart is performed for the JP1/AJS3 services on the remote site, and work task operation resumes.

Figure 12-4 Switching operation to remote site

[Figure]

(4) Rebuild the main site

Rebuild the JP1/AJS3 system on the disaster-affected main site. When this is done, begin copying data from the remote site back to the main site.

Figure 12-5 Rebuilding the main site

[Figure]

Supplementary note
Until the system is recovered on the main site, use the standard JP1/AJS3 backup process to take backups of the system on the remote site.

(5) Return operation to the main site

Stop the JP1/AJS3 services at the remote site, and make sure that no jobs are running. Having done so, stop the process of copying data from the remote site to the main site. Then, use a hardware operation to lift the write-protection applied to the main volume, and switch operation so that the logical host on the main site becomes the active site (called returning operation). When this process is completed, the hold placed on the JP1/AJS3 services on the main site is released, and a hold is instead placed on the JP1/AJS3 services on the remote site. A remote copy process is initiated from the main site to the remote site, the JP1/AJS3 services are started on the main site, and work task operation is resumed. The first time that the JP1/AJS3 services start after returning operation, the services are subject to a disaster recovery start.

Figure 12-6 Returning operation to the main site

[Figure]