Patent Number: 7,613,749

Title: System and method for application fault tolerance and recovery using topologically remotely located computing devices

Abstract: A system and method for application fault tolerance and recover using topologically remotely located computing devices are provided. A primary computing device runs one instance of an application (i.e. the primary application instance) at a production site and an active standby computing device runs a second instance of the application (i.e. the shadow application instance) at a recovery site which may be topologically remotely located from the production site. The two instances of the application are brought into a consistent state by running an initial application "checkpoint" on the primary computing device followed by an application "restart" on the active standby computing device. Events occurring in the primary application instance may be automatically and continuously recorded in a log and transferred to the recovery site using a peer-to-peer remote copy operation so as to maintain the states of the application instances consistent.

Inventors: Flynn, Jr.; John Thomas (Morgan Hill, CA), Howie; Mihaela (Burlingame, CA)

Assignee: International Business Machines Corporation

International Classification: G06F 12/00 (20060101); G06F 17/30 (20060101)

Expiration Date: 1/03/02017