JP2013149201A

Movatterモバイル変換

Info

Publication number: JP2013149201A
Application number: JP2012011231A
Authority: JP
Inventors: Junichiro Tsuchiya; 潤一郎土屋
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2012-01-23
Filing date: 2012-01-23
Publication date: 2013-08-01
Anticipated expiration: 2032-01-23
Also published as: JP5549688B2

Abstract

PROBLEM TO BE SOLVED: To shorten the start time of a standby server in an information processing system including standby servers which are less than operation servers.SOLUTION: A monitoring server instructs the first start of a standby server to a management server in a predetermined timing, and when detecting the failure of an operation server, outputs a predetermined instruction to the management server. When receiving the instruction of the first start, the management server instructs the first start to the standby server, and when receiving the predetermined instruction, transmits virtualization information to the standby server, and instructs second start. When receiving the instruction of the first start, the standby server stops after initializing hardware, and when receiving the virtualization information and the instruction of the second start, sets virtualization to the hardware, and executes the second start.

Description

Translated fromJapanese

本発明は、情報処理システム、特に、情報処理システムのフェイルオーバの動作に関する。 The present invention relates to an information processing system, and more particularly to a failover operation of the information processing system.

情報処理システムの利用の増加に伴い、情報処理システムで使用されるサーバの数は、増加している。また、処理の中断が問題となる基幹システムで使用されるサーバの数も、増加している。そのため、サーバの信頼性の向上が、望まれている。 As the use of information processing systems increases, the number of servers used in information processing systems has increased. In addition, the number of servers used in the backbone system where interruption of processing is a problem is increasing. Therefore, improvement of server reliability is desired.

そこで、サーバは、起動時に、ハードウェアの不具合の確認として、ＰＯＳＴ（Power On Self Test）を行う。 Therefore, the server performs a POST (Power On Self Test) at the time of startup as confirmation of a hardware failure.

さらに、サーバは、サーバのハードウェアの監視、遠隔操作、及び、ハードウェア・イベントを記録する管理用のコントローラ（ＢＭＣ：Base Management Controller）を備える。なお、サーバの機能を実現するハードウェアとは、ＣＰＵ（Central Processing Unit）、ＢＩＯＳ（Basic Input Output system）、メモリなどである。 Further, the server includes a management controller (BMC: Base Management Controller) that records server hardware monitoring, remote control, and hardware events. The hardware that realizes the server function includes a CPU (Central Processing Unit), a BIOS (Basic Input Output System), a memory, and the like.

しかし、サーバの構成部品は、常に高度化及び集積化が進んでいる。そのため、ハードウェアなど部品レベルでの信頼性の確保は、限界がある。 However, server components are constantly becoming more sophisticated and integrated. For this reason, there is a limit to ensuring reliability at the component level such as hardware.

そこで、業務を運用するために使用されているサーバ（運用サーバ）のほかに、障害が発生した運用サーバの業務の処理を引き継ぐサーバを備えた情報処理システムが用いられている。業務の処理を引き継ぐサーバは、「待機サーバ」と言われる。 Therefore, in addition to a server (operation server) used to operate a business, an information processing system including a server that takes over the business processing of the operation server in which a failure has occurred is used. A server that takes over business processing is called a “standby server”.

なお、運用サーバの障害時に待機サーバが運用を引き継ぐことは、「フェイルオーバ」と言われる。（例えば、特許文献１参照）
一般的に、待機サーバは、運用サーバとは機能や仕様は同じであるが、物理的に異なる装置やボードで構成される。そのため、運用サーバに搭載されているハードウェアに固定の設定情報は、待機サーバに搭載されているハードウェアの設定情報と異なる。例えば、運用サーバのＮＩＣ（Network Interface Card）のＭＡＣ（Medium Access Control）アドレスやＨＢＡ（Host Bus Adapter）のＷＷＮ（World Wide Name）は、待機サーバのＮＩＣのＭＡＣアドレスやＨＢＡのＷＷＮと異なる。そのため、運用サーバから待機サーバに業務が切り替わると、運用サーバを使用している装置（クライアント装置）は、接続先の切り替えが必要となる。しかし、クライアント装置は、一般的に、サーバと物理的にも離れており、数も多い。そのため、情報処理システムの管理者や利用者がすべてのクライアント装置の設定を変更することは、実質的に困難である。In addition, when the standby server takes over the operation when the operation server fails, it is called “failover”. (For example, see Patent Document 1)
In general, the standby server has the same functions and specifications as the operation server, but is configured by physically different devices and boards. Therefore, the setting information fixed to the hardware mounted on the operation server is different from the hardware setting information mounted on the standby server. For example, the MAC (Medium Access Control) address of the NIC (Network Interface Card) of the operation server and the WWN (World Wide Name) of the HBA (Host Bus Adapter) are different from the MAC address of the NIC of the standby server and the WWN of the HBA. For this reason, when a job is switched from the operation server to the standby server, a device (client device) using the operation server needs to switch the connection destination. However, the client device is generally physically separated from the server, and there are many numbers. Therefore, it is practically difficult for an administrator or user of the information processing system to change the settings of all client devices.

つまり、フェイルオーバを適用する情報処理システムは、クライアント装置の接続先を、自動的に、運用サーバから待機サーバに変更するできることが望ましい。 In other words, it is desirable that the information processing system to which failover is applied can automatically change the connection destination of the client device from the active server to the standby server.

サーバ側がクライアント装置に接続先の変更を通知できる場合、通知を受け取ったクライアント装置は、接続先を変更できる。しかし、フェイルオーバする際、運用サーバは、障害を発生しているため、クライアント装置に通知できない。また、待機サーバは、クライアント装置と接続を確保できていないため、クライアント装置に通知できない。つまり、運用サーバ及び待機サーバは、クライアント装置に、接続先の変更を通知できない。 When the server side can notify the client device of the change of the connection destination, the client device that has received the notification can change the connection destination. However, when a failover occurs, the operation server cannot notify the client device because a failure has occurred. In addition, since the standby server cannot secure a connection with the client device, it cannot notify the client device. That is, the operation server and the standby server cannot notify the client device of the change of the connection destination.

そこで、クライアント装置側が設定を変更せずにフェイルオーバを実現できる情報処理システムが用いられている。このような情報処理システムは、いろいろの種類がある。例えば、運用サーバ及び待機サーバの設定を仮想化し、仮想化した設定のための情報（以下、「仮想化情報」と言う。）を引き継ぐ情報処理システムが用いられている。この仮想化情報は、ＢＩＯＳの設定なども含む。なお、仮想化したＢＩＯＳは、「仮想ＢＩＯＳ」と言われる。 Therefore, an information processing system is used in which failover can be realized without changing the setting on the client device side. There are various types of such information processing systems. For example, an information processing system that virtualizes the settings of the operation server and the standby server and takes over information for virtualized settings (hereinafter referred to as “virtualized information”) is used. This virtualization information includes BIOS settings and the like. The virtualized BIOS is referred to as “virtual BIOS”.

このような情報処理システムの運用サーバは、仮想化情報をハードウェアに設定して動作する。そして、運用サーバに障害が発生し、待機サーバが業務の処理を引き継ぐとき、待機サーバは、運用サーバと同じ仮想化情報をハードウェアに設定して処理を引き継ぐ。つまり、待機サーバは、仮想化情報を用いて、クライアント装置にとって、運用サーバと同様の接続先として動作する。そのため、クライアント装置は、設定を変更しなくても、フェイルオーバした待機サーバに接続可能である。 The operation server of such an information processing system operates by setting virtualization information in hardware. When a failure occurs in the operation server and the standby server takes over the business process, the standby server takes over the process by setting the same virtualization information as the operation server in the hardware. That is, the standby server operates as a connection destination similar to the operation server for the client device using the virtualization information. Therefore, the client device can connect to the standby server that has failed over without changing the setting.

仮想化された運用サーバ及び待機サーバを管理するサーバは、「管理サーバ」と言われる。管理サーバは、各サーバの起動時、サーバに仮想化を実現するために必要な情報（仮想化情報）を通知する。 A server that manages the virtualized operation server and standby server is referred to as a “management server”. The management server notifies the server of information (virtualization information) necessary for realizing virtualization when each server is activated.

運用サーバ及び待機サーバは、起動時に、管理サーバから仮想化情報を受け取り、仮想化情報をハードウェアに設定して、運用を開始する。（例えば、特許文献２参照） When the operation server and the standby server are activated, the operation server and the standby server receive virtualization information from the management server, set the virtualization information in hardware, and start operation. (For example, see Patent Document 2)

特開２００８−２９３２４５JP 2008-293245 A特開２００７−０９４６１１JP2007-094611

フェイルオーバにおける運用サーバから待機サーバへの移行において、待機サーバは、ハードウェアの初期化を含む起動処理を行う。しかし、サーバの高度化に伴い、サーバに搭載されているＣＰＵの数やメモリの容量などハードウェアの規模は、増加している。そのため、サーバの起動処理におけるハードウェアの初期化の時間は、増加している。 In the transition from the active server to the standby server in the failover, the standby server performs a startup process including initialization of hardware. However, with the advancement of servers, the scale of hardware such as the number of CPUs mounted on the servers and the memory capacity is increasing. For this reason, the hardware initialization time in the server startup process is increasing.

つまり、フェイルオーバにおける待機サーバの起動の時間が、増大し、問題となっている。 That is, the standby server startup time in failover increases, which is a problem.

例えば、上述の特許文献２の発明は、運用サーバが障害時に、待機サーバに運用サーバの設定を引き継がせるものである。つまり、特許文献２の発明は、運用サーバが障害を発生してから、待機サーバが起動するものであり、待機サーバの起動処理の時間が増加するという問題点があった。 For example, the above-described invention of Patent Document 2 allows the standby server to take over the setting of the operation server when the operation server fails. That is, the invention of Patent Document 2 has a problem that the standby server is started after a failure occurs in the operation server, and the standby server startup processing time is increased.

そこで、上述の特許文献１においては、記憶装置の論理区画を用意し、待機サーバは、ＯＳ（Operating System）の読み込む前の状態で停止する。そして、運用サーバが故障すると、待機サーバは、ＯＳの論理区画への読み込みから再開し、起動時間の短縮を図っている。 Therefore, in Patent Document 1 described above, a logical partition of the storage device is prepared, and the standby server is stopped in a state before the OS (Operating System) is read. When the operation server fails, the standby server restarts from reading the OS into the logical partition to shorten the startup time.

しかし、待機サーバは、情報処理システムの冗長性を確保するための構成であり、待機中に業務処理しない。また、障害の発生する運用サーバは、運用サーバの一部である。そのため、情報処理システムは、運用サーバと同じ台数の待機サーバを含む必要はなく、運用サーバより少ない台数の待機サーバで冗長性を確保できる。 However, the standby server is a configuration for ensuring redundancy of the information processing system, and does not perform business processing during standby. In addition, the operation server in which a failure occurs is a part of the operation server. Therefore, the information processing system does not need to include the same number of standby servers as the active servers, and redundancy can be ensured with a smaller number of standby servers than the active servers.

ただし、待機サーバの数が運用サーバより数が少ない場合、待機サーバは、引き継ぐ運用サーバが事前に決まらない。 However, when the number of standby servers is smaller than that of the active server, the standby server is not determined in advance as the active server to take over.

しかし、特許文献１は、運用サーバに対応した待機サーバを選定して、ＮＩＣやホストバスアダプタなどＩ／Ｏデバイスを初期化して、論理区画を確保して、ＯＳの読み込む前の状態で停止するものである。そのため、上述の特許文献１は、ネットワークに接続した運用サーバに対応した数だけ、ＯＳを読み込む前の状態にした待機サーバを備えることが必要である。つまり、上述の特許文献１に記載の発明は、運用サーバより少ない待機サーバに対応できないという問題点があった。 However, Patent Document 1 selects a standby server corresponding to an operation server, initializes an I / O device such as a NIC or a host bus adapter, secures a logical partition, and stops in a state before reading the OS. Is. Therefore, Patent Document 1 described above needs to include as many standby servers that are in a state before the OS is read, corresponding to the number of operation servers connected to the network. That is, the above-described invention described in Patent Document 1 has a problem that it cannot cope with fewer standby servers than the operation server.

本発明の目的は、上記問題点を解決し、運用サーバより少ない待機サーバを含む情報処理システムにおいて、待機サーバの起動時間を短縮する情報処理システム、及び、情報処理システムの制御方法を提供することにある。 An object of the present invention is to solve the above-described problems and provide an information processing system and a control method for the information processing system that reduce the startup time of the standby server in an information processing system that includes fewer standby servers than operating servers. It is in.

本発明の情報処理システムは、業務を運用するための情報処理を実行する運用サーバと、前記運用サーバの障害時に業務の処理を引き継ぐ待機サーバと、前記運用サーバと前記待機サーバとの引き継ぎを管理する管理サーバと、前記運用サーバの障害を検出する監視サーバとを含み、前記待機サーバは、前記待機サーバの少なくとも一部のハードウェアを初期化した後に停止する第１の起動と、前記ハードウェアの仮想化の設定を含む第２の起動とを制御する入出力手段と、前記管理サーバからの第１の起動の指示を受けとり、前記入力手段に前記第１の起動を指示し、前記管理サーバから前記ハードウェアを仮想化するための仮想化情報と、前記第２の起動の指示とを受けとり、前記入出力手段に前記仮想化情報を渡し、前記第２の起動を指示する管理制御手段とを含み、前記監視サーバは、所定のタイミングで前記管理サーバに前記待機サーバの前記第１の起動のための指示を出し、前記運用サーバの障害を検出したときに前記管理サーバに所定の指示を出力し、前記管理サーバは、前記監視サーバから前記第１の起動のための指示を受けとり、前記待機サーバに前記第１の起動を指示し、前記監視サーバから前記所定の指示を受けとり、前記待機サーバに前記仮想化情報と、前記第２の起動の指示を送信する。 An information processing system according to the present invention manages an operation server that executes information processing for operating a business, a standby server that takes over processing of a business when the operation server fails, and takes over between the operation server and the standby server A first server that stops after initializing at least a part of the hardware of the standby server, and the hardware An input / output unit that controls the second activation including the virtualization setting; and a first activation instruction from the management server; the input unit is instructed to perform the first activation; and the management server The virtualization information for virtualizing the hardware and the second activation instruction are received, the virtualization information is passed to the input / output means, and the second activation is performed. The monitoring server issues an instruction for the first activation of the standby server to the management server at a predetermined timing, and the management server detects a failure of the operation server. A predetermined instruction is output to the server, and the management server receives the instruction for the first activation from the monitoring server, instructs the standby server to perform the first activation, and receives the predetermined activation from the monitoring server. An instruction is received, and the virtualization information and the second activation instruction are transmitted to the standby server.

本発明の情報処理システムの制御方法は、業務を運用するための情報処理を実行する運用サーバと、前記運用サーバの障害時に業務の処理を引き継ぐ待機サーバと、前記運用サーバと前記待機サーバとの引き継ぎを管理する管理サーバと、前記運用サーバの障害を検出する監視サーバとを含み、前記監視サーバは、所定のタイミングで前記管理サーバに前記待機サーバの第１の起動のための指示を出力し、前記運用サーバの障害を検出したときに前記管理サーバに所定の指示を出力し、前記管理サーバは、前記監視サーバから前記第１の起動のための指示を受けとり、前記待機サーバに第１の起動を指示し、前記監視サーバから前記所定の指示を受けとり、前記待機サーバに仮想化情報を送り、第２の起動を指示し、前記待機サーバは、前記管理サーバからの前記第１の起動の指示を受けとり、ハードウェアを初期化した後に停止し、前記管理サーバから前記仮想化情報と前記第２の起動の指示とを受けとり、前記仮想化情報を設定してハードウェアを起動する。 An information processing system control method according to the present invention includes: an operation server that executes information processing for operating a business; a standby server that takes over business processing when the operation server fails; and the operation server and the standby server A management server that manages takeover; and a monitoring server that detects a failure of the operation server, and the monitoring server outputs an instruction for first activation of the standby server to the management server at a predetermined timing. When a failure of the operation server is detected, a predetermined instruction is output to the management server, the management server receives the instruction for the first activation from the monitoring server, and the standby server receives the first instruction Instructing activation, receiving the predetermined instruction from the monitoring server, sending virtualization information to the standby server, instructing second activation, and the standby server Receives the first activation instruction from the management server, stops after initializing the hardware, receives the virtualization information and the second activation instruction from the management server, and sets the virtualization information Then start the hardware.

本発明の待機サーバは、ハードウェアを初期化した後に停止する第１の起動と、ハードウェアの仮想化を含む第２の起動とを制御する入出力手段と、管理サーバからの前記第１の起動の指示を受けとり、前記入力手段に前記第１の起動を指示し、前記管理サーバから前記ハードウェアの仮想化のためお仮想化情報と前記第２の起動の指示とを受けとり、前記入出力手段に前記仮想化情報を渡し、前記第２の起動を指示する管理制御手段とを含む。 The standby server according to the present invention includes an input / output unit that controls a first activation that stops after initialization of hardware and a second activation that includes hardware virtualization, and the first server from the management server. Receiving an activation instruction, instructing the input means to perform the first activation, receiving virtualization information and the second activation instruction for virtualization of the hardware from the management server; Management control means for passing the virtualization information to the means and instructing the second activation.

本発明の待機サーバの起動方法は、管理サーバからの第１の起動の指示を受けとり、ハードウェアを初期化した後に停止する第１の起動を行い、前記管理サーバからハードウェアの仮想化のための仮想化情報と第２の起動の指示とを受けると前記ハードウェアの仮想化を含む第２の起動を行う。 The standby server activation method of the present invention receives a first activation instruction from the management server, performs a first activation that stops after initializing the hardware, and for hardware virtualization from the management server When the virtualization information and the second activation instruction are received, the second activation including the hardware virtualization is performed.

本発明の待機サーバのプログラムは、管理サーバからの第１の起動の指示を受け取り、ハードウェアを初期化した後に停止する第１の起動処理と、前記管理サーバからハードウェアの仮想化するための仮想化情報と第２の起動の指示とを受けとり、ハードウェアの仮想化を含む第２の起動処理とをコンピュータに実行させる。 The standby server program according to the present invention receives a first activation instruction from the management server, and initializes the hardware and then stops, and the hardware virtualization from the management server The computer receives the virtualization information and the second activation instruction, and causes the computer to execute a second activation process including hardware virtualization.

本発明によれば、運用サーバより少ない台数の待機サーバを含む情報処理システムにおいて、待機サーバの起動時間を短縮できる。 According to the present invention, in the information processing system including a smaller number of standby servers than the operation server, it is possible to shorten the startup time of the standby server.

また、本発明によれば、待機サーバの起動時間が短くなるため、フェイルオーバの処理を高速化できる。 Further, according to the present invention, the startup time of the standby server is shortened, so that the failover process can be speeded up.

図１は、本発明における第１実施の形態に係る情報処理システムの構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the configuration of the information processing system according to the first embodiment of the present invention.図２は、第１の実施形態に係る管理サーバの構成の一例を示すブロック図である。FIG. 2 is a block diagram illustrating an example of the configuration of the management server according to the first embodiment.図３は、第１の実施形態に係る管理サーバのハードウェア構成の一例を示すブロック図である。FIG. 3 is a block diagram illustrating an example of a hardware configuration of the management server according to the first embodiment.図４は、第１の実施形態に係る待機サーバの構成の一例を示すブロック図である。FIG. 4 is a block diagram illustrating an example of the configuration of the standby server according to the first embodiment.図５は、第１の実施形態に係る待機サーバのハードウェア構成の一例を示すブロック図である。FIG. 5 is a block diagram illustrating an example of a hardware configuration of the standby server according to the first embodiment.図６は、第１の実施形態に係る監視サーバの構成の一例を示すブロック図である。FIG. 6 is a block diagram illustrating an example of the configuration of the monitoring server according to the first embodiment.図７は、第１の実施形態に係る監視サーバのハードウェア構成の一例を示すブロック図である。FIG. 7 is a block diagram illustrating an example of a hardware configuration of the monitoring server according to the first embodiment.図８は、第１の実施形態に係る動作の一例を示すシーケンス図である。FIG. 8 is a sequence diagram illustrating an example of an operation according to the first embodiment.図９は、第１の実施形態に係る動作の一例を示すシーケンス図である。FIG. 9 is a sequence diagram illustrating an example of an operation according to the first embodiment.

次に、本発明の実施形態について図面を参照して詳細に説明する。 Next, embodiments of the present invention will be described in detail with reference to the drawings.

なお、各図面は、本発明の実施形態を説明するものである。そのため、本発明は、各図面の記載に限られるわけではない。また、各図面の同様の構成には、同じ番号を付し、同様の説明を省略することがある。 Each drawing explains an embodiment of the present invention. Therefore, the present invention is not limited to the description of each drawing. Moreover, the same number is attached | subjected to the same structure of each drawing, and the same description may be abbreviate | omitted.

（第１の実施形態）
図１は、本発明のおける第１の実施形態に係る情報処理システム１０の構成の一例を示すブロック図である。(First embodiment)
FIG. 1 is a block diagram showing an example of the configuration of aninformation processing system 10 according to the first embodiment of the present invention.

情報処理システム１０は、管理サーバ２０と、運用サーバ３０と、待機サーバ４０と、監視サーバ５０とを含む。なお、管理サーバ２０、運用サーバ３０、待機サーバ４０、及び、監視サーバ５０は、ＬＡＮ（Local Area Network）やファイバーチャネルのようなネットワーク、又は、装置の内部バスなど一般的な通信路を介して接続している。 Theinformation processing system 10 includes amanagement server 20, anoperation server 30, astandby server 40, and amonitoring server 50. Themanagement server 20, theoperation server 30, thestandby server 40, and themonitoring server 50 are connected via a general communication path such as a LAN (Local Area Network) or a fiber channel network, or an internal bus of the apparatus. Connected.

管理サーバ２０は、運用サーバ３０と待機サーバ４０との業務のための動作の引き継ぎ（フェイルオーバ）を管理する。 Themanagement server 20 manages takeover (failover) of operations for operations between theoperation server 30 and thestandby server 40.

運用サーバ３０は、情報処理ステム１０で運用される、図示しないクライアント装置のための業務（サービス）を運用するための情報処理（以下、単に「業務の処理」とも言う）を実行する。 Theoperation server 30 executes information processing (hereinafter also simply referred to as “business processing”) for operating a business (service) for a client device (not shown) that is operated by theinformation processing system 10.

待機サーバ４０は、運用サーバ３０が故障などで動作できなくなった場合、運用サーバ３０の業務の処理を引き継ぐ。 When theoperation server 30 becomes unable to operate due to a failure or the like, thestandby server 40 takes over the business processing of theoperation server 30.

監視サーバ５０は、運用サーバ３０及び待機サーバ４０の状態を監視し、管理サーバ２０に通知する。 The monitoringserver 50 monitors the status of theoperation server 30 and thestandby server 40 and notifies themanagement server 20 of the status.

なお、図１は、図面の煩雑さを避けるため、説明に関係する構成を記載したものである。本実施形態の情報処理システム１０は、図１の記載に限られるわけではない。例えば、運用サーバ３０及び待機サーバ４０は、情報処理システム１０に２台以上含まれても良い。待機サーバ４０が２台以上含まれる場合、管理サーバ２０は、運用サーバ３０の業務の処理を引き継ぐ待機サーバ４０を所定の順番で選択して、フェイルオーバすればよい。例えば、管理サーバ２０は、ラウンドロビンのように、順番に待機サーバ４０を選択して、フェイルオーバしても良い。さらに、運用サーバ３０が論理的に複数のサーバとして動作している場合、管理サーバ２０は、複数の待機サーバ４０を用いてフェイルオーバしても良い。また、管理サーバ２０は、フェイルオーバする運用サーバ３０の付加状況を基に、１台の待機サーバ４０に複数台の運用サーバ３０の業務の処理を引き継がせても良い。 FIG. 1 shows a configuration related to the description in order to avoid complexity of the drawing. Theinformation processing system 10 of the present embodiment is not limited to the description of FIG. For example, two ormore operation servers 30 andstandby servers 40 may be included in theinformation processing system 10. When two ormore standby servers 40 are included, themanagement server 20 may select thestandby servers 40 that take over the business processing of theoperation server 30 in a predetermined order and perform failover. For example, themanagement server 20 may perform failover by selecting thestandby servers 40 in order as in round robin. Further, when theoperation server 30 is logically operating as a plurality of servers, themanagement server 20 may perform failover using a plurality ofstandby servers 40. In addition, themanagement server 20 may cause onestandby server 40 to take over the processing of a plurality ofoperation servers 30 based on the additional status of theoperation server 30 to be failed over.

また、管理サーバ２０及び監視サーバ５０は、１つのサーバで実現しても良い。あるいは、情報処理システム１０は、管理サーバ２０又は監視サーバ５０を、２台以上含んでも良い。 Further, themanagement server 20 and themonitoring server 50 may be realized by one server. Alternatively, theinformation processing system 10 may include two ormore management servers 20 ormonitoring servers 50.

続いて、各サーバの構成について説明する。 Next, the configuration of each server will be described.

まず、管理サーバ２０の構成について説明する。 First, the configuration of themanagement server 20 will be described.

図２は、管理サーバ２０の機能的な構成の一例を示すブロック図である。 FIG. 2 is a block diagram illustrating an example of a functional configuration of themanagement server 20.

なお、図２は、本実施形態の説明に関連する構成を記載したものである。例えば、表示部など説明に関係しない構成の記載は、省略した。 FIG. 2 describes a configuration related to the description of the present embodiment. For example, description of components that are not related to the description such as a display unit is omitted.

管理サーバ２０は、管理処理部２１０を含む。 Themanagement server 20 includes amanagement processing unit 210.

管理処理部２１０は、運用サーバ３０と待機サーバ４０とのフェイルオーバを管理する。そのため、管理処理部２１０は、運用サーバ３０及び待機サーバ４０に設定する仮想化情報と、仮想化情報を設定した運用サーバ３０又は待機サーバ４０に関する情報を含む紐付け情報とを保持（格納）する。 Themanagement processing unit 210 manages failover between theoperation server 30 and thestandby server 40. Therefore, themanagement processing unit 210 holds (stores) virtualization information set in theoperation server 30 and thestandby server 40 and association information including information related to theoperation server 30 or thestandby server 40 in which the virtualization information is set. .

管理サーバ２０は、監視サーバ５０からフェイルオーバの指示を受信すると、紐付け情報を参照して障害が発生した運用サーバ３０の仮想化情報を取得する。そして、管理サーバ２０は、待機サーバ４０に取得した仮想化情報を通知して、フェイルオーバにおける、運用サーバ３０から待機サーバ４０への切り替えを管理する。なお、本実施形態に用いる仮想化情報に、特に制限は無い。例えば、仮想化情報は、特許文献２に記載されている仮想化情報である。 When receiving the failover instruction from the monitoringserver 50, themanagement server 20 refers to the association information and acquires the virtualization information of theoperation server 30 in which the failure has occurred. Then, themanagement server 20 notifies thestandby server 40 of the acquired virtualization information, and manages switching from theoperation server 30 to thestandby server 40 in failover. There is no particular limitation on the virtualization information used in the present embodiment. For example, the virtualization information is the virtualization information described in Patent Document 2.

図３は、管理サーバ２０のハードウェア構成の一例を示すブロック図である。 FIG. 3 is a block diagram illustrating an example of a hardware configuration of themanagement server 20.

なお、図３は、本実施形態の説明に関連する構成を記載したものである。例えば、キーボード、マウス、及び、ディスプレイといった、説明に関係しない構成の記載は、省略した。 FIG. 3 shows a configuration related to the description of the present embodiment. For example, descriptions of components that are not related to the description such as a keyboard, a mouse, and a display are omitted.

管理サーバ２０は、ＣＰＵ２２０と、ＲＯＭ（Read Only Memory）２２２と、ＲＡＭ（Random Access Memory）２２４と、記憶装置２２６と、ＮＩＣ（ネットワークインターフェース回路又はネットワークインタフェースカード）２２８とを含む。 Themanagement server 20 includes aCPU 220, a ROM (Read Only Memory) 222, a RAM (Random Access Memory) 224, astorage device 226, and a NIC (network interface circuit or network interface card) 228.

ＣＰＵ２２０は、ＲＯＭ２２２又は記憶装置２２６に含まれるプログラムを読み出して実行し、管理サーバ２０としての機能を実行する。ＣＰＵ２２０は、プログラムを実行する際、ＲＡＭ２２４及び記憶装置２２６を一時記憶として使用してもよい。また、ＣＰＵ２２０は、ＮＩＣ２２８を用いて図３には示していないネットワーク（通信路）を介して他の装置又はサーバとデータをやり取りする。なお、ＣＰＵ２２０は、ＮＩＣ２２８を介して図示しない他の装置からプログラムを取得してもよい。また、ＣＰＵ２２０は、コンピュータで読み取り可能な記憶媒体２３０に含まれるプログラムを、図示しない読み取り装置を用いて読み出して動作しても良い。 TheCPU 220 reads out and executes a program included in theROM 222 or thestorage device 226 and executes a function as themanagement server 20. TheCPU 220 may use theRAM 224 and thestorage device 226 as temporary storage when executing the program. Further, theCPU 220 uses theNIC 228 to exchange data with other devices or servers via a network (communication path) not shown in FIG. Note that theCPU 220 may acquire a program from another device (not shown) via theNIC 228. Further, theCPU 220 may operate by reading a program included in the computer-readable storage medium 230 using a reading device (not shown).

ＲＯＭ２２２は、ＣＰＵ２２０が実行するプログラム及び固定データを保持する。 TheROM 222 holds programs executed by theCPU 220 and fixed data.

ＲＡＭ２２４は、ＣＰＵ２２０を作業領域としてデータ又はプログラムを一時記憶する。 TheRAM 224 temporarily stores data or programs using theCPU 220 as a work area.

記憶装置２２６は、ＣＰＵ２２０が実行するプログラムやデータを記憶する。また、記憶装置２２６は、ＣＰＵ２２０の作業領域となる場合もある。 Thestorage device 226 stores programs and data executed by theCPU 220. Thestorage device 226 may be a work area for theCPU 220.

ＮＩＣ２２８は、ＣＰＵ２２０と図３には示していないネットワークを介した他の装置とのデータのやり取りを中継する。 TheNIC 228 relays data exchange between theCPU 220 and another device via a network not shown in FIG.

さらに、管理サーバ２０は、個別のサーバとしてではなく、装置の一部の論理構成として実現しても良い。例えば、個々のサーバを１つのブレード（ブレードサーバ）として構成し、抜き差し可能な複数のブレードサーバを搭載できる筐体を備えたブレードサーバ装置が用いられている。このブレードサーバ装置の管理サーバ２０の機能は、いずれかのブレードサーバが実現しても良く、ブレードサーバとは別にブレードサーバ装置の筐体に設けられた管理モジュールとして実現しても良い。また、管理サーバ２０は、ブレードサーバ装置に接続した別装置のサーバとして実現しても良い。 Furthermore, themanagement server 20 may be realized as a logical configuration of a part of the apparatus, not as an individual server. For example, a blade server device is used that includes a case in which each server is configured as one blade (blade server) and a plurality of blade servers that can be inserted and removed can be mounted. The function of themanagement server 20 of the blade server device may be realized by any blade server, or may be realized as a management module provided in the casing of the blade server device separately from the blade server. Themanagement server 20 may be realized as a server of another device connected to the blade server device.

次に、運用サーバ３０及び待機サーバ４０の構成について説明する。運用サーバ３０と待機サーバ４０は、同様の機能を備える。そのため、待機サーバ４０の構成について説明し、運用サーバ３０の構成の詳細な説明は、省略する。なお、運用サーバ３０と待機サーバ４０は同じ構成のため、本実施形態の説明において、運用サーバ３０の構成の符号は、待機サーバ４０の構成の符号と同じとする。 Next, the configuration of theoperation server 30 and thestandby server 40 will be described. Theoperation server 30 and thestandby server 40 have similar functions. Therefore, the configuration of thestandby server 40 will be described, and a detailed description of the configuration of theoperation server 30 will be omitted. Since theoperation server 30 and thestandby server 40 have the same configuration, in the description of the present embodiment, the configuration code of theoperation server 30 is the same as the configuration code of thestandby server 40.

図４は、待機サーバ４０の機能的な構成の一例を示すブロック図である。 FIG. 4 is a block diagram illustrating an example of a functional configuration of thestandby server 40.

なお、図４は、本実施形態の説明に関連する構成を記載したものである。例えば、表示部など説明に関係しない構成の記載は、省略した。 FIG. 4 shows a configuration related to the description of this embodiment. For example, description of components that are not related to the description such as a display unit is omitted.

待機サーバ４０は、ＢＭＣ４１０と、ＢＩＯＳ４２０と、仮想化情報格納部４３０と、ハードウェア（ＨＷ）情報格納部４４０と、仮想入出力情報（仮想ＢＩＯＳ情報）格納部４５０とを含む。 Thestandby server 40 includes aBMC 410, aBIOS 420, a virtualizationinformation storage unit 430, a hardware (HW)information storage unit 440, and a virtual input / output information (virtual BIOS information)storage unit 450.

ＢＭＣ４１０は、待機サーバ４０のハードウェアの監視、遠隔操作、ハードウェア・イベントを記録する管理用のコントローラである。ＢＭＣ４１０は、待機サーバ４０の他のハードウェアとは異なる電源を備え、他のハードウェアの電源をオフした状態でも、電源の供給を受け、動作可能である。そして、ＢＭＣ４１０は、待機サーバ４０の他のハードウェアの電源のオン（電源を入れる）及びオフ（電源を切る）を制御する。 TheBMC 410 is a management controller that records hardware monitoring, remote control, and hardware events of thestandby server 40. TheBMC 410 has a power supply different from other hardware of thestandby server 40, and can operate by receiving power supply even when the power of the other hardware is turned off. TheBMC 410 controls turning on (turning on the power) and turning off (turning off the power) of other hardware of thestandby server 40.

このように、ＢＭＣ４１０は、通常、電源が入った（オン）状態である。つまり、待機サーバ４０の電源のオン・オフは、ＢＭＣ４１０以外のハードウェアの電源のオン・オフとなる。そのため、以下、ＢＭＣ４１０を除いた待機サーバ４０の他のハードウェア（以下、単に「ハードウェア」と言う）の電源をオン及びオフすることを、待機サーバ４０の電源をオン及びオフすると言う場合もある。 Thus, theBMC 410 is normally in a power-on (on) state. That is, the power on / off of thestandby server 40 is the power on / off of hardware other than theBMC 410. Therefore, hereinafter, turning on and off the other hardware (hereinafter simply referred to as “hardware”) of thestandby server 40 excluding theBMC 410 may be referred to as turning on and off thestandby server 40. is there.

また、ＢＭＣ４１０は、図４には示していないネットワーク（通信路）を介し、管理サーバ２０から、待機サーバ４０のハードウェアを仮想化するための仮想化情報を受け取る。さらに、ＢＭＣ４１０は、後ほど詳細に説明する本実施形態の待機サーバ４０の起動処理を管理する。なお、ＢＭＣ４１０は、管理コントローラ又は管理制御部と言うこともできる。 Further, theBMC 410 receives virtualization information for virtualizing the hardware of thestandby server 40 from themanagement server 20 via a network (communication path) not shown in FIG. Further, theBMC 410 manages the activation process of thestandby server 40 according to the present embodiment, which will be described in detail later. TheBMC 410 can also be called a management controller or a management control unit.

ＢＩＯＳ４２０は、待機サーバ４０において、ハードウェアと最も近い、低レベルの入出力を管理および制御する。さらに、ＢＩＯＳ４２０は、電源オン時などに、ハードウェアの初期化処理、起動時のテスト（ＰＯＳＴ）、及び、仮想化情報のハードウェアへの設定を制御する。なお、ＢＩＯＳ４２０は、入出力システム、又は、入出力部と言うこともできる。 TheBIOS 420 manages and controls the low-level input / output closest to the hardware in thestandby server 40. Furthermore, theBIOS 420 controls the hardware initialization process, the startup test (POST), and the setting of the virtualization information in the hardware when the power is turned on. TheBIOS 420 can also be called an input / output system or an input / output unit.

仮想化情報格納部４３０は、管理サーバ２０から受け取った仮想化情報を格納する。 The virtualizationinformation storage unit 430 stores the virtualization information received from themanagement server 20.

ＨＷ情報格納部４４０は、出荷時などに設定された待機サーバ４０のハードウェアに設定されたハードウェア情報（ＨＷ情報）、つまり仮想化を適用しない場合の待機サーバ４０のＨＷ情報を格納する。 The HWinformation storage unit 440 stores hardware information (HW information) set in hardware of thestandby server 40 set at the time of shipment, that is, HW information of thestandby server 40 when virtualization is not applied.

仮想化情報格納部４３０に仮想化情報が格納されている場合、ＢＩＯＳ４２０は、仮想化情報格納部４３０に格納されている仮想化情報を待機サーバ４０のハードウェアに設定する。 When the virtualization information is stored in the virtualizationinformation storage unit 430, theBIOS 420 sets the virtualization information stored in the virtualizationinformation storage unit 430 in the hardware of thestandby server 40.

一方、仮想化情報格納部４３０に仮想化情報が格納されていない場合、ＢＩＯＳ４２０は、ＨＷ情報格納部４４０に格納されているＨＷ情報を待機サーバ４０のハードウェアに設定する。 On the other hand, when the virtualization information is not stored in the virtualizationinformation storage unit 430, theBIOS 420 sets the HW information stored in the HWinformation storage unit 440 in the hardware of thestandby server 40.

仮想ＢＩＯＳ情報格納部４５０は、仮想入出力情報（仮想ＢＩＯＳ情報）を格納する。 The virtual BIOSinformation storage unit 450 stores virtual input / output information (virtual BIOS information).

仮想ＢＩＯＳ情報は、ＢＩＯＳ４２０の一部又は全部を仮想化する場合に用いる情報である。例えば、仮想ＢＩＯＳ情報は、管理サーバ２０から受け取ったブートデバイス情報である。仮想ＢＩＯＳ情報が仮想ＢＩＯＳ情報格納部４５０に格納されている場合、ＢＩＯＳ４２０は、仮想ＢＩＯＳ情報を用いて、仮想ＢＩＯＳとしての機能を実現する。 The virtual BIOS information is information used when a part or all of theBIOS 420 is virtualized. For example, the virtual BIOS information is boot device information received from themanagement server 20. When the virtual BIOS information is stored in the virtual BIOSinformation storage unit 450, theBIOS 420 realizes a function as a virtual BIOS using the virtual BIOS information.

また、仮想ＢＩＯＳ情報格納部４５０は、高速フラグを格納してもよい。なお、高速フラグは、後ほど詳細に説明する起動動作を指定するフラグである。例えば、高速フラグが格納されている場合、待機サーバ４０は、後ほど説明する第１の起動の実施を示す。 The virtual BIOSinformation storage unit 450 may store a high speed flag. Note that the high-speed flag is a flag that specifies a startup operation that will be described in detail later. For example, when the high-speed flag is stored, thestandby server 40 indicates the implementation of the first activation described later.

図５は、待機サーバ４０のハードウェア構成の一例を示すブロック図である。 FIG. 5 is a block diagram illustrating an example of a hardware configuration of thestandby server 40.

なお、図５は、本実施形態の説明に関連する構成を記載したものである。例えば、キーボード、マウス、及び、ディスプレイといった、説明に関係しない構成の記載は、省略した。 FIG. 5 describes a configuration related to the description of the present embodiment. For example, descriptions of components that are not related to the description such as a keyboard, a mouse, and a display are omitted.

待機サーバ４０は、ＣＰＵ４７０と、ＲＯＭ４７２と、ＲＡＭ４７４と、記憶装置４７６と、ＮＩＣ４７８とを含む。 Thestandby server 40 includes aCPU 470, aROM 472, aRAM 474, astorage device 476, and aNIC 478.

ＣＰＵ４７０は、ＲＯＭ４７２又は記憶装置４７６に含まれるプログラムを読み出して実行し、待機サーバ４０としての機能を実行する。ＣＰＵ４７０は、プログラムを実行する際、ＲＡＭ４７４及び記憶装置４７６を一時記憶として使用する。また、ＣＰＵ４７０は、ＮＩＣ４７８を介して図５には示していないネットワーク（通信路）に接続した装置やサーバとデータをやり取りする。なお、ＣＰＵ４７０は、ＮＩＣ４７８を介して図示しない他の装置からプログラムを取得してもよい。また、ＣＰＵ４７０は、コンピュータで読み取り可能な記憶媒体４８０に含まれるプログラムを、図示しない読み取り装置を用いて読み出して動作しても良い。 TheCPU 470 reads out and executes a program included in theROM 472 or thestorage device 476 and executes a function as thestandby server 40. TheCPU 470 uses theRAM 474 and thestorage device 476 as temporary storage when executing the program. Further, theCPU 470 exchanges data with a device or server connected to a network (communication path) not shown in FIG. TheCPU 470 may acquire a program from another device (not shown) via theNIC 478. Further, theCPU 470 may operate by reading a program included in the computer-readable storage medium 480 using a reading device (not shown).

ＲＯＭ４７２は、ＣＰＵ４７０が実行するプログラム及び固定データを保持する。 TheROM 472 holds programs executed by theCPU 470 and fixed data.

ＲＡＭ４７４は、ＣＰＵ４７０を作業領域としてデータ又はプログラムを一時記憶する。 TheRAM 474 temporarily stores data or programs using theCPU 470 as a work area.

記憶装置４７６は、ＣＰＵ４７０が実行するプログラムやデータを記憶する。また、記憶装置４７６は、ＣＰＵ４７０の作業領域となる場合もある。 Thestorage device 476 stores programs and data executed by theCPU 470. Thestorage device 476 may be a work area for theCPU 470.

ＮＩＣ４７８は、ＣＰＵ４７０と図５には示していないネットワーク（通信路）を介した他の装置とのデータのやり取りを中継する。 TheNIC 478 relays data exchange between theCPU 470 and other devices via a network (communication path) not shown in FIG.

次に、監視サーバ５０の構成について説明する。 Next, the configuration of themonitoring server 50 will be described.

図６は、監視サーバ５０の機能的な構成の一例を示すブロック図である。 FIG. 6 is a block diagram illustrating an example of a functional configuration of themonitoring server 50.

なお、図６は、本実施形態の説明に関連する構成を記載したものである。例えば、表示部といった、説明に関係しない構成の記載は、省略した。 FIG. 6 shows a configuration related to the description of the present embodiment. For example, the description of a configuration that is not related to the description, such as a display unit, is omitted.

監視サーバ５０は、情報処理システム１０の全体を監視する。 The monitoringserver 50 monitors the entireinformation processing system 10.

そのため、監視サーバ５０は、監視処理部５１０を含む。 Therefore, the monitoringserver 50 includes amonitoring processing unit 510.

監視サーバ５０の監視処理部５１０は、運用サーバ３０、及び待機サーバ４０の動作を監視する。そして、監視処理部５１０は、運用サーバ３０の障害を検出すると、管理サーバ２０にフェイルオーバを指示する。なお、監視処理部５１０は、管理サーバ２０を監視しても良い。 Themonitoring processing unit 510 of themonitoring server 50 monitors the operations of theoperation server 30 and thestandby server 40. When themonitoring processing unit 510 detects a failure in theoperation server 30, themonitoring processing unit 510 instructs themanagement server 20 to perform a failover. Note that themonitoring processing unit 510 may monitor themanagement server 20.

また、監視サーバ５０は、情報処理システム１０の管理者などから、情報処理システム１０に含まれるいずれのサーバを待機サーバ４０とするかの設定、及び、待機サーバ４０のフェイルオーバの順番を受け取り保持しても良い。この場合、監視サーバ５０は、管理サーバ２０に、待機サーバ４０の指示、及び、フェイルオーバ時に運用を引き継ぐ待機サーバ４０を指定しても良い。 In addition, the monitoringserver 50 receives and holds, from an administrator of theinformation processing system 10, the setting of which server included in theinformation processing system 10 is thestandby server 40 and the failover order of thestandby server 40. May be. In this case, the monitoringserver 50 may designate thestandby server 40 to themanagement server 20 and thestandby server 40 to take over the operation at the time of failover.

さらに、監視サーバ５０は、各サーバの個別の構成を監視しても良い。例えば、監視サーバ５０は、運用サーバ３０及び待機サーバ４０のＢＭＣ４１０やＢＩＯＳ４２０、各サーバのＯＳやアプリケーションを監視しても良い。 Furthermore, the monitoringserver 50 may monitor the individual configuration of each server. For example, the monitoringserver 50 may monitor theBMC 410 andBIOS 420 of theoperation server 30 and thestandby server 40 and the OS and application of each server.

また、監視サーバ５０は、情報処理システム１０の起動時など所定のタイミングで、管理サーバ２０に待機サーバ４０の第１の起動を指示する。 In addition, the monitoringserver 50 instructs themanagement server 20 to first start thestandby server 40 at a predetermined timing such as when theinformation processing system 10 is started.

図７は、監視サーバ５０のハードウェア構成の一例を示すブロック図である。 FIG. 7 is a block diagram illustrating an example of a hardware configuration of themonitoring server 50.

なお、図７は、本実施形態の説明に関連する構成を記載したものである。例えば、キーボード、マウス、及び、ディスプレイといった、説明に関係しない構成の記載は、省略した。 FIG. 7 describes a configuration related to the description of the present embodiment. For example, descriptions of components that are not related to the description such as a keyboard, a mouse, and a display are omitted.

監視サーバ５０は、ＣＰＵ５２０と、ＲＯＭ５２２と、ＲＡＭ５２４と、記憶装置５２６と、ＮＩＣ５２８とを含む。 The monitoringserver 50 includes aCPU 520, aROM 522, a RAM 524, astorage device 526, and a NIC 528.

ＣＰＵ５２０は、ＲＯＭ５２２又は記憶装置５２６に含まれるプログラムを読み出して実行し、監視サーバ５０としての機能を実行する。ＣＰＵ５２０は、プログラムを実行する際、ＲＡＭ５２４及び記憶装置５２６を一時記憶として使用する。また、ＣＰＵ５２０は、ＮＩＣ５２８を介して図７には示していないネットワーク（通信路）に接続した装置とデータをやり取りする。なお、ＣＰＵ５２０は、ＮＩＣ５２８を介して他の装置からプログラムを取得してもよい。また、ＣＰＵ５２０は、コンピュータで読み取り可能な記憶媒体５３０に含まれるプログラムを、図示しない読み取り装置を用いて読み出して動作しても良い。 TheCPU 520 reads out and executes a program included in theROM 522 or thestorage device 526 and executes a function as themonitoring server 50. TheCPU 520 uses the RAM 524 and thestorage device 526 as temporary storage when executing the program. TheCPU 520 exchanges data with a device connected to a network (communication path) not shown in FIG. 7 via the NIC 528. Note that theCPU 520 may acquire a program from another device via the NIC 528. Further, theCPU 520 may operate by reading a program included in the computer-readable storage medium 530 using a reading device (not shown).

ＲＯＭ５２２は、ＣＰＵ５２０が実行するプログラム及び固定データを保持する。 TheROM 522 holds programs executed by theCPU 520 and fixed data.

ＲＡＭ５２４は、ＣＰＵ５２０を作業領域としてデータ又はプログラムを一時記憶する。 The RAM 524 temporarily stores data or programs using theCPU 520 as a work area.

記憶装置５２６は、ＣＰＵ５２０が実行するプログラムやデータを記憶する。また、記憶装置５２６は、ＣＰＵ５２０の作業領域となる場合もある。 Thestorage device 526 stores programs and data executed by theCPU 520. In addition, thestorage device 526 may be a work area for theCPU 520.

ＮＩＣ５２８は、ＣＰＵ５２０と図７には示していないネットワーク（通信路）を介した他の装置とのデータのやり取りを中継する。 The NIC 528 relays data exchange between theCPU 520 and other devices via a network (communication path) not shown in FIG.

なお、既に説明しているが、管理サーバ２０と監視サーバ５０は、１つのサーバで実現してもよい。 Although already described, themanagement server 20 and themonitoring server 50 may be realized by a single server.

続いて、情報処理システム１０の動作について説明する。 Next, the operation of theinformation processing system 10 will be described.

本実施形態に情報処理システム１０は、待機サーバ４０の起動を２つの段階に分ける。この起動に基づき、情報処理システム１０は、待機サーバ４０の数が運用サーバ３０の数より少ない場合にも対応でき、且つ、フェイルオーバ時の待機サーバ４０の起動時間を短くできる。 In this embodiment, theinformation processing system 10 divides activation of thestandby server 40 into two stages. Based on this activation, theinformation processing system 10 can cope with the case where the number ofstandby servers 40 is smaller than the number ofoperation servers 30 and can shorten the activation time of thestandby server 40 at the time of failover.

次に、本実施形態の特徴である待機サーバ４０の２つの段階の起動の動作について、それぞれ図面を参照して説明する。 Next, the operation of starting the two stages of thestandby server 40, which is a feature of the present embodiment, will be described with reference to the drawings.

まず、第１の段階の起動の動作を説明する。 First, the startup operation in the first stage will be described.

図８は、第１の段階の起動の動作の一例を示すシーケンス図である。 FIG. 8 is a sequence diagram illustrating an example of the activation operation in the first stage.

なお、図８は、動作の説明に関連する構成を記載し、他の構成の記載は省略した。 FIG. 8 shows a configuration related to the description of the operation, and the description of the other configuration is omitted.

待機サーバ４０に第１の起動を実行させるため、監視サーバ５０は、所定のタイミングで、待機サーバ４０の設定及び指示を、管理サーバ２０に出力する（１００１）。なお、所定のタイミングは、特に制限など無い。ただし、このタイミングは、例えば、情報処理システム１０の起動時など、運用サーバ３０が動作を開始する前が望ましい。 In order to cause thestandby server 40 to execute the first activation, the monitoringserver 50 outputs the setting and instruction of thestandby server 40 to themanagement server 20 at a predetermined timing (1001). The predetermined timing is not particularly limited. However, this timing is preferably before theoperation server 30 starts operating, for example, when theinformation processing system 10 is activated.

管理サーバ２０は、指示された待機サーバ４０に、一般的な起動ではなく、本実施形態の第１の起動を指示する（１００２）。 Themanagement server 20 instructs the instructedstandby server 40 to perform the first activation of this embodiment instead of a general activation (1002).

指示を受けた待機サーバ４０のＢＭＣ４１０は、第１の起動を指示されたことを格納する（１００３）。ＢＭＣ４１０は、第１の起動を指示されたことの格納として、例えば、フラグ（高速フラグ）を用意し、仮想ＢＩＯＳ情報格納部４５０に高速フラグを格納してもよい。なお、本実施形態において、高速フラグの格納は、仮想ＢＩＯＳ情報格納部４５０に限る必要はない。例えば、ＢＭＣ４１０が、高速フラグを格納しても良い。 TheBMC 410 of thestandby server 40 that has received the instruction stores that the first activation has been instructed (1003). TheBMC 410 may prepare, for example, a flag (high-speed flag) as storage indicating that the first activation has been instructed, and store the high-speed flag in the virtual BIOSinformation storage unit 450. In the present embodiment, the high-speed flag storage need not be limited to the virtual BIOSinformation storage unit 450. For example, theBMC 410 may store the high speed flag.

第１の起動の指示（高速フラグ）を格納後、ＢＭＣ４１０は、格納完了を管理サーバ２０に返す（１００４）。 After storing the first activation instruction (high-speed flag), theBMC 410 returns the storage completion to the management server 20 (1004).

格納完了を受けた管理サーバ２０は、待機サーバ４０のＢＭＣ４１０に電源オンを指示する（１００５）。 Upon receiving the storage completion, themanagement server 20 instructs theBMC 410 of thestandby server 40 to turn on the power (1005).

電源オンの指示を受けた待機サーバ４０のＢＭＣ４１０は、本実施形態の第１の段階の起動を開始する（１００６）。 TheBMC 410 of thestandby server 40 that has received the power-on instruction starts activation of the first stage of the present embodiment (1006).

待機サーバ４０の第１の段階の起動の以降の動作については、後ほど説明する。 The operation after the start-up of thestandby server 40 in the first stage will be described later.

第１の段階の起動を開始したＢＭＣ４１０は、管理サーバ２０に電源オンの開始を通知する（１００７）。 TheBMC 410 that has started the first stage notifies themanagement server 20 of the start of power-on (1007).

電源オンの開始（又は、完了）を受け取った管理サーバ２０は、待機サーバ４０の電源オンの開始を監視サーバ５０に通知する（１００８）。 Themanagement server 20 that has received the start (or completion) of power-on notifies themonitoring server 50 of the start of power-on of the standby server 40 (1008).

なお、管理サーバ２０は、第１段階の起動を開始すると待機サーバ４０に対しての処理が終了する。そのため、ＢＭＣ４１０は、管理サーバ２０に電源オンの完了を通知しても良い。電源オンの完了を受け取った管理サーバ２０は、監視サーバ５０に電源オンの完了を通知する。 Note that themanagement server 20 ends the processing for thestandby server 40 when starting the first stage of activation. Therefore, theBMC 410 may notify themanagement server 20 of the completion of power-on. Themanagement server 20 that has received the power-on completion notifies themonitoring server 50 of the power-on completion.

また、待機サーバ４０で電源オンを開始すると、監視サーバ５０の処理は、終了する。そのため、管理サーバ２０は、電源オンの開始に換えて、電源オンの終了又は設定完了を返しても良い。 Further, when thestandby server 40 starts to turn on the power, the processing of themonitoring server 50 ends. Therefore, themanagement server 20 may return a power-on end or a setting completion instead of the power-on start.

待機サーバ４０の第１段階の起動の残りの動作について説明する。 The remaining operation of the first activation of thestandby server 40 will be described.

電源オンの指示を受けたＢＭＣ４１０は、待機サーバ４０の電源をオンする。待機サーバ４０の電源がオンすると、ＢＩＯＳ４２０は、ハードウェアの立ち上げ、例えば、ハードウェアの初期化処理（１００９）とハードウェアのＰＯＳＴ（１０１０）とを行う。 Receiving the power-on instruction, theBMC 410 turns on thestandby server 40. When thestandby server 40 is powered on, theBIOS 420 starts up the hardware, for example, performs a hardware initialization process (1009) and a hardware POST (1010).

特許文献２に記載の待機サーバは、この後、通常の起動処理を継続する。 Thereafter, the standby server described in Patent Document 2 continues normal startup processing.

また、特許文献１に記載の待機サーバは、この後、運用サーバに対応した設定（論理区画の確保）を行う。その後、特許文献１に記載の待機サーバは、ＯＳをロードするところで停止する。 In addition, the standby server described in Patent Document 1 performs setting corresponding to the operation server (reserving logical partitions) thereafter. Thereafter, the standby server described in Patent Document 1 stops when the OS is loaded.

本実施形態の待機サーバ４０のＢＩＯＳ４２０は、次の説明するように、特許文献２及び特許文献１に記載の待機サーバとは異なる動作を行う。 TheBIOS 420 of thestandby server 40 according to the present embodiment performs an operation different from that of the standby server described in Patent Document 2 and Patent Document 1, as described below.

ハードウェアの設定が終了すると、ＢＩＯＳ４２０は、ＢＭＣ４１０に第１の起動を指示されているか否かを問い合わせる（１０１１）。なお、既に説明した高速フラグを用いる場合、ＢＩＯＳ４２０は、ＢＭＣ４１０に高速フラグを格納しているか否かを問い合わせる。 When the hardware setting is completed, theBIOS 420 inquires of theBMC 410 whether or not the first activation is instructed (1011). When using the high-speed flag already described, theBIOS 420 inquires whether the high-speed flag is stored in theBMC 410.

ＢＭＣ４１０は、ＢＩＯＳ４２０に第１の起動を指示されているか否か（指示状態）を返す（１０１２）。例えば、高速フラグを用いる場合、ＢＭＣ４１０は、高速フラグを格納しているか否かを返す。 TheBMC 410 returns whether or not theBIOS 420 is instructed to perform the first activation (instruction state) (1012). For example, when the high speed flag is used, theBMC 410 returns whether or not the high speed flag is stored.

ここでは、既に説明したとおり第１の起動の指示を受けているため、ＢＭＣ４１０は、ＢＩＯＳ４２０に指示されていることを返す。なお、ここで、第１の起動を指示されていないことを返した場合、ＢＩＯＳ４２０は、通常の起動を行う。 Here, since the first activation instruction has been received as described above, theBMC 410 returns an instruction to theBIOS 420. Here, when it is returned that the first activation is not instructed, theBIOS 420 performs normal activation.

第１の起動を指示されているため、ＢＩＯＳ４２０は、以降の処理、例えば、ハードウェアに仮想化情報を設定せず、停止状態（スリープなど）を開始する（１０１３）。本実施形態において、ＢＩＯＳ４２０が実現するスリープに特に制限はない。例えば、高速フラグを用いる場合、ＢＩＯＳ４２０は、ＢＭＣ４１０が高速フラグの格納をやめるまで、ＢＭＣ４１０の高速フラグの格納を定期的に確認（ポーリング）してもよい。あるいは、ＢＩＯＳ４２０は、ＡＣＰＩ（Advanced Configuration and Power Interface）に定義されたシステムスリープに移行し、ＢＭＣ４１０などからスリープ終了の要求を受けるまで、スリープを継続しても良い。 Since the first activation is instructed, theBIOS 420 starts a subsequent process (for example, sleep state) without setting the virtualization information in the hardware (1013). In the present embodiment, there is no particular limitation on the sleep realized by theBIOS 420. For example, when using the high-speed flag, theBIOS 420 may periodically check (polling) the storage of the high-speed flag of theBMC 410 until theBMC 410 stops storing the high-speed flag. Alternatively, theBIOS 420 may shift to the system sleep defined in ACPI (Advanced Configuration and Power Interface) and continue the sleep until a sleep termination request is received from theBMC 410 or the like.

ここまでの動作で、待機サーバ４０の第１の起動は、終了する。 With the operation so far, the first activation of thestandby server 40 ends.

つまり、待機サーバ４０は、ハードウェアの初期化や初期テストを実施するが、仮想化情報の設定など、運用サーバ３０の業務の引き継ぎの際に必要となる情報を設定しないでスリープする。 That is, thestandby server 40 performs hardware initialization and an initial test, but sleeps without setting information necessary for taking over the operation of theoperation server 30 such as setting of virtualization information.

なお、管理サーバ２０は、第１の起動指示（１００２）と電源オンの指示（１００５）を１つの指示としても良い。この場合、ＢＭＣ４１０は、第１の起動指示を格納（１００３）後、待機サーバ４０の電源のオン（１００６）を指示する。 Themanagement server 20 may use the first activation instruction (1002) and the power-on instruction (1005) as one instruction. In this case, after storing the first activation instruction (1003), theBMC 410 instructs thestandby server 40 to turn on (1006).

次に、第２の起動の動作について説明する。 Next, the second activation operation will be described.

第２の起動は、待機サーバ４０が第１の起動のスリープ状態に移行後、監視サーバ５０が運用サーバ３０の障害を検出した場合の動作である。 The second activation is an operation when themonitoring server 50 detects a failure of theoperation server 30 after thestandby server 40 shifts to the sleep state of the first activation.

図９は、第２の段階の起動の動作の一例を示すシーケンス図である。 FIG. 9 is a sequence diagram illustrating an example of the activation operation in the second stage.

運用サーバ３０の障害（２０００）を検出した監視サーバ５０は、フェイルオーバを管理サーバ２０に要求する（２００１）。そのため、監視サーバ５０は、管理サーバ２０に所定の指示を出力する。なお、情報処理システム１０が複数の待機サーバ４０を含む場合、管理サーバ２０は、監視サーバ５０が指定した待機サーバ４０を用いてフェイルオーバする。あるいは、管理サーバ２０は、所定の規則に従って待機サーバ４０を決定してもよい。 The monitoringserver 50 that detects the failure (2000) of theoperation server 30 requests themanagement server 20 to perform a failover (2001). Therefore, the monitoringserver 50 outputs a predetermined instruction to themanagement server 20. When theinformation processing system 10 includes a plurality ofstandby servers 40, themanagement server 20 fails over using thestandby server 40 specified by the monitoringserver 50. Alternatively, themanagement server 20 may determine thestandby server 40 according to a predetermined rule.

フェイルオーバの指示を受けた管理サーバ２０は、待機サーバ４０がフェイルオーバ可能か否かを確認するため、待機サーバ４０に装置状態を要求する（２００２）。本実施形態の待機サーバ４０は、スリープ状態がフェイルオーバ可能な状態である。そのため、例えば、管理サーバ２０は、ＢＭＣ４１０に待機サーバ４０の電源状態を問い合わせても良い。 Receiving the failover instruction, themanagement server 20 requests the apparatus state from thestandby server 40 in order to confirm whether thestandby server 40 can fail over (2002). Thestandby server 40 of the present embodiment is in a state where the sleep state can be failed over. Therefore, for example, themanagement server 20 may inquire theBMC 410 about the power supply state of thestandby server 40.

ＢＭＣ４１０は、待機サーバ４０の装置状態を管理サーバ２０に返す（２００３）。なお、電源状態を要求されて場合、ＢＭＣ４１０は、スリープなど待機サーバ４０の電源状態を管理サーバ２０に返す。 TheBMC 410 returns the device status of thestandby server 40 to the management server 20 (2003). When the power status is requested, theBMC 410 returns the power status of thestandby server 40 such as sleep to themanagement server 20.

管理サーバ２０は、受け取った待機サーバ４０の装置状態（例えば、電源状態）を確認する（２００４）。待機サーバ４０の装置状態がフェイルオーバ可能でない場合、管理サーバ２０は、所定の規則の従って再実行（リトライ）するか、監視サーバ５０又は図示しない情報処理システム１０の保守サーバにフェイルオーバができないことを通知する。なお、管理サーバ２０は、リトライとして、例えば、別の待機サーバ４０を選択して装置状態の確認する（２００２）。 Themanagement server 20 confirms the device status (for example, power status) of the received standby server 40 (2004). If the device state of thestandby server 40 is not capable of failover, themanagement server 20 re-executes (retry) according to a predetermined rule, or notifies themonitoring server 50 or the maintenance server of the information processing system 10 (not shown) that failover cannot be performed. To do. For example, themanagement server 20 selects anotherstandby server 40 and confirms the device status as a retry (2002).

待機サーバ４０がフェイルオーバ可能な場合、管理サーバ２０は、管理サーバ２０の管理処理部２１０が格納する紐付け情報と仮想化情報とを基に、障害を起こした運用サーバ３０に設定した仮想化情報を取得する。そして、管理サーバ２０は、待機サーバ４０に、取得した仮想化情報を送り、待機サーバ４０に仮想化情報の設定を指示する（２００５）。 When thestandby server 40 can fail over, themanagement server 20 uses the association information stored in themanagement processing unit 210 of themanagement server 20 and the virtualization information to set the virtualization information set in theoperation server 30 in which the failure has occurred To get. Then, themanagement server 20 sends the acquired virtualization information to thestandby server 40 and instructs thestandby server 40 to set the virtualization information (2005).

仮想化情報を受け取ったＢＭＣ４１０は、仮想化情報を仮想化情報格納部４３０に格納する（２００６）。ＢＭＣ４１０は、仮想化情報とともに、管理サーバ２０から仮想ＢＩＯＳ情報を受け取っても良い。仮想ＢＩＯＳ情報を受け取ったＢＭＣ４１０は、仮想ＢＩＯＳ情報を仮想ＢＩＯＳ情報格納部４５０に格納する。 TheBMC 410 that has received the virtualization information stores the virtualization information in the virtualization information storage unit 430 (2006). TheBMC 410 may receive virtual BIOS information from themanagement server 20 together with the virtualization information. TheBMC 410 that has received the virtual BIOS information stores the virtual BIOS information in the virtual BIOSinformation storage unit 450.

情報の格納が終了後、ＢＭＣ４１０は、仮想化情報の格納終了を管理サーバ２０に通知する（２００７）。 After storing the information, theBMC 410 notifies themanagement server 20 of the end of storing the virtualization information (2007).

仮想化情報の格納を受け取った管理サーバ２０は、待機サーバ４０のＢＭＣ４１０に、待機サーバ４０の第２の段階の起動を指示する（２００８）。 Themanagement server 20 that has received the storage of the virtualization information instructs theBMC 410 of thestandby server 40 to start thestandby server 40 in the second stage (2008).

第２の段階の起動の指示を受けたＢＭＣ４１０は、第２の起動を開始する（２００９）。 TheBMC 410 that has received the second-stage activation instruction starts the second activation (2009).

具体的には、ＢＭＣ４１０は、ＢＩＯＳ４２０に第２の起動を指示する（２０１０）。本実施形態に第２の起動の開始の指示は、特に制限はない。例えば、ＢＩＯＳ４２０が、ＢＭＣ４１０の高速フラグをポーリングしている場合、ＢＭＣ４１０は、高速フラグを解除しても良い。この場合、ＢＭＣ４１０は、具体的な指示をＢＩＯＳ４２０に送る必要が無い。このように本実施形態は、必ずしも通知しない場合も含むため、図９の矢印は、破線を用いた。あるいは、ＢＩＯＳ４２０がＡＣＰＩのスリープ状態の場合、ＢＭＣ４１０は、ＢＩＯＳ４２０にスリープ解除を指示してもよい。 Specifically, theBMC 410 instructs theBIOS 420 to perform the second activation (2010). The instruction for starting the second activation is not particularly limited in the present embodiment. For example, when theBIOS 420 is polling the high speed flag of theBMC 410, theBMC 410 may cancel the high speed flag. In this case, theBMC 410 does not need to send a specific instruction to theBIOS 420. As described above, since this embodiment includes a case where notification is not always performed, a broken line is used for the arrow in FIG. Alternatively, when theBIOS 420 is in the ACPI sleep state, theBMC 410 may instruct theBIOS 420 to cancel the sleep.

この後のＢＩＯＳ４２０の動作は、後ほど説明する。 The subsequent operation of theBIOS 420 will be described later.

ＢＩＯＳ４２０に第２段階の起動を指示後、ＢＭＣ４１０は、管理サーバ２０に第２段階の起動の開始を通知する（２０１１）。 After instructing theBIOS 420 to start the second stage, theBMC 410 notifies themanagement server 20 of the start of the second stage start (2011).

起動の開始を受け取った管理サーバ２０は、監視サーバ５０にフェイルオーバの開始を通知する（２０１２）。 Themanagement server 20 that has received the start of startup notifies themonitoring server 50 of the start of failover (2012).

なお、管理サーバ２０は、第２段階の起動を開始すると待機サーバ４０に対しての処理が終了する。そのため、シーケンス２０１１において、ＢＭＣ４１０は、管理サーバ２０に起動の完了を通知しても良い。起動の完了の通知を受けた管理サーバ２０は、監視サーバ５０にフェイルオーバの完了を通知しても良い。 In addition, themanagement server 20 will complete | finish the process with respect to thestandby server 40, if the start of a 2nd step is started. Therefore, in thesequence 2011, theBMC 410 may notify themanagement server 20 of the start completion. Themanagement server 20 that has received the notification of completion of activation may notify themonitoring server 50 of the completion of failover.

また、フェイルオーバの開始後、監視サーバ５０が待機サーバ４０を運用中のサーバとして監視する場合、管理サーバ２０は、フェイルオーバの開始の通知に換えて、フェイルオーバの完了を通知しても良い。 In addition, when themonitoring server 50 monitors thestandby server 40 as an active server after the failover is started, themanagement server 20 may notify the completion of the failover instead of the notification of the failover start.

第２の段階の起動の開始の指示を受けたＢＩＯＳ４２０は、スリープを終了する。そして、ＢＩＯＳ４２０は、仮想化情報をハードウェアに設定する。なお、本実施形態のＢＩＯＳ４２０の仮想化情報の取得は、特に制限はない。例えば、ＢＩＯＳ４２０は、仮想化情報格納部４３０及び仮想ＢＩＯＳ情報格納部４５０が情報を格納しているかを確認しても良い。本実施形態の説明では、他の例として、ＢＩＯＳ４２０が、ＢＭＣ４１０から仮想化情報と仮想ＢＩＯＳ情報とを取得するとして説明する。 TheBIOS 420 that has received the instruction to start the activation in the second stage ends the sleep. Then, theBIOS 420 sets the virtualization information in hardware. In addition, acquisition of the virtualization information of theBIOS 420 of this embodiment is not particularly limited. For example, theBIOS 420 may check whether the virtualizationinformation storage unit 430 and the virtual BIOSinformation storage unit 450 store information. In the description of the present embodiment, as another example, theBIOS 420 is described as acquiring virtualization information and virtual BIOS information from theBMC 410.

ＢＩＯＳ４２０は、ＢＭＣ４１０に仮想化情報（及び仮想ＢＩＯＳ情報）を要求する（２０１３）。 TheBIOS 420 requests virtualization information (and virtual BIOS information) from the BMC 410 (2013).

ＢＭＣ４１０は、シーケンス２００６で格納した、仮想化情報格納部４３０に格納されている仮想化情報と、仮想ＢＩＯＳ情報格納部４５０に格納している仮想ＢＩＯＳ情報とを、ＢＩＯＳ４２０に送る（２０１４）。 TheBMC 410 sends the virtualization information stored in the virtualizationinformation storage unit 430 and the virtual BIOS information stored in the virtual BIOSinformation storage unit 450 stored in thesequence 2006 to the BIOS 420 (2014).

ＢＩＯＳ４２０は、仮想化情報と仮想ＢＩＯＳ情報とを基にハードウェアの仮想化を含む論理的な設定を行い、さらに、仮想ＢＩＯＳ情報を基に仮想ＢＩＯＳとしての動作を開始する（２０１５）。 TheBIOS 420 performs logical settings including hardware virtualization based on the virtualization information and the virtual BIOS information, and further starts an operation as a virtual BIOS based on the virtual BIOS information (2015).

ハードウェアの設定を終了後、ＢＩＯＳ４２０は、ＯＳをロードする（２０１６）。 After completing the hardware setting, theBIOS 420 loads the OS (2016).

そして、待機サーバ４０は、運用に必要なアプリケーションの動作を開始し、運用サーバ３０の業務の処理を引き継ぐ。 Then, thestandby server 40 starts operation of an application necessary for operation, and takes over the business processing of theoperation server 30.

ここまで説明した動作で、第２の起動の動作は、終了する。 With the operation described so far, the second activation operation is completed.

この後、待機サーバ４０は、運用に必要なアプリケーションの動作を開始する。 Thereafter, thestandby server 40 starts an operation of an application necessary for operation.

なお、管理サーバ２０は、仮想化情報設定要求（２００５）と起動指示（２００８）を１つの指示としても良い。この場合、ＢＭＣ４１０は、仮想化情報を設定（２００６）後、第２の起動を開始する（２００９）。 Themanagement server 20 may use the virtualization information setting request (2005) and the start instruction (2008) as one instruction. In this case, theBMC 410 starts the second activation after setting the virtualization information (2006) (2009).

本実施形態に係る情報処理システム１０は、運用サーバ３０より少ない待機サーバ４０において、待機サーバ４０の起動時間を短縮できる。 Theinformation processing system 10 according to the present embodiment can shorten the activation time of thestandby server 40 in thestandby servers 40 that are fewer than theoperation server 30.

その理由は、次のとおりである。 The reason is as follows.

本実施形態の待機サーバ４０は、障害が発生してフェイルオーバが開始する前にハードウェアの初期化など第１の段階の起動を終了している。そのため、障害が発生してフェイルオーバする場合、待機サーバ４０は、仮想化を含む第２の起動を処理すればよい。つまり、待機サーバ４０は、第１の段階の起動の時間だけ、フェイルオーバでの起動時間を短くできる。 Thestandby server 40 of the present embodiment has finished the first stage of activation such as hardware initialization before a failure occurs and failover starts. Therefore, when a failure occurs and a failover occurs, thestandby server 40 may process the second activation including virtualization. That is, thestandby server 40 can shorten the start-up time for failover by the start time for the first stage.

なお、一般的に、サーバの起動において、ハードウェアの仮想化を含む論理的な設定に必要な時間は、ハードウェアの初期化に必要な時間に比べ短い。つまり、待機サーバ４０の第２の段階の起動時間は、第１の段階の起動時間に比べ短い。そのため、本実施形態の待機サーバ４０は、フェイルオーバで必要となる起動時間を大幅に削減できる。 In general, the time required for logical setting including hardware virtualization in starting the server is shorter than the time required for hardware initialization. That is, the startup time of the second stage of thestandby server 40 is shorter than the startup time of the first stage. Therefore, thestandby server 40 of the present embodiment can greatly reduce the startup time required for failover.

さらに、本実施形態の待機サーバ４０は、第１の段階の起動後でも、ハードウェアの仮想化を設定していない。そして、待機サーバ４０は、第２の段階の起動で、仮想化情報を受け取り、ハードウェアを仮想化する。そのため、待機サーバ４０は、いずれの運用サーバ３０についてもフェイルオーバに対応できる。つまり、本実施形態の情報処理システム１０は、運用サーバ３０より少ない台数の待機サーバ４０でフェイルオーバを実現できる。 Furthermore, thestandby server 40 of the present embodiment does not set hardware virtualization even after the first stage startup. Then, thestandby server 40 receives the virtualization information and virtualizes the hardware at the start of the second stage. Therefore, thestandby server 40 can cope with failover for anyoperational server 30. That is, theinformation processing system 10 according to the present embodiment can implement failover with a smaller number ofstandby servers 40 than theoperation server 30.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 While the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

１０情報処理システム
２０管理サーバ
３０運用サーバ
４０待機サーバ
５０監視サーバ
２１０管理処理部
２２０ＣＰＵ
２２２ＲＯＭ
２２４ＲＡＭ
２２６記憶装置
２２８ＮＩＣ
２３０記憶媒体
４１０ＢＭＣ
４２０ＢＩＯＳ
４３０仮想化情報格納部
４４０ＨＷ情報格納部
４５０仮想ＢＩＯＳ情報格納部
４７０ＣＰＵ
４７２ＲＯＭ
４７４ＲＡＭ
４７６記憶装置
４７８ＮＩＣ
４８０記憶媒体
５１０監視処理部
５２０ＣＰＵ
５２２ＲＯＭ
５２４ＲＡＭ
５２６記憶装置
５２８ＮＩＣ
５３０記憶媒体DESCRIPTION OFSYMBOLS 10Information processing system 20Management server 30Operation server 40Standby server 50Monitoring server 210Management processing part 220 CPU
222 ROM
224 RAM
226storage device 228 NIC
230 Storage medium 410 BMC
420 BIOS
430 Virtualizationinformation storage unit 440 HWinformation storage unit 450 Virtual BIOSinformation storage unit 470 CPU
472 ROM
474 RAM
476storage device 478 NIC
480Storage medium 510Monitoring processing unit 520 CPU
522 ROM
524 RAM
526 storage device 528 NIC
530 storage media

Claims

Translated fromJapanese

業務を運用するための情報処理を実行する運用サーバと、
前記運用サーバの障害時に業務の処理を引き継ぐ待機サーバと、
前記運用サーバと前記待機サーバとの引き継ぎを管理する管理サーバと、
前記運用サーバの障害を検出する監視サーバとを含み、
前記待機サーバは、
前記待機サーバの少なくとも一部のハードウェアを初期化した後に停止する第１の起動と、
前記ハードウェアの仮想化の設定を含む第２の起動とを制御する
入出力手段と、
前記管理サーバからの第１の起動の指示を受けとり、前記入力手段に前記第１の起動を指示し、
前記管理サーバから前記ハードウェアを仮想化するための仮想化情報と、前記第２の起動の指示とを受けとり、前記入出力手段に前記仮想化情報を渡し、前記第２の起動を指示する
管理制御手段と、
を含み、
前記監視サーバは、
所定のタイミングで前記管理サーバに前記待機サーバの前記第１の起動のための指示を出し、
前記運用サーバの障害を検出したときに前記管理サーバに所定の指示を出力し、
前記管理サーバは、
前記監視サーバから前記第１の起動のための指示を受けとり、前記待機サーバに前記第１の起動を指示し、
前記監視サーバから前記所定の指示を受けとり、前記待機サーバに前記仮想化情報と、前記第２の起動の指示を送信する
情報処理システム。An operation server that executes information processing for operating business,
A standby server that takes over business processing in the event of a failure of the operational server;
A management server that manages the takeover between the operation server and the standby server;
A monitoring server that detects a failure of the operational server,
The standby server is
A first start to stop after initializing at least some of the hardware of the standby server;
Input / output means for controlling the second startup including the hardware virtualization setting;
Receiving a first activation instruction from the management server, instructing the first activation to the input means;
Receives virtualization information for virtualizing the hardware and the second activation instruction from the management server, passes the virtualization information to the input / output means, and instructs the second activation Control means;
Including
The monitoring server is
An instruction for the first activation of the standby server is issued to the management server at a predetermined timing,
When a failure of the operation server is detected, a predetermined instruction is output to the management server,
The management server
Receiving an instruction for the first activation from the monitoring server, instructing the standby server to perform the first activation;
An information processing system that receives the predetermined instruction from the monitoring server and transmits the virtualization information and the second activation instruction to the standby server.

前記待機サーバは、
前記仮想化情報を格納する仮想化情報格納手段と、
仮想化を適用しないときのハードウェアに関するハードウェア情報を格納するハードウェア情報格納手段と
を含む請求項１に記載の情報処理システム。The standby server is
Virtualization information storage means for storing the virtualization information;
The information processing system according to claim 1, further comprising: hardware information storage means for storing hardware information related to hardware when virtualization is not applied.

前記待機サーバは、
前記入出力手段の一部又は全部を仮想化するための情報である仮想入出力情報を格納する仮想入出力情報格納手段
を含む請求項１及び請求項２に記載の情報処理システム。The standby server is
The information processing system according to claim 1, further comprising virtual input / output information storage means for storing virtual input / output information that is information for virtualizing a part or all of the input / output means.

前記管理サーバは、
前記仮想化情報と、前記仮想化情報と関係するサーバに関する情報を含む紐付け情報とを含む管理処理手段を
含む請求項２又は請求項３に記載の情報処理システム。The management server
The information processing system according to claim 2 or 3, further comprising management processing means including the virtualization information and association information including information related to a server related to the virtualization information.

業務を運用するための情報処理を実行する運用サーバと、
前記運用サーバの障害時に業務の処理を引き継ぐ待機サーバと、
前記運用サーバと前記待機サーバとの引き継ぎを管理する管理サーバと、
前記運用サーバの障害を検出する監視サーバとを含み、
前記監視サーバは、
所定のタイミングで前記管理サーバに前記待機サーバの第１の起動のための指示を出力し、
前記運用サーバの障害を検出したときに前記管理サーバに所定の指示を出力し、
前記管理サーバは、
前記監視サーバから前記第１の起動のための指示を受けとり、前記待機サーバに第１の起動を指示し、
前記監視サーバから前記所定の指示を受けとり、前記待機サーバに仮想化情報を送り、第２の起動を指示し、
前記待機サーバは、
前記管理サーバからの前記第１の起動の指示を受けとり、ハードウェアを初期化した後に停止し、
前記管理サーバから前記仮想化情報と前記第２の起動の指示とを受けとり、前記仮想化情報を設定してハードウェアを起動する
情報処理システムの制御方法。An operation server that executes information processing for operating business,
A standby server that takes over business processing in the event of a failure of the operational server;
A management server that manages the takeover between the operation server and the standby server;
A monitoring server that detects a failure of the operational server,
The monitoring server is
Outputting an instruction for first activation of the standby server to the management server at a predetermined timing;
When a failure of the operation server is detected, a predetermined instruction is output to the management server,
The management server
Receiving an instruction for the first activation from the monitoring server, instructing the standby server to perform the first activation,
Receiving the predetermined instruction from the monitoring server, sending virtualization information to the standby server, instructing a second activation;
The standby server is
Receives the first start instruction from the management server, stops after initializing the hardware,
A control method for an information processing system that receives the virtualization information and the second activation instruction from the management server, sets the virtualization information, and activates hardware.

ハードウェアを初期化した後に停止する第１の起動と、ハードウェアの仮想化を含む第２の起動とを制御する入出力手段と、
管理サーバからの前記第１の起動の指示を受けとり、前記入力手段に前記第１の起動を指示し、前記管理サーバから前記ハードウェアの仮想化のための仮想化情報と前記第２の起動の指示とを受けとり、前記入出力手段に前記仮想化情報を渡し、前記第２の起動を指示する管理制御手段と
を含む待機サーバ。Input / output means for controlling the first activation that stops after initializing the hardware and the second activation that includes hardware virtualization;
The first activation instruction is received from the management server, the first activation is instructed to the input means, and the virtualization information for hardware virtualization and the second activation instruction are transmitted from the management server. A management server that receives the instruction, passes the virtualization information to the input / output unit, and instructs the second activation.

管理サーバからの第１の起動の指示を受けとり、ハードウェアを初期化した後に停止する第１の起動を行い、
前記管理サーバからハードウェアの仮想化のための仮想化情報と第２の起動の指示とを受けると前記ハードウェアの仮想化を含む第２の起動を行う
待機サーバの起動方法。Receives the first activation instruction from the management server, performs the first activation to stop after initializing the hardware,
A standby server activation method for performing second activation including virtualization of hardware when receiving virtualization information for hardware virtualization and a second activation instruction from the management server.

管理サーバからの第１の起動の指示を受け取り、ハードウェアを初期化した後に停止する第１の起動処理と、
前記管理サーバからハードウェアの仮想化するための仮想化情報と第２の起動の指示とを受けとり、ハードウェアの仮想化を含む第２の起動処理と
をコンピュータに実行させる待機サーバのプログラム。A first activation process for receiving a first activation instruction from the management server and stopping after initializing the hardware;
A standby server program that receives virtualization information for hardware virtualization and a second activation instruction from the management server, and causes a computer to execute a second activation process including hardware virtualization.