JPH11184643A

Movatterモバイル変換

Info

Publication number: JPH11184643A
Application number: JP9365609A
Authority: JP
Inventors: Hideki Matsuhashi; 英樹松橋
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1997-12-22
Filing date: 1997-12-22
Publication date: 1999-07-09

Abstract

PROBLEM TO BE SOLVED: To always maintain constant the physical arrangement relation of disk devices constituting a disk array without degrading the reliability concerning a disk array device having a hot swap function. SOLUTION: When any faulty disk device occurs among disk devices 22-1 to 22-n constituting a disk array, a recovery means 33 recovers data on the faulty disk device onto a backup disk device 22 through a hot swap function and changes the disk array into configuration using a backup disk device 22 in place of the faulty disk device. When the faulty disk device is exchanged with a normal disk device, the recovery means 33 recovers the data on the backup disk device 22 onto the exchanged disk device in the state of maintaining the redundant state of the disk array without stopping job operation and changes the disk array again into configuration using the exchanged disk device in place of the backup disk device 22.

Description

Translated fromJapanese

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は予備のディスク装置
を備えたディスクアレイ装置に関し、より詳細には、故
障ディスク装置に代えて予備ディスク装置を使用してい
る状態において、故障ディスク装置が正常なディスク装
置に交換されたとき、予備ディスク装置を使用しない元
のディスクアレイ構成に戻すようにしたディスクアレイ
装置の管理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a disk array device having a spare disk device, and more particularly, to a case in which a spare disk device is used instead of a failed disk device. The present invention relates to a disk array device management method for returning to an original disk array configuration that does not use a spare disk device when the disk array device is replaced.

【０００２】[0002]

【従来の技術】ディスクアレイ装置は、比較的安価で小
型のディスク装置を数台から十数台並べて並列動作させ
ることにより、データ転送速度や信頼性の向上を図った
ものであり、代表的なものに、ＲＡＩＤ（Ｒｅｄｕｎｄ
ａｎｔａｒｒａｙｏｆｉｎｅｘｐｅｎｓｉｖｅ）
１乃至５の各装置が知られている。何れも冗長化構成に
よって、障害発生時には正常なディスク装置のデータか
ら障害の発生したディスク装置のデータを復元すること
ができる。2. Description of the Related Art A disk array device is designed to improve data transfer speed and reliability by arranging several to a dozen or more relatively inexpensive and small disk devices and operating them in parallel. What is RAID (Redund
ant array of inexpensive)
Each of the devices 1 to 5 is known. In any case, due to the redundant configuration, when a failure occurs, data of a failed disk device can be restored from data of a normal disk device.

【０００３】しかしながら、使用不可能となるような故
障の発生したディスク装置をそのままにしておくと冗長
構成が維持できないため、別のディスク装置にも障害が
発生した場合にデータの復元が不可能となる。そこで、
ディスクアレイ装置内の１つのディスク装置に使用不可
能となるような故障が発生した場合には、正常なディス
ク装置を使って冗長構成を速やかに維持するようにして
いる。[0003] However, if a failed disk device that cannot be used is left as it is, a redundant configuration cannot be maintained. Therefore, if another disk device also fails, data cannot be restored. Become. Therefore,
When a failure occurs such that one disk device in the disk array device becomes unusable, a normal disk device is used to quickly maintain the redundant configuration.

【０００４】このようなことを実現するため、例えば特
開平７−１５２４９５号公報に見られるように、故障デ
ィスク装置が発生したとき、その故障ディスク装置が正
常なディスク装置と人手によって交換されるまで縮退さ
れた状態で運転を続け、正常なディスク装置と交換され
たらデータの復旧を行って冗長構成に復帰する方式が知
られているが、人手によるディスク装置の交換が遅れる
とそれだけ縮退運転期間が長くなり、信頼性の点で問題
がある。In order to realize such a situation, for example, as disclosed in Japanese Patent Application Laid-Open No. 7-152495, when a failed disk device occurs, it is necessary to manually replace the failed disk device with a normal disk device. It is known to continue operation in a degraded state and recover data after replacing the disk unit with a normal disk unit to return to a redundant configuration. It is longer and has problems in reliability.

【０００５】このため、多少コストがかかるが、予めデ
ィスクアレイ装置内に予備のディスク装置を準備してお
き、故障ディスク装置が発生したら業務運用中に自動的
に予備ディスク装置に対して故障ディスク装置のデータ
を復旧し、故障ディスク装置以外のディスク装置と前記
予備ディスク装置とでディスクアレイを再構成して運用
を続ける方式が一般に採用されている。このような機能
はホットスワップ機能などと呼ばれている。また、故障
ディスク装置はシステムの電源を切ることなく正常な新
しいディスク装置と交換できる。従来は、このようにし
て交換した新しいディスク装置を新たな予備ディスク装
置としていた（例えば前記公報の「従来の技術」参
照）。[0005] For this reason, although it costs a little, a spare disk device is prepared in advance in the disk array device, and if a failed disk device occurs, the failed disk device is automatically replaced with the spare disk device during business operation. In general, a method of restoring the above data and reconstructing a disk array by using a disk device other than the failed disk device and the spare disk device to continue operation is generally adopted. Such a function is called a hot swap function or the like. Further, a failed disk device can be replaced with a normal new disk device without turning off the system. Conventionally, a new disk device replaced in this way is used as a new spare disk device (for example, see "Conventional technology" in the above-mentioned publication).

【０００６】[0006]

【発明が解決しようとする課題】上述したようにホット
スワップ機能を備えた従来のディスクアレイ装置におい
ては、故障ディスク装置と交換されたディスク装置は新
たな予備ディスク装置としてシステムに残されるため、
故障発生，自動復旧，ディスク交換が繰り返されると、
ディスクアレイを構成するディスク装置の物理的な配置
関係が初期の関係から大きく変化してしまう。As described above, in the conventional disk array device having the hot swap function, the disk device replaced with the failed disk device is left in the system as a new spare disk device.
When failures occur, automatic recovery, and disk replacement are repeated,
The physical arrangement relationship of the disk devices constituting the disk array greatly changes from the initial relationship.

【０００７】例えば図１３（ａ）の平面図に示すよう
に、合計３０台のディスク装置が５行，６列に配置され
ており、初期の状態においては、１行目から５行目まで
の各行の５台のディスク装置毎にそれぞれ論理的な１つ
のディスク装置を構成するディスクアレイＤＡ１〜ＤＡ
５が構築されており、また６行目の５台のディスク装置
は予備ディスク装置として待機しているとする。この状
態で、例えばディスクアレイＤＡ１の５番目のディスク
装置が故障した為、ホットスワップ機能により１番目の
予備ディスク装置を使った自動復旧が行われ、その後に
利用者が故障ディスク装置を交換すると、図１３（ｂ）
の平面図に示すように、ディスクアレイＤＡ１の構成デ
ィスクは１行目の４台と６行目の１台になる。同様に、
故障発生，自動復旧，ディスク交換が何度か繰り返され
ると、例えば図１３（ｃ）に示すように、ディスクアレ
イＤＡ１〜ＤＡ５の構成ディスクは物理的に各行に散在
し、また予備ディスク装置も各行に分散する。この結
果、各ディスクアレイを構成するディスク装置がどれ
で、予備としてのディスク装置がどれであるかを、物理
的な位置関係を基に特定することができなくなり、保
守，管理に支障が生じるという問題があった。For example, as shown in the plan view of FIG. 13A, a total of 30 disk devices are arranged in 5 rows and 6 columns, and in an initial state, the first to fifth rows are arranged. Disk arrays DA1 to DA constituting one logical disk device for each of the five disk devices in each row
5 is constructed, and the five disk devices in the sixth row are on standby as spare disk devices. In this state, for example, since the fifth disk device of the disk array DA1 has failed, automatic recovery using the first spare disk device is performed by the hot swap function, and when the user subsequently replaces the failed disk device, FIG. 13 (b)
As shown in the plan view of FIG. 7, the disks constituting the disk array DA1 are four in the first row and one in the sixth row. Similarly,
If the failure occurrence, the automatic recovery, and the disk replacement are repeated several times, for example, as shown in FIG. 13C, the disks constituting the disk arrays DA1 to DA5 are physically scattered in each row, and the spare disk device is also in each row. Disperse in. As a result, it is not possible to identify which disk device constitutes each disk array and which disk device is a spare disk device based on the physical positional relationship, which causes a problem in maintenance and management. There was a problem.

【０００８】そこで本発明の目的は、ディスクアレイを
構成するディスク装置の物理的な配置関係を常に維持す
るのに好適なディスクアレイ装置の管理方法を提供する
ことにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a disk array device management method suitable for always maintaining the physical arrangement of the disk devices constituting the disk array.

【０００９】[0009]

【課題を解決するための手段】本発明は上記の目的を達
成するために、ディスクアレイを構成する複数のディス
ク装置に加えて１台以上の予備ディスク装置を備え、デ
ィスクアレイを構成する何れかのディスク装置に故障が
発生した場合に運用業務を停止することなく故障ディス
ク装置のデータを予備ディスク装置に復旧し、前記故障
ディスク装置以外のディスク装置と前記予備ディスク装
置とでディスクアレイを再構成して該再構成ディスクア
レイで運用業務を続行する機能を有するディスクアレイ
装置において、前記予備ディスク装置のデータを、前記
故障ディスク装置と交換された正常なディスク装置に復
旧し、前記再構成ディスクアレイ中の前記予備ディスク
装置以外のディスク装置と前記正常なディスク装置とで
ディスクアレイを再度、再構成する。より具体的には、
前記故障ディスク装置が正常なディスク装置に交換され
たとき、前記再構成ディスクアレイに対する運用業務を
停止することなく且つ前記再構成ディスクアレイの冗長
状態を維持しながら、前記予備ディスク装置のデータを
前記交換後の正常なディスク装置に復旧する第１の段階
と、前記交換後の正常なディスク装置を前記再構成ディ
スクアレイに組み込むと共に、前記予備ディスク装置を
前記再構成ディスクアレイから切り離す第２の段階とを
含むことを特徴とする。According to the present invention, there is provided a disk array comprising at least one spare disk unit in addition to a plurality of disk units constituting a disk array. When a failure occurs in any of the disk devices, the data of the failed disk device is restored to the spare disk device without stopping the operation, and the disk array is reconfigured with the disk devices other than the failed disk device and the spare disk device In the disk array device having the function of continuing the operation in the reconfigured disk array, the data of the spare disk device is restored to a normal disk device replaced with the failed disk device, and the reconfigured disk array is restored. Disk arrays other than the spare disk unit in the disk array and the normal disk units Every time, to reconstruct. More specifically,
When the failed disk device is replaced with a normal disk device, the data of the spare disk device is transferred to the spare disk device without stopping the operation of the reconstructed disk array and maintaining the redundant state of the reconstructed disk array. A first stage of restoring to a normal disk device after replacement, and a second stage of incorporating the normal disk device after replacement into the reconfigurable disk array and separating the spare disk device from the reconfigurable disk array And characterized in that:

【００１０】ここで、第１の段階においては、前記再構
成ディスクアレイに対する運用業務にかかる書き込み処
理は、前記再構成ディスクアレイを構成する全ディスク
装置と前記交換後の正常なディスク装置とを対象にする
と共に、前記再構成ディスクアレイ中の予備ディスク装
置と前記交換後の正常なディスク装置に対しては同じデ
ータを二重に書き込むようにし、前記再構成ディスクア
レイに対する運用業務にかかる読み込み処理は、再構成
ディスクアレイを構成する全ディスク装置を対象にす
る。そして、これら運用業務にかかる書き込み処理およ
び読み込み処理と排他制御を行って、前記再構成ディス
クアレイ中の予備ディスク装置の全てのデータを前記交
換後の正常なディスク装置に複写する処理を行う。この
ような複写処理は、前記再構成ディスクアレイ中の予備
ディスク装置の全データを読み込んで前記交換後の正常
なディスク装置に書き戻すことにより行うことができ、
また前記再構成ディスクアレイの全ディスク装置の全デ
ータを読み込んで前記再構成ディスクアレイを構成する
全ディスク装置と前記交換後の正常なディスク装置に書
き戻すことにより行うこともできる。更に、上記運用業
務にかかる書き込み処理および読み込み処理と排他制御
を行って、前記再構成ディスクアレイ中の前記予備ディ
スク装置以外の全ディスク装置のデータを読み込んでデ
ータを再構築し、該再構築したデータのうち前記予備デ
ィスク装置に対応するデータを前記交換後の正常なディ
スク装置に書き戻す処理を行うようにすることも可能で
ある。Here, in the first stage, the write processing relating to the operation of the reconfigured disk array is performed on all the disk devices constituting the reconfigured disk array and the normal disk device after the replacement. In addition, the same data is written in duplicate to the spare disk device in the reconfigured disk array and the normal disk device after the replacement. , For all disk devices constituting the reconfigured disk array. Then, a write process and a read process related to these operation tasks and an exclusive control are performed, and a process of copying all data of the spare disk device in the reconfigurable disk array to a normal disk device after the replacement is performed. Such a copying process can be performed by reading all the data of the spare disk device in the reconfigured disk array and writing it back to the normal disk device after the replacement,
Further, it can be performed by reading all data of all the disk devices of the reconfigurable disk array and writing back to all the disk devices constituting the reconfigurable disk array and the normal disk device after the replacement. Further, by performing the write process and the read process and the exclusive control related to the operation job, the data of all the disk devices other than the spare disk device in the reconfigured disk array are read, and the data is reconstructed. It is also possible to perform a process of writing back the data corresponding to the spare disk device to the normal disk device after the replacement.

【００１１】[0011]

【発明の実施の形態】次に本発明の実施の形態の例につ
いて図面を参照して詳細に説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, embodiments of the present invention will be described in detail with reference to the drawings.

【００１２】図１は本発明を適用したディスクアレイ装
置の一例を示すブロック図である。この例のディスクア
レイ装置１は、ディスク部２とディスクアレイ制御部３
とで構成され、ホストプロセッサ等の上位装置４に接続
されている。FIG. 1 is a block diagram showing an example of a disk array device to which the present invention is applied. The disk array device 1 of this example includes a disk unit 2 and a disk array control unit 3
And is connected to the host device 4 such as a host processor.

【００１３】ディスク部２は、ディスクアレイを構成す
る磁気ディスク装置等の複数のディスク装置２１−１〜
２１−ｎと１台以上の予備ディスク装置２２とから構成
されている。ディスク装置２１−１〜２１−ｎは、１つ
或いは複数のディスクアレイを構成するためのディスク
装置であり、予備ディスク装置２２は何れかのディスク
アレイ中のディスク装置が故障した場合にその代替とし
て一時的に使用するディスク装置である。また、本実施
例では、各ディスク装置に当該ディスク装置がシステム
に対して挿抜されたことを物理的，電気的あるいは光学
的に検出する検出器２３が設けられ、その検出信号をデ
ィスクアレイ制御部３に送ることにより、ディスク装置
が交換されたことを自動で検出している。なお、ディス
ク装置が交換されたことを人手でディスクアレイ制御部
３に通知するように構成することも可能である。The disk unit 2 includes a plurality of disk devices 21-1 to 21-1 such as magnetic disk devices constituting a disk array.
21-n and one or more spare disk devices 22. The disk devices 21-1 to 21-n are disk devices for configuring one or a plurality of disk arrays, and the spare disk device 22 is used as a substitute when a disk device in any disk array fails. This is a disk device used temporarily. Further, in this embodiment, each disk device is provided with a detector 23 for physically, electrically or optically detecting that the disk device is inserted into or removed from the system. 3 automatically detects that the disk device has been replaced. It is also possible to manually notify the disk array controller 3 that the disk device has been replaced.

【００１４】ディスクアレイ制御部３は、業務処理手段
３１と、Ｉ／Ｏ処理手段３２と、復旧手段３３と、制御
情報記憶手段３４と、排他制御手段３５と、ディスク交
換検出手段３６とを主要部として備えている。なお、デ
ィスクアレイ制御部３はマイクロプロセッサ等のプロセ
ッサを主要な構成要素として実現でき、その場合、その
プロセッサのマイクロプログラムは、フロッピィディス
ク媒体，ＣＤ−ＲＯＭ等の機械読み取り可能な記録媒体
ＭＥＭに記録されており、電源投入時にそのプロセッサ
にロードされ、そのプロセッサ上に上記各手段３１〜３
６を実現する。The disk array control unit 3 mainly includes a business processing unit 31, an I / O processing unit 32, a recovery unit 33, a control information storage unit 34, an exclusive control unit 35, and a disk replacement detection unit 36. It is provided as a part. The disk array controller 3 can be realized by using a processor such as a microprocessor as a main component. In this case, the microprogram of the processor is recorded on a machine-readable recording medium MEM such as a floppy disk medium or a CD-ROM. When the power is turned on, the processor is loaded into the processor and the above means 31 to 3 are loaded on the processor.
6 is realized.

【００１５】制御情報記憶手段３４は、各ディスクアレ
イ（論理的なディスク装置）がどのディスク装置２１−
１〜２１−ｎ，２２で構成されているかといったディス
クアレイ構成情報や、各ディスクアレイに対する運用業
務にかかるデータ書き込み処理，データ読み込み処理の
アクセス方法等を記憶する手段である。The control information storage means 34 stores which disk device 21-
This is a means for storing disk array configuration information such as whether the disk array is composed of 1 to 21-n, 22 and an access method for data write processing and data read processing relating to operation work for each disk array.

【００１６】業務処理手段３１は、上位装置４とディス
ク部２との間のＩ／Ｏ処理を制御する手段である。業務
処理手段３１は、上位装置４から或るディスクアレイに
対するデータ書き込み処理，データ読み込み処理が要求
されると、制御情報記憶手段３４中の制御情報を参照し
てディスクアレイ構成およびアクセス方法を認識し、Ｉ
／Ｏ処理手段３２を通じてディスク部２をアクセスし、
結果を上位装置４に転送する。The job processing means 31 is a means for controlling I / O processing between the host device 4 and the disk unit 2. When the host device 4 requests data write processing and data read processing for a certain disk array, the business processing unit 31 refers to the control information in the control information storage unit 34 to recognize the disk array configuration and the access method. , I
Access the disk unit 2 through the / O processing means 32,
The result is transferred to the host device 4.

【００１７】復旧手段３３は、Ｉ／Ｏ処理手段３２から
故障ディスク装置の発生が通知されたときにＩ／Ｏ処理
手段３２を通じてディスク部２をアクセスすることによ
りホットスワップ機能にかかる復旧制御を実施し、また
ディスク装置交換時にはＩ／Ｏ処理手段３２を通じてデ
ィスク部２をアクセスすることにより、交換されたディ
スク装置にデータを復旧する制御等を行う手段である。
復旧手段３３は、それらの制御に際し、制御情報記憶手
段３４の内容を更新することにより、ディスクアレイの
再構成やアクセス方法の変更等を適宜実施する。The recovery unit 33 performs recovery control for the hot swap function by accessing the disk unit 2 through the I / O processing unit 32 when the occurrence of a failed disk device is notified from the I / O processing unit 32. In addition, when the disk device is replaced, the disk unit 2 is accessed through the I / O processing unit 32 to perform control for restoring data to the replaced disk device.
The recovery unit 33 updates the contents of the control information storage unit 34 during the control to appropriately reconfigure the disk array or change the access method.

【００１８】排他制御手段３５は、業務処理手段３１に
よるディスク部２へのアクセスと復旧手段３３によるデ
ィスク部２へのアクセスとの競合を調停する手段であ
る。例えばディスク部２全体の記憶領域を１Ｍバイト等
の所定のサイズに論理的に分割し、個々の分割領域を業
務処理手段３１および復旧手段３３が同時にアクセスし
ないように排他制御を行う。業務処理手段３１および復
旧手段３３はアクセスに先立ってアクセス先領域のロッ
クを排他制御手段３５に要求し、成功したらアクセスを
実施し、失敗したら待ち合わせを行って再度ロック要求
から処理を繰り返す。The exclusive control unit 35 is a unit that arbitrates competition between access to the disk unit 2 by the business processing unit 31 and access to the disk unit 2 by the recovery unit 33. For example, the storage area of the entire disk unit 2 is logically divided into a predetermined size such as 1 Mbyte, and exclusive control is performed so that the business processing unit 31 and the recovery unit 33 do not simultaneously access each divided area. Prior to the access, the business processing unit 31 and the recovery unit 33 request the exclusive control unit 35 to lock the access destination area. If the access control unit succeeds, the access is executed. If the access control unit fails, the process is repeated and the process is repeated from the lock request.

【００１９】Ｉ／Ｏ処理手段３２は、業務処理手段３１
および復旧手段３３から要求されるディスク部２に対す
るＩ／Ｏを実際に処理する部分である。また、何れかの
ディスク装置に使用不可能な故障を検出した場合にはそ
の旨を復旧手段３３に通知する機能を有している。The I / O processing means 32 includes the business processing means 31
And a part for actually processing I / O to the disk unit 2 requested by the recovery unit 33. Further, when an unusable failure is detected in any of the disk devices, a function of notifying the recovery unit 33 of the failure is provided.

【００２０】ディスク交換検出手段３６は、ディスク部
２に設けられた各検出器２３の出力を監視し、ディスク
装置がシステムから一旦外され、その後取り付けられた
ことをもって、当該ディスク装置が交換されたことを検
出する手段である。検出時には交換されたディスク装置
を復旧手段３３に通知する。The disk replacement detection means 36 monitors the output of each detector 23 provided in the disk unit 2, and once the disk device is once removed from the system and then attached, the disk device is replaced. This is a means for detecting the fact. Upon detection, the replaced disk device is notified to the recovery unit 33.

【００２１】図２はディスクアレイの制御状態の遷移を
フローチャート形式で示したものである。FIG. 2 is a flow chart showing the control state transition of the disk array.

【００２２】最初は、ディスクアレイを構成する何れか
のディスク装置に使用不可能な故障が発生するまで、予
備ディスク装置２２を使用しない本来のディスクアレイ
構成にて冗長状態で通常運転を行う（Ｓ１）。この状態
をＡとする。First, normal operation is performed in a redundant state in the original disk array configuration without using the spare disk unit 22 until an unusable failure occurs in any of the disk units constituting the disk array (S1). ). This state is referred to as A.

【００２３】何れかのディスク装置に故障が発生すると
（Ｓ２でＹＥＳ）、ホットスワップ機能が働き、故障デ
ィスク装置に代えて予備ディスク装置がディスクアレイ
構成に組み入れられ、予備ディスク装置のデータが完全
に復旧するまで予備ディスク装置を使用した縮退運転を
行う（Ｓ３，Ｓ４）。この状態をＢとする。If a failure occurs in any of the disk devices (YES in S2), the hot swap function operates, a spare disk device is incorporated in the disk array configuration in place of the failed disk device, and the data of the spare disk device is completely removed. Until the recovery, the degenerate operation using the spare disk device is performed (S3, S4). This state is referred to as B.

【００２４】予備ディスク装置のデータ復旧が完了する
と、故障ディスク装置が交換されるまで、予備ディスク
装置を使用した冗長状態で通常運転を行う（Ｓ５）。こ
の状態をＣとする。When data recovery of the spare disk device is completed, normal operation is performed in a redundant state using the spare disk device until the failed disk device is replaced (S5). This state is C.

【００２５】故障ディスク装置が正常なディスク装置と
交換されると（Ｓ６でＹＥＳ）、交換ディスク装置のデ
ータが完全に復旧するまで、予備ディスク装置，交換デ
ィスク装置を使用した冗長状態で拡張運転を行う（Ｓ
７，Ｓ８）。この状態をＤとする。そして、交換ディス
ク装置のデータ復旧が完了すると、再び状態Ａの状態に
戻る。When the failed disk device is replaced with a normal disk device (YES in S6), the extended operation is performed in a redundant state using the spare disk device and the replacement disk device until the data of the replacement disk device is completely restored. Do (S
7, S8). This state is D. When the data recovery of the replacement disk device is completed, the state returns to the state A again.

【００２６】以下、各状態Ａ〜Ｄ毎にその詳細を説明す
る。なお、説明の便宜上、ディスク装置２１−１〜２１
−５の５台のディスク装置から構成されているディスク
アレイを例にする。この５台のディスク装置は上位装置
４から転送されるデータ以外に、任意の１台のディスク
装置のデータが失われた場合でも論理的なディスク装置
としてはデータが失われないように冗長データとしての
パリティデータが格納されている。ＲＡＩＤとしては例
えばＲＡＩＤ３，ＲＡＩＤ５を使用することができる
が、他のＲＡＩＤに対しても適用可能である。Hereinafter, details of each of the states A to D will be described. For convenience of explanation, the disk devices 21-1 to 21-1
An example is a disk array composed of five disk devices of −5. These five disk devices are used as redundant data so that data is not lost as a logical disk device even if data of any one disk device is lost in addition to the data transferred from the higher-level device 4. Is stored. As the RAID, for example, RAID3 and RAID5 can be used, but the present invention is also applicable to other RAIDs.

【００２７】○状態Ａ図３に、状態Ａにおけるディスクアレイ構成とそれに対
するアクセス方法を示す。状態Ａでは、ディスクアレイ
がディスク装置２１−１〜２１−５で構成されているこ
と、ディスクアレイに対する書き込み処理および読み込
み処理は全ディスク装置２１−１〜２１−５に対して行
うことが、制御情報記憶手段３４中のディスクアレイ構
成情報およびアクセス方法情報に設定される。このた
め、業務処理手段３１は、図３の実線の矢印に示すよう
に、上位装置４からの要求に従ってＩ／Ｏ処理手段３２
を通じてディスクアレイに対して書き込み処理を行うと
きは全ディスク装置２１−１〜２１−５を対象とし、ま
た読み込み処理を行うときも全ディスク装置２１−１〜
２１−５を対象とする。State A FIG. 3 shows a disk array configuration in state A and an access method therefor. In the state A, the disk array is composed of the disk devices 21-1 to 21-5, and the writing process and the reading process for the disk array can be performed for all the disk devices 21-1 to 21-5. The information is set in the disk array configuration information and the access method information in the information storage unit 34. For this reason, as shown by the solid arrow in FIG.
When writing processing to the disk array is performed through all the disk devices 21-1 to 21-5, all the disk devices 21-1 to 21-5 are also used when performing reading processing.
21-5 is targeted.

【００２８】○状態Ｂ図４に、状態Ｂにおけるディスクアレイ構成とそれに対
するアクセス方法を示す。ディスクアレイ中の何れかの
ディスク装置、例えばディスク装置２１−５に使用不可
能な故障が発生したことがＩ／Ｏ処理手段３２で検出さ
れると、復旧手段３３は、制御情報記憶手段３４中の当
該ディスクアレイ構成情報から故障ディスク装置２１−
５を取り去ることで、故障ディスク装置２１−５をディ
スクアレイ構成から切り離し、代わりに予備ディスク装
置２２を当該ディスクアレイ構成情報に追加すること
で、予備ディスク装置２２をディスクアレイ構成に組み
入れる。ここで、どのディスク装置が待機状態の予備デ
ィスク装置であるかは復旧手段３３自身が管理してお
り、そのような予備ディスク装置が複数存在する場合に
は、何れかを選択して使用中とする。また、復旧手段３
３は制御情報記憶手段３４中のアクセス方法の指定を変
更し、ディスクアレイに対する書き込み処理は再構成さ
れたディスクアレイの全ディスク装置２１−１〜２１−
４，２２に対して行い、読み込み処理はディスク装置２
１−１〜２１−４だけに対して行ってディスクアレイシ
ステムの特徴であるデータの自動復元機能を使用してデ
ータ修復するアクセス方法を設定する。State B FIG. 4 shows a disk array configuration in state B and an access method therefor. When the I / O processing unit 32 detects that an unusable failure has occurred in any of the disk devices in the disk array, for example, the disk device 21-5, the recovery unit 33 stores the information in the control information storage unit 34. From the failed disk device 21-
By removing the disk drive 5, the failed disk drive 21-5 is separated from the disk array configuration, and the spare disk drive 22 is added to the disk array configuration information instead, thereby incorporating the spare disk drive 22 into the disk array configuration. Here, the recovery unit 33 itself manages which disk device is the standby disk device in the standby state, and when there are a plurality of such spare disk devices, one of them is selected and used. I do. Recovery means 3
3 changes the designation of the access method in the control information storage means 34, and the write processing for the disk array is performed for all the disk devices 21-1 to 21- of the reconfigured disk array.
4 and 22 and the reading process is performed on the disk device 2
An access method for performing data restoration using only an automatic restoration function of data, which is a feature of the disk array system, is set for only 1-1 to 21-4.

【００２９】これにより、業務処理手段３１は、図４の
実線の矢印に示すように、上位装置４からの要求に従っ
てＩ／Ｏ処理手段３２を通じてディスクアレイに対して
書き込み処理を行うときは全ディスク装置２１−１〜２
１−４，２２を対象とし、他方、読み込み処理を行うと
きはディスク装置２１−１〜２１−４を対象としてデー
タの自動復元機能を使用してデータを修復する。As a result, as shown by a solid arrow in FIG. 4, the business processing means 31 performs all disk writing when performing write processing to the disk array through the I / O processing means 32 in accordance with a request from the host device 4. Apparatus 21-1 to 2
On the other hand, when performing read processing, data is restored using the automatic data restoration function for the disk devices 21-1 to 21-4.

【００３０】同時に復旧手段３３は、図４の破線の矢印
に示すように、Ｉ／Ｏ処理手段３２を通じて、ディスク
装置２１−１〜２１−４の全データをデータの自動復元
機能を使用して読み込み、次いで、この読み込んだデー
タをディスク装置２１−１〜２１−４の元の位置と予備
ディスク装置２２の該当する位置に書き込むことによ
り、予備ディスク２２のデータの復旧を行う。このと
き、ディスク装置２１−１〜２１−４，２２の容量を幾
つかの領域に分割し、１つの分割領域ごとに排他制御手
段３５でロックをかけてその領域のアクセス中に業務処
理手段３１が同領域をアクセスしないようにし、１つの
分割領域へのアクセスが終了するごとにその領域のロッ
クを解除し、次の分割領域を処理する。At the same time, the recovery means 33 restores all the data in the disk devices 21-1 to 21-4 through the I / O processing means 32 by using an automatic data recovery function, as indicated by the dashed arrow in FIG. The data on the spare disk 22 is recovered by writing the read data to the original positions of the disk devices 21-1 to 21-4 and the corresponding positions of the spare disk device 22. At this time, the capacity of each of the disk devices 21-1 to 21-4 and 22 is divided into several areas, and the exclusive control means 35 locks each of the divided areas, and the business processing means 31 Does not access the same area, unlocks the area each time access to one divided area is completed, and processes the next divided area.

【００３１】○状態Ｃ図５に、状態Ｃにおけるディスクアレイ構成とそれに対
するアクセス方法を示す。復旧手段３３は、予備ディス
ク装置２２のデータ復旧が完了すると、制御情報記憶手
段３４中のアクセス方法を、当該ディスクアレイに対す
る書き込み処理および読み込み処理を全ディスク装置２
１−１〜２１−４，２２に対して行うように変更する。
これにより、業務処理手段３１は、図５の実線の矢印に
示すように、上位装置４からの要求に従ってＩ／Ｏ処理
手段３２を通じてディスクアレイに対して書き込み処理
を行うときは全ディスク装置２１−１〜２１−４，２２
を対象とし、また読み込み処理を行うときも全ディスク
装置２１−１〜２１−４，２２を対象とする。この状態
は、状態Ａと同じであり、ただディスク装置２１−５の
代わりに予備ディスク装置２２が使用されている点が相
違している。State C FIG. 5 shows a disk array configuration in state C and an access method therefor. When the data recovery of the spare disk device 22 is completed, the recovery means 33 changes the access method in the control information storage means 34 to the write processing and the read processing for the disk array,
This is changed so as to be performed on 1-1 to 21-4 and 22.
As a result, as shown by the solid arrow in FIG. 5, the business processing unit 31 performs write processing on the disk array through the I / O processing unit 32 in accordance with a request from the higher-level device 4 when all the disk devices 21- 1-21-4,22
, And all the disk devices 21-1 to 21-4, 22 are also subjected to the reading process. This state is the same as the state A, except that the spare disk unit 22 is used instead of the disk unit 21-5.

【００３２】また、復旧手段３３は状態Ｃへ移行する
と、図示しないコンソール等にメッセージを表示し、利
用者に故障ディスク装置２１−５を正常なディスク装置
に交換するよう促す。故障ディスク装置２１−５の交換
はシステムの電源を遮断することなしに可能である。When the recovery means 33 shifts to the state C, a message is displayed on a console or the like (not shown) to urge the user to replace the failed disk device 21-5 with a normal disk device. The replacement of the failed disk device 21-5 is possible without shutting off the power supply of the system.

【００３３】○状態Ｄ図６に、状態Ｄにおけるディスクアレイ構成とそれに対
するアクセス方法を示す。ディスク交換検出手段３６か
ら故障ディスク２１−５が正常なディスク装置２４に交
換されたことが通知されると、復旧手段３３は、制御情
報記憶手段３４の当該ディスクアレイ構成情報を更新
し、交換ディスク装置２４を予備ディスク装置２２との
二重化のペアとしてディスクアレイ構成に組み込む。ま
た、アクセス方法を変更し、ディスクアレイに対する書
き込み処理はディスクアレイを構成する全ディスク装置
２１−１〜２１−４，２２，２４に対して行い且つディ
スク装置２２とディスク装置２４とには同じデータを二
重書きするように指定し、読み込み処理はディスク装置
２１−１〜２１−４と二重化ファイルの一方の予備ディ
スク装置２２とを対象とするように指定する。State D FIG. 6 shows a disk array configuration in state D and an access method therefor. When the disk replacement detecting unit 36 notifies that the failed disk 21-5 has been replaced with the normal disk unit 24, the recovery unit 33 updates the disk array configuration information in the control information storage unit 34, and The device 24 is incorporated in the disk array configuration as a duplex pair with the spare disk device 22. In addition, the access method is changed, and the writing process for the disk array is performed for all the disk devices 21-1 to 21-4, 22 and 24 constituting the disk array, and the same data is stored in the disk devices 22 and 24. Is specified to be written in duplicate, and the read processing is specified to target the disk devices 21-1 to 21-4 and one of the spare disk devices 22 of the duplicated file.

【００３４】これにより、業務処理手段３１は、図６の
実線の矢印に示すように、上位装置４からの要求に従っ
てＩ／Ｏ処理手段３２を通じてディスクアレイに対して
書き込み処理を行うときは全ディスク装置２１−１〜２
１−４，２２，２４に対して行い、且つ予備ディスク装
置２２と交換ディスク装置２４とに対しては同じデータ
を二重書きする。また、読み込み処理を行うときはディ
スク装置２１−１〜２１−４，２２を対象とする。As a result, as shown by the solid arrow in FIG. 6, the business processing means 31 performs all the disk writing when performing the write processing to the disk array through the I / O processing means 32 in accordance with the request from the host device 4. Apparatus 21-1 to 2
The same data is duplicately written to the spare disk unit 22 and the replacement disk unit 24. When the reading process is performed, the disk devices 21-1 to 21-4 and 22 are targeted.

【００３５】同時に復旧手段３３は、図６の白ヌキ矢印
に示すように、Ｉ／Ｏ処理手段３２を通じて、予備ディ
スク装置２２の全データを交換ディスク装置２４に複写
することで、交換ディスク装置２４のデータの復旧を行
う。つまり、交換ディスク装置２４の内容を予備ディス
ク装置２２の内容に一致させる。At the same time, the recovery means 33 copies all the data of the spare disk device 22 to the replacement disk device 24 through the I / O processing device 32 as shown by the white arrow in FIG. Data recovery. That is, the contents of the replacement disk device 24 are made to match the contents of the spare disk device 22.

【００３６】図７に予備ディスク装置２２の全データを
交換ディスク装置２４に複写する処理の一実施例を示
す。この実施例では、同図の破線の矢印に示すように、
予備ディスク装置２２の全データを順次に読み込み、交
換ディスク装置２４の同じ位置に書き戻すことにより、
交換ディスク装置２４の内容を予備ディスク装置２２の
内容に一致させている。このとき、予備ディスク装置２
２および交換ディスク装置２４の全領域を幾つかの領域
に分割し、１つの分割領域ごとに排他制御手段３５でロ
ックをかけてその領域のアクセス中に業務処理手段３１
が同領域をアクセスしないようにし、１つの分割領域へ
のアクセスが終了する毎にその領域のロックを解除し、
次の分割領域を処理する。FIG. 7 shows an embodiment of processing for copying all data in the spare disk unit 22 to the exchange disk unit 24. In this embodiment, as indicated by the dashed arrow in FIG.
By sequentially reading all data in the spare disk device 22 and writing it back to the same position in the replacement disk device 24,
The contents of the replacement disk device 24 match the contents of the spare disk device 22. At this time, the spare disk device 2
2 and the whole area of the exchange disk unit 24 are divided into several areas, and the exclusive control means 35 locks the divided areas for each divided area, and the business processing means 31
Prevents access to the same area, unlocks the area each time access to one divided area ends,
Process the next divided area.

【００３７】図８に予備ディスク装置２２の全データを
交換ディスク装置２４に複写する処理の別の実施例を示
す。この実施例では、復旧手段３３は同図の破線の矢印
に示すように、Ｉ／Ｏ処理手段３２を通じて、交換ディ
スク装置２４以外の全ディスク装置２１−１〜２１−
４，２２の全データを順次に読み込み、この読み込んだ
データをディスク装置２１−１〜２１−４，２２の元の
位置と交換ディスク装置２４の該当する位置に書き込む
ことにより、交換ディスク装置２４の内容を予備ディス
ク装置２２の内容に一致させている。このとき、ディス
ク装置２１−１〜２１−４，２２，２４の容量を幾つか
の領域に分割し、１つの分割領域ごとに排他制御手段３
５でロックをかけてその領域のアクセス中に業務処理手
段３１が同領域をアクセスしないようにし、１つの分割
領域へのアクセスが終了する毎にその領域のロックを解
除し、次の分割領域を処理する。FIG. 8 shows another embodiment of the processing for copying all the data in the spare disk unit 22 to the replacement disk unit 24. In this embodiment, the recovery unit 33 transmits all the disk devices 21-1 to 21-other than the replacement disk device 24 through the I / O processing unit 32, as indicated by the dashed arrows in FIG.
4 and 22 are sequentially read, and the read data is written to the original positions of the disk devices 21-1 to 21-4 and 22 and the corresponding positions of the replacement disk device 24. The contents match the contents of the spare disk device 22. At this time, the capacity of each of the disk devices 21-1 to 21-4, 22, and 24 is divided into several areas, and the exclusive control unit 3 is provided for each divided area.
In step 5, a lock is applied to prevent the business processing unit 31 from accessing the area while accessing the area. Each time the access to one divided area is completed, the area is unlocked, and the next divided area is unlocked. To process.

【００３８】上述のようにして交換ディスク装置２４の
データ復旧が完了すると、復旧手段３３は制御情報記憶
手段３４中の当該ディスクアレイ構成情報から予備ディ
スク装置２２を取り去ることで、予備ディスク装置２２
をディスクアレイ構成から切り離し、当該ディスクアレ
イをディスク装置２１−１〜２１−４，２４からなるデ
ィスクアレイに再構成する。また、アクセス方法を、デ
ィスクアレイに対する書き込み処理および読み込み処理
は全ディスク装置２１−１〜２１−４，２４に対して行
う方法に変更する。これにより、業務処理手段３１は、
図９の実線の矢印に示すように、上位装置４からの要求
に従ってＩ／Ｏ処理手段３２を通じてディスクアレイに
対して書き込み処理を行うときは全ディスク装置２１−
１〜２１−４，２４を対象とし、また読み込み処理を行
うときも全ディスク装置２１−１〜２１−４，２２を対
象とする。また復旧手段３３は、予備ディスク装置２２
を以後に発生する可能性のある障害ディスク装置の代替
用として、再び待機状態として管理する。When the data recovery of the replacement disk unit 24 is completed as described above, the recovery unit 33 removes the spare disk unit 22 from the disk array configuration information in the control information storage unit 34, and
Is separated from the disk array configuration, and the disk array is reconfigured into a disk array including the disk devices 21-1 to 21-4 and 24. Further, the access method is changed to a method in which write processing and read processing for the disk array are performed for all the disk devices 21-1 to 21-4 and 24. Thereby, the business processing means 31
As shown by the solid arrow in FIG. 9, when the write processing is performed on the disk array through the I / O processing means 32 in accordance with the request from the upper level apparatus 4, all the disk apparatuses 21
1 to 21-4 and 24, and all the disk devices 21-1 to 21-4 and 22 are also subjected to the reading process. Further, the recovery means 33 is provided for the spare disk device 22.
Is managed again as a standby state as a substitute for a failed disk device that may occur later.

【００３９】図１０は、状態Ｄにおける交換ディスク装
置２４のデータの復旧方法の別の実施例を示す。図６で
は、予備ディスク装置２２の内容を交換ディスク装置２
４に複写することでデータ復旧を行ったが、本実施例で
は、予備ディスク装置２２を使わず、ディスクアレイシ
ステムが本来有するデータの自動復旧機能を使用してデ
ータの復旧を行う。即ち、復旧手段３３は、図１０の破
線の矢印に示すように、Ｉ／Ｏ処理手段３２を通じて、
ディスク装置２１−１〜２１−４の全データを読み込ん
でデータの自動復元機能によってデータを再構築し、次
いで、この再構築したデータをディスク装置２１−１〜
２１−４の元の位置と予備ディスク装置２２および交換
ディスク装置２４の同一の位置に書き込むことにより、
交換ディスク装置２４の内容を予備ディスク装置２２の
内容に一致させる。このとき、ディスク装置２１−１〜
２１−４，２２，２４の容量を幾つかの領域に分割し、
１つの分割領域ごとに排他制御手段３５でロックをかけ
てその領域のアクセス中に業務処理手段３１が同領域を
アクセスしないようにし、１つの分割領域へのアクセス
が終了するごとにその領域のロックを解除し、次の分割
領域を処理する。FIG. 10 shows another embodiment of a method for restoring data of the exchange disk device 24 in the state D. In FIG. 6, the contents of the spare disk device 22 are
In this embodiment, the data is recovered by using the automatic data recovery function inherent in the disk array system without using the spare disk device 22. That is, the recovery unit 33 transmits through the I / O processing unit 32, as indicated by the dashed arrow in FIG.
All the data of the disk devices 21-1 to 21-4 are read, and the data is reconstructed by the automatic data restoration function. Then, the reconstructed data is stored in the disk devices 21-1 to 21-4.
By writing to the original position of 21-4 and the same position of the spare disk device 22 and the replacement disk device 24,
The contents of the replacement disk device 24 are matched with the contents of the spare disk device 22. At this time, the disk devices 21-1 to 21-1
The capacity of 21-4, 22, 24 is divided into several areas,
The exclusive control unit 35 locks each divided area so that the business processing unit 31 does not access the area while accessing the area, and locks the area every time the access to one divided area is completed. Is canceled, and the next divided area is processed.

【００４０】なお、ディスク装置２１−１〜２１−４か
ら読み込んだデータに基づき自動復元したデータのうち
予備ディスク装置２２に対応するデータだけを、交換デ
ィスク装置２４に書き込むようにしても良い。It should be noted that only the data corresponding to the spare disk device 22 among the data automatically restored based on the data read from the disk devices 21-1 to 21-4 may be written to the replacement disk device 24.

【００４１】図１１は復旧手段３３の処理例を示すフロ
ーチャートである。ディスク装置２１−１〜２１−５か
ら構成されるディスクアレイが通常運転されている際
に、ディスクアレイ中の例えばディスク装置２１−５に
使用不可能な故障が発生したことがＩ／Ｏ処理手段３２
で検出されると（Ｓ１１でＹＥＳ）、復旧手段３３は、
制御情報記憶手段３４中の当該ディスクアレイ構成情報
から故障ディスク装置２１−５を取り去ることでディス
クアレイ構成から切り離し、代わりに現在待機中となっ
ている予備ディスク装置２２を当該ディスクアレイ構成
情報に追加してディスクアレイ構成に組み入れ、また、
制御情報記憶手段３４中のアクセス方法を、ディスクア
レイに対する書き込み処理は再構成されたディスクアレ
イの全ディスク装置２１−１〜２１−４，２２に対して
行い、読み込み処理はディスク装置２１−１〜２１−４
だけに対して行ってディスクアレイシステムの特徴であ
るデータの自動復元機能を使用してデータ復元するアク
セス方法に変更する（Ｓ１２）。同時に復旧手段３３
は、予備ディスク２２のデータの復旧を開始する（Ｓ１
３）。そして、予備ディスク装置２２のデータ復旧が完
了すると、復旧手段３３は、制御情報記憶手段３４中の
アクセス方法を、当該ディスクアレイに対する書き込み
処理および読み込み処理を全ディスク装置２１−１〜２
１−４，２２に対して行うように変更する（Ｓ１４）。FIG. 11 is a flowchart showing a processing example of the restoration means 33. During normal operation of the disk array composed of the disk devices 21-1 to 21-5, an I / O processing unit detects that an unusable failure has occurred in, for example, the disk device 21-5 in the disk array. 32
(YES in S11), the recovery unit 33
By removing the failed disk device 21-5 from the disk array configuration information in the control information storage means 34, the disk device is disconnected from the disk array configuration, and the spare disk device 22 currently in standby is added to the disk array configuration information instead. Into the disk array configuration,
The access method in the control information storage means 34 is performed for all the disk devices 21-1 to 21-4 and 22 of the reconfigured disk array in the writing process for the disk array, and for the reading process for the disk devices 21-1 to 21-4 in the reconfigured disk array. 21-4
(S12). At the same time, recovery means 33
Starts the recovery of the data on the spare disk 22 (S1
3). Then, when the data recovery of the spare disk device 22 is completed, the recovery unit 33 changes the access method in the control information storage unit 34 to the write processing and the read processing for the disk array,
The process is changed to be performed for 1-4 and 22 (S14).

【００４２】その後、利用者によって故障ディスク装置
が交換され、ディスク交換検出手段３６から故障ディス
ク２１−５が正常なディスク装置２４に交換されたこと
が通知されると（Ｓ１５でＹＥＳ）、復旧手段３３は、
交換ディスク装置２４が物理的なフォーマットが必要な
ディスクである場合（Ｓ１６でＹＥＳ）、ステップＳ１
７で交換ディスクに対して物理フォーマット処理を行っ
てステップＳ１８に進み、その必要のないディスクの場
合は直ちにステップＳ１８へ進む。ステップＳ１８で
は、復旧手段３３は、制御情報記憶手段３４のディスク
アレイ構成情報を更新し、交換ディスク装置２４を予備
ディスク装置２２との二重化のペアとしてディスクアレ
イ構成に組み込み、また、アクセス方法を変更し、ディ
スクアレイに対する書き込み処理はディスクアレイを構
成する全ディスク装置２１−１〜２１−４，２２，２４
に対して行い且つディスク装置２２とディスク装置２４
とには同じデータを二重書きするように指定し、読み込
み処理はディスク装置２１−１〜２１−４と二重化ファ
イルの一方の予備ディスク装置２２とを対象とするよう
に指定する。次に、復旧手段３３は、交換ディスク装置
２４のデータの復旧を行う（Ｓ１９）。そして、交換デ
ィスク装置２４のデータ復旧が完了すると、復旧手段３
３は制御情報記憶手段３４中の当該ディスクアレイ構成
情報から予備ディスク装置２２を取り去ることでディス
クアレイ構成から切り離し、当該ディスクアレイをディ
スク装置２１−１〜２１−４，２４からなるディスクア
レイに再構成し、更にアクセス方法を、ディスクアレイ
に対する書き込み処理および読み込み処理は全ディスク
装置２１−１〜２１−４，２４に対して行うアクセス方
法に変更する（Ｓ２０）。Thereafter, when the user replaces the failed disk device and the disk replacement detecting means 36 notifies that the failed disk 21-5 has been replaced with the normal disk device 24 (YES in S15), the recovery means 33 is
If the replacement disk device 24 is a disk that requires physical formatting (YES in S16), step S1
In step 7, a physical formatting process is performed on the replacement disk, and the flow advances to step S18. If the disk is unnecessary, the flow immediately advances to step S18. In step S18, the recovery unit 33 updates the disk array configuration information in the control information storage unit 34, incorporates the replacement disk unit 24 into the disk array configuration as a duplex pair with the spare disk unit 22, and changes the access method. However, the writing process to the disk array is performed by all the disk devices 21-1 to 21-4, 22, 24 constituting the disk array
And the disk drive 22 and the disk drive 24
Specifies that the same data is to be double-written, and that the read processing is to be performed for the disk devices 21-1 to 21-4 and one of the spare disk devices 22 of the duplicated file. Next, the recovery unit 33 recovers the data of the replacement disk device 24 (S19). When the data recovery of the replacement disk device 24 is completed, the recovery means 3
3 removes the spare disk device 22 from the disk array configuration information in the control information storage means 34 to separate it from the disk array configuration, and recycles the disk array into a disk array composed of the disk devices 21-1 to 21-4 and 24. Then, the access method is changed to an access method in which write processing and read processing for the disk array are performed for all the disk devices 21-1 to 21-4 and 24 (S20).

【００４３】以上のように、故障ディスク装置が正常な
ディスク装置に交換されたとき、予備ディスク装置のデ
ータを交換後の正常なディスク装置に復旧し、予備ディ
スク装置以外のディスク装置と前記交換後のディスク装
置とでディスクアレイを再構成することで、ディスクア
レイを構成するディスク装置の物理的な配置関係を常に
初期の状態に維持することができる。また、交換ディス
ク装置のデータ復旧時、運用業務にかかる書き込み処理
及び読み込み処理を停止していないので、運用業務を停
止する必要がない。As described above, when the failed disk device is replaced with a normal disk device, the data of the spare disk device is restored to the normal disk device after the replacement, and the disk device other than the spare disk device is replaced with the normal disk device after the replacement. By reconfiguring the disk array with this disk device, the physical arrangement of the disk devices constituting the disk array can always be maintained in the initial state. Further, at the time of data recovery of the exchange disk device, since the writing process and the reading process related to the operation are not stopped, it is not necessary to stop the operation.

【００４４】更に、交換ディスク装置のデータを復旧す
る方法としては、図１２に示すように、故障ディスク装
置が交換ディスク装置２４に交換された場合に、予備デ
ィスク装置２２をディスクアレイ構成から切り離してデ
ィスクアレイをディスク装置２１−１〜２１−４で縮退
運転させ、その縮退運転中に予備ディスク装置２２のデ
ータを交換ディスク装置２４に複写し、複写完了後、交
換ディスク装置２４をディスクアレイ構成に組み入れる
方法が考えられる。しかし、この方法では、せっかく予
備ディスク装置２２を用いてディスクアレイを冗長状態
に復帰させたにもかかわらず、故障ディスク装置交換後
に再びディスクアレイが縮退状態になってしまい、交換
ディスク装置に復旧処理が完了するまでの間、ディスク
アレイ装置の信頼性が著しく低下する。これに対し上記
各実施例では、交換ディスク装置のデータ復旧処理中、
再構成ディスクアレイの冗長状態は維持されており縮退
状態にはならないので、ディスクアレイ装置の信頼性が
低下することがない。Further, as a method for restoring the data of the replacement disk device, as shown in FIG. 12, when the failed disk device is replaced with the replacement disk device 24, the spare disk device 22 is separated from the disk array configuration. The disk array is degraded by the disk devices 21-1 to 21-4. During the degraded operation, the data of the spare disk device 22 is copied to the replacement disk device 24. After the copying is completed, the replacement disk device 24 is changed to a disk array configuration. There are several ways to incorporate it. However, according to this method, even though the disk array is returned to the redundant state by using the spare disk unit 22, the disk array becomes degraded again after the replacement of the failed disk unit. Until the operation is completed, the reliability of the disk array device is significantly reduced. In contrast, in each of the above embodiments, during the data recovery process of the replacement disk device,
Since the redundant state of the reconfigured disk array is maintained and does not enter the degenerated state, the reliability of the disk array device does not decrease.

【００４５】[0045]

【発明の効果】以上説明したように本発明によれば以下
のような効果を得ることができる。As described above, according to the present invention, the following effects can be obtained.

【００４６】故障ディスク装置が正常なディスク装置に
交換されたとき、予備ディスク装置のデータを交換後の
正常なディスク装置に復旧し、予備ディスク装置以外の
ディスク装置と前記交換後のディスク装置とでディスク
アレイを再構成しているため、ディスクアレイを構成す
るディスク装置の物理的な配置関係を常に一定に維持す
ることができる。When the failed disk device is replaced with a normal disk device, the data of the spare disk device is restored to the normal disk device after the replacement, and the disk devices other than the spare disk device and the replaced disk device are used. Since the disk array is reconfigured, the physical arrangement relationship of the disk devices constituting the disk array can always be kept constant.

【００４７】予備ディスク装置のデータを交換後の正常
なディスク装置に復旧する際、運用業務を停止する必要
がないため、運用業務にはほとんど支障が生じない。When the data of the spare disk device is restored to the normal disk device after the exchange, the operation job does not need to be stopped, so that the operation job is hardly affected.

【００４８】予備ディスク装置のデータを交換後の正常
なディスク装置に復旧する際、再構成ディスクアレイの
冗長状態を維持しているので、ディスクアレイ装置の信
頼性が低下しない。When the data of the spare disk device is restored to the normal disk device after the replacement, the reliability of the disk array device does not decrease because the redundant state of the reconfigured disk array is maintained.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明を適用したディスクアレイ装置の一例を
示すブロック図である。FIG. 1 is a block diagram showing an example of a disk array device to which the present invention has been applied.

【図２】ディスクアレイの制御状態の遷移をフローチャ
ート形式で示した図である。FIG. 2 is a diagram showing a transition of a control state of the disk array in a flowchart form.

【図３】状態Ａにおけるディスクアレイ構成とそれに対
するアクセス方法を示す図である。FIG. 3 is a diagram showing a disk array configuration in a state A and an access method to the disk array configuration.

【図４】状態Ｂにおけるディスクアレイ構成とそれに対
するアクセス方法を示す図である。FIG. 4 is a diagram showing a disk array configuration in a state B and an access method to the disk array configuration.

【図５】状態Ｃにおけるディスクアレイ構成とそれに対
するアクセス方法を示す図である。FIG. 5 is a diagram showing a disk array configuration in a state C and an access method to the disk array configuration.

【図６】状態Ｄにおけるディスクアレイ構成とそれに対
するアクセス方法を示す図である。FIG. 6 is a diagram showing a disk array configuration in a state D and an access method to the disk array configuration.

【図７】予備ディスク装置の全データを交換ディスク装
置に複写する処理の一実施例を示す図である。FIG. 7 is a diagram showing an embodiment of a process of copying all data of a spare disk device to a replacement disk device.

【図８】予備ディスク装置の全データを交換ディスク装
置に複写する処理の別の実施例を示す図である。FIG. 8 is a diagram showing another embodiment of a process of copying all data of a spare disk device to a replacement disk device.

【図９】交換ディスク装置のデータ復旧後のディスクア
レイ構成とそれに対するアクセス方法を示す図である。FIG. 9 is a diagram showing a disk array configuration of a replacement disk device after data recovery and an access method therefor.

【図１０】状態Ｄにおける交換ディスク装置のデータの
復旧方法の別の実施例を示す図である。FIG. 10 is a diagram showing another embodiment of the method for restoring data of the replacement disk device in the state D.

【図１１】復旧手段の処理例を示すフローチャートであ
る。FIG. 11 is a flowchart illustrating a processing example of a recovery unit.

【図１２】交換ディスク装置のデータを復旧する通常の
方法の説明図である。FIG. 12 is an explanatory diagram of a normal method for restoring data of a replacement disk device.

【図１３】ディスクアレイ装置におけるディスク装置の
配置構成の例を示す平面図である。FIG. 13 is a plan view showing an example of an arrangement configuration of disk devices in the disk array device.

【符号の説明】[Explanation of symbols]

１…ディスクアレイ装置２…ディスク部３…ディスクアレイ制御部４…上位装置２１−１〜２１−ｎ…ディスク装置２２…予備ディスク装置２３…検出器２４…交換ディスク装置３１…業務処理手段３２…Ｉ／Ｏ処理手段３３…復旧手段３４…制御情報記憶手段３５…排他制御手段３６…ディスク交換検出手段 DESCRIPTION OF SYMBOLS 1 ... Disk array apparatus 2 ... Disk part 3 ... Disk array control part 4 ... Host apparatus 21-1 to 21-n ... Disk apparatus 22 ... Spare disk apparatus 23 ... Detector 24 ... Exchange disk apparatus 31 ... Business processing means 32 ... I / O processing means 33 recovery means 34 control information storage means 35 exclusive control means 36 disk replacement detection means

Claims

Translated fromJapanese

【特許請求の範囲】[Claims]

【請求項１】ディスクアレイを構成する複数のディス
ク装置に加えて１台以上の予備ディスク装置を備え、デ
ィスクアレイを構成する何れかのディスク装置に故障が
発生した場合に運用業務を停止することなく故障ディス
ク装置のデータを予備ディスク装置に復旧し、前記故障
ディスク装置以外のディスク装置と前記予備ディスク装
置とでディスクアレイを再構成して該再構成ディスクア
レイで運用業務を続行する機能を有するディスクアレイ
装置において、前記予備ディスク装置のデータを、前記故障ディスク装
置と交換された正常なディスク装置に復旧し、前記再構
成ディスクアレイ中の前記予備ディスク装置以外のディ
スク装置と前記正常なディスク装置とでディスクアレイ
を再度、再構成することを特徴とするディスクアレイ装
置の管理方法。1. A system in which one or more spare disk devices are provided in addition to a plurality of disk devices constituting a disk array, and an operation job is stopped when a failure occurs in any of the disk devices constituting the disk array. Recovers the data of the failed disk device to the spare disk device, reconfigures the disk array with the disk devices other than the failed disk device and the spare disk device, and continues the operation with the reconfigured disk array. In the disk array device, the data of the spare disk device is restored to a normal disk device replaced with the failed disk device, and the disk devices other than the spare disk device in the reconfigured disk array and the normal disk device are restored. A disk array device characterized by reconfiguring the disk array again Management method.

【請求項２】ディスクアレイを構成する複数のディス
ク装置に加えて１台以上の予備ディスク装置を備え、デ
ィスクアレイを構成する何れかのディスク装置に故障が
発生した場合に運用業務を停止することなく故障ディス
ク装置のデータを予備ディスク装置に復旧し、前記故障
ディスク装置以外のディスク装置と前記予備ディスク装
置とでディスクアレイを再構成して該再構成ディスクア
レイで運用業務を続行する機能を有するディスクアレイ
装置において、前記故障ディスク装置が正常なディスク装置に交換され
たとき、前記再構成ディスクアレイに対する運用業務を
停止することなく且つ前記再構成ディスクアレイの冗長
状態を維持しながら、前記予備ディスク装置のデータを
前記交換後の正常なディスク装置に復旧する第１の段階
と、前記交換後の正常なディスク装置を前記再構成ディスク
アレイに組み込むと共に、前記予備ディスク装置を前記
再構成ディスクアレイから切り離す第２の段階とを含む
ことを特徴とするディスクアレイ装置の管理方法。2. A system in which one or more spare disk devices are provided in addition to a plurality of disk devices constituting a disk array, and operation operation is stopped when a failure occurs in any of the disk devices constituting the disk array. Recovers the data of the failed disk device to the spare disk device, reconfigures the disk array with the disk devices other than the failed disk device and the spare disk device, and continues the operation with the reconfigured disk array. In the disk array device, when the failed disk device is replaced with a normal disk device, the spare disk is replaced without stopping the operation of the reconfigured disk array and maintaining the redundant state of the reconfigured disk array. A first stage of restoring device data to a normal disk device after the replacement; With incorporation of the normal disk device after the replacement to the reconfiguration disk array management method for a disk array device which comprises a second step of separating the spare disk device from said reconfigurable disk array.

【請求項３】前記第１の段階においては、前記再構成
ディスクアレイに対する運用業務にかかる書き込み処理
は、前記再構成ディスクアレイを構成する全ディスク装
置と前記交換後の正常なディスク装置とを対象にすると
共に、前記再構成ディスクアレイ中の予備ディスク装置
と前記交換後の正常なディスク装置に対しては同じデー
タを二重に書き込むようにし、前記再構成ディスクアレ
イに対する運用業務にかかる読み込み処理は、再構成デ
ィスクアレイを構成する全ディスク装置を対象にし、こ
れら運用業務にかかる書き込み処理および読み込み処理
と排他制御を行って前記再構成ディスクアレイ中の予備
ディスク装置の全てのデータを前記交換後の正常なディ
スク装置に複写する処理を行うことを特徴とする請求項
２記載のディスクアレイ装置の管理方法。3. In the first stage, a write process related to an operation task for the reconfigured disk array is performed on all the disk devices constituting the reconfigured disk array and the normal disk devices after the replacement. In addition, the same data is written in duplicate to the spare disk device in the reconfigured disk array and the normal disk device after the replacement. For all the disk devices constituting the reconfigured disk array, write and read processes and exclusive control related to these operation tasks are performed, and all data of the spare disk devices in the reconfigured disk array after the replacement are exchanged. 3. The disk storage device according to claim 2, wherein a process of copying to a normal disk device is performed. Ray device management method.

【請求項４】前記再構成ディスクアレイ中の予備ディ
スク装置の全データを読み込んで前記交換後の正常なデ
ィスク装置に書き戻すことにより、前記予備ディスク装
置のデータを前記交換後の正常なディスク装置に複写す
ることを特徴とする請求項３記載のディスクアレイ装置
の管理方法。4. The normal disk device after the replacement by reading all the data of the spare disk device in the reconfigured disk array and writing it back to the normal disk device after the replacement. 4. The method for managing a disk array device according to claim 3, wherein the data is copied to a disk array device.

【請求項５】前記再構成ディスクアレイの全ディスク
装置の全データを読み込んで前記再構成ディスクアレイ
を構成する全ディスク装置と前記交換後の正常なディス
ク装置に書き戻すことにより、前記予備ディスク装置の
データを前記交換後の正常なディスク装置に複写するこ
とを特徴とする請求項３記載のディスクアレイ装置の管
理方法。5. The spare disk device by reading all data of all the disk devices of the reconfigured disk array and writing it back to all the disk devices constituting the reconfigured disk array and the normal disk device after the replacement. 4. The method according to claim 3, wherein said data is copied to a normal disk device after said replacement.

【請求項６】前記第１の段階においては、前記再構成
ディスクアレイに対する運用業務にかかる書き込み処理
は、前記再構成ディスクアレイを構成する全ディスク装
置と前記交換後の正常なディスク装置とを対象にすると
共に、前記再構成ディスクアレイ中の予備ディスク装置
と前記交換後の正常なディスク装置に対しては同じデー
タを二重に書き込むようにし、前記再構成ディスクアレ
イに対する運用業務にかかる読み込み処理は、再構成デ
ィスクアレイを構成する全ディスク装置を対象にし、こ
れら運用業務にかかる書き込み処理および読み込み処理
と排他制御を行って前記再構成ディスクアレイ中の前記
予備ディスク装置以外の全ディスク装置のデータを読み
込んでデータを再構築し、該再構築したデータのうち前
記予備ディスク装置に対応するデータを前記交換後の正
常なディスク装置に書き戻す処理を行うことを特徴とす
る請求項２記載のディスクアレイ装置の管理方法。6. In the first stage, a write process related to an operation for the reconfigured disk array is performed on all disk devices constituting the reconfigured disk array and the normal disk devices after the replacement. In addition, the same data is written in duplicate to the spare disk device in the reconfigured disk array and the normal disk device after the replacement. The data of all the disk devices other than the spare disk device in the reconfigured disk array are subjected to the write process and the read process and the exclusive control related to these operation tasks for all the disk devices constituting the reconfigured disk array. Reading and reconstructing the data, the spare disk unit of the reconstructed data 3. The management method for a disk array device according to claim 2, wherein a process of writing back the data corresponding to the data to the normal disk device after the replacement is performed.

【請求項７】ディスクアレイを構成する複数のディス
ク装置に加えて１台以上の予備ディスク装置を備え、デ
ィスクアレイを構成する何れかのディスク装置に故障が
発生した場合に運用業務を停止することなく故障ディス
ク装置のデータを予備ディスク装置に復旧し、前記故障
ディスク装置以外のディスク装置と前記予備ディスク装
置とでディスクアレイを再構成して該再構成ディスクア
レイで運用業務を続行する機能を有するディスクアレイ
装置におけるディスクアレイ制御部を構成するコンピュ
ータに、前記故障ディスク装置が正常なディスク装置に交換され
たとき、前記再構成ディスクアレイに対する運用業務を
停止することなく且つ前記再構成ディスクアレイの冗長
状態を維持しながら、前記予備ディスク装置のデータを
前記交換後の正常なディスク装置に復旧する第１のステ
ップと、前記交換後の正常なディスク装置を前記再構成ディスク
アレイに組み込むと共に、前記予備ディスク装置を前記
再構成ディスクアレイから切り離す第２のステップとを
実行させるプログラムを記録した機械読み取り可能な記
録媒体。7. A system in which one or more spare disk devices are provided in addition to a plurality of disk devices constituting a disk array, and operation operations are stopped when a failure occurs in any of the disk devices constituting the disk array. Recovers the data of the failed disk device to the spare disk device, reconfigures the disk array with the disk devices other than the failed disk device and the spare disk device, and continues the operation with the reconfigured disk array. A computer constituting a disk array control unit in the disk array device, when the failed disk device is replaced with a normal disk device, without stopping the operation of the reconfigured disk array and without redundancy of the reconfigured disk array. Exchange the data of the spare disk unit while maintaining the status A first step of restoring to a normal disk device, and a second step of incorporating the replaced normal disk device into the reconfigurable disk array and disconnecting the spare disk device from the reconfigurable disk array. A machine-readable recording medium that records a program to be executed.