JP2000242434A

Movatterモバイル変換

Info

Publication number: JP2000242434A
Application number: JP11344260A
Authority: JP
Inventors: Naoto Matsunami; 直人松並; Takashi Oeda; 高大枝; Akira Yamamoto; 山本　　彰; Yasuyuki Ajimatsu; 康行味松; Masahiko Sato; 雅彦佐藤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1998-12-22
Filing date: 1999-12-03
Publication date: 2000-09-08

Abstract

(57)【要約】【課題】計算機システムの規模、要求などに応じた記憶
装置システムを構築でき、将来における記憶装置システ
ムの拡張、信頼性の向上を容易に実現できるようにす
る。【解決手段】記憶装置システム１は、データを保持する
記憶装置とそれを制御する制御装置を有する複数のサブ
セット１０とサブセット１０とホスト３０との間に配置
されるスイッチ装置２０を有する。スイッチ装置２０
は、記憶装置システム１の構成を管理する管理情報を保
持する管理テーブルを有し、管理情報に従ってホスト３
０が出力するフレーム情報に含まれるアドレス情報を変
換してフレーム情報をサブセット１０に振り分ける。(57) [Summary] A storage device system can be constructed according to the scale, requirements, and the like of a computer system, and expansion and reliability improvement of the storage device system in the future can be easily realized. A storage device system includes a plurality of subsets having a storage device that holds data and a control device that controls the storage device, and a switch device disposed between the subset and a host. Switch device 20
Has a management table for holding management information for managing the configuration of the storage device system 1, and according to the management information, the host 3
0 converts the address information included in the frame information output and distributes the frame information to the subset 10.

Description

Translated fromJapanese

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、複数のディスク装
置を制御するディスク制御システムの実現方法に関し、
特に、ディスク制御システムの高速化、低コスト化、コ
ストパフォーマンスの向上の方法に関する。The present invention relates to a method for implementing a disk control system for controlling a plurality of disk devices,
In particular, the present invention relates to a method for increasing the speed, reducing the cost, and improving the cost performance of a disk control system.

【０００２】[0002]

【従来の技術】計算機システムに用いられる記憶装置シ
ステムとして、複数のディスク装置を制御するディスク
アレイシステムがある。ディスクアレイシステムについ
ては、例えば、“A Case for Redundant Arrays of Ine
xpensive Disks (RAID)”; InProc. ACM SIGMOD, June
1988（カリフォルニア大学バークレー校発行）に開示さ
れている。ディスクアレイは、複数のディスク装置を並
列に動作させることで、ディスク装置を単体で用いた記
憶装置システムに比べ高速化を実現する技術である。2. Description of the Related Art As a storage device system used in a computer system, there is a disk array system for controlling a plurality of disk devices. For the disk array system, for example, see “A Case for Redundant Arrays of Ine
xpensive Disks (RAID) ”; InProc. ACM SIGMOD, June
1988 (published by the University of California, Berkeley). The disk array is a technology that operates a plurality of disk devices in parallel to achieve higher speed as compared with a storage device system using a single disk device.

【０００３】複数のディスクアレイシステムを、複数の
ホストと相互に接続する方法として、ファイバチャネル
（Fibre Channel）のFabricを使用した方法がある。こ
の方法を適用した計算機システムの例が、日経エレクト
ロニクス1995.7.3（no.639）「シリアルSCSIがいよいよ
市場へ」P.79 図３に示されている。ここに開示される
計算機システムでは、複数のホストコンピュータ（以下
では単にホストと呼ぶ）と複数のディスクアレイシステ
ムが、それぞれ、ファイバチャネルを介してファブリッ
ク装置に接続される。ファブリック装置は、ファイバチ
ャネルのスイッチであり、ファブリック装置に接続する
任意の装置間の転送路の接続を行う。ファブリック装置
はファイバチャネルのパケットである「フレーム」の転
送に対し透過であり、ホストとディスクアレイシステム
は、互いにファブリック装置を意識することなく２点間
で通信を行う。As a method for interconnecting a plurality of disk array systems with a plurality of hosts, there is a method using Fiber Channel Fabric. An example of a computer system to which this method is applied is shown in Nikkei Electronics 1995.7.3 (No. 639) "Serial SCSI is finally on the market" P.79 FIG. In the computer system disclosed herein, a plurality of host computers (hereinafter simply referred to as hosts) and a plurality of disk array systems are respectively connected to a fabric device via a fiber channel. The fabric device is a fiber channel switch, and connects a transfer path between arbitrary devices connected to the fabric device. The fabric device is transparent to the transfer of a “frame” that is a fiber channel packet, and the host and the disk array system communicate between two points without being aware of the fabric device.

【０００４】[0004]

【発明が解決しようとする課題】従来のディスクアレイ
システムでは、大容量化のためディスク装置の台数を増
やし、高性能化のため台数に見合った性能を有するコン
トローラを実現しようとすると、コントローラの内部バ
スの性能限界や、転送制御を行うプロセッサの性能限界
が顕在化する。このような問題に対処するために、内部
バスを拡張し、プロセッサ数を増加することが行われて
いる。しかし、このような対処の仕方は、多数のバス制
御によるコントローラ構成の複雑化や、プロセッサ間の
共有データの排他制御等による制御ソフトの複雑化とオ
ーバヘッドの増加を招く。このため、コストを非常に上
昇させるとともに、性能は頭打ちになり、その結果、コ
ストパフォーマンスが悪化する。また、このような装置
は、大規模なシステムでは、そのコストに見合った性能
が実現できるものの、規模がそれほど大きくないシステ
ムには見合わない、拡張性が制限される、開発期間の増
大と開発コストの上昇を招くといった課題がある。In the conventional disk array system, if the number of disk drives is increased for increasing the capacity and a controller having a performance corresponding to the number is to be realized for higher performance, the internal capacity of the controller is increased. The performance limit of a bus and the performance limit of a processor that performs transfer control become apparent. To cope with such a problem, an internal bus is expanded to increase the number of processors. However, such a countermeasure causes a complicated controller configuration by controlling a large number of buses, a complicated control software due to exclusive control of shared data between processors, and an increase in overhead. For this reason, the cost is greatly increased, and the performance is leveled off. As a result, the cost performance deteriorates. In addition, such a device can achieve performance commensurate with the cost of a large-scale system, but cannot meet the cost of a system of a small size, has limited scalability, and has an increased development time and There is a problem that the cost rises.

【０００５】複数のディスクアレイシステムを並べファ
ブリック装置で相互接続することによって、システム全
体としての大容量化、高性能化を行うことが可能であ
る。しかし、この方法では、ディスクアレイシステム間
に関連性は全くなく、特定のディスクアレイシステムに
アクセスが集中したとしてもそれを他の装置に分散する
ことができないので、実使用上の高性能化が実現できな
い。また、ホストから見た論理的なディスク装置（論理
ユニットと呼ぶ）の容量は、１台のディスクアレイシス
テムの容量に制限されるので、論理ユニットの大容量化
は実現できない。By arranging a plurality of disk array systems and interconnecting them with a fabric device, it is possible to increase the capacity and performance of the entire system. However, in this method, there is no relevance between the disk array systems, and even if access is concentrated on a specific disk array system, it cannot be distributed to other devices. I can't. Further, since the capacity of a logical disk device (referred to as a logical unit) as viewed from the host is limited to the capacity of one disk array system, the capacity of the logical unit cannot be increased.

【０００６】ディスクアレイシステム全体を高信頼化し
ようとした際に、ホストが備えているミラーリング機能
を用いて２台のディスクアレイシステムによるミラー構
成を実現することができるが、ホストによるミラーリン
グのための制御オーバヘッドが発生し、システム性能が
制限されるという課題がある。また、多数のディスクア
レイシステムがシステム内に個別に存在すると、システ
ム管理者が管理するための負荷が増加する。このため、
多数の保守人員、複数台分の保守費用が必要になる等、
管理コストが増加する。さらに、複数のディスクアレイ
システム、ファブリック装置は、それぞれ独立した装置
であるので、各種設定は、それぞれの装置毎に異なる方
法で実施する必要がある。このため、管理者のトレーニ
ングや、操作時間の増大にともない運用コストが増大す
る。When trying to make the entire disk array system highly reliable, a mirror configuration using two disk array systems can be realized by using the mirroring function of the host. There is a problem that control overhead occurs and system performance is limited. In addition, when a large number of disk array systems exist individually in the system, a load for management by a system administrator increases. For this reason,
A large number of maintenance personnel, maintenance costs for multiple units are required,
Administration costs increase. Further, since the plurality of disk array systems and the fabric devices are independent devices, various settings need to be performed by different methods for each device. For this reason, the operation cost increases with the training of the administrator and the increase in the operation time.

【０００７】本発明の目的は、これら従来技術における
課題を解決し、計算機システムの規模、要求などに応じ
た記憶装置システムを構築でき、将来における記憶装置
システムの拡張、信頼性の向上などに容易に対応するこ
とのできる記憶装置システムを実現することにある。SUMMARY OF THE INVENTION An object of the present invention is to solve these problems in the prior art, to construct a storage system according to the scale and requirements of a computer system, and to easily expand the storage system and improve reliability in the future. Another object of the present invention is to realize a storage device system that can cope with the above.

【０００８】[0008]

【課題を解決するための手段】本発明の記憶装置システ
ムは、データを保持する記憶媒体を有する記憶装置と、
この記憶装置を制御する制御装置とを有する複数の記憶
装置サブシステム、複数の記憶装置サブシステムに保持
されるデータを使用する計算機に接続された第１のイン
タフェースノード、各々が記憶装置サブシステムのいず
れかに接続された複数の第２のインタフェースノード、
及び第１のインタフェースノード及び複数の第２のイン
タフェースノードが接続され、第１のインタフェースノ
ードと複数の第２のインタフェースノードとの間でフレ
ームの転送を行う転送手段を有する。A storage device system according to the present invention includes a storage device having a storage medium for holding data;
A plurality of storage subsystems having a control device for controlling the storage subsystem; a first interface node connected to a computer using data held in the plurality of storage subsystems; A plurality of second interface nodes connected to either,
And a first interface node and a plurality of second interface nodes are connected to each other, and a transfer unit for transferring a frame between the first interface node and the plurality of second interface nodes is provided.

【０００９】好ましくは、第１のインタフェースノード
は、記憶装置システムの構成情報を格納した構成管理テ
ーブルと、計算機から送られてくるフレームに応答し
て、該フレームを解析し、構成管理テーブルに保持され
た構成情報に基づいてそのフレームの転送先に関する情
報変換して転送手段に転送する。Preferably, the first interface node analyzes the frame in response to a frame sent from a computer and stores the configuration information in a configuration management table storing configuration information of the storage device system. The information on the transfer destination of the frame is converted based on the obtained configuration information and transferred to the transfer unit.

【００１０】また、フレームの転送に際して、第１のイ
ンタフェースノードは、そのフレームを受け取るべきノ
ードのノードアドレス情報をフレームに付加する。転送
手段はフレームに付加されたノードアドレス情報に従っ
てフレームを転送する。第２のインタフェースノード
は、転送手段から受け取ったフレームからノードアドレ
ス情報を除いてフレームを再形成し、目的の記憶装置サ
ブシステムに転送する。In transferring a frame, the first interface node adds node address information of a node that should receive the frame to the frame. The transfer means transfers the frame according to the node address information added to the frame. The second interface node re-creates the frame received from the transfer unit by removing the node address information, and transfers the frame to the target storage subsystem.

【００１１】本発明のある態様において、記憶装置シス
テムは、転送手段に接続する管理プロセッサを有する。
管理プロセッサは、オペレータからの指示に従って、構
成管理テーブルに構成情報を設定する。構成情報には、
計算機からのアクセスを制限する情報が含まれる。In one embodiment of the present invention, the storage device system has a management processor connected to the transfer means.
The management processor sets the configuration information in the configuration management table according to an instruction from the operator. The configuration information includes
Information that restricts access from the computer is included.

【００１２】[0012]

【発明の実施の形態】［第１実施形態］図１は、本発明
が適用されたディスクアレイシステムを用いたコンピュ
ータシステムの一実施形態における構成図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [First Embodiment] FIG. 1 is a block diagram showing an embodiment of a computer system using a disk array system to which the present invention is applied.

【００１３】1はディスクアレイシステム、３０はディ
スクアレイシステムが接続されるホストコンピュータ
（ホスト）である。ディスクアレイシステム1は、ディ
スクアレイサブセット１０、ディスクアレイスイッチ２
０、ディスクアレイシステム全体の設定管理を行うディ
スクアレイシステム構成管理手段７０、ディスクアレイ
スイッチ２０とディスクアレイシステム構成管理手段７
０との間、およびディスクアレイサブセット１０ディス
クアレイシステム構成管理手段７０との間の通信インタ
フェース（通信Ｉ／Ｆ）８０を有する。ホスト３０とデ
ィスクアレイシステム1とは、ホストインタフェース
（ホストＩ／Ｆ）３１で接続されており、ホストＩ／Ｆ
３１はディスクアレイシステム1のディスクアレイスイ
ッチ２０に接続する。ディスクアレイシステム1の内部
において、ディスクアレイスイッチ２０とディスクアレ
イサブセット１０は、ディスクアレイインタフェース
（ディスクアレイＩ／Ｆ２１）で接続される。1 is a disk array system, and 30 is a host computer (host) to which the disk array system is connected. The disk array system 1 includes a disk array subset 10 and a disk array switch 2
0, a disk array system configuration management unit 70 for managing settings of the entire disk array system, a disk array switch 20 and a disk array system configuration management unit 7
0, and a communication interface (communication I / F) 80 with the disk array subset 10 and the disk array system configuration management means 70. The host 30 and the disk array system 1 are connected by a host interface (host I / F) 31, and the host I / F
31 is connected to the disk array switch 20 of the disk array system 1. Inside the disk array system 1, the disk array switch 20 and the disk array subset 10 are connected by a disk array interface (disk array I / F 21).

【００１４】ホスト３０、ディスクアレイサブセット１
０は、図では、各々４台示されているが、この台数に関
しては制限はなく任意である。ホスト３０とディスクア
レイサブセット１０の台数が異なっても構わない。ま
た、ディスクアレイスイッチ２０は、本実施形態では図
示の通り二重化されている。各ホスト３０および各ディ
スクアレイサブセット１０は、それぞれ別々のホストＩ
／Ｆ３１、ディスクアレイＩ／Ｆ２１で二重化されたデ
ィスクアレイスイッチ２０の双方に接続されている。こ
れは、一方のディスクアレイスイッチ２０、ホストＩ／
Ｆ３１、あるいはディスクアレイＩ／Ｆ２１が故障して
も他方を使用することでホスト３０からディスクアレイ
システム1へのアクセスを可能とし、高い可用性を実現
するためである。しかし、このような二重化は必ずしも
必須ではなく、システムに要求される信頼性レベルに応
じて選択可能である。Host 30, Disk Array Subset 1
Although 0 is shown in the figure for each of the four units, there is no limitation on the number, and the number is arbitrary. The number of hosts 30 and the number of disk array subsets 10 may be different. In this embodiment, the disk array switch 20 is duplicated as shown. Each host 30 and each disk array subset 10 have separate host I
/ F31 and the disk array switch 20 duplexed by the disk array I / F 21. This is because one of the disk array switches 20 and the host I /
This is because even if the F31 or the disk array I / F 21 fails, the other can be used to access the disk array system 1 from the host 30, thereby realizing high availability. However, such duplication is not always necessary, and can be selected according to the reliability level required for the system.

【００１５】図２は、ディスクアレイサブセット１０の
一構成例を示す構成図である。１０１は上位システム
（ホスト１０）からのコマンドを解釈してキャッシュヒ
ットミス判定を実施し、上位システムとキャッシュ間の
データ転送を制御する上位アダプタ、１０２はディスク
データアクセス高速化のためのキャッシュ、および、マ
ルチプロセッサ間の共有データを格納する共有メモリ
（以下キャッシュ・共有メモリと呼ぶ）、１０４はディ
スクアレイサブセット10内に格納される複数のディスク
ユニットである。１０３はディスクユニット１０４を制
御し、ディスクユニット１０４とキャッシュ間のデータ
転送を制御する下位アダプタである。１０６はディスク
アレイサブセット構成管理手段であり、ディスクアレイ
システム1全体を管理するディスクアレイシステム構成
管理手段７０と通信Ｉ／Ｆ８０を介して通信し、構成パ
ラメータの設定や、障害情報の通報等の管理を行う。FIG. 2 is a configuration diagram showing an example of the configuration of the disk array subset 10. As shown in FIG. Reference numeral 101 denotes a host adapter that interprets a command from the host system (host 10) to determine a cache hit / miss and controls data transfer between the host system and the cache. 102 denotes a cache for accelerating disk data access. A shared memory (hereinafter referred to as cache / shared memory) 104 for storing shared data between multiprocessors; and 104, a plurality of disk units stored in the disk array subset 10. A lower adapter 103 controls the disk unit 104 and controls data transfer between the disk unit 104 and the cache. Reference numeral 106 denotes a disk array subset configuration management unit that communicates with the disk array system configuration management unit 70 that manages the entire disk array system 1 via the communication I / F 80, and manages configuration parameter setting, failure information reporting, and the like. I do.

【００１６】上位アダプタ１０１、キャッシュ・共有メ
モリ１０２、下位アダプタ１０３はそれぞれ二重化され
ている。この理由は上記ディスクアレイスイッチ２０の
二重化と同様、高可用性を実現するためであり必須では
ない。また、各ディスクユニット１０４は、二重化され
た下位アダプタ１０３のいずれからも制御可能である。
本実施形態では、低コスト化の観点から同一のメモリ手
段をキャッシュと共有メモリに共用しているが、これら
は勿論分離することも可能である。The upper adapter 101, the cache / shared memory 102, and the lower adapter 103 are each duplicated. The reason for this is to realize high availability as in the case of the above-mentioned duplexing of the disk array switch 20, and is not essential. Further, each disk unit 104 can be controlled from any of the duplexed lower adapters 103.
In the present embodiment, the same memory means is shared between the cache and the shared memory from the viewpoint of cost reduction, but these can of course be separated.

【００１７】上位アダプタ１０１は、上位アダプタ１０
１の制御を実行する上位ＭＰＵ１０１０、上位システ
ム、すなわちディスクアレイスイッチ２０との接続Ｉ／
ＦであるディスクアレイＩ／Ｆ２１を制御するディスク
アレイＩ／Ｆコントローラ１０１１、キャッシュ・共有
メモリ１０２と上位ＭＰＵ１０１０とディスクアレイＩ
／Ｆコントローラ１０１１との間の通信、データ転送を
行う上位バス１０１２を含む。The upper adapter 101 is a host adapter 10
1 which controls the upper MPU 1010 and the upper system, ie, the connection I / O with the disk array switch 20.
A disk array I / F controller 1011 for controlling the disk array I / F 21 as the F, the cache / shared memory 102, the upper MPU 1010, and the disk array I
/ F controller 1011 and an upper bus 1012 for performing data transfer and communication with the controller 1011.

【００１８】図では各上位アダプタ１０１毎に1台のデ
ィスクアレイＩ／Ｆコントローラ１０１１が示されてい
るが、１つの上位アダプタに対し、複数のディスクアレ
イＩ／Ｆコントローラ１０１１を設けてもよい。In the figure, one disk array I / F controller 1011 is shown for each upper adapter 101, but a plurality of disk array I / F controllers 1011 may be provided for one upper adapter.

【００１９】下位アダプタ１０３は、下位アダプタ１０
３の制御を実行する下位ＭＰＵ１０３０、ディスク１０
４とのインタフェースであるディスクＩ／Ｆを制御する
ディスクＩ／Ｆコントローラ１０３１、キャッシュ・共
有メモリ１０２と下位ＭＰＵ１０３０とディスクＩ／Ｆ
コントローラ１０３１との間の通信、データ転送を行う
下位バス１０３２を含む。The lower adapter 103 includes the lower adapter 10
MPU 1030 that executes the control of the third disk 10
I / F controller 1031, which controls the disk I / F which is an interface with the C.4, the cache / shared memory 102, the lower MPU 1030, and the disk I / F
A lower bus 1032 for performing communication and data transfer with the controller 1031 is included.

【００２０】図では各下位アダプタ１０３毎に4台のデ
ィスクＩ／Ｆコントローラ１０３１が示されているが、
その数は任意であり、ディスクアレイの構成や、接続す
るディスク台数に応じて変更可能である。In the figure, four disk I / F controllers 1031 are shown for each lower adapter 103.
The number is arbitrary and can be changed according to the configuration of the disk array and the number of disks to be connected.

【００２１】図３は、ディスクアレイスイッチ２０の一
構成例を示す構成図である。２００はディスクアレイス
イッチ全体の制御および管理を行うプロセッサである管
理プロセッサ（ＭＰ）、２０１はｎ×ｎの相互スイッチ
経路を構成するクロスバスイッチ、２０２はディスクア
レイＩ／Ｆ２１毎に設けられるディスクアレイＩ／Ｆノ
ード、２０３はホストＩ／Ｆ３１毎に設けられるホスト
Ｉ／Ｆノード、２０４はディスクアレイシステム構成管
理手段７０との間の通信を行う通信コントローラであ
る。２０２０はディスクアレイＩ／Ｆノード２０２とク
ロスバスイッチ２０１を接続するパス、２０３０はホス
トＩ／Ｆノード２０３とクロスバスイッチ２０１を接続
するパス、２０４０は他のディスクアレイスイッチ２０
と接続し、クラスタを構成するためのクラスタ間Ｉ／
Ｆ、２０５０はＭＰ２００とクロスバスイッチ２０１を
接続するためのパスである。FIG. 3 is a configuration diagram showing an example of the configuration of the disk array switch 20. Reference numeral 200 denotes a management processor (MP) which is a processor for controlling and managing the entire disk array switch; 201, a crossbar switch constituting an n × n mutual switch path; and 202, a disk array I / F provided for each disk array I / F 21. A / F node, 203 is a host I / F node provided for each host I / F 31, and 204 is a communication controller that performs communication with the disk array system configuration management means 70. Reference numeral 2020 denotes a path connecting the disk array I / F node 202 and the crossbar switch 201, reference numeral 2030 denotes a path connecting the host I / F node 203 and the crossbar switch 201, and reference numeral 2040 denotes another disk array switch 20.
And I / O between clusters to configure a cluster
F and 2050 are paths for connecting the MP 200 and the crossbar switch 201.

【００２２】図４はクロスバスイッチ２０１の構造を示
す構成図である。２０１０はクロスバスイッチ２０１に
接続するパス２０２０、２０３０、２０５０、およびク
ラスタ間Ｉ／Ｆ２０４０を接続するポートであるスイッ
チングポート（ＳＷＰ）である。ＳＷＰ２０１０はすべ
て同一の構造を有し、あるＳＷＰから他のＳＷＰへの転
送経路のスイッチング制御を行う。図では１つのＳＷＰ
についてのみ転送経路を示しているが、すべてのＳＷＰ
間で同様の転送経路が存在する。FIG. 4 is a block diagram showing the structure of the crossbar switch 201. Reference numeral 2010 denotes a switching port (SWP) which is a port connecting the paths 2020, 2030, and 2050 connected to the crossbar switch 201 and the inter-cluster I / F 2040. The SWPs 2010 all have the same structure, and perform switching control of a transfer path from one SWP to another SWP. In the figure, one SWP
, The transfer path is shown only for all SWPs.
A similar transfer path exists between them.

【００２３】図５は、ホストＩ／Ｆノード２０３の一構
成例を示す構成図である。本実施形態では、具体的に説
明をするためにホストＩ／Ｆ３１とディスクアレイＩ／
Ｆ２１の両方にファイバチャネルを使用するものと仮定
する。もちろんホストＩ／Ｆ３１とディスクアレイＩ／
Ｆ２１として、ファイバチャネル以外のインタフェース
を適用することも可能である。ホストＩ／Ｆノード２０
３とディスクアレイＩ／Ｆノード２０２の両方に同一の
インタフェースを使用することで、両者を同一構造にで
きる。本実施形態においては、ディスクアレイＩ／Ｆノ
ード２０２も図に示すホストＩ／Ｆノード２０３と同様
に構成される。以下では、ホストＩ／Ｆノード２０３を
例に説明を行う。FIG. 5 is a configuration diagram showing an example of the configuration of the host I / F node 203. In this embodiment, the host I / F 31 and the disk array I / F 31
Assume that Fiber Channel is used for both F21. Of course, host I / F 31 and disk array I / F
As F21, an interface other than the fiber channel can be applied. Host I / F node 20
3 and the disk array I / F node 202, by using the same interface, both can have the same structure. In the present embodiment, the disk array I / F node 202 is configured similarly to the host I / F node 203 shown in FIG. Hereinafter, the host I / F node 203 will be described as an example.

【００２４】２０２１は受信したファイバチャネルフレ
ーム（以下単にフレームと呼ぶ）をどのノードに転送す
るかを検索する検索プロセッサ（ＳＰ）、２０２２はホ
スト３０（ディスクアレイＩ／Ｆノード２０２の場合
は、ディスクアレイサブセット１０）との間でフレーム
を送受信するインタフェースコントローラ（ＩＣ）、２
０２２はＩＣ２０２３が受信したフレームに対しＳＰ２
０２１が検索した結果に基づいて変換を施すスイッチン
グコントローラ（ＳＣ）、２０２４はＳＣ２０２１が変
換したフレームを他のノードに転送するためにクロスバ
スイッチ２０１を通過できる形式にパケット化するパケ
ット生成部（ＳＰＧ）、２０２５は受信したフレームを
一時的に格納するフレームバッファ（ＦＢ）、２０２６
は一つのホストからのディスクアレイアクセス要求コマ
ンド（以下単にコマンドと呼ぶ）に対応した複数のフレ
ーム列であるエクスチェンジ（Exchange）を識別するた
めのエクスチェンジ番号を管理するエクスチェンジテー
ブル（ＥＴ）、２０２７は複数のディスクアレイサブセ
ット１０の構成情報を格納するディスクアレイ構成管理
テーブル（ＤＣＴ）である。Reference numeral 2021 denotes a search processor (SP) for searching for a node to which a received fiber channel frame (hereinafter, simply referred to as a frame) is to be transferred, and reference numeral 2022 denotes a host 30 (in the case of the disk array I / F node 202, a disk An interface controller (IC) for transmitting and receiving frames to and from the array subset 10), 2
022 is SP2 for the frame received by the IC 2023.
A switching controller (SC) 2021 performs conversion based on the search result, and a packet generation unit (SPG) 2024 converts the frame converted by the SC 2021 into a format that can pass through the crossbar switch 201 to transfer the frame to another node. , 2025 are frame buffers (FB) for temporarily storing received frames, 2026
Is an exchange table (ET) that manages an exchange number for identifying an exchange (Exchange) that is a plurality of frame strings corresponding to a disk array access request command (hereinafter simply referred to as a command) from one host. 3 is a disk array configuration management table (DCT) for storing configuration information of the disk array subset 10 of FIG.

【００２５】ディスクアレイスイッチ２０の各構成部
は、すべてハードウェアロジックで構成されることが性
能上望ましい。しかし、求められる性能を満足できるな
らば、汎用プロセッサを用いたプログラム制御によりＳ
Ｐ２０２１やＳＣ２０２２の機能を実現することも可能
である。It is desirable in terms of performance that all components of the disk array switch 20 are constituted by hardware logic. However, if the required performance can be satisfied, S
It is also possible to realize the functions of P2021 and SC2022.

【００２６】各ディスクアレイサブセット１０は、各々
が有するディスクユニット１０４を１または複数の論理
的なディスクユニットとして管理している。この論理的
なディスクユニットを論理ユニット（ＬＵ）と呼ぶ。Ｌ
Ｕは、物理的なディスクユニット１０４と１対１で対応
する必要はなく、１台のディスクユニット１０４に複数
のＬＵが構成され、あるいは、複数のディスクユニット
１０４で１つのＬＵが構成されても構わない。Each disk array subset 10 manages its own disk unit 104 as one or more logical disk units. This logical disk unit is called a logical unit (LU). L
The U does not need to correspond one-to-one with the physical disk unit 104, and a plurality of LUs may be configured in one disk unit 104, or a single LU may be configured in a plurality of disk units 104. I do not care.

【００２７】ディスクアレイサブセット１０の外部から
見た場合、１つのＬＵは、１台のディスク装置として認
識される。本実施形態では、ディスクアレイスイッチ２
０によりさらに論理的なＬＵが構成され、ホスト３０
は、このＬＵに対してアクセスするように動作する。本
明細書では、１つのＬＵでホスト３０から認識される１
つのＬＵが構成される場合、ホスト３０により認識され
るＬＵを独立ＬＵ（ＩＬＵ）、複数のＬＵでホスト３０
から認識される１つのＬＵが構成される場合、ホスト３
０により認識されるＬＵを統合ＬＵ（ＣＬＵ）と呼ぶ。When viewed from the outside of the disk array subset 10, one LU is recognized as one disk device. In the present embodiment, the disk array switch 2
0 constitutes a further logical LU, and the host 30
Operates to access this LU. In this specification, one LU recognized by the host 30 by one LU
When one LU is configured, an LU recognized by the host 30 is an independent LU (ILU), and a plurality of LUs
When one LU recognized from is configured, the host 3
The LU recognized by 0 is called an integrated LU (CLU).

【００２８】図１２に、４つのディスクアレイサブセッ
トのＬＵで１つの統合ＬＵが構成される場合における各
階層間でのアドレス空間の対応関係を示す。図におい
て、１０００は、一例として、ホスト“＃２”からみた
ディスクアレイシステム１の１つの統合ＬＵにおけるア
ドレス空間、１１００は、ディスクアレイサブセット１
０のＬＵのアドレス空間、１２００はディスクユニット
１０４（ここでは、ディスクアレイサブセット“＃０”
についてのみ図示されている）のアドレス空間を示して
いる。FIG. 12 shows the correspondence of the address space between the respective layers when one integrated LU is constituted by the LUs of the four disk array subsets. In the figure, reference numeral 1000 denotes, for example, an address space in one integrated LU of the disk array system 1 viewed from the host “# 2”, and 1100 denotes a disk array subset 1
0 LU address space 1200 is the disk unit 104 (here, the disk array subset “# 0”).
(Only shown in FIG. 2).

【００２９】各ディスクアレイサブセット１０のＬＵ
は、ここでは、４台のディスクユニット１０４によりＲ
ＡＩＤ５（ＲｅｄｕｎｄａｎｔＡｒｒａｙｓｏｆ
ＩｎｅｘｐｅｎｓｉｖｅＤｉｓｋｓＬｅｖｅｌ
５）型ディスクアレイとして構成されるものとする。各
ディスクアレイサブセット１０は、それぞれｎ０、ｎ
１、ｎ２、ｎ３の容量を有するＬＵを持つ。ディスクア
レイスイッチ２０は、これら４つのＬＵの持つアドレス
空間を（ｎ０＋ｎ１＋ｎ２＋ｎ３）の容量を有するアド
レス空間に統合し、ホスト３０から認識される統合ＬＵ
を実現する。The LU of each disk array subset 10
Here, R is calculated by four disk units 104.
AID5 (Redundant Arrays of
Inexpensive Disks Level
5) It is configured as a type disk array. Each disk array subset 10 has n0, n
It has LUs with capacities of 1, n2 and n3. The disk array switch 20 integrates the address space of these four LUs into an address space having a capacity of (n0 + n1 + n2 + n3), and the integrated LU recognized by the host 30.
To achieve.

【００３０】本実施形態では、例えば、ホスト＃２が領
域Ａ１００１をアクセスする場合、領域Ａ１００１を指
定したアクセス要求は、ディスクアレイスイッチ２０に
よりディスクアレイサブセット＃０のＬＵの領域Ａ′１
１０１をアクセスするための要求に変換されてディスク
アレイサブセット＃０に転送される。ディスクアレイサ
ブセット＃０は、領域Ａ′１１０１をさらに、ディスク
ユニット１０４上の領域Ａ″１２０１にマッピングして
アクセスを行う。アドレス空間１０００とアドレス空間
１１００との間のマッピングは、ディスクアレイスイッ
チ２０が有するＤＣＴ２０７に保持された構成情報に基
づき行われる。この処理の詳細については後述する。な
お、ディスクアレイサブセット内におけるマッピングに
ついては、既によく知られた技術であり、本明細書では
詳細な説明については省略する。In this embodiment, for example, when the host # 2 accesses the area A1001, an access request designating the area A1001 is sent by the disk array switch 20 to the LU area A'1 of the disk array subset # 0.
The request 101 is converted into a request to access 101 and transferred to the disk array subset # 0. The disk array subset # 0 performs access by mapping the area A ′ 1101 to the area A ″ 1201 on the disk unit 104. The mapping between the address space 1000 and the address space 1100 is performed by the disk array switch 20. This processing is performed based on the configuration information held in the DCT 207. The details of this processing will be described later, and the mapping in the disk array subset is a well-known technique, and will be described in detail in this specification. Is omitted.

【００３１】本実施形態において、ＤＣＴ２０７は、シ
ステム構成テーブルとサブセット構成テーブルを含む。
図６は、システム構成テーブルの構成を、図７は、サブ
セット構成テーブルの構成を示す。In this embodiment, the DCT 207 includes a system configuration table and a subset configuration table.
FIG. 6 shows the configuration of the system configuration table, and FIG. 7 shows the configuration of the subset configuration table.

【００３２】図７に示すように、システム構成テーブル
２０２７０は、ホストＬＵの構成を示す情報を保持する
ホストＬＵ構成テーブル20271、及びディスクアレイス
イッチ２０のディスクアレイＩ／Ｆノード２０２とディ
スクアレイサブセット１０との接続関係を示すディスク
アレイＩ／Ｆノード構成テーブル20272を有する。As shown in FIG. 7, the system configuration table 20270 includes a host LU configuration table 20271 holding information indicating the configuration of the host LU, the disk array I / F node 202 of the disk array switch 20, and the disk array subset 10 And a disk array I / F node configuration table 20272 indicating the connection relationship with.

【００３３】ホストＬＵ構成テーブル20271は、ホスト
３０からみたＬＵごとに、そのＬＵを識別する番号であ
るHost-LU No.、ＬＵの属性を示すLU Type、CLU Clas
s、及びCLU Stripe Size、ホストＬＵの状態を示す情報
であるCondition、ホストＬＵを構成するディスクアレ
イサブセット１０のＬＵに関する情報であるＬＵ情報
（LU Info.）を有する。The host LU configuration table 20271 includes, for each LU viewed from the host 30, a Host-LU No., which is a number for identifying the LU, an LU Type indicating the attribute of the LU, and a CLU Clas.
s, CLU Stripe Size, Condition indicating information on the status of the host LU, and LU information (LU Info.) indicating information on LUs of the disk array subset 10 constituting the host LU.

【００３４】LU Typeは、このホストＬＵがＣＬＵであ
るか、ＩＬＵであるかといったＬＵの種類を示す情報で
ある。CLU Classは、LU TypeによりこのホストＬＵがＣ
ＬＵであることが示される場合に、そのクラスが“Join
ed”、“mirrored”、及び“Striped”のいずれである
かを示す情報である。“Joined”は、図１１により説明
したように、いくつかのＬＵを連結して１つの大きな記
憶空間を持つＣＬＵが構成されていることを示す。“Mi
rrored”は、第６実施形態として後述するように、２つ
のＬＵにより二重化されたＬＵであることを示す。“St
riped”は、第７実施形態として後述するように、複数
のＬＵで構成され、データがこれら複数のＬＵに分散し
て格納されたＬＵであることを示す。CLU Stripe Size
は、CLU Classにより「Striped」であることが示される
場合に、ストライピングサイズ（データの分散の単位と
なるブロックのサイズ）を示す。LU Type is information indicating the type of LU such as whether the host LU is a CLU or an ILU. CLU Class indicates that this host LU is C
If the class is indicated as LU, the class is "Join
It is information indicating any of “ed”, “mirrored”, and “Striped.” “Joined” has one large storage space by connecting several LUs as described with reference to FIG. Indicates that a CLU has been configured.
“rrored” indicates that the LU is duplicated by two LUs, as described later in the sixth embodiment.
The “riped” indicates that the LU is composed of a plurality of LUs and is an LU in which data is distributed and stored in the plurality of LUs, as described later in the seventh embodiment. CLU Stripe Size
Indicates the striping size (the size of a block serving as a unit of data distribution) when the CLU Class indicates “Striped”.

【００３５】Conditionにより示される状態には、“Nor
mal”、“Warning”、“Fault”、及び“Not Defined”
の４種類がある。“Normal”はこのホストＬＵが正常な
状態であることを示す。“Warning”は、このホストＬ
Ｕを構成するＬＵに対応するいずれかのディスクユニッ
トに障害が発生している等の理由により縮退運転が行わ
れていることを示す。“Fault”は、ディスクアレイサ
ブセット１０の故障などによりこのホストＬＵを運転す
ることができないことを示す。“Not Defined”は、対
応するHost-LU No.のホストＬＵが定義されていないこ
とを示す。The condition indicated by Condition includes “Nor
mal ”,“ Warning ”,“ Fault ”, and“ Not Defined ”
There are four types. “Normal” indicates that this host LU is in a normal state. “Warning” is the host L
This indicates that the degenerate operation is being performed, for example, because a failure has occurred in any disk unit corresponding to the LU configuring U. “Fault” indicates that the host LU cannot be operated due to a failure of the disk array subset 10 or the like. “Not Defined” indicates that the host LU of the corresponding Host-LU No. is not defined.

【００３６】LU Infoは、このホストＬＵを構成するＬ
Ｕについて、そのＬＵが属するディスクアレイサブセッ
ト１０を特定する情報、ディスクアレイサブセット内で
のＬＵＮ、及びそのサイズを示す情報を含む。ホストＬ
ＵがＩＬＵの場合には、唯一のＬＵに関する情報が登録
される。ホストＬＵがＣＬＵの場合には、それを構成す
る全てのＬＵについて、それぞれのＬＵに関する情報が
登録される。例えば、図において、Host-LU No.が
“０”であるHost-LUは、ディスクアレイサブセット
“＃０”のＬＵＮ“０”、ディスクアレイサブセット
“＃１”のＬＵＮ“０”、ディスクアレイサブセット
“＃２”のＬＵＮ“０”、ディスクアレイサブセット
“＃３”のＬＵＮ“０”の4つのＬＵから構成されるＣ
ＬＵであり、そのＣＬＵクラスが“Joined”であるＣＬ
Ｕであることが分かる。LU Info is an L that constitutes this host LU.
For U, the information includes information specifying the disk array subset 10 to which the LU belongs, LUNs in the disk array subset, and information indicating the size. Host L
When U is ILU, information on only one LU is registered. When the host LU is a CLU, information on each LU is registered for all the LUs that make up the CLU. For example, in the figure, a Host-LU whose Host-LU No. is “0” is LUN “0” of the disk array subset “# 0”, LUN “0” of the disk array subset “# 1”, and disk array subset C composed of four LUs: LUN “0” of “# 2” and LUN “0” of disk array subset “# 3”
CL which is LU and whose CLU class is "Joined"
It turns out that it is U.

【００３７】ディスクアレイＩ／Ｆノード構成テーブル
20272は、ディスクアレイＩ／Ｆ２１が接続するディス
クアレイサブセット１０のポートごとに、どのディスク
アレイスイッチ２０のディスクアレイＩ／Ｆノード２０
２が接続されるかを示す情報を保持する。Disk array I / F node configuration table
Reference numeral 20272 denotes a disk array I / F node 20 of any disk array switch 20 for each port of the disk array subset 10 to which the disk array I / F 21 connects.
2 is stored.

【００３８】具体的には、ディスクアレイサブセット１
０を特定するSubset No.、ポートを特定するSubset Por
t No.、そのポートに接続するディスクアレイスイッチ
２０を特定するSwitch No.、及びそのディスクアレイス
イッチ２０のディスクアレイＩ／Ｆノード２０２を特定
するI/F Node No.を有する。ディスクアレイサブセット
１０が複数のポートを備えている場合には、そのポート
毎に情報が設定される。Specifically, the disk array subset 1
Subset No. specifying 0, Subset Por specifying port
t No., Switch No. for specifying the disk array switch 20 connected to the port, and I / F Node No. for specifying the disk array I / F node 202 of the disk array switch 20. When the disk array subset 10 has a plurality of ports, information is set for each of the ports.

【００３９】サブセット構成テーブルは、図７に示すよ
うに、各ディスクアレイサブセット１０に対応する複数
のテーブル202720〜202723を有する。各テーブルは、デ
ィスクアレイサブセット１０内で構築されたＲＡＩＤグ
ループの構成を示す情報を保持するＲＡＩＤグループ構
成テーブル202730と、ディスクアレイサブセット１０内
に構築されたＬＵの構成を示す情報を保持するＬＵ構成
テーブル202740を含む。As shown in FIG. 7, the subset configuration table has a plurality of tables 202720 to 202723 corresponding to each disk array subset 10. Each table includes a RAID group configuration table 202730 that holds information indicating the configuration of a RAID group built in the disk array subset 10, and an LU configuration that holds information indicating the configuration of an LU built in the disk array subset 10. Includes table 202740.

【００４０】ＲＡＩＤグループ構成テーブル202730は、
ＲＡＩＤグループに付加された番号を示すGroup No.、
そのＲＡＩＤグループのレベルを示すLevel、そのＲＡ
ＩＤグループを構成するディスクの数を示す情報である
Disks、そのＲＡＩＤグループがＲＡＩＤレベル０，５
等のストライピングされた構成の場合、そのストライプ
サイズを示すStripe Sizeを情報として含む。例えば、
図に示されるテーブルにおいて、ＲＡＩＤグループ
“０”は、４台のディスクユニットにより構成されたＲ
ＡＩＤグループであり、ＲＡＩＤレベルが５、ストライ
プサイズがＳ０である。The RAID group configuration table 202730 includes:
Group No. indicating the number added to the RAID group,
Level indicating the level of the RAID group, the RA
This is information indicating the number of disks constituting the ID group.
Disks, the RAID groups of which are RAID levels 0 and 5
In the case of such a striped configuration, the information includes a Stripe Size indicating the stripe size. For example,
In the table shown in the figure, the RAID group “0” is an R group composed of four disk units.
This is an AID group, the RAID level is 5, and the stripe size is S0.

【００４１】ＬＵ構成テーブル202740は、ＬＵに付加さ
れた番号（ＬＵＮ）を示すLU No.、このＬＵがどのＲＡ
ＩＤグループに構成されているのかを示すRAID Group、
ＬＵの状態を示すCondition、このＬＵのサイズ（容
量）を示すSize、このＬＵがディスクアレイサブセット
１０のどのポートからアクセス可能なのかを示すPort、
及びその代替となるポートを示すAlt. Portを情報とし
て含む。Conditionで示される状態は、ホストＬＵにつ
いてのConditionと同様、“Normal”、“Warning”、
“Fault”、“Not Defined”の４種類がある。Alt. Por
tに設定された情報により特定されるポートは、Portに
設定された情報で特定されるポートに障害が発生したと
きに用いられるが、単に複数のポートから同一のＬＵを
アクセスするために用いることもできる。The LU configuration table 202740 contains an LU number indicating the number (LUN) added to the LU, and which RA
RAID Group that indicates whether it is configured as an ID group,
Condition indicating the status of the LU, Size indicating the size (capacity) of the LU, Port indicating from which port of the disk array subset 10 the LU can be accessed,
And Alt. Port indicating an alternative port are included as information. The state indicated by Condition is “Normal”, “Warning”,
There are four types: “Fault” and “Not Defined”. Alt. Por
The port specified by the information set in t is used when a failure occurs in the port specified by the information set in Port, but it is used simply to access the same LU from multiple ports. Can also.

【００４２】図８は、ファイバチャネルにおけるフレー
ムの構成図である。ファイバチャネルのフレーム４０
は、フレームの先頭を示すＳＯＦ（Start Of Frame）４
００、フレームヘッダ４０１、転送の実態データを格納
する部位であるフレームペイロード４０２、３２ビット
のエラー検出コードであるＣＲＣ（Cyclic RedundancyC
heck）４０３、フレームの最後尾を示すＥＯＦ（End Of
Frame）４０４を含む。フレームヘッダ４０１は、図９
に示すような構造になっており、フレーム転送元のＩＤ
（S_ID）、フレーム転送先のＩＤ（D_ID)、エクスチェ
ンジの起動元、応答先が指定するそれぞれのエクスチェ
ンジＩＤ（OX_ID、RX_ID）、エクスチェンジ中のフレー
ムグループを指定するシーケンスのＩＤ（SEQ_ID）等が
格納されている。FIG. 8 is a configuration diagram of a frame in the fiber channel. Fiber Channel frame 40
Is SOF (Start Of Frame) 4 indicating the beginning of the frame.
00, a frame header 401, a frame payload 402 which stores actual data of transfer, and a CRC (Cyclic Redundancy C) which is a 32-bit error detection code.
heck) 403, EOF (End Of) indicating the end of the frame
Frame) 404. The frame header 401 is shown in FIG.
And the ID of the frame transfer source
(S_ID), ID of frame transfer destination (D_ID), exchange ID (OX_ID, RX_ID) specified by exchange start source and response destination, ID of sequence (SEQ_ID) specifying frame group during exchange, etc. are stored. Have been.

【００４３】本実施形態では、ホスト３０により発行さ
れるフレームには、S_IDとしてホスト３０に割り当てら
れたＩＤが、また、D_IDとしてディスクアレイスイッチ
２０のポートに割り当てられたＩＤが使用される。一つ
のホストコマンドに対し、１ペアのエクスチェンジＩＤ
（OX_ID、RX_ID）が割り当てられる。複数のデータフレ
ームを同一のエクスチェンジに対し発行する必要がある
ときは、その全データフレームに対して同一のSEQ_IDが
割り当てられ、おのおのはシーケンスカウント（SEQ_CN
T）で識別される。フレームペイロード４０２の最大長
は２１１０バイトであり、フレーム種毎に格納される内
容が異なる。例えば、後述するFCP_CMDフレームの場
合、図１０に示すように、ＳＣＳＩのLogical Unit Num
ber（ＬＵＮ）、Command Description Block（ＣＤＢ）
等が格納される。ＣＤＢは、ディスク（ディスクアレ
イ）アクセスに必要なコマンドバイト、転送開始論理ア
ドレス（ＬＢＡ）、転送長（ＬＥＮ）を含む。In this embodiment, the frame issued by the host 30 uses the ID assigned to the host 30 as the S_ID, and the ID assigned to the port of the disk array switch 20 as the D_ID. One pair of exchange ID for one host command
(OX_ID, RX_ID) are assigned. When multiple data frames need to be issued for the same exchange, the same SEQ_ID is assigned to all the data frames, and each data frame is assigned a sequence count (SEQ_CN
T). The maximum length of the frame payload 402 is 2110 bytes, and the content stored differs for each frame type. For example, in the case of an FCP_CMD frame to be described later, as shown in FIG.
ber (LUN), Command Description Block (CDB)
Are stored. The CDB includes a command byte, a transfer start logical address (LBA), and a transfer length (LEN) required for accessing a disk (disk array).

【００４４】以下、本実施形態のディスクアレイシステ
ムの動作を説明する。Hereinafter, the operation of the disk array system of the present embodiment will be described.

【００４５】ディスクアレイシステムを使用するのに先
立ち、ディスクアレイスイッチ２０に対して、ディスク
アレイサブセット１０の構成情報を設定する必要があ
る。システム管理者は、管理端末５からディスクアレイ
システム構成手段７０を介して、すべてのディスクアレ
イサブセット１０およびディスクアレイスイッチ２０の
構成設定情報を獲得する。管理者は、管理端末５から所
望のシステム構成になるよう論理ユニットの構成設定、
RAIDレベルの設定、障害発生時の交代パスの設定等、各
種設定に必要な設定情報を入力する。ディスクアレイシ
ステム構成管理手段７０は、その設定情報を受け、各デ
ィスクアレイサブセット１０およびディスクアレイスイ
ッチ２０に設定情報を転送する。なお、管理端末５にお
ける設定情報の入力については第５実施形態にて別途説
明する。Prior to using the disk array system, it is necessary to set the configuration information of the disk array subset 10 in the disk array switch 20. The system administrator acquires the configuration setting information of all the disk array subsets 10 and the disk array switches 20 from the management terminal 5 via the disk array system configuration means 70. The administrator sets the configuration of the logical unit from the management terminal 5 so as to obtain a desired system configuration,
Enter setting information required for various settings, such as setting of RAID level and setting of alternate path when a failure occurs. The disk array system configuration management means 70 receives the setting information and transfers the setting information to each disk array subset 10 and the disk array switch 20. The input of the setting information in the management terminal 5 will be described separately in a fifth embodiment.

【００４６】ディスクアレイスイッチ２０では、通信コ
ントローラ２０４が設定情報を獲得し、ＭＰ２００によ
り各ディスクアレイサブセット１０のアドレス空間情報
等の構成情報が設定される。ＭＰ２００は、クロスバス
イッチ２０１経由で各ホストＩ／Ｆノード２０３および
ディスクアレイＩ／Ｆノード２０２に、ディスクアレイ
サブセット１０の構成情報を配信する。In the disk array switch 20, the communication controller 204 acquires the setting information, and the MP 200 sets configuration information such as address space information of each disk array subset 10. The MP 200 distributes the configuration information of the disk array subset 10 to each host I / F node 203 and the disk array I / F node 202 via the crossbar switch 201.

【００４７】各ノード２０３、および２０２はこの情報
を受信すると、ＳＰ２０２１により構成情報をＤＣＴ２
０２７に格納する。ディスクアレイサブセット１０で
は、ディスクアレイサブセット構成管理手段１０６が、
設定情報を獲得し、共有メモリ１０２に格納する。各上
位ＭＰＵ１０１０および下位ＭＰＵ１０３０は、共有メ
モリ１０２上の設定情報を参照し、各々の構成管理を実
施する。When each of the nodes 203 and 202 receives this information, the SP 2021 converts the configuration information into the DCT2
027. In the disk array subset 10, the disk array subset configuration management means 106
The setting information is acquired and stored in the shared memory 102. Each of the upper MPU 1010 and the lower MPU 1030 refers to the setting information on the shared memory 102 and performs the respective configuration management.

【００４８】以下では、ホスト“＃２”がディスクアレ
イシステム1に対し、リードコマンドを発行した場合の
動作を説明する。図１１に、ホストからのリード動作時
にファイバチャネルを通して転送されるフレームのシー
ケンスを示す模式図を、図１３にこのときのディスクア
レイスイッチのホストＩ／Ｆノード２０３における動作
のフローチャートを示す。The operation when the host “# 2” issues a read command to the disk array system 1 will be described below. FIG. 11 is a schematic diagram showing a sequence of a frame transferred through the fiber channel at the time of a read operation from the host, and FIG. 13 is a flowchart of the operation in the host I / F node 203 of the disk array switch at this time.

【００４９】なお、以下の説明では、ホスト“＃２”
が、図１２における記憶領域Ａ１００１をアクセスする
ことを仮定する。記憶領域Ａ１００１に対応する実際の
記憶領域Ａ″は、ディスクアレイサブセット“＃０”の
ＬＵＮ＝０のＬＵを構成するディスクユニット＃２のア
ドレス空間内に存在するものとする。また、アドレス空
間１０００を構成するＬＵを定義しているホストＬＵ構
成テーブル20271のLU Typeには「ＣＬＵ」が、CLU Clas
sには「Joined」が設定されているものとする。In the following description, the host "# 2"
Access the storage area A1001 in FIG. The actual storage area A "corresponding to the storage area A1001 is assumed to exist in the address space of the disk unit # 2 constituting the LU of LUN = 0 of the disk array subset"# 0 ". In the LU type of the host LU configuration table 20271 that defines the LUs constituting "CLU", "CLU" is
It is assumed that "Joined" is set in s.

【００５０】データのリード時、ホスト３０は、リード
コマンドを格納したコマンドフレーム「FCP_CMD」をデ
ィスクアレイスイッチ２０に発行する（図１１矢印
（ａ））。ディスクアレイスイッチ２０のホストＩ／Ｆ
ノード“＃２”は、ＩＣ２０２３によりホストＩ／Ｆ３
１経由でコマンドフレーム「FCP_CMD」を受信する（ス
テップ20001）。ＩＣ２０２３は、ＳＣ２０２２にコマ
ンドフレームを転送する。ＳＣ２０２２は、受け取った
コマンドフレームを一旦ＦＢ２０２５に格納する。この
際、ＳＣ２０２２は、コマンドフレームのＣＲＣを計算
し、受信情報が正しいことを検査する。ＣＲＣの検査に
誤りがあれば、ＳＣ２０２２は、その旨をＩＣ２０２３
に通知する。ＩＣ２０２３は、誤りの通知をＳＣ２０２
２から受けると、ホストＩ／Ｆ３１を介してホスト３０
にＣＲＣエラーを報告する。（ステップ20002）。When reading data, the host 30 issues a command frame “FCP_CMD” storing a read command to the disk array switch 20 (arrow (a) in FIG. 11). Host I / F of disk array switch 20
The node “# 2” is connected to the host I / F 3 by the IC 2023.
Then, a command frame “FCP_CMD” is received via Step 1 (Step 20001). The IC 2023 transfers the command frame to the SC 2022. The SC 2022 temporarily stores the received command frame in the FB 2025. At this time, the SC 2022 calculates the CRC of the command frame and checks that the received information is correct. If there is an error in the CRC check, the SC 2022 notifies the IC 2023 accordingly.
Notify. The IC 2023 sends the error notification to the SC 202
2 from the host 30 via the host I / F 31
To report a CRC error. (Step 20002).

【００５１】ＣＲＣが正しい場合、ＳＣ２０２２は、Ｆ
Ｂ２０２５に保持したフレームをリードし、それがコマ
ンドフレームであることを認識してフレームヘッダ４０
１を解析する（ステップ20003）。そして、ＳＣ２０２
２は、ＳＰ２０２１に指示し、S_ID、D_ID、OX_ID等の
エクスチェンジ情報をＥＴ２０２６に登録する（ステッ
プ20004）。If the CRC is correct, SC 2022 returns F
The frame held in B2025 is read, and it is recognized that the frame is a command frame.
1 is analyzed (step 20003). And SC202
2 instructs the SP 2021 to register exchange information such as S_ID, D_ID, and OX_ID in the ET 2026 (step 20004).

【００５２】次に、ＳＣ２０２２は、フレームペイロー
ド４０２を解析し、ホスト３０により指定されたＬＵＮ
およびＣＤＢを取得する（ステップ20005）。ＳＰ２０
２１は、ＳＣ２０２２の指示により、ＤＣＴ２０２７を
検索し、ディスクアレイサブセット１０の構成情報を得
る。具体的には、ＳＰ２０２１は、ホストＬＵ構成テー
ブル20271を検索し、受信したフレームペイロード４０
２に格納されたＬＵＮと一致するHost-LU No.を有する
情報を見つける。ＳＰ２０２１は、LU Type、CLU Class
に設定された情報からホストＬＵの構成を認識し、LU I
nfo.に保持されている情報に基づきアクセスすべきディ
スクサブセット１０とその中のＬＵのＬＵＮ、及びこの
ＬＵ内でのＬＢＡを判別する。次に、ＳＰ２０２１は、
サブセット構成テーブル202720のＬＵ構成テーブル2027
40を参照し、目的のディスクアレイサブセット１０の接
続ポートを確認し、ディスクアレイＩ／Ｆノード構成テ
ーブル20272からそのポートに接続するディスクアレイ
Ｉ／Ｆノード２０２のノードNo.を得る。ＳＰ２０２１
は、このようにして得たディスクアレイサブセット１０
を識別する番号、ＬＵＮ、ＬＢＡ等の変換情報をＳＣ２
０２２に報告する。（ステップ20006）。Next, the SC 2022 analyzes the frame payload 402 and checks the LUN specified by the host 30.
And a CDB (step 20005). SP20
21 searches the DCT 2027 according to the instruction of the SC 2022, and obtains the configuration information of the disk array subset 10. Specifically, the SP 2021 searches the host LU configuration table 20271, and receives the received frame payload 40
2. Find the information having the Host-LU No. that matches the LUN stored in 2. SP2021 is LU Type, CLU Class
Recognizes the configuration of the host LU from the information set in the
Based on the information held in the nfo., the disk subset 10 to be accessed, the LUN of the LU in the disk subset 10, and the LBA in this LU are determined. Next, SP 2021
LU configuration table 2027 of subset configuration table 202720
With reference to 40, the connection port of the target disk array subset 10 is confirmed, and the node number of the disk array I / F node 202 connected to the port is obtained from the disk array I / F node configuration table 20272. SP2021
Is the disk array subset 10 thus obtained.
The conversion information such as a number for identifying the
022. (Step 20006).

【００５３】次に、ＳＣ２０２２は、獲得した変換情報
を使用しフレームペイロード４０２のＬＵＮとＣＤＢの
なかのＬＢＡを変換する。また、フレームヘッダ４０１
のD_IDを対応するディスクアレイサブセット１０のホス
トＩ／Ｆコントローラ１０１１のD_IDに変換する。な
お、この時点ではS_IDは書き換えない（ステップ２００
０７）。Next, the SC 2022 uses the obtained conversion information to convert the LUN of the frame payload 402 and the LBA in the CDB. Also, the frame header 401
Is converted into the D_ID of the host I / F controller 1011 of the corresponding disk array subset 10. At this point, the S_ID is not rewritten (step 200).
07).

【００５４】ＳＣ２０２２は、変換後のコマンドフレー
ムと、対象ディスクアレイサブセット１０に接続するデ
ィスクアレイＩ／Ｆノード番号を、ＳＰＧ２０２４に転
送する。ＳＰＧ２０２４は、受け取った変換後のコマン
ドフレームに対し、図１４に示すような簡単な拡張ヘッ
ダ６０１を付加したパケットを生成する。このパケット
をスイッチングパケット（ＳＰａｃｋｅｔ）６０と呼
ぶ。S Packet６０の拡張ヘッダ６０１には、転送元（自
ノード）番号、転送先ノード番号、及び転送長が付加含
まれる。ＳＰＧ２０２４は、生成したS Packet６０をク
ロスバスイッチ２０１に送信する（ステップ20008）。The SC 2022 transfers the converted command frame and the disk array I / F node number connected to the target disk array subset 10 to the SPG 2024. The SPG 2024 generates a packet in which a simple extension header 601 as shown in FIG. 14 is added to the received converted command frame. This packet is called a switching packet (S Packet) 60. The extension header 601 of the S Packet 60 additionally includes a transfer source (own node) number, a transfer destination node number, and a transfer length. The SPG 2024 transmits the generated S Packet 60 to the crossbar switch 201 (Step 20008).

【００５５】クロスバスイッチ２０１は、ホストＩ／Ｆ
ノード“＃２”と接続するＳＷＰ２０１０によりS Pack
et６０を受信する。ＳＷＰ２０１０は、S Packet６０の
拡張ヘッダ６０１を参照し、転送先のノードが接続する
ＳＷＰへのスイッチ制御を行って経路を確立し、S Pack
et６０を転送先のディスクアレイＩ／Ｆノード２０２
（ここでは、ディスクアレイＩ／Ｆノード“＃０”）に
転送する。ＳＷＰ２０１０は、経路の確立をS Packet６
０の受信の度に実施し、S Packet６０の転送が終了した
ら、その経路を解放する。ディスクアレイＩ／Ｆノード
“＃０”では、ＳＰＧ２０２４がS Packet６０を受信
し、拡張ヘッダ６０１を外してコマンドフレームの部分
をＳＣ２０２２に渡す。The crossbar switch 201 is connected to the host I / F
SPack by SWP2010 connected to node "# 2"
Receive et60. The SWP 2010 refers to the extension header 601 of the SPacket 60, performs switch control to the SWP to which the transfer destination node connects, establishes a path, and
et60 is the destination disk array I / F node 202
(Here, the data is transferred to the disk array I / F node “# 0”). The SWP 2010 determines that a route has been
The process is performed every time the packet is received, and when the transfer of the S Packet 60 is completed, the route is released. In the disk array I / F node “# 0”, the SPG 2024 receives the S Packet 60, removes the extension header 601 and passes the command frame portion to the SC 2022.

【００５６】ＳＣ２０２２は、受け取ったコマンドフレ
ームのフレームヘッダのS_IDに自分のＩＤを書き込む。
次にＳＣ２０２２は、ＳＰ２０２１に対し、コマンドフ
レームのS_ID、D_ID、OX_ID等のエクスチェンジ情報、
及びフレーム転送元ホストＩ／Ｆノード番号をＥＴ２０
２６に登録するよう指示し、ＩＣ２０２３にコマンドフ
レームを転送する。ＩＣ２０２３は、フレームヘッダ４
０１の情報に従い、接続するディスクアレイサブセット
１０（ここでは、ディスクアレイサブセット“＃０”）
にコマンドフレームを転送する（図１１矢印（ｂ））。The SC 2022 writes its own ID in the S_ID of the frame header of the received command frame.
Next, the SC 2022 gives the SP 2021 exchange information such as S_ID, D_ID, and OX_ID of the command frame,
And the frame transfer source host I / F node number as ET20
26, and the command frame is transferred to the IC 2023. The IC 2023 has a frame header 4
01, the disk array subset 10 to be connected (here, the disk array subset “# 0”)
(FIG. 11 arrow (b)).

【００５７】ディスクアレイサブセット“＃０”は、変
換後のコマンドフレーム「FCP_CMD」をディスクアレイ
Ｉ／Ｆコントローラ１０１１で受信する。上位ＭＰＵ１
０１０は、コマンドフレームのフレームペイロード４０
２に格納されたＬＵＮとＣＤＢを取得し、指定された論
理ユニットのＬＢＡからＬＥＮ長のデータをリードする
コマンドであると認識する。In the disk array subset “# 0”, the converted command frame “FCP_CMD” is received by the disk array I / F controller 1011. Upper MPU1
010 is the frame payload 40 of the command frame.
2. The LUN and CDB stored in No. 2 are acquired, and the command is read to read LEN-length data from the LBA of the specified logical unit.

【００５８】上位ＭＰＵ１０１０は、共有メモリ１０２
に格納されたキャッシュ管理情報を参照し、キャッシュ
ヒットミス／ヒット判定を行う。ヒットすればキャッシ
ュ１０２からデータ転送を実施する。ミスの場合、ディ
スクユニットからデータをリードする必要があるので、
ＲＡＩＤ５の構成に基づくアドレス変換を実施し、キャ
ッシュ空間を確保する。そして、ディスクユニット２か
らのリード処理に必要な処理情報を生成し、下位ＭＰＵ
１０３０に処理を引き継ぐべく、共有メモリ１０２に処
理情報を格納する。The upper MPU 1010 has the shared memory 102
The cache hit miss / hit is determined by referring to the cache management information stored in. If there is a hit, data transfer from the cache 102 is performed. In the case of a mistake, it is necessary to read data from the disk unit,
The address conversion based on the configuration of RAID5 is performed to secure the cache space. Then, it generates the processing information necessary for the read processing from the disk unit 2 and
Processing information is stored in the shared memory 102 so that the processing can be taken over to 1030.

【００５９】下位ＭＰＵ１０３０は、共有メモリ１０２
に処理情報が格納されたことを契機に処理を開始する。
下位ＭＰＵ１０３０は、適切なディスクＩ／Ｆコントロ
ーラ１０３１を特定し、ディスクユニット２へのリード
コマンドを生成して、ディスクＩ／Ｆコントローラ１０
３１にコマンドを発行する。ディスクＩ／Ｆコントロー
ラ１０３１は、ディスクユニット2からリードしたデー
タをキャッシュ１０２の指定されたアドレスに格納して
下位ＭＰＵ１０３０に終了報告を通知する。下位ＭＰＵ
１０３０は、処理が正しく終了したことを上位ＭＰＵ１
０１０に通知すべく共有メモリ１０２に処理終了情報を
格納する。The lower MPU 1030 has the shared memory 102
The processing is started when the processing information is stored in the.
The lower MPU 1030 specifies the appropriate disk I / F controller 1031, generates a read command to the disk unit 2, and
31 is issued. The disk I / F controller 1031 stores the data read from the disk unit 2 at the specified address of the cache 102, and notifies the lower MPU 1030 of the end report. Lower MPU
1030, the upper MPU 1 indicates that the processing has been completed correctly.
010, the processing end information is stored in the shared memory 102.

【００６０】上位ＭＰＵ１０１０は、共有メモリ１０２
に処理終了情報が格納されたことを契機に処理を再開
し、ディスクアレイＩ／Ｆコントローラ１０１１にリー
ドデータ準備完了を通知する。ディスクアレイＩ／Ｆコ
ントローラ１０１１は、ディスクアレイスイッチ２０の
当該ディスクアレイＩ／Ｆノード“＃０”に対し、ファ
イバチャネルにおけるデータ転送準備完了フレームであ
る「FCP_XFER_RDY」を発行する（図１１矢印（ｃ））。The upper MPU 1010 has the shared memory 102
When the processing end information is stored in the disk array I / F, the processing is restarted and the disk array I / F controller 1011 is notified of read data preparation completion. The disk array I / F controller 1011 issues “FCP_XFER_RDY”, which is a data transfer preparation completion frame in the fiber channel, to the disk array I / F node “# 0” of the disk array switch 20 (see arrow (c) in FIG. 11). )).

【００６１】ディスクアレイＩ／Ｆノード“＃０”で
は、データ転送準備完了フレーム「FCP_XFER_RDY」を受
信すると、ＳＣ２０２２が、ディスクアレイサブセット
２０から受信した応答先エクスチェンジＩＤ（RX_ID）
を獲得し、S_ID、D_ID、OX_IDを指定して、ＳＰ２０２
１に指示しＥＴ２０２６の当該エクスチェンジ情報にRX
_IDを登録する。ＳＣ２０２２は、データ転送準備完了
フレームの転送先（コマンドフレームの転送元）のホス
トＩ／Ｆノード番号を獲得する。ＳＣ２０２２は、この
フレームのS_IDを無効化し、ＳＰＧ２０２４に転送す
る。ＳＰＧ２０２４は、先に述べたようにしてS Packet
を生成し、クロスバスイッチ２０１経由で対象ホストＩ
／Ｆノード“＃２”に転送する。When the disk array I / F node “# 0” receives the data transfer preparation completion frame “FCP_XFER_RDY”, the SC 2022 sends the response destination exchange ID (RX_ID) received from the disk array subset 20.
, And specify S_ID, D_ID, OX_ID, and SP202
1 and RX to the relevant exchange information of ET2026
Register _ID. The SC 2022 acquires the host I / F node number of the transfer destination of the data transfer preparation completion frame (the transfer source of the command frame). The SC 2022 invalidates the S_ID of this frame and transfers it to the SPG 2024. The SPG 2024 uses the S Packet as described above.
To the target host I via the crossbar switch 201.
/ F node "# 2".

【００６２】ホストＩ／Ｆノード“＃２”では、ＳＰＧ
２０２４がデータ転送準備完了フレームのS Packetを受
信すると、S Packetの拡張ヘッダを外し「FCP_XFER_RD
Y」を再生してＳＣ２０２２に渡す（ステップ20011）。
ＳＣ２０２２は、ＳＰ２０２１に指示しＥＴ２０２６を
サーチして該当するエクスチェンジを特定する（ステッ
プ20012）。In the host I / F node “# 2”, the SPG
When the data transfer preparation frame 2024 receives the S Packet of the data transfer preparation completion frame, the extension header of the S Packet is removed and “FCP_XFER_RD
"Y" is reproduced and transferred to the SC 2022 (step 20011).
The SC 2022 instructs the SP 2021 to search for the ET 2026 and specify the corresponding exchange (step 20012).

【００６３】次に、ＳＣ２０２２は、フレームが「FCP_
XFER_RDY」であるかどうか調べ（ステップ20013）、「F
CP_XFER_EDY」であれば、ＥＴ２０２６の応答先エクス
チェンジＩＤ（RX_ID）の更新をＳＰ２０２１に指示す
る。応答先エクスチェンジＩＤとしては、このフレーム
に付加されていた値が使用される（ステップ20014）。
そして、ＳＣ２０２２は、フレームヘッダ４０１のS_I
D、D_IDをホストＩ／Ｆノード２０３のＩＤとホスト３
０のＩＤを用いた適切な値に変換する（ステップ2001
5）。これらの処理によりフレームヘッダ４０１は、ホ
スト“＃２”に対するフレームに変換される。ＩＣ２０
２３は、ホスト“＃２”に対し、このデータ転送準備完
了フレーム「FCP_XFER_RDY」を発行する（図１１の矢印
（ｄ）：ステップ20016）。Next, the SC 2022 determines that the frame is “FCP_
XFER_RDY ”(step 20003), and
If “CP_XFER_EDY”, it instructs the SP 2021 to update the response destination exchange ID (RX_ID) of the ET 2026. As the response destination exchange ID, the value added to this frame is used (step 20004).
Then, the SC 2022 determines the S_I of the frame header 401.
D and D_ID are the ID of the host I / F node 203 and the host 3
Is converted to an appropriate value using an ID of 0 (step 2001).
Five). Through these processes, the frame header 401 is converted into a frame for the host “# 2”. IC20
23 issues the data transfer preparation completion frame “FCP_XFER_RDY” to the host “# 2” (arrow (d) in FIG. 11: step 20016).

【００６４】ディスクアレイサブセット“＃０”のディ
スクアレイＩ／Ｆコントローラ１０１１は、データ転送
を行うため、データフレーム「FCP_DATA」を生成し、デ
ィスクアレイスイッチ２０に転送する（図１１矢印
（ｅ））。フレームペイロードの転送長には制限がある
ため、１フレームで転送できる最大のデータ長は２ＫＢ
である。データ長がこれを越える場合は、必要数だけデ
ータフレームを生成し発行する。すべてのデータフレー
ムには同一のSEQ_IDが割り当てられる。データフレーム
の発行は、同一のSEQ_IDに対し複数のフレームが生成さ
れることを除き（すなわちSEQ_CNTが変化する）、デー
タ転送準備完了フレームの場合と同様である。The disk array I / F controller 1011 of the disk array subset “# 0” generates a data frame “FCP_DATA” for data transfer and transfers it to the disk array switch 20 (arrow (e) in FIG. 11). . Since the transfer length of the frame payload is limited, the maximum data length that can be transferred in one frame is 2 KB
It is. If the data length exceeds this, a required number of data frames are generated and issued. The same SEQ_ID is assigned to all data frames. Issuing a data frame is similar to issuing a data transfer ready frame, except that multiple frames are generated for the same SEQ_ID (ie, SEQ_CNT changes).

【００６５】ディスクアレイスイッチ２０は、データ転
送準備完了フレームの処理と同様に、データフレーム
「FCP_DATA」のフレームヘッダ４０１の変換を実施す
る。ただし、データフレームの転送の場合、RX_IDが既
に確立されているので、データ転送準備完了フレームの
処理におけるステップ20014の処理はスキップされる。
フレームヘッダ４０１の変換後、ディスクアレイスイッ
チ２０は、ホスト“＃２”にデータフレームを転送する
（図１１矢印（ｆ））。The disk array switch 20 converts the frame header 401 of the data frame “FCP_DATA” in the same manner as the processing of the data transfer preparation completion frame. However, in the case of transferring a data frame, since the RX_ID has already been established, the process of step 20014 in the process of the data transfer preparation completion frame is skipped.
After the conversion of the frame header 401, the disk array switch 20 transfers the data frame to the host “# 2” (arrow (f) in FIG. 11).

【００６６】次に、ディスクアレイサブセット“＃０”
のディスクアレイＩ／Ｆコントローラ１０１１は、終了
ステータス転送を行うため、ステータスフレーム「FCP_
RSP」を生成し、ディスクアレイスイッチ２０に対し発
行する（図１１矢印（ｇ））。ディスクアレイスイッチ
２０では、データ転送準備完了フレームの処理と同様
に、ＳＰＧ２０２４がS Packetから拡張ヘッダを外し
「FCP_RSP」ステータスフレームを再現し（ステップ200
21）、ＳＰ２０２１によりＥＴ２０２６を検索しエクス
チェンジ情報を獲得する（ステップ20022）。ＳＣ２０
２２は、その情報に基づきフレームを変換する（ステッ
プの20023）。変換されたフレームは、ＩＣ２０２３に
よりホスト“＃２”に転送される（図１１矢印（ｈ）：
ステップ20024）。最後にＳＰ２０２１は、ＥＴ２０２
６からエクスチェンジ情報を削除する（ステップ2002
5）。Next, the disk array subset "# 0"
The disk array I / F controller 1011 of the first embodiment transmits the end status by transmitting the status frame “FCP_
RSP ”is generated and issued to the disk array switch 20 (arrow (g) in FIG. 11). In the disk array switch 20, the SPG 2024 removes the extension header from the S Packet and reproduces the "FCP_RSP" status frame in the same manner as the processing of the data transfer preparation completion frame (step 200).
21), the ET 2026 is searched by the SP 2021 to obtain exchange information (step 20022). SC20
22 converts the frame based on the information (step 20023). The converted frame is transferred to the host “# 2” by the IC 2023 (arrow (h) in FIG. 11):
Step 20004). Finally, SP2021 is the ET202
Exchange information is deleted from Step 6 (Step 2002)
Five).

【００６７】以上のようにしてディスクアレイからのリ
ード処理が行われる。ディスクアレイシステム１に対す
るライト処理についてもデータフレームの転送方向が逆
転するのみで、上述したリード処理と同様の処理が行わ
れる。The read processing from the disk array is performed as described above. In the write processing for the disk array system 1, the same processing as the above-described read processing is performed only by reversing the data frame transfer direction.

【００６８】図３に示したように、ディスクアレイスイ
ッチ２０は、クロスバスイッチ２０１にクラスタ間Ｉ／
Ｆ２０４０を備えている。図１に示したシステム構成で
は、クラスタ間Ｉ／Ｆ２０４０は使用されていない。本
実施形態のディスクアレイスイッチ２０は、クラスタ間
Ｉ／Ｆ２０４０を利用して図１５に示すように、他のデ
ィスクアレイスイッチと相互に接続されることができ
る。As shown in FIG. 3, the disk array switch 20 controls the inter-cluster I /
F2040. In the system configuration shown in FIG. 1, the inter-cluster I / F 2040 is not used. The disk array switch 20 of this embodiment can be mutually connected to other disk array switches by using the inter-cluster I / F 2040 as shown in FIG.

【００６９】本実施形態におけるディスクアレイスイッ
チ２０単独では、ホスト３０とディスクアレイサブセッ
ト１０を合計８台までしか接続できないが、クラスタ間
Ｉ／Ｆ２０４０を利用して複数のディスクアレイスイッ
チを相互接続し、接続できるホスト１０とディスクアレ
イの数を増やすことができる。例えば、図１５に示すシ
ステムでは、４台のディスクアレイスイッチ２０を使っ
てホスト３０とディスクアレイサブセット１０を合計３
２台まで接続でき、これらの間で相互にデータ転送が可
能になる。The disk array switch 20 alone in this embodiment can connect only up to eight hosts 30 and the disk array subsets 10 in total. However, a plurality of disk array switches are interconnected by using the inter-cluster I / F 2040. The number of connectable hosts 10 and disk arrays can be increased. For example, in the system shown in FIG. 15, the host 30 and the disk array subset 10 are
Up to two units can be connected, and data can be transferred between them.

【００７０】このように、本実施形態では、ディスク容
量や性能の必要性に合わせて、ディスクアレイサブセッ
トやホストの接続台数を増加していくことができる。ま
た、必要な転送帯域分のホストＩ／Ｆを用いてホスト−
ディスクアレイシステム間を接続することができるの
で、容量、性能、接続台数の拡張性を大幅に向上させる
ことができる。As described above, in this embodiment, the number of disk array subsets and the number of connected hosts can be increased in accordance with the necessity of disk capacity and performance. Also, the host-I / F for the necessary transfer band is used to
Since the disk array systems can be connected, the scalability of capacity, performance, and the number of connected devices can be greatly improved.

【００７１】以上説明した実施形態によれば、１台のデ
ィスクアレイサブセットの性能が、内部のＭＰＵや内部
バスで制限されたとしても、複数のディスクアレイサブ
セットを用いて、ディスクアレイスイッチによりホスト
とディスクアレイサブセット間を相互接続することがで
きる。これにより、ディスクアレイシステムトータルと
して高い性能を実現することができる。ディスクアレイ
サブセットの性能が比較的低いものであっても、複数の
ディスクアレイサブセットを用いることで高性能化を実
現できる。したがって、低コストのディスクアレイサブ
セットをコンピュータシステムの規模に合わせて必要な
台数だけ接続することができ、規模に応じた適切なコス
トでディスクアレイシステムを構築することが可能とな
る。According to the above-described embodiment, even if the performance of one disk array subset is limited by the internal MPU and the internal bus, the host can be connected to the host by the disk array switch using a plurality of disk array subsets. Interconnections between the disk array subsets can be made. Thereby, high performance can be realized as a whole of the disk array system. Even if the performance of the disk array subset is relatively low, high performance can be realized by using a plurality of disk array subsets. Therefore, a required number of low-cost disk array subsets can be connected according to the scale of the computer system, and a disk array system can be constructed at an appropriate cost according to the scale.

【００７２】また、ディスク容量の増大や性能の向上が
必要になったときは、ディスクアレイサブセットを必要
なだけ追加すればよい。さらに、複数のディスクアレイ
スイッチを用いて任意の数のホスト及びディスクアレイ
サブセットを接続できるので、容量、性能、接続台数の
いずれをも大幅に向上させることができ、高い拡張性を
有するシステムが実現できる。When it is necessary to increase the disk capacity and the performance, it is sufficient to add as many disk array subsets as necessary. Furthermore, since any number of hosts and disk array subsets can be connected using multiple disk array switches, capacity, performance, and the number of connected units can be significantly improved, realizing a highly scalable system. it can.

【００７３】さらにまた、本実施形態によれば、ディス
クアレイサブセットとして、従来のディスクアレイシス
テムそのものの縮小機を用いることができるので、既に
開発した大規模な制御ソフトウェア資産をそのまま利用
でき、開発コストの低減と開発期間の短縮を実現するこ
とができる。Furthermore, according to the present embodiment, since a reduction device of the conventional disk array system itself can be used as a disk array subset, a large-scale control software asset already developed can be used as it is, and the development cost can be reduced. And development time can be reduced.

【００７４】［第２実施形態］図１６は、本発明の第２
の実施形態におけるコンピュータシステムの構成図であ
る。本実施形態は、ディスクアレイスイッチのホストＩ
／Ｆノードにおいて、フレームヘッダ４０１のみを変換
し、フレームペイロード４０２は操作しない点、及び、
ディスクアレイスイッチ、ホストＩ／Ｆ、ディスクアレ
イＩ／Ｆが二重化されていない点で第１実施形態と構成
上相違する。したがって、各部の構成は、第１実施形態
と大きく変わるところがなく、その詳細については説明
を省略する。[Second Embodiment] FIG. 16 shows a second embodiment of the present invention.
FIG. 3 is a configuration diagram of a computer system according to the embodiment. In this embodiment, the host I of the disk array switch
/ F node converts only the frame header 401 and does not manipulate the frame payload 402, and
The configuration differs from the first embodiment in that the disk array switch, host I / F, and disk array I / F are not duplicated. Therefore, the configuration of each unit is not largely different from that of the first embodiment, and the details will not be described.

【００７５】図１６において、各ディスクアレイサブセ
ット１０は、複数の論理ユニット（ＬＵ）１１０で構成
されている。各ＬＵ１１０は、独立ＬＵとして構成され
る。一般に、各ディスクアレイサブセット１０内のＬＵ
１１０に割り当てられるＬＵＮは、０から始まる連続番
号である。このため、ホスト３０に対して、ディスクア
レイシステム1内のすべてのＬＵ１１０のＬＵＮを連続
的に見せる場合には、第１実施形態と同様に、フレーム
ペイロード４０２のＬＵＮフィールドを変換する必要が
ある。本実施形態では、各ディスクアレイサブセット１
０のＬＵＮをそのままホスト３０に見せることで、フレ
ームペイロード４０２の変換を不要とし、ディスクアレ
イスイッチの制御を簡単なものとしている。In FIG. 16, each disk array subset 10 is composed of a plurality of logical units (LUs) 110. Each LU 110 is configured as an independent LU. Generally, LUs in each disk array subset 10
LUNs assigned to 110 are consecutive numbers starting from 0. Therefore, when the LUNs of all the LUs 110 in the disk array system 1 are continuously displayed to the host 30, it is necessary to convert the LUN field of the frame payload 402, as in the first embodiment. In the present embodiment, each disk array subset 1
By showing the LUN of 0 to the host 30 as it is, the conversion of the frame payload 402 becomes unnecessary and the control of the disk array switch is simplified.

【００７６】本実施形態のディスクアレイスイッチ２０
は、ホストＩ／Ｆノード２０３ごとに特定のディスクア
レイサブセット１０をアクセスできるものと仮定する。
この場合、一つのホストＩ／Ｆ３１を使うと、１台のデ
ィスクアレイサブセット１０にあるＬＵ１１０のみがア
クセス可能である。１台のホストから複数のディスクア
レイサブセット１０のＬＵ１１０をアクセスしたい場合
には、そのホストを複数のホストＩ／Ｆノード２０３に
接続する。また、複数のホスト３０から１台のディスク
アレイサブセット１０のＬＵ１１０をアクセスできるよ
うにする場合は、同一のホストＩ／Ｆノード２０３にル
ープトポロジーや、ファブリックトポロジー等を用い、
複数のホスト３０を接続する。このように構成すると、
１台のホスト３０から１つのＬＵ１１０をアクセスする
際に、ホストＩ／Ｆノード２０３のD_ID毎にディスクア
レイサブセット１０が確定することになるため、各ＬＵ
のＬＵＮをそのままホスト３０に見せることが可能であ
る。The disk array switch 20 of this embodiment
Assume that a particular disk array subset 10 can be accessed for each host I / F node 203.
In this case, if one host I / F 31 is used, only the LU 110 in one disk array subset 10 can be accessed. When a single host wants to access the LUs 110 of a plurality of disk array subsets 10, the host is connected to a plurality of host I / F nodes 203. When a plurality of hosts 30 can access the LU 110 of one disk array subset 10, a loop topology, a fabric topology, or the like is used for the same host I / F node 203.
A plurality of hosts 30 are connected. With this configuration,
When one LU 110 is accessed from one host 30, the disk array subset 10 is determined for each D_ID of the host I / F node 203.
Can be shown to the host 30 as it is.

【００７７】本実施形態では、上述した理由により、ホ
スト３０に、各ディスクアレイサブセット１０内のＬＵ
１１０のＬＵＮをそのままホスト３０に見せているた
め、ディスクアレイスイッチ２０におけるＬＵＮの変換
は不要となる。このため、ディスクアレイスイッチ２０
は、ホスト３０からフレームを受信すると、フレームヘ
ッダ４０１のみを第１実施例と同様にして変換し、フレ
ームペイロード４０２は変換せずにディスクアレイサブ
セット１０に転送する。本実施形態における各部の動作
は、フレームペイロード４０２の変換が行われないこと
を除くと第１実施形態と同様であるので、ここでは詳細
な説明を省略する。本実施形態によれば、ディスクアレ
イスイッチ２０の開発を容易にできる。In the present embodiment, the LU in each disk array subset 10 is
Since the LUN of 110 is shown to the host 30 as it is, the conversion of the LUN in the disk array switch 20 becomes unnecessary. Therefore, the disk array switch 20
Receives the frame from the host 30, converts only the frame header 401 in the same manner as in the first embodiment, and transfers the frame payload 402 to the disk array subset 10 without conversion. The operation of each unit in the present embodiment is the same as that of the first embodiment except that the conversion of the frame payload 402 is not performed, and thus detailed description is omitted here. According to the present embodiment, the development of the disk array switch 20 can be facilitated.

【００７８】［第３実施形態］第２実施形態では、ディ
スクアレイスイッチのホストＩ／Ｆノードにおいて、フ
レームヘッダのみを変換しているが、以下に説明する第
３実施形態ではフレームヘッダも含め、フレームの変換
を行わない形態について説明する。本実施形態のコンピ
ュータシステムは、図１に示す第１実施形態におけるコ
ンピュータシステムと同様に構成される。[Third Embodiment] In the second embodiment, only the frame header is converted in the host I / F node of the disk array switch. However, in the third embodiment described below, including the frame header, A mode in which frame conversion is not performed will be described. The computer system of the present embodiment has the same configuration as the computer system of the first embodiment shown in FIG.

【００７９】第１、および第２実施形態では、ホスト３
０に対し、ディスクアレイサブセット１０の台数や、Ｌ
Ｕ１１０の構成等、ディスクアレイシステム1の内部構
成を隠蔽している。このため、ホスト３０からはディス
クアレイシステム1が全体で１つの記憶装置として見え
る。これに対し、本実施形態では、ディスクアレイサブ
セット１０をそのままホスト３０に公開し、ホスト３０
がフレームヘッダのD_IDとして直接ディスクアレイサブ
セットのポートのＩＤを使えるようにする。これによ
り、ディスクアレイスイッチは、フレームヘッダの情報
に従ってフレームの転送を制御するだけで済み、従来技
術におけるファイバチャネルのファブリック装置と同等
のスイッチ装置をディスクアレイスイッチ２０に替えて
利用することができる。In the first and second embodiments, the host 3
0, the number of disk array subsets 10 and L
The internal configuration of the disk array system 1 such as the configuration of U110 is hidden. Therefore, the disk array system 1 appears to the host 30 as one storage device as a whole. In contrast, in the present embodiment, the disk array subset 10 is disclosed to the host 30 as it is,
Can use the ID of the port of the disk array subset directly as the D_ID of the frame header. As a result, the disk array switch only needs to control the transfer of the frame according to the information of the frame header, and can use a switch device equivalent to the fiber channel fabric device in the related art in place of the disk array switch 20.

【００８０】ディスクアレイシステム構成管理手段７０
は、ディスクアレイサブセット１０の通信コントローラ
１０６、及びディスクアレイスイッチ２０の通信手段２
０４と通信して各ディスクアレイサブセット１０及びデ
ィスクアレイスイッチ２０の構成情報を獲得し、あるい
は、設定する。Disk array system configuration management means 70
Are the communication controller 106 of the disk array subset 10 and the communication means 2 of the disk array switch 20
04 to obtain or set the configuration information of each disk array subset 10 and disk array switch 20.

【００８１】ディスクアレイスイッチ２０は、基本的に
は図３に示す第１実施形態におけるディスクアレイスイ
ッチと同様の構成を有する。しかし、本実施形態では、
ホスト３０が発行するフレームのフレームヘッダの情報
をそのまま使ってフレームの転送を制御するため、第１
実施形態、あるいは第２実施形態でディスクアレイスイ
ッチ２０のホストＩ／Ｆノード２０３、ディスクアレイ
Ｉ／Ｆノード２０２が有するＤＣＴ２０２７や、ＳＣ２
０２２、ＳＰＧ２０２４等により実現されるフレームヘ
ッダ等の変換の機能は不要となる。ディスクアレイスイ
ッチ２０が有するクロスバスイッチ２０１は、フレーム
ヘッダの情報に従ってホストＩ／Ｆノード２０３、及び
ディスクアレイＩ／Ｆノード２０２の間でファイバチャ
ネルのフレームの転送を行う。The disk array switch 20 has basically the same configuration as the disk array switch in the first embodiment shown in FIG. However, in this embodiment,
In order to control the transfer of the frame using the information of the frame header of the frame issued by the host 30 as it is, the first
In the embodiment or the second embodiment, the host I / F node 203 of the disk array switch 20, the DCT 2027 of the disk array I / F node 202, or the SC2
022, the function of converting a frame header or the like realized by the SPG2024 or the like is not required. The crossbar switch 201 of the disk array switch 20 transfers a fiber channel frame between the host I / F node 203 and the disk array I / F node 202 according to the information of the frame header.

【００８２】本実施形態では、ディスクアレイシステム
の構成をディスクアレイシステム構成管理手段７０で一
括して管理するために、ディスクアレイ管理用テーブル
（以下、このテーブルもＤＣＴと呼ぶ）をディスクアレ
イシステム構成管理手段７０に備える。ディスクアレイ
システム構成管理手段７０が備えるＤＣＴは、図６、７
に示す、システム構成テーブル20270とサブセット構成
テーブル202720〜202723の２つのテーブル群を含む。な
お、本実施形態では、ホストＬＵは全てＩＬＵとして構
成されるため、ホストＬＵ構成テーブル20271のLU Type
は全て「ＩＬＵ」となり、CLU Class、CLU Stripe Size
は意味をなさない。In this embodiment, a disk array management table (hereinafter also referred to as a DCT) is used to manage the configuration of the disk array system collectively by the disk array system configuration management means 70. It is provided in the management means 70. The DCT included in the disk array system configuration management means 70 is shown in FIGS.
, Two table groups of a system configuration table 20270 and subset configuration tables 202720 to 202723. In this embodiment, since all the host LUs are configured as ILUs, the LU Type in the host LU configuration table 20271
Are all “ILU”, CLU Class, CLU Stripe Size
Does not make sense.

【００８３】管理者は、管理端末５を操作してディスク
アレイシステム構成管理手段７０と通信し、ディスクア
レイサブセット１０のディスク容量、ディスクユニット
の台数等の情報を得て、ディスクアレイサブセット１０
のＬＵ１１０の設定、ＲＡＩＤレベルの設定等を行う。
次に管理者は、管理端末５によりディスクアレイシステ
ム構成管理手段７０と通信し、ディスクアレイスイッチ
２０を制御して、各ホスト３０とディスクアレイサブセ
ット２０間の関係情報を設定する。The administrator operates the management terminal 5 to communicate with the disk array system configuration management means 70 to obtain information such as the disk capacity of the disk array subset 10 and the number of disk units.
Of the LU 110, setting of the RAID level, and the like.
Next, the administrator uses the management terminal 5 to communicate with the disk array system configuration management means 70, controls the disk array switch 20, and sets the relationship information between each host 30 and the disk array subset 20.

【００８４】以上の操作により、ディスクアレイシステ
ム1の構成が確立し、ホスト３０から管理者が望む通り
にＬＵ１１０が見えるようになる。ディスクアレイ構成
管理手段７０は以上の設定情報を保存し、管理者からの
操作に応じ構成の確認や、構成の変更を行うことができ
る。With the above operation, the configuration of the disk array system 1 is established, and the LU 110 can be seen from the host 30 as desired by the administrator. The disk array configuration management means 70 stores the above setting information, and can confirm the configuration or change the configuration according to the operation of the administrator.

【００８５】本実施形態によれば、ひとたびディスクア
レイシステム1を構成すれば、管理者からディスクアレ
イスイッチ２０の存在を認識させることが無く、複数の
ディスクアレイサブシステムを１台のディスクアレイシ
ステムと同様に扱うことができる。また、本実施形態に
よれば、ディスクアレイスイッチ２０とディスクアレイ
サブセット１０は、同一の操作環境によって統一的に操
作することができ、その構成確認や、構成変更も容易に
なる。さらに、本実施形態によれば、従来使用していた
ディスクアレイシステムを本実施形態におけるディスク
アレイシステムに置き換える場合に、ホスト３０の設定
を変更することなく、ディスクアレイシステム1の構成
をそれまで使用していたディスクアレイシステムの構成
に合わせることができ、互換性を維持できる。According to the present embodiment, once the disk array system 1 is configured, a plurality of disk array subsystems can be combined with one disk array system without making the administrator aware of the existence of the disk array switch 20. Can be treated similarly. Further, according to the present embodiment, the disk array switch 20 and the disk array subset 10 can be operated in a unified manner in the same operation environment, and the configuration confirmation and configuration change become easy. Further, according to the present embodiment, when replacing the disk array system used conventionally with the disk array system according to the present embodiment, the configuration of the disk array system 1 can be used without changing the setting of the host 30. It can be adapted to the configuration of the disk array system that has been used, and can maintain compatibility.

【００８６】［第４実施形態］以上説明した第１から第
３の実施形態では、ホストＩ／Ｆにファイバチャネルを
使用している。以下に説明する実施形態では、ファイバ
チャネル以外のインタフェースが混在した形態について
説明する。[Fourth Embodiment] In the first to third embodiments described above, a fiber channel is used for the host I / F. In the embodiment described below, a mode in which interfaces other than Fiber Channel are mixed will be described.

【００８７】図１７は、ホストＩ／ＦがパラレルＳＣＳ
Ｉである場合のホストＩ／Ｆノード２０３内部のＩＣ２
０２３の一構成例を示す。20230はパラレルＳＣＳＩの
プロトコル制御を行うＳＣＳＩプロトコルコントローラ
（ＳＰＣ）、20233はファイバチャネルのプロトコル制
御を行うファイバチャネルプロトコルコントローラ（Ｆ
ＰＣ）、20231はパラレルＳＣＳＩとファイバチャネル
のシリアルＳＣＳＩをプロトコル変換するプロトコル変
換プロセッサ（ＰＥＰ）、20232はプロトコル変換中デ
ータを一時保存するバッファ（ＢＵＦ）である。FIG. 17 shows that the host I / F is a parallel SCS.
IC2 inside the host I / F node 203 in the case of I
023 shows a configuration example. Reference numeral 20230 denotes a SCSI protocol controller (SPC) for controlling a protocol of a parallel SCSI, and reference numeral 20233 denotes a fiber channel protocol controller (F) for controlling a protocol of a fiber channel.
PC, 20231 is a protocol conversion processor (PEP) for converting the protocol between parallel SCSI and fiber channel serial SCSI, and 20232 is a buffer (BUF) for temporarily storing data during protocol conversion.

【００８８】本実施形態において、ホスト３０は、ディ
スクアレイＩ／Ｆノード２０３に対してＳＣＳＩコマン
ドを発行する。リードコマンドの場合、ＳＰＣ20230
は、これをＢＵＦ20232に格納し、ＰＥＰ20231に割り込
みでコマンドの受信を報告する。ＰＥＰ20231は、ＢＵ
Ｆ20232に格納されたコマンドを利用し、ＦＰＣ20233へ
のコマンドに変換し、ＦＰＣ20233に送る。ＦＰＣ20233
は、このコマンドを受信すると、フレーム形式に変換
し、ＳＣ２０２２に引き渡す。この際、エクスチェンジ
ＩＤ、シーケンスＩＤ、ソースＩＤ、デスティネイショ
ンＩＤは、以降の処理が可能なようにＰＥＰ20231によ
り付加される。あとのコマンド処理は、第１実施形態と
同様に行われる。In this embodiment, the host 30 issues a SCSI command to the disk array I / F node 203. In case of read command, SPC20230
Stores this in the BUF 20232 and reports the reception of the command to the PEP 20231 by interruption. PEP20231 is a BU
Using the command stored in the F20232, the command is converted into a command for the FPC20233 and sent to the FPC20233. FPC20233
Receives this command, converts it into a frame format, and delivers it to SC2022. At this time, the exchange ID, sequence ID, source ID, and destination ID are added by the PEP 20231 so that the subsequent processing can be performed. Subsequent command processing is performed in the same manner as in the first embodiment.

【００８９】ディスクアレイサブセット１０は、データ
の準備が完了すると、データ転送準備完了フレームの発
行、データ転送、正常終了後ステータスフレームの発行
を実施する。ディスクアレイサブセット１０からＩＣ２
０２３までの間では、フレームヘッダ４０１やフレーム
ペイロード４０２が必要に応じ変換されながら、各種フ
レームの転送が行われる。ＩＣ２０２３のＦＰＣ20233
は、データ転送準備完了フレームを受信し、続いてデー
タを受信してＢＵＦ20232に格納し、続けて正常に転送
が終わったならば、ステータスフレームを受信し、ＰＴ
Ｐ20231に割り込みをかけてデータの転送完了を報告す
る。ＰＴＰ20231は、割り込みを受けると、ＳＰＣ20230
を起動し、ホスト３０に対しデータ転送を開始するよう
指示する。ＳＰＣ20230はホスト３０にデータを送信
し、正常終了を確認するとＰＴＰ20231に対し割り込み
で正常終了を報告する。When data preparation is completed, the disk array subset 10 issues a data transfer preparation completion frame, performs data transfer, and issues a status frame after normal termination. Disk array subset 10 to IC2
Until 023, various frames are transferred while the frame header 401 and the frame payload 402 are converted as necessary. FPC20233 of IC2023
Receives the data transfer ready frame, subsequently receives the data and stores it in the BUF 20232, and if the transfer is normally completed, receives the status frame and
An interrupt is issued to P20231 to report the data transfer completion. When the PTP 20231 receives the interrupt, the STP 20230
And instructs the host 30 to start data transfer. The SPC 20230 transmits the data to the host 30 and, upon confirming the normal end, reports the normal end to the PTP 20231 by interruption.

【００９０】ここでは、ファイバチャネル以外のホスト
Ｉ／Ｆの例としてパラレルＳＣＳＩを示したが、他のイ
ンタフェース、例えば、メインフレームへのホストＩ／
ＦであるESCON等に対しても同様に適用することが可能
である。ディスクアレイスイッチ２０のホストＩ／Ｆノ
ード２０３として、例えば、ファイバチャネル、パラレ
ルＳＣＳＩ、及びESCONに対応したホストＩ／Ｆノード
を設けることで、１台のディスクアレイシステム1に、
メインフレームと、パーソナルコンピュータ、ワークス
テーション等のいわゆるオープンシステムの両方を混在
させて接続することが可能である。本実施形態では、デ
ィスクアレイＩ／Ｆとしては、第１から第３実施形態と
同様、ファイバチャネルを用いているが、ディスクアレ
イＩ／Ｆに対しても任意のＩ／Ｆを使用することが可能
である。Here, the parallel SCSI is shown as an example of the host I / F other than the Fiber Channel, but other interfaces, for example, the host I / F to the mainframe,
The same can be applied to ESCON or the like which is F. As a host I / F node 203 of the disk array switch 20, for example, by providing a host I / F node corresponding to Fiber Channel, parallel SCSI, and ESCON, one disk array system 1
It is possible to mix and connect both a mainframe and a so-called open system such as a personal computer and a workstation. In the present embodiment, a fiber channel is used as the disk array I / F, as in the first to third embodiments. However, an arbitrary I / F may be used for the disk array I / F. It is possible.

【００９１】［第５実施形態］次に、ディスクアレイシ
ステム1の構成管理の方法について、第５実施形態とし
て説明する。図１８は、本実施形態のシステム構成図で
ある。本実施形態では、ホスト３０が4台設けられてい
る。ホスト“＃０”、“＃１”とディスクアレイシステ
ム1の間のＩ／Ｆ３０はファイバチャネル、ホスト“＃
２”とディスクアレイシステム1の間は、パラレルＳＣ
ＳＩ（Ultra SCSI）、ホスト“＃３”とディスクアレイ
システム1の間は、パラレルＳＣＳＩ（Ultra2 SCSI）で
接続されている。[Fifth Embodiment] Next, a method of managing the configuration of the disk array system 1 will be described as a fifth embodiment. FIG. 18 is a system configuration diagram of the present embodiment. In the present embodiment, four hosts 30 are provided. The I / F 30 between the hosts "# 0" and "# 1" and the disk array system 1 is a fiber channel and the host "#
2 "and the disk array system 1 are parallel SC
The SI (Ultra SCSI), the host "# 3" and the disk array system 1 are connected by parallel SCSI (Ultra 2 SCSI).

【００９２】パラレルＳＣＳＩのディスクアレイスイッ
チ２０への接続は第４実施形態と同様に行われる。ディ
スクアレイシステム1は、４台のディスクアレイサブセ
ット３０を有する。ディスクアレイサブセット“＃０”
には４つの独立ＬＵ、ディスクアレイサブセット“＃
１”には２つの独立ＬＵがそれぞれ構成されている。デ
ィスクアレイサブセット“＃２”と“＃３”で１つの統
合ＬＵが構成されている。本実施形態では、第１実施形
態と同様、ホスト３０に対しディスクアレイサブセット
１０を隠蔽し、ファイバチャネルのフレームを変換する
ものとする。各ＬＵに割り当てられるＬＵＮは、ディス
クアレイサブセット“＃０”のＬＵから順に、ＬＵＮ＝
０、１、２、・・・６までの７つである。The connection of the parallel SCSI to the disk array switch 20 is performed in the same manner as in the fourth embodiment. The disk array system 1 has four disk array subsets 30. Disk array subset “# 0”
Has four independent LUs, disk array subset “#”
1 ", two independent LUs are respectively configured. One integrated LU is configured by the disk array subsets"# 2 "and"# 3 ". In the present embodiment, as in the first embodiment, It is assumed that the disk array subset 10 is concealed from the host 30 and the frame of the Fiber Channel is converted, and the LUN assigned to each LU is LUN = LU in the disk array subset “# 0” in order.
0, 1, 2,...

【００９３】図１９は、管理端末５の表示画面上に表示
される画面の一例である。図は、ホストＩ／Ｆ３１と各
論理ユニット（ＬＵ）との対応を示した論理接続構成画
面である。FIG. 19 is an example of a screen displayed on the display screen of the management terminal 5. The figure is a logical connection configuration screen showing the correspondence between the host I / F 31 and each logical unit (LU).

【００９４】論理接続構成画面５０には、各ホストＩ／
Ｆ３１に関する情報３１００、各ＬＵ１１０に関する情
報11000、ディスクアレイサブセット１０とＬＵ１１０
の関係等が表示される。ホストＩ／Ｆ３１に関する情報
としては、Ｉ／Ｆ種類、Ｉ／Ｆ速度、ステータス等が含
まれる。ＬＵ１１０に関する情報としては、格納サブセ
ット番号、ＬＵＮ、容量、ＲＡＩＤレベル、ステータ
ス、情報、等が表示される。管理者はこの画面を参照す
ることで、容易にディスクアレイシステム１の構成を管
理することができる。The logical connection configuration screen 50 displays each host I /
Information 3100 on F31, information 11000 on each LU110, disk array subset 10 and LU110
Are displayed. Information on the host I / F 31 includes an I / F type, an I / F speed, a status, and the like. As information on the LU 110, a storage subset number, LUN, capacity, RAID level, status, information, and the like are displayed. The administrator can easily manage the configuration of the disk array system 1 by referring to this screen.

【００９５】論理接続構成画面５０上で、ホストＩ／Ｆ
とＬＵの間に引かれている線は、各ホストＩ／Ｆ３１を
経由してアクセス可能なＬＵ１１０を示している。ホス
トＩ／Ｆから線の引かれていないＬＵ１１０に対して、
そのホストＩ／Ｆに接続するホスト３０からはアクセス
できない。ホスト３０によって、扱うデータ形式が異な
り、また使用者も異なることから、セキュリティ維持
上、適切なアクセス制限を設けることが不可欠である。
そこで、システムを設定する管理者が、この画面を用い
て、各ＬＵ１１０とホストＩ／Ｆとの間のアクセス許可
をあたえるか否かによって、アクセス制限を実施する。
図において、例えば、ＬＵ“＃０”は、ホストＩ／Ｆ
“＃０”および“＃１”からアクセス可能であるが、ホ
ストＩ／Ｆ“＃２”、“＃３”からはアクセスできな
い。ＬＵ“＃４”は、ホストＩ／Ｆ“＃２”からのみア
クセス可能である。On the logical connection configuration screen 50, the host I / F
The line drawn between and LU indicates the LU 110 that can be accessed via each host I / F 31. For the LU 110 with no line drawn from the host I / F,
It cannot be accessed from the host 30 connected to the host I / F. Since the data format handled and the user differ depending on the host 30, it is indispensable to provide an appropriate access restriction for maintaining security.
Therefore, the administrator who sets up the system uses this screen to restrict access depending on whether or not to grant access permission between each LU 110 and the host I / F.
In the figure, for example, LU “# 0” is a host I / F
It can be accessed from “# 0” and “# 1”, but cannot be accessed from the host I / Fs “# 2” and “# 3”. The LU “# 4” is accessible only from the host I / F “# 2”.

【００９６】このようなアクセス制限を実現するためア
クセス制限情報は、ディスクアレイシステム構成管理手
段７０からディスクアレイスイッチ２０に対して送信さ
れる。ディスクアレイスイッチ２０に送られたアクセス
制限情報は、各ホストＩ／Ｆノード２０３に配信され、
各ホストＩ／Ｆノード２０３のＤＣＴ２０２７に登録さ
れる。ホストにより、アクセスが制限されたＬＵに対す
るＬＵ存在有無の検査コマンドが発行された場合、各ホ
ストＩ／Ｆノード２０３は、ＤＣＴ２０２７の検査を行
い、検査コマンドに対し応答しないか、あるいは、エラ
ーを返すことで、そのＬＵは、ホストからは認識されな
くなる。ＬＵ存在有無の検査コマンドとしては、ＳＣＳ
Ｉプロトコルの場合、Test Unit Readyコマンドや、Inq
uiryコマンドが一般に用いられる。この検査なしに、リ
ード／ライトが実施されることはないため、容易にアク
セスの制限をかけることが可能である。The access restriction information is transmitted from the disk array system configuration management means 70 to the disk array switch 20 to realize such access restriction. The access restriction information sent to the disk array switch 20 is distributed to each host I / F node 203,
It is registered in the DCT 2027 of each host I / F node 203. When the host issues a check command for the presence / absence of an LU for an access-restricted LU, each host I / F node 203 checks the DCT 2027 and does not respond to the check command or returns an error. As a result, the LU is not recognized by the host. The check command for the presence or absence of LU is SCS.
In case of I protocol, Test Unit Ready command or Inq
The uiry command is commonly used. Since read / write is not performed without this inspection, access can be easily restricted.

【００９７】本実施形態ではホストＩ／Ｆ３１毎にアク
セス制限をかけているが、これを拡張することで、ホス
ト３０毎にアクセス制限をかけることも容易に実現でき
る。また、ホストＩ／Ｆ３１、ホスト３０、あるいは、
アドレス空間を特定して、リードのみ可、ライトのみ
可、リード／ライトとも可、リード／ライトとも不可と
いった、コマンドの種別に応じたアクセス制限をかける
こともできる。この場合、アクセス制限情報としてホス
トＩ／Ｆ番号、ホストＩＤ、アドレス空間、制限コマン
ド等を指定してディスクアレイスイッチ２０に制限を設
定する。In the present embodiment, the access is restricted for each host I / F 31. However, by extending this, it is possible to easily realize the access restriction for each host 30. Also, the host I / F 31, the host 30, or
By specifying the address space, it is also possible to restrict access according to the type of command, such as read only, write only, read / write, and read / write. In this case, a restriction is set in the disk array switch 20 by specifying a host I / F number, a host ID, an address space, a restriction command, and the like as access restriction information.

【００９８】次に、新たなディスクアレイサブセット１
０の追加について説明する。ディスクアレイサブセット
１０を新規に追加する場合、管理者は、ディスクアレイ
スイッチ２０の空いているディスクアレイＩ／Ｆノード
２０２に追加するディスクアレイサブセット１０を接続
する。つづけて、管理者は、管理端末５を操作し、論理
接続構成画面５０に表示されている「最新状態を反映」
ボタン５００１を押下する。この操作に応答して、未設
定のディスクアレイサブセットを表す絵が画面上に表示
される（図示せず）。このディスクアレイサブセットの
絵が選択されるすると、ディスクアレイサブセットの設
定画面が現れる。管理者は、表示された設定画面上で、
新規に追加されたディスクアレイサブセットの各種設定
を実施する。ここで設定される項目にはＬＵの構成、Ｒ
ＡＩＤレベル等がある。続けて、図１９の論理接続構成
図の画面に切り替えると、新規ディスクアレイサブセッ
トとＬＵが現れる。以降、ホストＩ／Ｆ３１毎に対する
アクセス制限を設定し、「設定実行」ボタン５００２を
押下すると、ディスクアレイスイッチ２０に対し、アク
セス制限情報、およびディスクアレイサブセット、ＬＵ
の情報が転送され、設定が実行される。Next, a new disk array subset 1
The addition of 0 will be described. When newly adding the disk array subset 10, the administrator connects the disk array subset 10 to be added to the empty disk array I / F node 202 of the disk array switch 20. Subsequently, the administrator operates the management terminal 5 to display the “reflect the latest state” displayed on the logical connection configuration screen 50.
A button 5001 is pressed. In response to this operation, a picture representing the unset disk array subset is displayed on the screen (not shown). When the picture of the disk array subset is selected, a setting screen for the disk array subset appears. On the displayed setting screen, the administrator
Perform various settings for the newly added disk array subset. The items set here include the LU configuration and R
There are AID levels and the like. Subsequently, when switching to the screen of the logical connection configuration diagram of FIG. 19, a new disk array subset and LU appear. After that, the access restriction for each host I / F 31 is set, and when the “execute setting” button 5002 is pressed, the access restriction information, the disk array subset, and the LU are sent to the disk array switch 20.
Is transferred and the setting is executed.

【００９９】各ディスクアレイサブセット１０にＬＵ１
１０を追加する際の手順も上述した手順で行われる。ま
た、ディスクアレイサブセット、およびＬＵの削除につ
いてもほぼ同様の手順で行われる。異なる点は、管理者
が各削除部位を画面上で選択して「削除」ボタン５００
３を押下し、適切な確認が行われたのち、実行される点
である。以上のように、管理端末７０を用いることで、
管理者はディスクアレイシステム全体を一元的に管理で
きる。LU1 is assigned to each disk array subset 10.
The procedure for adding 10 is also performed in the manner described above. The deletion of the disk array subset and the LU is performed in substantially the same procedure. The difference is that the administrator selects each part to be deleted on the screen and clicks the “Delete” button 500.
This is the point that the operation is performed after the user presses 3 and an appropriate confirmation is performed. As described above, by using the management terminal 70,
The administrator can centrally manage the entire disk array system.

【０１００】［第６実施形態］次に、ディスクアレイス
イッチ２０によるミラーリングの処理について、第６実
施形態として説明する。ここで説明するミラーリングと
は、２台のディスクアレイサブセットの２つの独立ＬＵ
により二重書きをサポートする方法であり、ディスクア
レイサブセットのコントローラまで含めた二重化であ
る。従って、信頼性は、ディスクのみの二重化とは異な
る。[Sixth Embodiment] Next, a mirroring process performed by the disk array switch 20 will be described as a sixth embodiment. The mirroring described here refers to two independent LUs of two disk array subsets.
Is a method that supports dual writing, and is a duplex including the controller of the disk array subset. Therefore, reliability is different from disk-only duplication.

【０１０１】本実施形態におけるシステムの構成は図１
に示すものと同じである。図１に示す構成おいて、ディ
スクアレイサブセット“＃０”と“＃１”は全く同一の
ＬＵ構成を備えており、この２つのディスクアレイサブ
セットがホスト３０からは１つのディスクアレイとして
見えるものとする。便宜上、ミラーリングされたディス
クアレイサブセットのペアの番号を“＃０１”と呼ぶ。
また、各ディスクアレイサブセットのＬＵ“＃０”とＬ
Ｕ“＃１”によってミラーリングペアが形成され、この
ＬＵのペアを便宜上、ＬＵ“＃０１”と呼ぶ。ＤＣＴ２
０２７のホストＬＵ構成テーブル20271上でＬＵ＃０１
を管理するための情報は、CLU Classに「Mirrored」が
設定され、LU Info.として、ＬＵ＃０とＬＵ＃１に関す
る情報が設定される。その他の各部の構成は第１実施形
態と同様である。The system configuration according to the present embodiment is shown in FIG.
Is the same as that shown in FIG. In the configuration shown in FIG. 1, the disk array subsets “# 0” and “# 1” have exactly the same LU configuration, and these two disk array subsets appear to the host 30 as one disk array. I do. For convenience, the pair number of the mirrored disk array subset is referred to as “# 01”.
Also, LU “# 0” and L of each disk array subset
A mirroring pair is formed by U “# 1”, and this LU pair is called LU “# 01” for convenience. DCT2
LU # 01 in the host LU configuration table 20271 of LU 027
Is set to “Mirrored” in the CLU Class, and information about LU # 0 and LU # 1 is set as LU Info. The other components are the same as in the first embodiment.

【０１０２】本実施形態における各部の動作は、第１実
施例とほぼ同様である。以下、第１実施形態と相違する
点について、ディスクアレイスイッチ２０のホストＩ／
Ｆノード２０３の動作を中心に説明する。図２０は、本
実施形態におけるライト動作時に転送されるフレームの
シーケンスを示す模式図、図２１、２２は、ライト動作
時におけるホストＩ／Ｆノード２０３による処理の流れ
を示すフローチャートである。The operation of each section in this embodiment is almost the same as in the first embodiment. Hereinafter, the difference between the first embodiment and the first embodiment will be described.
The operation of the F node 203 will be mainly described. FIG. 20 is a schematic diagram showing a sequence of frames transferred at the time of a write operation in the present embodiment, and FIGS. 21 and 22 are flowcharts showing the flow of processing by the host I / F node 203 at the time of a write operation.

【０１０３】ライト動作時、ホスト３０が発行したライ
トコマンドフレーム（FCP_CMD）は、ＩＣ２０２３によ
り受信される（図２０の矢印（ａ）：ステップ2100
1）。ＩＣ２０２３により受信されたライトコマンドフ
レームは、第１実施形態で説明したリード動作時におけ
るステップ20002 20005と同様に処理される（ステップ
21002 - 21005）。At the time of the write operation, the write command frame (FCP_CMD) issued by the host 30 is received by the IC 2023 (arrow (a) in FIG. 20: step 2100).
1). The write command frame received by the IC 2023 is processed in the same manner as in Steps 20002 and 20005 in the read operation described in the first embodiment (Step 20002).
21002-21005).

【０１０４】ＳＣ２０２２は、ＳＰ２０２１を使ってＤ
ＣＴ２０２７を検索し、ミラー化されたディスクアレイ
サブセット“＃０１”のＬＵ“＃０１”へのライトアク
セス要求であることを認識する（ステップ21006）。Ｓ
Ｃ２０２２は、ＦＢ２０２５上に、受信したコマンドフ
レームの複製を作成する（ステップ21007）。ＳＣ２０
２２は、ＤＣＴ２０２７に設定されている構成情報に基
づいてコマンドフレームの変換を行い、ＬＵ“＃０”と
ＬＵ“＃１”の両者への別々のコマンドフレームを作成
する（ステップ21008）。ここで、ＬＵ“＃０”を主Ｌ
Ｕ、ＬＵ“＃１”を従ＬＵと呼び、コマンドフレームに
もそれぞれ主コマンドフレーム、従コマンドフレームと
呼ぶ。そして、両者別々にＥＴ２０２６にエクスチェン
ジ情報を格納し、ディスクアレイサブセット“＃０”お
よびディスクアレイサブセット“＃１”に対し作成した
コマンドフレームを発行する（図２０の矢印（ｂ０）
（ｂ１）：ステップ21009）。The SC 2022 uses the SP 2021 to
The CT 2027 is searched to recognize that it is a write access request to the LU “# 01” of the mirrored disk array subset “# 01” (step 21006). S
The C2022 creates a copy of the received command frame on the FB 2025 (step 21007). SC20
22 converts the command frame based on the configuration information set in the DCT 2027, and creates separate command frames for both LU “# 0” and LU “# 1” (step 21008). Here, LU “# 0” is the main L
U and LU “# 1” are referred to as a slave LU, and the command frames are also referred to as a main command frame and a slave command frame, respectively. Then, the exchange information is separately stored in the ET 2026, and the command frames created for the disk array subset “# 0” and the disk array subset “# 1” are issued (arrow (b0) in FIG. 20).
(B1): Step 21009).

【０１０５】各ディスクアレイサブセット“＃０”、
“＃１”は、コマンドフレームを受信し、それぞれ独立
にデータ転送準備完了フレーム（FCP_XFER_RDY）をディ
スクアレイスイッチ２０に送信する（図２０の矢印（ｃ
０）（ｃ１））。ディスクアレイスイッチ２０では、ホ
ストＩ／Ｆノード２０３が、第１実施形態におけるリー
ド動作のステップ20011 20013と同様の処理により転送
されてきたデータ転送準備完了フレームを処理する（ス
テップ21011 - 21013）。Each disk array subset “# 0”,
“# 1” receives a command frame and independently transmits a data transfer preparation completion frame (FCP_XFER_RDY) to the disk array switch 20 (arrow (c) in FIG. 20).
0) (c1)). In the disk array switch 20, the host I / F node 203 processes the data transfer preparation completion frame transferred by the same processing as in the steps 20011 20013 of the read operation in the first embodiment (steps 21011 to 21013).

【０１０６】各ディスクアレイサブセットからのデータ
転送準備完了フレームがそろった段階で（ステップ2101
4）、ＳＣ２０２２は、主データ転送準備完了フレーム
に対する変換を実施し（ステップ21015）、ＩＣ２０２
３により変換後のフレームをホスト３０に送信する（図
２０の矢印（ｄ）：ステップ21015）。When the data transfer ready frames from each disk array subset are completed (step 2101)
4), SC 2022 performs conversion on the main data transfer ready frame (step 21015), and IC 202
Then, the converted frame is transmitted to the host 30 in step 3 (arrow (d) in FIG. 20: step 21015).

【０１０７】ホスト３０は、データ転送準備完了フレー
ムを受信した後、ライトデータ送信のため、データフレ
ーム（FCP_DATA）をディスクアレイスイッチ２０に送信
する（図２０の矢印（ｅ））。ホスト３０からのデータ
フレームは、ＩＣ２０２３により受信されると（ステッ
プ21031）、リードコマンドフレームやライトコマンド
フレームと同様に、ＦＢ２０２５に格納され、ＣＲＣ検
査、フレームヘッダの解析が行われる（ステップ2103
2、21033）。フレームヘッダの解析結果に基づき、ＥＴ
２０２６がＳＰ２０２１により検索され、エクスチェン
ジ情報が獲得される（ステップ21034）。After receiving the data transfer preparation completion frame, the host 30 transmits a data frame (FCP_DATA) to the disk array switch 20 for write data transmission (arrow (e) in FIG. 20). When the data frame from the host 30 is received by the IC 2023 (step 21031), it is stored in the FB 2025 as in the case of the read command frame and the write command frame, and the CRC check and the analysis of the frame header are performed (step 2103).
2, 21033). ET based on the analysis result of the frame header
2026 is searched by the SP 2021, and exchange information is obtained (step 21034).

【０１０８】ＳＣ２０２２は、ライトコマンドフレーム
のときと同様に複製を作成し（ステップ21035）、その
一方をディスクアレイサブセット“＃０”内のＬＵ“＃
０”に、他方をディスクアレイサブセット“＃１”内の
ＬＵ“＃１”に向けて送信する（図２０の矢印（ｆ０）
（ｆ１）：ステップ21037）。The SC 2022 creates a duplicate as in the case of the write command frame (step 21035), and assigns one of the duplicates to the LU “#” in the disk array subset “# 0”.
0 "to the LU"# 1 "in the disk array subset"# 1 "(arrow (f0) in FIG. 20).
(F1): Step 21037).

【０１０９】ディスクアレイサブセット“＃０”、“＃
１”は、各々、データフレームを受信し、ディスクユニ
ット１０４に対しそれぞれライトし、ステータスフレー
ム（FCP_RSP）をディスクアレイスイッチ２０に送信す
る。Disk array subsets “# 0”, “#”
1 "receives a data frame, writes the data frame to the disk unit 104, and transmits a status frame (FCP_RSP) to the disk array switch 20.

【０１１０】ＳＣ２０２２は、ディスクアレイサブセッ
ト“＃０”、“＃１”それぞれからステータスフレーム
を受信すると、それらのステータスフレームから拡張ヘ
ッダを外してフレームヘッダを再現し、ＥＴ２０２６か
らエクスチェンジ情報を獲得する（ステップ21041、210
42）。When the SC 2022 receives the status frames from the disk array subsets “# 0” and “# 1”, the SC 2022 removes the extension headers from those status frames and reproduces the frame headers, and obtains exchange information from the ET 2026 ( Step 21041, 210
42).

【０１１１】ディスクアレイサブセット“＃０”、“＃
１”の両者からのステータスフレームが揃うと（ステッ
プ21043）、ステータスが正常終了であることを確認の
うえ、ＬＵ“＃０”からの主ステータスフレームに対す
る変換を行い（ステップ21044）、従ステータスフレー
ム消去する（ステップ21045）。そして、ＩＣ２０２３
は、正常終了を報告するためのコマンドフレームをホス
トに送信する（図２０の矢印（ｈ）：ステップ2104
6）。最後にＳＰ２０２１は、ＥＴ２０２６のエクスチ
ェンジ情報を消去する（ステップ21047）。The disk array subsets “# 0” and “#”
When the status frames from both “1” are prepared (step 21043), after confirming that the status is normally completed, conversion to the main status frame from LU “# 0” is performed (step 21044). Erase (step 21045), and IC 2023
Sends a command frame for reporting normal termination to the host (arrow (h) in FIG. 20: step 2104)
6). Finally, the SP 2021 deletes the exchange information of the ET 2026 (step 21047).

【０１１２】以上でミラーリング構成におけるライト処
理が終了する。ミラーリングされたＬＵ“＃０１”に対
するリード処理は、データの転送方向が異なるだけで、
上述したライト処理とほぼ同様に行われるが、ライトと
は異なり、２台のディスクアレイサブセットにリードコ
マンドを発行する必要はなく、どちらか一方に対してコ
マンドフレームを発行すればよい。たとえば、常に主Ｌ
Ｕに対してコマンドフレームを発行してもよいが、高速
化のため、主／従双方のＬＵに対して、交互にコマンド
フレームを発行するなどにより、負荷を分散すると有効
である。Thus, the write processing in the mirroring configuration ends. The read processing for the mirrored LU “# 01” differs only in the data transfer direction.
The write process is performed in substantially the same manner as described above, but unlike the write process, it is not necessary to issue a read command to two disk array subsets, and it is sufficient to issue a command frame to one of them. For example, always L
Although a command frame may be issued to U, it is effective to distribute the load by alternately issuing command frames to both the master and slave LUs for speeding up.

【０１１３】上述した処理では、ステップ21014、及び
ステップ21043で２台のディスクアレイサブセット“＃
０”、“＃１”の応答を待ち、両者の同期をとって処理
が進められる。このような制御では、双方のディスクア
レイサブセットでの処理の成功が確認されてから処理が
進むため、エラー発生時の対応が容易になる。その一方
で、全体の処理速度が、どちらか遅いほうの応答に依存
してしまうため、性能が低下するという欠点がある。In the processing described above, the two disk array subsets “#” are set in steps 21014 and 21043.
0 ”and“ # 1 ”, and the process proceeds in synchronization with the two.In such control, the process proceeds after the success of the process in both disk array subsets is confirmed. On the other hand, when an error occurs, it is easy to deal with it, but on the other hand, the overall processing speed depends on the response which is slower, so that there is a disadvantage that the performance is reduced.

【０１１４】この問題を解決するため、ディスクアレイ
スイッチにおいて、ディスクアレイサブセットの応答を
待たずに次の処理に進んだり、ディスクアレイサブセッ
トのどちらか一方からの応答があった時点で次の処理に
進む「非同期型」の制御をすることも可能である。非同
期型の制御を行った場合のフレームシーケンスの一例
を、図２０において破線矢印で示す。In order to solve this problem, the disk array switch proceeds to the next processing without waiting for the response of the disk array subset, or proceeds to the next processing when there is a response from one of the disk array subsets. It is also possible to perform a “non-synchronous” control. An example of a frame sequence when the asynchronous control is performed is indicated by a broken arrow in FIG.

【０１１５】破線矢印で示されるフレームシーケンスで
は、ステップ21016で行われるホストへのデータ転送準
備完了フレームの送信が、ステップ21009の処理の後、
ディスクアレイサブセット１０からのデータ転送準備完
了フレームを待たずに実施される。この場合、ホストに
送信されるデータ転送準備完了フレームは、ディスクア
レイスイッチ２０のＳＣ２０２２により生成される（破
線矢印（ｄ′））。In the frame sequence indicated by the broken arrow, the transmission of the data transfer preparation completion frame to the host performed in step 21016 is performed after the processing in step 21009.
This is performed without waiting for a data transfer preparation completion frame from the disk array subset 10. In this case, the data transfer preparation completion frame transmitted to the host is generated by the SC 2022 of the disk array switch 20 (broken arrow (d ')).

【０１１６】ホスト３０からは、破線矢印（ｅ′）で示
されるタイミングでデータフレームがディスクアレイス
イッチ２０に転送される。ディスクアレイスイッチ２０
では、このデータフレームが一旦ＦＢ２０２５に格納さ
れる。ＳＣ２０２２は、ディスクアレイサブセット１０
からのデータ転送準備完了フレームの受信に応答して、
データ転送準備完了フレームが送られてきたディスクア
レイサブセット１０に対し、ＦＢ２０２５に保持された
データフレームを転送する（破線矢印（ｆ０′）、（ｆ
１′））。The data frame is transferred from the host 30 to the disk array switch 20 at the timing indicated by the dashed arrow (e '). Disk array switch 20
Then, this data frame is temporarily stored in the FB 2025. SC 2022 is a disk array subset 10
In response to receiving the data transfer ready frame from
The data frame held in the FB 2025 is transferred to the disk array subset 10 to which the data transfer preparation completion frame has been sent (dashed arrow (f0 '), (f
1 ')).

【０１１７】ディスクアレイスイッチ２０からホスト３
０への終了報告は、双方のディスクアレイサブシステム
１０からの報告（破線矢印（ｇ０′）、（ｇ０′））が
あった時点でおこなわれる（破線矢印（ｈ′））。この
ような処理により、図２０に示される時間Ｔａの分だけ
処理時間を短縮することが可能である。From the disk array switch 20 to the host 3
The end report to 0 is made when there are reports (broken arrows (g0 ') and (g0')) from both disk array subsystems 10 (broken arrows (h ')). Through such processing, the processing time can be reduced by the time Ta shown in FIG.

【０１１８】ディスクアレイスイッチ２０とディスクア
レイサブセット１０間のフレーム転送の途中でエラーが
発生した場合、以下の処理が実施される。If an error occurs during the frame transfer between the disk array switch 20 and the disk array subset 10, the following processing is performed.

【０１１９】実行中の処理がライト処理の場合、エラー
が発生したＬＵに対し、リトライ処理が行われる。リト
ライが成功すれば、処理はそのまま継続される。あらか
じめ設定された規定の回数のリトライが失敗した場合、
ディスクアレイスイッチ２０は、このディスクアレイサ
ブセット１０（もしくはＬＵ）に対するアクセスを禁止
し、そのことを示す情報をＤＣＴ２０２７に登録する。
また、ディスクアレイスイッチ２０は、ＭＰ２００、通
信コントローラ２０４を経由して、ディスクシステム構
成手段７０にそのことを通知する。If the process being executed is a write process, a retry process is performed on the LU in which an error has occurred. If the retry succeeds, the process continues. If the specified number of retries fails,
The disk array switch 20 prohibits access to the disk array subset 10 (or LU), and registers information indicating this in the DCT 2027.
Further, the disk array switch 20 notifies the disk system configuration unit 70 of the fact via the MP 200 and the communication controller 204.

【０１２０】ディスクシステム構成手段７０は、この通
知に応答して管理端末５にアラームを発行する。これに
より管理者は、トラブルが発生したことを認識できる。
その後、ディスクアレイスイッチ２０は、正常なディス
クアレイサブセットを用いて運転を継続する。ホスト３
０は、エラーが発生したことを認識することはなく、処
理を継続できる。The disk system configuration means 70 issues an alarm to the management terminal 5 in response to this notification. This allows the administrator to recognize that a trouble has occurred.
Thereafter, the disk array switch 20 continues operation using the normal disk array subset. Host 3
A value of 0 does not recognize that an error has occurred, and the processing can be continued.

【０１２１】本実施形態によれば、２台のディスクアレ
イサブシステムでミラー構成を実現できるので、ディス
クの耐障害性を上げることことができる。また、ディス
クアレイコントローラ、ディスクアレイＩ／Ｆ、及びデ
ィスクアレイＩ／Ｆノードの耐障害性を上げることがで
き、内部バスの二重化等するくとなくディスクアレイシ
ステム全体の信頼性を向上させることができる。According to the present embodiment, since a mirror configuration can be realized by two disk array subsystems, the fault tolerance of the disk can be improved. Further, the fault tolerance of the disk array controller, the disk array I / F, and the disk array I / F node can be improved, and the reliability of the entire disk array system can be improved without duplication of the internal bus. it can.

【０１２２】［第７実施形態］次に、３台以上のディス
クアレイサブセット１０を統合し、１台の論理的なディ
スクアレイサブセットのグループを構成する方法につい
て説明する。本実施形態では、複数のディスクアレイサ
ブセット１０にデータを分散して格納する。これによ
り、ディスクアレイサブセットへのアクセスを分散さ
せ、特定のディスクアレイサブセットへのアクセスの集
中を抑止することで、トータルスループットを向上させ
る。本実施形態では、ディスクアレイスイッチによりこ
のようなストライピング処理を実施する。[Seventh Embodiment] Next, a method of integrating three or more disk array subsets 10 to form one logical disk array subset group will be described. In the present embodiment, data is distributed and stored in a plurality of disk array subsets 10. As a result, access to the disk array subset is distributed, and concentration of access to a specific disk array subset is suppressed, thereby improving the total throughput. In the present embodiment, such striping processing is performed by the disk array switch.

【０１２３】図２３は、本実施形態におけるディスクア
レイシステム1のアドレスマップである。ディスクアレ
イサブセット１０のアドレス空間は、ストレイプサイズ
Ｓでストライピングされている。ホストから見たディス
クアレイシステム１のアドレス空間は、ストライプサイ
ズＳ毎に、ディスクアレイサブセット“＃０”、“＃
１”、“＃２”、“＃３”に分散されている。ストライ
プサイズＳのサイズは任意であるが、あまり小さくない
方がよい。ストライプサイズＳが小さすぎると、アクセ
スすべきデータが複数のストライプに属するストライプ
またぎが発生したときに、その処理にオーバヘッドが発
生するおそれがある。ストライプサイズＳを大きくする
と、ストライプまたぎが発生する確率が減少するので性
能向上のためには好ましい。ＬＵの数は任意に設定する
ことができる。FIG. 23 is an address map of the disk array system 1 in this embodiment. The address space of the disk array subset 10 is striped with a stripe size S. The address space of the disk array system 1 as viewed from the host includes disk array subsets “# 0” and “#” for each stripe size S.
1 ","# 2 ", and"# 3 ". The size of the stripe size S is arbitrary, but should not be too small.If the stripe size S is too small, there are a plurality of data to be accessed. When a stripe crossover belonging to the stripe of the above-mentioned stripe occurs, there is a possibility that overhead will be generated in the processing.When the stripe size S is increased, the probability of occurrence of the stripe crossover decreases, which is preferable for improving the performance. The number can be set arbitrarily.

【０１２４】以下、本実施形態におけるホストＩ／Ｆノ
ード２０３の動作について、図２４に示す動作フローチ
ャートを参照しつつ第１実施形態との相違点に着目して
説明する説明する。なお、本実施形態では、ＤＣＴ２０
２７のホストＬＵ構成テーブル20271上で、ストライピ
ングされたホストＬＵに関する情報のCLU Classには「S
triped」が、CLU Stripe Sizeにはストライプサイズ
「Ｓ」が設定される。Hereinafter, the operation of the host I / F node 203 in this embodiment will be described with reference to the operation flowchart shown in FIG. 24 and focusing on the differences from the first embodiment. In the present embodiment, the DCT 20
27, the CLU Class of the information on the striped host LU is “S
“triped” and the stripe size “S” are set in the CLU Stripe Size.

【０１２５】ホスト３０がコマンドフレームを発行する
と、ディスクアレイスイッチ２０は、ホストＩ／Ｆノー
ド２０３のＩＣ２０２３でこれを受信する（ステップ２
２００１）、ＳＣ２０２２は、ＩＣ２０２３からこのコ
マンドフレームを受け取り、ＳＰ２０２１を使ってＤＣ
Ｔ２０２７を検索し、ストライピングする必要があるこ
とを認識する（ステップ２２００５）。When the host 30 issues a command frame, the disk array switch 20 receives the command frame with the IC 2023 of the host I / F node 203 (step 2).
2001), the SC 2022 receives the command frame from the IC 2023, and uses the SP 2021 to
T2027 is searched to recognize that striping is necessary (step 22005).

【０１２６】次に、ＳＣ２０２２は、ＳＰ２０２１によ
りＤＣＴ２０２７を検索し、ストライプサイズＳを含む
構成情報から、アクセスの対象となるデータが属するス
トライプのストライプ番号を求め、このストライプがど
のディスクアレイサブセット１０に格納されているか特
定する（ステップ22006）。この際、ストライプまたぎ
が発生する可能性があるが、この場合の処理については
後述する。ストライプまたぎが発生しない場合、ＳＰ２
０２１の計算結果に基づき、ＳＣ２０２２はコマンドフ
レームに対し変換を施し（ステップ22007）、エクスチ
ェンジ情報をＥＴ２０２６に格納する（ステップ2200
8）。以降は、第１実施形態と同様の処理が行われる。Next, the SC 2022 searches the DCT 2027 by the SP 2021 to obtain the stripe number of the stripe to which the data to be accessed belongs from the configuration information including the stripe size S, and stores this stripe in any disk array subset 10 Is specified (step 22006). At this time, there is a possibility that a stripe jump may occur, and the processing in this case will be described later. If no stripe straddling occurs, SP2
On the basis of the calculation result of 021, the SC 2022 converts the command frame (step 22007), and stores the exchange information in the ET 2026 (step 2200).
8). Thereafter, the same processing as in the first embodiment is performed.

【０１２７】ストライプまたぎが発生した場合、ＳＰ２
０２１は、２つのコマンドフレームを生成する。この生
成は、例えば、ホスト３０が発行したコマンドフレーム
を複製することで行われる。生成するコマンドフレーム
のフレームヘッダ、フレームペイロード等は、新規に設
定する。第６実施形態と同様、ＳＣ２０２２でコマンド
フレームの複製を作成した後、変換を実施することも可
能であるが、ここでは、ＳＰ２０２１により新規に作成
されるものとする。ＳＣ２０２２は、２つのコマンドフ
レームが生成されると、これらを各ディスクアレイサブ
セット１０に送信する。When a stripe crossover occurs, SP2
021 generates two command frames. This generation is performed, for example, by copying a command frame issued by the host 30. A frame header, a frame payload, and the like of the generated command frame are newly set. As in the sixth embodiment, it is possible to execute the conversion after creating a copy of the command frame in the SC 2022, but it is assumed here that the command frame is newly created by the SP 2021. When two command frames are generated, the SC 2022 transmits them to each disk array subset 10.

【０１２８】この後、第１実施形態と同様にデータ転送
が実施される。ここで、本実施形態では、第１実施形
態、あるいは第６実施形態と異なり、データ自体を１台
のホスト３０と２台のディスクアレイサブセット１０間
で転送する必要がある。たとえば、リード処理の場合、
２台のディスクアレイサブセット１０から転送されるデ
ータフレームは、すべてホスト３０に転送する必要があ
る。この際ＳＣ２０２２は、各ディスクアレイサブセッ
ト１０から転送されてくるデータフレームに対し、ＥＴ
２０２６に登録されたエクスチェンジ情報に従い、適切
な順番で、適切なエクスチェンジ情報を付加してホスト
３０に送信する。Thereafter, data transfer is performed as in the first embodiment. Here, in the present embodiment, unlike the first embodiment or the sixth embodiment, it is necessary to transfer the data itself between one host 30 and two disk array subsets 10. For example, for lead processing,
All data frames transferred from the two disk array subsets 10 need to be transferred to the host 30. At this time, the SC 2022 performs ET on the data frame transferred from each disk array subset 10.
According to the exchange information registered in 2026, appropriate exchange information is added in an appropriate order and transmitted to the host 30.

【０１２９】ライト処理の場合は、コマンドフレームの
場合と同様、２つのデータフレームに分割して、該当す
るディスクアレイサブセット１０に転送する。なお、デ
ータフレームの順序制御は、ホスト、あるいはディスク
アレイサブセットがアウトオブオーダー（Out of Orde
r）機能と呼ばれる、順不同処理に対応しているならば
必須ではない。In the case of the write process, the data is divided into two data frames and transferred to the corresponding disk array subset 10, as in the case of the command frame. Note that the order control of the data frame is performed when the host or the disk array subset is out of order.
r) It is not essential if it supports out-of-order processing called functions.

【０１３０】最後に、すべてのデータ転送が完了し、デ
ィスクアレイスイッチ２０が２つのステータスフレーム
をディスクアレイサブセット１０から受信すると、ＳＰ
２０２１（あるいはＳＣ２０２２）は、ホスト３０への
ステータスフレームを作成し、これをＩＣ２０２３によ
りホスト３０に送信する。Finally, when all data transfer is completed and the disk array switch 20 receives two status frames from the disk array subset 10, SP
The 2021 (or SC 2022) creates a status frame for the host 30 and transmits this to the host 30 by the IC 2023.

【０１３１】本実施形態によれば、アクセスを複数のデ
ィスクアレイサブセットに分散することができるので、
トータルとしてスループットを向上させることができる
とともに、アクセスレイテンシも平均的に低減させるこ
とが可能である。According to this embodiment, access can be distributed to a plurality of disk array subsets.
The throughput can be improved as a whole, and the access latency can be reduced on average.

【０１３２】［第８実施形態］次に、２台のディスクア
レイシステム（またはディスクアレイサブセット）間に
おける複製の作成について、第８実施形態として説明す
る。ここで説明するようなシステムは、２台のディスク
アレイシステムの一方を遠隔地に配置し、天災等による
他方のディスクアレイシステムの障害に対する耐性を備
える。このような災害に対する対策をディザスタリカバ
リと呼び、遠隔地のディスクアレイシステムとの間で行
われる複製の作成のことをリモートコピーと呼ぶ。[Eighth Embodiment] Next, the creation of a copy between two disk array systems (or disk array subsets) will be described as an eighth embodiment. In the system described here, one of the two disk array systems is arranged at a remote location, and the system is resistant to a failure of the other disk array system due to a natural disaster or the like. Such a countermeasure against a disaster is called disaster recovery, and creation of a copy with a remote disk array system is called remote copy.

【０１３３】第６実施形態で説明したミラーリングで
は、地理的にほぼ同一の場所に設置されたディスクアレ
イサブセット１０でミラーを構成するので、ディスクア
レイＩ／Ｆ２１はファイバチャネルでよい。しかし、リ
モートコピーを行うディスクアレイ（ディスクアレイサ
ブセット）が１０ｋｍを越える遠隔地に設置される場
合、中継なしでファイバチャネルによりフレームを転送
する事ができない。ディザスタリカバリに用いられる場
合、お互いの間の距離は通常数百ｋｍ以上となる、この
ため、ファイバチャネルでディスクアレイ間を接続する
ことは実用上不可能であり、ＡＴＭ（Asynchronous Tra
nsfer Mode）等による高速公衆回線や衛星通信等が用い
られる。In the mirroring described in the sixth embodiment, since the mirror is configured by the disk array subsets 10 installed at almost the same location geographically, the disk array I / F 21 may be a fiber channel. However, when a disk array (disk array subset) for performing remote copy is installed in a remote place exceeding 10 km, frames cannot be transferred by fiber channel without relay. When used for disaster recovery, the distance between each other is usually several hundred km or more. Therefore, it is practically impossible to connect the disk arrays with a fiber channel.
nsfer Mode) or a high-speed public line, satellite communication, or the like.

【０１３４】図２５は、本実施形態におけるディザスタ
リカバリシステムの構成例である。FIG. 25 shows an example of the configuration of a disaster recovery system according to this embodiment.

【０１３５】８１はサイトＡ、８２はサイトＢであり、
両サイトは、地理的な遠隔地に設置される。9は公衆回
線であり、ＡＴＭパケットがここを通過する。サイトＡ
８１、およびサイトＢ８２は、それぞれディスクアレイ
システム1を有する。ここでは、サイトＡ８１が通常使
用される常用サイトであり、サイトＢ８２はサイトＡ８
１が災害等でダウンしたときに使用されるリモートディ
ザスタリカバリサイトである。Reference numeral 81 denotes a site A, 82 denotes a site B,
Both sites will be located in geographically remote locations. Reference numeral 9 denotes a public line through which an ATM packet passes. Site A
81 and site B82 each have a disk array system 1. Here, site A81 is a regular site that is normally used, and site B82 is site A8.
1 is a remote disaster recovery site used when a disaster occurs.

【０１３６】サイトＡ８１のディスクアレイシステム１
０のディスクアレイサブセット“＃０”、“＃１”の内
容は、サイトＢ８２のディスクアレイシステム１０のリ
モートコピー用ディスクアレイサブセット“＃０”、
“＃１”にコピーされる。ディスクアレイスイッチ２０
のＩ／Ｆノードのうち、リモートサイトに接続するもの
はＡＴＭを用いて公衆回線9に接続されている。このノ
ードをＡＴＭノード２０５と呼ぶ。ＡＴＭノード２０５
は、図５に示すホストＩ／Ｆノードと同様に構成され、
ＩＣ２０２３がＡＴＭ−ファイバチャネルの変換を行
う。この変換は、第４実施形態におけるＳＣＳＩ−ファ
イバチャネルの変換と同様の方法により実現される。Disk Array System 1 at Site A81
The contents of the disk array subsets “# 0” and “# 1” of the disk array subset “# 0”,
Copied to “# 1”. Disk array switch 20
Of the I / F nodes connected to the remote site are connected to the public line 9 using ATM. This node is called an ATM node 205. ATM node 205
Is configured similarly to the host I / F node shown in FIG.
The IC 2023 performs ATM-Fibre Channel conversion. This conversion is realized by a method similar to the SCSI-Fibre Channel conversion in the fourth embodiment.

【０１３７】本実施形態におけるリモートコピーの処理
は、第６実施形態におけるミラーリングの処理と類似す
る。以下、第６実施形態におけるミラーリングの処理と
異なる点について説明する。The remote copy processing according to the present embodiment is similar to the mirroring processing according to the sixth embodiment. Hereinafter, points different from the mirroring processing in the sixth embodiment will be described.

【０１３８】ホスト３０がライトコマンドフレームを発
行すると、サイトＡ８１のディスクアレイシステム１０
は、第６実施形態における場合と同様にフレームの二重
化を実施し、その一方を自身のディスクアレイサブセッ
ト10に転送する。他方のフレームは、ＡＴＭノード２０
５によりファイバチャネルフレームからＡＴＭパケット
に変換され、公衆回線9を介してサイトＢ８２に送られ
る。When the host 30 issues a write command frame, the disk array system 10 in the site A 81
Performs the frame duplication as in the sixth embodiment, and transfers one of the frames to its own disk array subset 10. The other frame is the ATM node 20
5, the packet is converted from the fiber channel frame into an ATM packet, and sent to the site B82 via the public line 9.

【０１３９】サイトＢ８２では、ディスクアレイスイッ
チ２０のＡＴＭノード２０５がこのパケットを受信す
る。ＡＴＭノード２０５のＩＣ２０２３は、ＡＴＭパケ
ットからファイバチャネルフレームを再現し、ＳＣ２０
２２に転送する。ＳＣ２０２２は、ホスト３０からライ
トコマンドを受信したときと同様にフレーム変換を施
し、リモートコピー用のディスクアレイサブセットに転
送する。以降、データ転送準備完了フレーム、データフ
レーム、ステータスフレームのすべてにおいて、ＡＴＭ
ノード２０５においてファイバチャネル−ＡＴＭ変換を
行い、同様のフレーム転送処理を実施することにより、
リモートコピーが実現できる。At the site B82, the ATM node 205 of the disk array switch 20 receives this packet. The IC 2023 of the ATM node 205 reproduces the Fiber Channel frame from the ATM packet and
22. The SC 2022 performs frame conversion in the same manner as when a write command is received from the host 30, and transfers the frame to the remote copy disk array subset. Thereafter, in all of the data transfer ready frame, data frame, and status frame, the ATM
By performing Fiber Channel-ATM conversion at the node 205 and performing the same frame transfer processing,
Remote copy can be realized.

【０１４０】ホスト３０がリードコマンドフレームを発
行した際には、ディスクアレイスイッチ２０は、自サイ
トのディスクアレイサブセット１０に対してのみコマン
ドフレームを転送し、自サイトのディスクアレイサブセ
ット１０からのみデータをリードする。このときの動作
は、第１実施形態と同一となる。When the host 30 issues a read command frame, the disk array switch 20 transfers the command frame only to the disk array subset 10 at its own site, and transfers data only from the disk array subset 10 at its own site. Lead. The operation at this time is the same as in the first embodiment.

【０１４１】本実施形態によれば、ユーザデータをリア
ルタイムでバックアップし、天災等によるサイト障害、
ディスクアレイシステム障害に対する耐性を備えること
ができる。According to this embodiment, user data is backed up in real time,
It is possible to provide resistance to a disk array system failure.

【０１４２】［第９実施形態］次に、一台のディスクア
レイサブセット１０に包含される複数のＬＵの統合につ
いて説明する。例えば、メインフレーム用のディスク装
置は、過去のシステムとの互換性を維持するために、論
理ボリュームのサイズの最大値が２ＧＢに設定されてい
る。このようなディスクアレイシステムをオープンシス
テムでも共用する場合、ＬＵは論理ボリュームサイズの
制限をそのまま受けることになり、小サイズのＬＵが多
数ホストから見えることになる。このような方法では、
大容量化が進展した場合に運用が困難になるという問題
が生じる。そこで、ディスクアレイスイッチ２０の機能
により、この論理ボリューム（すなわちＬＵ）を統合し
て一つの大きな統合ＬＵを構成することを考える。本実
施形態では、統合ＬＵの作成をディスクアレイスイッチ
２０で実施する。[Ninth Embodiment] Next, the integration of a plurality of LUs included in one disk array subset 10 will be described. For example, in a mainframe disk device, the maximum value of the logical volume size is set to 2 GB in order to maintain compatibility with past systems. When such a disk array system is shared even by an open system, LUs are subject to the restriction of the logical volume size as they are, and many small-sized LUs can be seen from many hosts. In such a way,
There is a problem that operation becomes difficult when the capacity is increased. Thus, it is considered that the logical volumes (that is, LUs) are integrated by the function of the disk array switch 20 to form one large integrated LU. In the present embodiment, the creation of the integrated LU is performed by the disk array switch 20.

【０１４３】本実施形態におけるＬＵの統合は、第１実
施形態における複数のディスクアレイサブセット１０に
よる統合ＬＵの作成と同一である。相違点は、同一のデ
ィスクアレイサブセット１０内の複数ＬＵによる統合で
あることだけである。ディスクアレイシステムとしての
動作は、第１実施形態と全く同一となる。The integration of LUs in this embodiment is the same as the creation of integrated LUs by the plurality of disk array subsets 10 in the first embodiment. The only difference is that the integration is performed by a plurality of LUs in the same disk array subset 10. The operation of the disk array system is exactly the same as in the first embodiment.

【０１４４】このように、同一のディスクアレイサブセ
ット１０に包含される複数のＬＵを統合して一つの大き
なＬＵを作成することで、ホストから多数のＬＵを管理
する必要がなくなり、運用性に優れ、管理コストを低減
したディスクアレイシステムを構築できる。As described above, by integrating a plurality of LUs included in the same disk array subset 10 to create one large LU, there is no need to manage a large number of LUs from the host, resulting in excellent operability. Thus, a disk array system with reduced management costs can be constructed.

【０１４５】［第１０実施形態］次に、ディスクアレイ
スイッチ１０による交代パスの設定方法について、図２
６を参照しつつ説明する。[Tenth Embodiment] Next, an alternate path setting method by the disk array switch 10 will be described with reference to FIG.
This will be described with reference to FIG.

【０１４６】図２６に示された計算機システムにおける
各部の構成は、第１の実施形態と同様である。ここで
は、２台のホスト３０が、各々異なるディスクアレイＩ
／Ｆ２１を用いてディスクアレイサブセット１０をアク
セスするとように構成していると仮定する。図では、デ
ィスクアレイサブセット、ディスクアレイスイッチ２０
のホストＩ／Ｆノード２０３およびディスクアレイＩ／
Ｆノード２０２は、ここでの説明に必要な数しか示され
ていない。The configuration of each unit in the computer system shown in FIG. 26 is the same as in the first embodiment. Here, two hosts 30 are connected to different disk arrays I
It is assumed that the disk array subset 10 is accessed using / F21. In the figure, the disk array subset, the disk array switch 20
Host I / F node 203 and disk array I / F
Only the number of F-nodes 202 required for the description here is shown.

【０１４７】ディスクアレイサブセット１０は、図２と
同様の構成を有し、２つのディスクアレイＩ／Ｆコント
ローラはそれぞれ１台のディスクアレイスイッチ２０に
接続している。ディスクアレイスイッチ２０の各ノード
のＤＣＴ２２７には、ディスクアレイＩ／Ｆ２１の交代
パスが設定される。交代パスとは、ある一つのパスに障
害が発生した場合にもアクセス可能になるように設けら
れる代替のパスのことである。ここでは、ディスクアレ
イＩ／Ｆ“＃０”の交替パスをディスクアレイＩ／Ｆ
“＃１”、ディスクアレイＩ／Ｆ“＃１”の交替パスを
ディスクアレイＩ／Ｆ“＃０”として設定しておく。同
様に、ディスクアレイサブセット１０内の上位アダプタ
間、キャッシュ・交代メモリ間、下位アダプタ間のそれ
ぞれについても交代パスを設定しておく。The disk array subset 10 has a configuration similar to that of FIG. 2, and two disk array I / F controllers are connected to one disk array switch 20, respectively. An alternate path of the disk array I / F 21 is set in the DCT 227 of each node of the disk array switch 20. An alternate path is an alternative path that is provided so that it can be accessed even when a failure occurs in one path. Here, the alternate path of the disk array I / F “# 0” is set as the disk array I / F.
An alternate path between “# 1” and the disk array I / F “# 1” is set as the disk array I / F “# 0”. Similarly, an alternate path is set for each of the upper-level adapters, the cache / alternate memory, and the lower-level adapters in the disk array subset 10.

【０１４８】次に、図２６に示すように、ディスクアレ
イサブセット１の上位アダプタ“＃１”に接続するディ
スクアレイＩ／Ｆ２１が断線し、障害が発生したと仮定
して、交替パスの設定動作を説明する。このとき、障害
が発生したディスクアレイＩ／Ｆ２１を利用しているホ
スト“＃１”は、ディスクアレイサブセット１０にアク
セスできなくなる。ディスクアレイスイッチ２０は、デ
ィスクアレイサブセット１０との間のフレーム転送の異
常を検出し、リトライ処理を実施しても回復しない場
合、このパスに障害が発生したと認識する。Next, as shown in FIG. 26, assuming that the disk array I / F 21 connected to the upper adapter “# 1” of the disk array subset 1 has been disconnected and a fault has occurred, an alternate path setting operation is performed. Will be described. At this time, the host “# 1” using the failed disk array I / F 21 cannot access the disk array subset 10. The disk array switch 20 detects an abnormality in frame transfer with the disk array subset 10 and, if recovery is not possible even after retry processing, recognizes that a failure has occurred in this path.

【０１４９】パスの障害が発生すると、ＳＰ２０２１
は、ＤＣＴ２０２７にディスクアレイＩ／Ｆ“＃１”に
障害が発生したことを登録し、交代パスとしてディスク
アレイＩ／Ｆ“＃０”を使用することを登録する。以
降、ホストＩ／Ｆノード２０３のＳＣ２０２２は、ホス
ト“＃１”からのフレームをディスクアレイＩ／Ｆ“＃
０”に接続するディスクアレイＩ／Ｆノード２０２に転
送するように動作する。When a path failure occurs, the SP 2021
Registers in the DCT 2027 that a failure has occurred in the disk array I / F "# 1" and that the disk array I / F "# 0" is to be used as an alternate path. Thereafter, the SC 2022 of the host I / F node 203 transmits the frame from the host “# 1” to the disk array I / F “#”.
It operates to transfer the data to the disk array I / F node 202 connected to “0”.

【０１５０】ディスクアレイサブセット１０の上位アダ
プタ１０１は、ホスト“＃１”からのコマンドを引き継
いで処理する。また、ディスクアレイスイッチ２０は、
ディスクアレイシステム構成管理手段７０に障害の発生
を通知し、ディスクアレイシステム構成管理手段７０に
より管理者に障害の発生が通報される。The host adapter 101 of the disk array subset 10 takes over the command from the host “# 1” and processes it. In addition, the disk array switch 20
The occurrence of the failure is notified to the disk array system configuration management means 70, and the occurrence of the failure is reported to the administrator by the disk array system configuration management means 70.

【０１５１】本実施形態によれば、パスに障害が発生し
た際の交替パスへの切り替えを、ホスト側に認識させる
ことなく行うことができ、ホスト側の交代処理設定を不
要にできる。これにより、システムの可用性を向上させ
ることができる。According to the present embodiment, switching to an alternate path when a failure occurs in a path can be performed without causing the host to recognize the path, and the setting of the alternate processing on the host can be made unnecessary. Thereby, the availability of the system can be improved.

【０１５２】以上説明した各実施形態では、記憶メディ
アとして、すべてディスク装置を用いたディスクアレイ
システムについて説明した。しかし、本発明は、これに
限定されるものではなく、記憶メディアとしてディスク
装置に限らず、光ディスク装置、テープ装置、ＤＶＤ装
置、半導体記憶装置等を用いた場合にも同様に適用でき
る。In each of the embodiments described above, a disk array system using all disk devices as storage media has been described. However, the present invention is not limited to this, and can be similarly applied to a case where an optical disk device, a tape device, a DVD device, a semiconductor storage device, or the like is used as a storage medium, without being limited to a disk device.

【０１５３】[0153]

【発明の効果】本発明によれば、計算機システムの規
模、要求などに応じた記憶装置システムの拡張、信頼性
の向上などを容易に実現することのできる記憶装置シス
テムを実現することができる。According to the present invention, it is possible to realize a storage system capable of easily realizing expansion of the storage system and improvement of reliability according to the scale and requirements of the computer system.

【図面の簡単な説明】[Brief description of the drawings]

【図１】第１実施形態のコンピュータシステムの構成図
である。FIG. 1 is a configuration diagram of a computer system according to a first embodiment.

【図２】第１実施形態のディスクアレイサブセットの構
成図である。FIG. 2 is a configuration diagram of a disk array subset according to the first embodiment.

【図３】第１実施形態のディスクアレイスイッチの構成
図である。FIG. 3 is a configuration diagram of a disk array switch according to the first embodiment.

【図４】第１実施形態におけるディスクアレイスイッチ
のクロスバスイッチの構成図である。FIG. 4 is a configuration diagram of a crossbar switch of the disk array switch according to the first embodiment.

【図５】第１実施形態におけるディスクアレイスイッチ
のホストＩ／Ｆノードの構成図である。FIG. 5 is a configuration diagram of a host I / F node of the disk array switch according to the first embodiment.

【図６】システム構成テーブルの構成図である。FIG. 6 is a configuration diagram of a system configuration table.

【図７】サブセット構成テーブルの構成図である。FIG. 7 is a configuration diagram of a subset configuration table.

【図８】ファイバチャネルのフレームの構成図である。FIG. 8 is a configuration diagram of a fiber channel frame.

【図９】ファイバチャネルのフレームヘッダの構成図で
ある。FIG. 9 is a configuration diagram of a fiber channel frame header.

【図１０】ファイバチャネルのフレームペイロードの構
成図である。FIG. 10 is a configuration diagram of a fiber channel frame payload.

【図１１】ホストからのリード動作時にファイバチャネ
ルを通して転送されるフレームのシーケンスを示す模式
図である。FIG. 11 is a schematic diagram showing a sequence of a frame transferred through a fiber channel at the time of a read operation from a host.

【図１２】ホストＬＵ、各ディスクアレイサブセットの
ＬＵ、及び各ディスクユニットの対応関係を示す模式図
である。FIG. 12 is a schematic diagram showing the correspondence between host LUs, LUs of each disk array subset, and disk units.

【図１３】ライト処理時のホストＩ／Ｆノードにおける
処理のフローチャートである。FIG. 13 is a flowchart of a process in the host I / F node at the time of a write process.

【図１４】スイッチングパケットの構成図である。FIG. 14 is a configuration diagram of a switching packet.

【図１５】複数のディスクアレイスイッチをクラスタ接
続したディスクアレイシステムの構成図である。FIG. 15 is a configuration diagram of a disk array system in which a plurality of disk array switches are connected in a cluster.

【図１６】第２実施形態におけるコンピュータシステム
の構成図である。FIG. 16 is a configuration diagram of a computer system according to a second embodiment.

【図１７】第４実施形態におけるディスクアレイスイッ
チのインタフェースコントローラの構成図である。FIG. 17 is a configuration diagram of an interface controller of a disk array switch according to a fourth embodiment.

【図１８】第５実施形態におけるコンピュータシステム
の構成図である。FIG. 18 is a configuration diagram of a computer system according to a fifth embodiment.

【図１９】論理接続構成画面の表示例を示す画面構成図
である。FIG. 19 is a screen configuration diagram illustrating a display example of a logical connection configuration screen.

【図２０】第６実施形態におけるフレームシーケンスを
示す模式図である。FIG. 20 is a schematic diagram showing a frame sequence in the sixth embodiment.

【図２１】第６実施形態のミラーリングライト処理時の
ホストＩ／Ｆノードにおける処理のフローチャートであ
る。FIG. 21 is a flowchart of a process in a host I / F node during a mirroring write process according to a sixth embodiment.

【図２２】第６実施形態のミラーリングライト処理時の
ホストＩ／Ｆノードにおける処理のフローチャートであ
る。FIG. 22 is a flowchart of a process in a host I / F node during a mirroring write process according to a sixth embodiment.

【図２３】第７実施形態におけるホストＬＵと各ディス
クアレイサブセットのＬＵとの対応関係を示す模式図で
ある。FIG. 23 is a schematic diagram showing the correspondence between host LUs and LUs of each disk array subset in the seventh embodiment.

【図２４】第７実施形態におけるホストＩ／Ｆノードの
処理を示すフローチャートである。FIG. 24 is a flowchart illustrating processing of a host I / F node according to the seventh embodiment.

【図２５】第８実施形態におけるディザスタリカバリシ
ステムの構成図である。FIG. 25 is a configuration diagram of a disaster recovery system according to an eighth embodiment.

【図２６】交替パスの設定についての説明図である。FIG. 26 is an explanatory diagram of setting of a replacement path.

【符号の説明】[Explanation of symbols]

１…ディスクアレイシステム、５…管理端末、１０…デ
ィスクアレイサブセット、２０…ディスクアレイスイッ
チ、３０…ホストコンピュータ、７０…ディスクアレイ
システム構成管理手段、２００…管理プロセッサ、２０
１…クロスバスイッチ、２０２…ディスクアレイＩ／Ｆ
ノード、２０３…ホストＩ／Ｆノード、２０４…通信コ
ントローラ。DESCRIPTION OF SYMBOLS 1 ... Disk array system, 5 ... Management terminal, 10 ... Disk array subset, 20 ... Disk array switch, 30 ... Host computer, 70 ... Disk array system configuration management means, 200 ... Management processor, 20
1: Crossbar switch, 202: Disk array I / F
Node 203: Host I / F node 204: Communication controller

───────────────────────────────────────────────────── フロントページの続き (72)発明者山本彰神奈川県川崎市麻生区王禅寺1099番地株式会社日立製作所システム開発研究所内 (72)発明者味松康行神奈川県川崎市麻生区王禅寺1099番地株式会社日立製作所システム開発研究所内 (72)発明者佐藤雅彦神奈川県小田原市国府津2880番地株式会社日立製作所ストレージシステム事業部内 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Akira Yamamoto 1099 Ozenji Temple, Aso-ku, Kawasaki City, Kanagawa Prefecture Inside Hitachi, Ltd.System Development Laboratory (72) Inventor Yasuyuki Ajimatsu 1099 Ozenji Temple, Aso-ku, Kawasaki City, Kanagawa Prefecture (72) Inventor Masahiko Sato 2880 Kozu, Odawara City, Kanagawa Prefecture Storage Systems Division, Hitachi, Ltd.