Movatterモバイル変換


[0]ホーム

URL:


CN102662787A - Method for protecting system disk RAID (redundant array of independent disks) - Google Patents

Method for protecting system disk RAID (redundant array of independent disks)
Download PDF

Info

Publication number
CN102662787A
CN102662787ACN201210116418XACN201210116418ACN102662787ACN 102662787 ACN102662787 ACN 102662787ACN 201210116418X ACN201210116418X ACN 201210116418XACN 201210116418 ACN201210116418 ACN 201210116418ACN 102662787 ACN102662787 ACN 102662787A
Authority
CN
China
Prior art keywords
raid
user
detection module
information
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210116418XA
Other languages
Chinese (zh)
Inventor
孙磊
李瑞东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
IEIT Systems Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co LtdfiledCriticalInspur Electronic Information Industry Co Ltd
Priority to CN201210116418XApriorityCriticalpatent/CN102662787A/en
Publication of CN102662787ApublicationCriticalpatent/CN102662787A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明提供一种保护系统盘raid的方法,系统包括检测模块和操作模块,检测模块检测系统盘raid的状态,操作模块,根据检测模块反馈的状态关闭或恢复系统的运行,检测模块根据收集到raid信息,发出强制的恢复指令给操作模块,从而强制用户进行恢复操作,否则系统将保持关闭,具体步骤如下:系统先利用raid卡的API提供的接口信息来获取掉盘的raiddegrade信息,包括通过接收raid卡发出的报警日志来判断状态或通过提供的API接口由检测模块自己去检测raid状态,检测模块在获取到raid掉盘信息后,自动触发下一个操作,给正在使用系统的用户发出警告,如果用户不及时处理,那么操作模块将对用户系统强制暂停,进行关机操作,如果这时候用户去强行开机,操作模块拒绝开机,只有当用户真正的把坏掉的硬盘更换后,恢复raid状态后,用户才可以正常开机。

Figure 201210116418

The invention provides a method for protecting system disk raid. The system includes a detection module and an operation module. The detection module detects the state of the system disk raid. Raid information, send a mandatory recovery command to the operation module, thereby forcing the user to perform recovery operations, otherwise the system will remain closed, the specific steps are as follows: the system first uses the interface information provided by the raid card API to obtain the raiddegrade information of the lost disk, including through Receive the alarm log sent by the raid card to judge the status or use the provided API interface to detect the raid status by the detection module itself. After the detection module obtains the information of the raid disk failure, it will automatically trigger the next operation and issue a warning to the users who are using the system , if the user does not deal with it in time, the operation module will force the user system to suspend and shut down the system. If the user tries to force the system to start at this time, the operation module will refuse to start the system. Only when the user actually replaces the broken hard disk, the raid state will be restored. After that, the user can boot normally.

Figure 201210116418

Description

Translated fromChinese
一种保护系统盘raid的方法A method of protecting system disk raid

技术领域technical field

本发明涉及计算机存储测试领域,具体涉及一种保护系统盘raid的方法。The invention relates to the field of computer storage testing, in particular to a method for protecting system disk raid.

背景技术Background technique

目前诸多系统盘raid都存在一个致命的问题,就是如果磁盘损坏不及时修复的话,很可能会引起系统的彻底崩溃。而在raid发生故障时,一般存储都会有告警方式,比如控制器蜂鸣器报警、盘柜会有红灯报警、并会伴随日至报警,但机器如果是在远程机房,用户又没有及时的去查看日志,没有及时的去修复故障,那么很可能会造成raid的进一步降级,导致不可恢复性的损坏.本发明就是解决这种问题,通过对raid状态的识别,来强制用户进行raid修复。At present, many system disk raids have a fatal problem, that is, if the disk damage is not repaired in time, it may cause a complete system crash. When the raid fails, the general storage will have alarm methods, such as the controller buzzer alarm, the cabinet will have a red light alarm, and will be accompanied by an alarm at the end of the day, but if the machine is in a remote computer room, the user does not have timely warning To check the log, if the failure is not repaired in time, it is likely to cause further degradation of the raid, resulting in irrecoverable damage. The present invention solves this problem by forcing the user to repair the raid by identifying the raid status.

发明内容Contents of the invention

本发明的目的是提供一种保护系统盘raid的方法。The purpose of the invention is to provide a method for protecting system disk raid.

本发明的目的是按以下方式实现的,系统包括检测模块和操作模块,检测模块检测系统盘raid的状态,操作模块,根据检测模块反馈的状态关闭或恢复系统的运行,检测模块根据收集到raid信息,发出强制的恢复指令给操作模块,从而强制用户进行恢复操作,否则系统将保持关闭,具体步骤如下:The purpose of the present invention is achieved in the following manner, the system includes a detection module and an operation module, the detection module detects the state of the system disk raid, and the operation module closes or resumes the operation of the system according to the state fed back by the detection module, and the detection module collects the raid according to information, send a forced recovery command to the operation module, thereby forcing the user to perform the recovery operation, otherwise the system will remain closed, the specific steps are as follows:

系统先利用raid卡的API提供的接口信息来获取掉盘的raid degrade信息,包括通过接收raid卡发出的报警日志来判断状态或通过提供的API接口由检测模块自己去检测raid状态,检测模块在获取到raid掉盘信息后,自动触发下一个操作,给正在使用系统的用户发出警告,如果用户不及时处理,那么操作模块将对用户系统强制暂停,进行关机操作,如果这时候用户去强行开机,操作模块拒绝开机,只有当用户真正的把坏掉的硬盘更换后,恢复raid状态后,用户才可以正常开机。The system first uses the interface information provided by the API of the raid card to obtain the raid degrade information of the lost disk, including judging the status by receiving the alarm log sent by the raid card, or detecting the raid status by the detection module itself through the provided API interface. After obtaining the raid disk loss information, it will automatically trigger the next operation and issue a warning to the user who is using the system. If the user does not deal with it in time, the operation module will force the user system to suspend and shut down the system. If the user tries to force the system to start , the operation module refuses to boot, and only after the user actually replaces the broken hard disk and restores the raid state, the user can boot normally.

本发明的有益效果是:解决了由于用户疏忽或者网络管理员素质不高,由于疏于管理,所导致的无法挽回的数据损失。因为本发明的办法是强制用户进行故障处理和恢复,这大大加强了系统的可靠性和稳定性简化了raid错误报警机制,以往在发生raid错误时,总有一大队的日志等着去解读,然后在判断怎么去做。而本发明将这些过程简化,只需要用户按提示进行故障恢复就行。The beneficial effect of the present invention is that it solves the irreparable data loss caused by negligence of users or low quality of network administrators and negligent management. Because the method of the present invention is to force the user to carry out troubleshooting and recovery, this greatly strengthens the reliability and stability of the system and simplifies the raid error alarm mechanism. In the past, when a raid error occurred, there was always a large team of logs waiting to be interpreted, judging what to do. However, the present invention simplifies these processes, and only needs the user to perform fault recovery according to the prompts.

附图说明Description of drawings

图1是系统流程图;Fig. 1 is a system flow chart;

图2是系统结构示意图。Figure 2 is a schematic diagram of the system structure.

具体实施方式Detailed ways

参照说明书附图对本发明的方法作以下详细地说明。The method of the present invention is described in detail below with reference to the accompanying drawings.

本发明的一种保护系统盘raid的方法,是首先利用raid卡的API提供的接口信息来获取掉盘的raid degrade信息,这个可以通过多种方式:可以通过接收raid卡发出的报警日志来判断状态;当然也可以通过提供的API接口由检测模块自己去检测raid状态.检测模块在获取到raid掉盘信息后,并不像常规的操作那样来抛出故障,而是自动触发下一个操作,也就是给正在使用系统的用户发出警告,如果用户不及时处理,那么操作模块将对用户系统强制暂停,进行关机操作,如果这时候用户去强行开机,操作模块都是不允许去开机的,只有当用户真正的把坏掉的硬盘更坏后,恢复raid状态后,用户才可以正常开机。A method for protecting the system disk raid of the present invention is to firstly utilize the interface information provided by the API of the raid card to obtain the raid degrade information of the lost disk. This can be done in a variety of ways: it can be judged by receiving the alarm log sent by the raid card Status; of course, the detection module can also detect the raid status by itself through the provided API interface. After the detection module obtains the raid disk loss information, it does not throw a fault like a conventional operation, but automatically triggers the next operation. That is to issue a warning to users who are using the system. If the user does not deal with it in time, the operation module will forcefully suspend the user system and perform a shutdown operation. After the user actually makes the broken hard disk worse and restores the raid state, the user can start the system normally.

实施例Example

如图所示,通过raid卡的API接口信息,可以获取目前raid的状态信息,如果是degrade的状态,检测模块在收到信息后,会发送强制关闭系统的指令给操作模块。As shown in the figure, the current status information of the raid can be obtained through the API interface information of the raid card. If it is in the degrade state, the detection module will send an instruction to forcibly shut down the system to the operation module after receiving the information.

操作模块对系统执行强制关闭的操作,来迫使用户进行系统恢复;如果不恢复用户将无法进入系统,等用户恢复后,通过RAID卡API反馈的新的raid状态信息,检测模块来发动开启系统的指令,这时候用户才可以进入系统.The operation module executes the operation of forcibly shutting down the system to force the user to restore the system; if the user does not restore the system, the user will not be able to enter the system. After the user recovers, the new raid status information fed back by the RAID card API will be used by the detection module to initiate the start of the system. command, the user can enter the system at this time.

可以通过在OS下内嵌检测模块和操作模块软件来实现,检测模块负责与raid卡进行交互连接,时时检测和交换raid信息。操作模块和OS进行绑定,并根据检测模块的输入进行对应的操作。It can be realized by embedding the detection module and operation module software under the OS. The detection module is responsible for interactive connection with the raid card, and detects and exchanges raid information from time to time. The operation module is bound with the OS, and performs corresponding operations according to the input of the detection module.

除说明书所述的技术特征外,均为本专业技术人员的已知技术。Except for the technical features described in the instructions, all are known technologies by those skilled in the art.

Claims (1)

The interface message that system utilizes the API of raid card to provide is earlier obtained the raid degrade information of dish, comprises through receiving the alarm log that the raid card sends coming the judgement state or removing to detect the raid state through the api interface that provides by detection module oneself, and detection module is getting access to after raid falls dish information; Automatically trigger next operation; The user who just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, the start of operational module refusal has only after the hard disk that the real handle of user breaks down is changed; After recovering the raid state, the user just can normal boot-strap.
CN201210116418XA2012-04-202012-04-20Method for protecting system disk RAID (redundant array of independent disks)PendingCN102662787A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201210116418XACN102662787A (en)2012-04-202012-04-20Method for protecting system disk RAID (redundant array of independent disks)

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201210116418XACN102662787A (en)2012-04-202012-04-20Method for protecting system disk RAID (redundant array of independent disks)

Publications (1)

Publication NumberPublication Date
CN102662787Atrue CN102662787A (en)2012-09-12

Family

ID=46772286

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201210116418XAPendingCN102662787A (en)2012-04-202012-04-20Method for protecting system disk RAID (redundant array of independent disks)

Country Status (1)

CountryLink
CN (1)CN102662787A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103049345A (en)*2012-12-102013-04-17北京百度网讯科技有限公司Magnetic disk state transition detection method and device based on asynchronous communication mechanism
CN103207820A (en)*2013-02-052013-07-17北京百度网讯科技有限公司Method and device for fault positioning of hard disk on basis of raid card log
CN103995772A (en)*2014-06-102014-08-20浪潮电子信息产业股份有限公司RAID card log completely-storing method based on LINUX operation system
CN104679623A (en)*2013-11-292015-06-03中国移动通信集团公司Server hard disk maintaining method, system and server monitoring equipment
CN105045689A (en)*2015-06-252015-11-11浪潮电子信息产业股份有限公司Method for monitoring and alarming hard disks by using RAID card batch detection
CN106021065A (en)*2016-05-192016-10-12浪潮电子信息产业股份有限公司Method for automatically detecting bad track information of raid disk under linux

Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030204788A1 (en)*2002-04-292003-10-30International Business Machines CorporationPredictive failure analysis for storage networks
CN1808365A (en)*2005-01-172006-07-26英业达股份有限公司 Automatic Reconstruction Method of Disk Redundant Array Device
US20080040540A1 (en)*2006-08-112008-02-14Intel CorporationOn-disk caching for raid systems
CN201546683U (en)*2009-09-282010-08-11高建军Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030204788A1 (en)*2002-04-292003-10-30International Business Machines CorporationPredictive failure analysis for storage networks
CN1808365A (en)*2005-01-172006-07-26英业达股份有限公司 Automatic Reconstruction Method of Disk Redundant Array Device
US20080040540A1 (en)*2006-08-112008-02-14Intel CorporationOn-disk caching for raid systems
CN201546683U (en)*2009-09-282010-08-11高建军Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging

Cited By (8)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103049345A (en)*2012-12-102013-04-17北京百度网讯科技有限公司Magnetic disk state transition detection method and device based on asynchronous communication mechanism
CN103049345B (en)*2012-12-102015-11-25北京百度网讯科技有限公司Based on Disk State transition detection method and the device of asynchronous mechanism
CN103207820A (en)*2013-02-052013-07-17北京百度网讯科技有限公司Method and device for fault positioning of hard disk on basis of raid card log
CN103207820B (en)*2013-02-052016-06-29北京百度网讯科技有限公司The Fault Locating Method of hard disk and device based on raid card log
CN104679623A (en)*2013-11-292015-06-03中国移动通信集团公司Server hard disk maintaining method, system and server monitoring equipment
CN103995772A (en)*2014-06-102014-08-20浪潮电子信息产业股份有限公司RAID card log completely-storing method based on LINUX operation system
CN105045689A (en)*2015-06-252015-11-11浪潮电子信息产业股份有限公司Method for monitoring and alarming hard disks by using RAID card batch detection
CN106021065A (en)*2016-05-192016-10-12浪潮电子信息产业股份有限公司Method for automatically detecting bad track information of raid disk under linux

Similar Documents

PublicationPublication DateTitle
CN107179957B (en)Physical machine fault classification processing method and device and virtual machine recovery method and system
CN102662787A (en)Method for protecting system disk RAID (redundant array of independent disks)
CN102279775B (en)Method for processing failure of hard disk under Linux system
CN104536855B (en)Fault detection method and device
US20190278651A1 (en)Methods And Systems For Detecting And Capturing Host System Hang Events
CN101221508B (en) Method and device for starting equipment
KR101712172B1 (en)The preliminary diagnosis and analysis and recovery system of computer error, and method thereof
CN102880522B (en)Hardware fault-oriented method and device for correcting faults in key files of system
CN106682162B (en)Log management method and device
CN105045689A (en)Method for monitoring and alarming hard disks by using RAID card batch detection
US8984335B2 (en)Core diagnostics and repair
CN107329877B (en)Air ticket business monitoring and executing system and method
CN106776128A (en)Method for ensuring normal start of Linux operating system
CN103019885A (en)Method and system for monitoring embedded Linux-based hard disc bad track
CN104714863A (en)Method for completely storing Raid card logs on basis of Linux operation system after system crashes
CN107590042A (en)A kind of server method for testing open/close machine and system based on linux system
CN110187841A (en)A kind of method, apparatus and storage server of system management memory disk
CN111581021B (en)Application program starting abnormality repairing method, device, equipment and storage medium
CN101145983B (en)A self-diagnosis and self-discovery subsystem and method of network management system
CN105095017A (en)Partition table based hard disk repair method and apparatus
CN103049343B (en)Operating system blue screen restoration methods and device
CN102541682A (en)Method for restoring abnormal programs in embedded system quickly and automatically
CN102369513A (en)Method for improving stability of computer system and computer system
CN105217389B (en)The method and system of elevator spare part remote management
CN101324859A (en)Method for detecting hard disk hardware fault

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
C02Deemed withdrawal of patent application after publication (patent law 2001)
WD01Invention patent application deemed withdrawn after publication

Application publication date:20120912


[8]ページ先頭

©2009-2025 Movatter.jp