

技术领域technical field
本发明涉及计算机存储测试领域,具体涉及一种保护系统盘raid的方法。The invention relates to the field of computer storage testing, in particular to a method for protecting system disk raid.
背景技术Background technique
目前诸多系统盘raid都存在一个致命的问题,就是如果磁盘损坏不及时修复的话,很可能会引起系统的彻底崩溃。而在raid发生故障时,一般存储都会有告警方式,比如控制器蜂鸣器报警、盘柜会有红灯报警、并会伴随日至报警,但机器如果是在远程机房,用户又没有及时的去查看日志,没有及时的去修复故障,那么很可能会造成raid的进一步降级,导致不可恢复性的损坏.本发明就是解决这种问题,通过对raid状态的识别,来强制用户进行raid修复。At present, many system disk raids have a fatal problem, that is, if the disk damage is not repaired in time, it may cause a complete system crash. When the raid fails, the general storage will have alarm methods, such as the controller buzzer alarm, the cabinet will have a red light alarm, and will be accompanied by an alarm at the end of the day, but if the machine is in a remote computer room, the user does not have timely warning To check the log, if the failure is not repaired in time, it is likely to cause further degradation of the raid, resulting in irrecoverable damage. The present invention solves this problem by forcing the user to repair the raid by identifying the raid status.
发明内容Contents of the invention
本发明的目的是提供一种保护系统盘raid的方法。The purpose of the invention is to provide a method for protecting system disk raid.
本发明的目的是按以下方式实现的,系统包括检测模块和操作模块,检测模块检测系统盘raid的状态,操作模块,根据检测模块反馈的状态关闭或恢复系统的运行,检测模块根据收集到raid信息,发出强制的恢复指令给操作模块,从而强制用户进行恢复操作,否则系统将保持关闭,具体步骤如下:The purpose of the present invention is achieved in the following manner, the system includes a detection module and an operation module, the detection module detects the state of the system disk raid, and the operation module closes or resumes the operation of the system according to the state fed back by the detection module, and the detection module collects the raid according to information, send a forced recovery command to the operation module, thereby forcing the user to perform the recovery operation, otherwise the system will remain closed, the specific steps are as follows:
系统先利用raid卡的API提供的接口信息来获取掉盘的raid degrade信息,包括通过接收raid卡发出的报警日志来判断状态或通过提供的API接口由检测模块自己去检测raid状态,检测模块在获取到raid掉盘信息后,自动触发下一个操作,给正在使用系统的用户发出警告,如果用户不及时处理,那么操作模块将对用户系统强制暂停,进行关机操作,如果这时候用户去强行开机,操作模块拒绝开机,只有当用户真正的把坏掉的硬盘更换后,恢复raid状态后,用户才可以正常开机。The system first uses the interface information provided by the API of the raid card to obtain the raid degrade information of the lost disk, including judging the status by receiving the alarm log sent by the raid card, or detecting the raid status by the detection module itself through the provided API interface. After obtaining the raid disk loss information, it will automatically trigger the next operation and issue a warning to the user who is using the system. If the user does not deal with it in time, the operation module will force the user system to suspend and shut down the system. If the user tries to force the system to start , the operation module refuses to boot, and only after the user actually replaces the broken hard disk and restores the raid state, the user can boot normally.
本发明的有益效果是:解决了由于用户疏忽或者网络管理员素质不高,由于疏于管理,所导致的无法挽回的数据损失。因为本发明的办法是强制用户进行故障处理和恢复,这大大加强了系统的可靠性和稳定性简化了raid错误报警机制,以往在发生raid错误时,总有一大队的日志等着去解读,然后在判断怎么去做。而本发明将这些过程简化,只需要用户按提示进行故障恢复就行。The beneficial effect of the present invention is that it solves the irreparable data loss caused by negligence of users or low quality of network administrators and negligent management. Because the method of the present invention is to force the user to carry out troubleshooting and recovery, this greatly strengthens the reliability and stability of the system and simplifies the raid error alarm mechanism. In the past, when a raid error occurred, there was always a large team of logs waiting to be interpreted, judging what to do. However, the present invention simplifies these processes, and only needs the user to perform fault recovery according to the prompts.
附图说明Description of drawings
图1是系统流程图;Fig. 1 is a system flow chart;
图2是系统结构示意图。Figure 2 is a schematic diagram of the system structure.
具体实施方式Detailed ways
参照说明书附图对本发明的方法作以下详细地说明。The method of the present invention is described in detail below with reference to the accompanying drawings.
本发明的一种保护系统盘raid的方法,是首先利用raid卡的API提供的接口信息来获取掉盘的raid degrade信息,这个可以通过多种方式:可以通过接收raid卡发出的报警日志来判断状态;当然也可以通过提供的API接口由检测模块自己去检测raid状态.检测模块在获取到raid掉盘信息后,并不像常规的操作那样来抛出故障,而是自动触发下一个操作,也就是给正在使用系统的用户发出警告,如果用户不及时处理,那么操作模块将对用户系统强制暂停,进行关机操作,如果这时候用户去强行开机,操作模块都是不允许去开机的,只有当用户真正的把坏掉的硬盘更坏后,恢复raid状态后,用户才可以正常开机。A method for protecting the system disk raid of the present invention is to firstly utilize the interface information provided by the API of the raid card to obtain the raid degrade information of the lost disk. This can be done in a variety of ways: it can be judged by receiving the alarm log sent by the raid card Status; of course, the detection module can also detect the raid status by itself through the provided API interface. After the detection module obtains the raid disk loss information, it does not throw a fault like a conventional operation, but automatically triggers the next operation. That is to issue a warning to users who are using the system. If the user does not deal with it in time, the operation module will forcefully suspend the user system and perform a shutdown operation. After the user actually makes the broken hard disk worse and restores the raid state, the user can start the system normally.
实施例Example
如图所示,通过raid卡的API接口信息,可以获取目前raid的状态信息,如果是degrade的状态,检测模块在收到信息后,会发送强制关闭系统的指令给操作模块。As shown in the figure, the current status information of the raid can be obtained through the API interface information of the raid card. If it is in the degrade state, the detection module will send an instruction to forcibly shut down the system to the operation module after receiving the information.
操作模块对系统执行强制关闭的操作,来迫使用户进行系统恢复;如果不恢复用户将无法进入系统,等用户恢复后,通过RAID卡API反馈的新的raid状态信息,检测模块来发动开启系统的指令,这时候用户才可以进入系统.The operation module executes the operation of forcibly shutting down the system to force the user to restore the system; if the user does not restore the system, the user will not be able to enter the system. After the user recovers, the new raid status information fed back by the RAID card API will be used by the detection module to initiate the start of the system. command, the user can enter the system at this time.
可以通过在OS下内嵌检测模块和操作模块软件来实现,检测模块负责与raid卡进行交互连接,时时检测和交换raid信息。操作模块和OS进行绑定,并根据检测模块的输入进行对应的操作。It can be realized by embedding the detection module and operation module software under the OS. The detection module is responsible for interactive connection with the raid card, and detects and exchanges raid information from time to time. The operation module is bound with the OS, and performs corresponding operations according to the input of the detection module.
除说明书所述的技术特征外,均为本专业技术人员的已知技术。Except for the technical features described in the instructions, all are known technologies by those skilled in the art.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210116418XACN102662787A (en) | 2012-04-20 | 2012-04-20 | Method for protecting system disk RAID (redundant array of independent disks) |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210116418XACN102662787A (en) | 2012-04-20 | 2012-04-20 | Method for protecting system disk RAID (redundant array of independent disks) |
| Publication Number | Publication Date |
|---|---|
| CN102662787Atrue CN102662787A (en) | 2012-09-12 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201210116418XAPendingCN102662787A (en) | 2012-04-20 | 2012-04-20 | Method for protecting system disk RAID (redundant array of independent disks) |
| Country | Link |
|---|---|
| CN (1) | CN102662787A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103049345A (en)* | 2012-12-10 | 2013-04-17 | 北京百度网讯科技有限公司 | Magnetic disk state transition detection method and device based on asynchronous communication mechanism |
| CN103207820A (en)* | 2013-02-05 | 2013-07-17 | 北京百度网讯科技有限公司 | Method and device for fault positioning of hard disk on basis of raid card log |
| CN103995772A (en)* | 2014-06-10 | 2014-08-20 | 浪潮电子信息产业股份有限公司 | RAID card log completely-storing method based on LINUX operation system |
| CN104679623A (en)* | 2013-11-29 | 2015-06-03 | 中国移动通信集团公司 | Server hard disk maintaining method, system and server monitoring equipment |
| CN105045689A (en)* | 2015-06-25 | 2015-11-11 | 浪潮电子信息产业股份有限公司 | Method for monitoring and alarming hard disks by using RAID card batch detection |
| CN106021065A (en)* | 2016-05-19 | 2016-10-12 | 浪潮电子信息产业股份有限公司 | Method for automatically detecting bad track information of raid disk under linux |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030204788A1 (en)* | 2002-04-29 | 2003-10-30 | International Business Machines Corporation | Predictive failure analysis for storage networks |
| CN1808365A (en)* | 2005-01-17 | 2006-07-26 | 英业达股份有限公司 | Automatic Reconstruction Method of Disk Redundant Array Device |
| US20080040540A1 (en)* | 2006-08-11 | 2008-02-14 | Intel Corporation | On-disk caching for raid systems |
| CN201546683U (en)* | 2009-09-28 | 2010-08-11 | 高建军 | Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030204788A1 (en)* | 2002-04-29 | 2003-10-30 | International Business Machines Corporation | Predictive failure analysis for storage networks |
| CN1808365A (en)* | 2005-01-17 | 2006-07-26 | 英业达股份有限公司 | Automatic Reconstruction Method of Disk Redundant Array Device |
| US20080040540A1 (en)* | 2006-08-11 | 2008-02-14 | Intel Corporation | On-disk caching for raid systems |
| CN201546683U (en)* | 2009-09-28 | 2010-08-11 | 高建军 | Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103049345A (en)* | 2012-12-10 | 2013-04-17 | 北京百度网讯科技有限公司 | Magnetic disk state transition detection method and device based on asynchronous communication mechanism |
| CN103049345B (en)* | 2012-12-10 | 2015-11-25 | 北京百度网讯科技有限公司 | Based on Disk State transition detection method and the device of asynchronous mechanism |
| CN103207820A (en)* | 2013-02-05 | 2013-07-17 | 北京百度网讯科技有限公司 | Method and device for fault positioning of hard disk on basis of raid card log |
| CN103207820B (en)* | 2013-02-05 | 2016-06-29 | 北京百度网讯科技有限公司 | The Fault Locating Method of hard disk and device based on raid card log |
| CN104679623A (en)* | 2013-11-29 | 2015-06-03 | 中国移动通信集团公司 | Server hard disk maintaining method, system and server monitoring equipment |
| CN103995772A (en)* | 2014-06-10 | 2014-08-20 | 浪潮电子信息产业股份有限公司 | RAID card log completely-storing method based on LINUX operation system |
| CN105045689A (en)* | 2015-06-25 | 2015-11-11 | 浪潮电子信息产业股份有限公司 | Method for monitoring and alarming hard disks by using RAID card batch detection |
| CN106021065A (en)* | 2016-05-19 | 2016-10-12 | 浪潮电子信息产业股份有限公司 | Method for automatically detecting bad track information of raid disk under linux |
| Publication | Publication Date | Title |
|---|---|---|
| CN107179957B (en) | Physical machine fault classification processing method and device and virtual machine recovery method and system | |
| CN102662787A (en) | Method for protecting system disk RAID (redundant array of independent disks) | |
| CN102279775B (en) | Method for processing failure of hard disk under Linux system | |
| CN104536855B (en) | Fault detection method and device | |
| US20190278651A1 (en) | Methods And Systems For Detecting And Capturing Host System Hang Events | |
| CN101221508B (en) | Method and device for starting equipment | |
| KR101712172B1 (en) | The preliminary diagnosis and analysis and recovery system of computer error, and method thereof | |
| CN102880522B (en) | Hardware fault-oriented method and device for correcting faults in key files of system | |
| CN106682162B (en) | Log management method and device | |
| CN105045689A (en) | Method for monitoring and alarming hard disks by using RAID card batch detection | |
| US8984335B2 (en) | Core diagnostics and repair | |
| CN107329877B (en) | Air ticket business monitoring and executing system and method | |
| CN106776128A (en) | Method for ensuring normal start of Linux operating system | |
| CN103019885A (en) | Method and system for monitoring embedded Linux-based hard disc bad track | |
| CN104714863A (en) | Method for completely storing Raid card logs on basis of Linux operation system after system crashes | |
| CN107590042A (en) | A kind of server method for testing open/close machine and system based on linux system | |
| CN110187841A (en) | A kind of method, apparatus and storage server of system management memory disk | |
| CN111581021B (en) | Application program starting abnormality repairing method, device, equipment and storage medium | |
| CN101145983B (en) | A self-diagnosis and self-discovery subsystem and method of network management system | |
| CN105095017A (en) | Partition table based hard disk repair method and apparatus | |
| CN103049343B (en) | Operating system blue screen restoration methods and device | |
| CN102541682A (en) | Method for restoring abnormal programs in embedded system quickly and automatically | |
| CN102369513A (en) | Method for improving stability of computer system and computer system | |
| CN105217389B (en) | The method and system of elevator spare part remote management | |
| CN101324859A (en) | Method for detecting hard disk hardware fault |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication | Application publication date:20120912 |