Movatterモバイル変換


[0]ホーム

URL:


CN102170460A - Cluster storage system and data storage method thereof - Google Patents

Cluster storage system and data storage method thereof
Download PDF

Info

Publication number
CN102170460A
CN102170460ACN2011100575768ACN201110057576ACN102170460ACN 102170460 ACN102170460 ACN 102170460ACN 2011100575768 ACN2011100575768 ACN 2011100575768ACN 201110057576 ACN201110057576 ACN 201110057576ACN 102170460 ACN102170460 ACN 102170460A
Authority
CN
China
Prior art keywords
file
data
value
preferred value
storage device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011100575768A
Other languages
Chinese (zh)
Inventor
刘家驹
张立强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Beijing Electronic Information Industry Co Ltd
Original Assignee
Inspur Beijing Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Beijing Electronic Information Industry Co LtdfiledCriticalInspur Beijing Electronic Information Industry Co Ltd
Priority to CN2011100575768ApriorityCriticalpatent/CN102170460A/en
Publication of CN102170460ApublicationCriticalpatent/CN102170460A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明公开了一种集群存储系统及其数据存储方法,涉及集群存储系统。本发明公开的系统,包括共享存储设备,位于元节点的数据分级服务器和DRBD,其中:所述数据分级服务器,确定所述共享存储设备中各文件的优先值,并将优先值大于设定值的文件的数据上传到所述DRBD;所述DRBD,接收所述数据分级服务器上传的文件的数据并存储。本发明实施例采用混合存储架构兼顾集中式存储低沉本大容量的优势和分布式存储高可靠性的优点,同时构建数据提取模型分类安放数据,便于数据管理,提高整个集群的容灾性,为电子信息系统的安全运行提供了有效保障。

Figure 201110057576

The invention discloses a cluster storage system and a data storage method thereof, relating to the cluster storage system. The system disclosed in the present invention includes a shared storage device, a data grading server located at a meta node, and DRBD, wherein: the data grading server determines the priority value of each file in the shared storage device, and sets the priority value greater than the set value The data of the file uploaded to the DRBD; the DRBD receives and stores the data of the file uploaded by the data grading server. The embodiment of the present invention adopts a hybrid storage architecture that takes into account the advantages of low cost and large capacity of centralized storage and high reliability of distributed storage. At the same time, a data extraction model is constructed to classify and store data, which is convenient for data management and improves the disaster tolerance of the entire cluster. The safe operation of the electronic information system provides an effective guarantee.

Figure 201110057576

Description

A kind of cluster storage system and date storage method thereof
Technical field
The present invention relates to cluster storage system, particularly a kind of cluster storage system and date storage method thereof.
Background technology
At present, there is certain defective in traditional HA framework that many enterprises and institutions are adopted, such as disk array of two minicomputer collocation, forms a cover group system, all information datas all are stored on this disk array, storage has only portion, in case this disk array generation problem just faces the danger of whole service system-down, and the employing distributed storage, though can guarantee backup, the disk utilance is low, and be restricted by the cost limit capacity.As seen, realize the high available of business, must guarantee earlier that the storage height is available; In other words, lack the operation system of high availability storage, can not realize real high availability.At this situation, we have proposed the high workable solution of storage.
Summary of the invention
Technical problem to be solved by this invention is how to improve the disaster tolerance of group system.Therefore, provide a kind of cluster storage system and date storage method thereof.
In order to address the above problem, the invention discloses a kind of cluster storage system, comprise shared storage device, be positioned at the data staging server and the DRBD of first node, wherein:
Described data staging server is determined the preferred value of each file in the described shared storage device, and with preferred value greater than the data upload of the file of set point to described DRBD;
Described DRBD receives the data and the storage of the file that described data staging server uploads.
Preferably, in the said system, described data staging server determines that the preferred value of each file in the described shared storage device refers to:
Described data staging server is the summation of the parameter value of file in the described shared storage device preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
Perhaps, in the said system, described data staging server determines that the preferred value of each file in the described shared storage device refers to:
The parameter value that described data staging server is a file in the described shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all preferred value calculating parameters preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
Preferably, have data staging server and DRBD at least two first nodes in the above-mentioned cluster storage system.
The invention also discloses a kind of date storage method of aforesaid cluster storage system, comprising:
Described cluster storage system is determined the preferred value of each file in the shared storage device, only with preferred value greater than the storage of the file of set point in distributed copy block equipment (DRBD).
Preferably, in the said method, described cluster storage system determines that the preferred value of each file in the described shared storage device refers to:
Described cluster storage system is the summation of the parameter value of file in the described shared storage device preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
Perhaps, in the said method, described cluster storage system determines that the preferred value of each file in the described shared storage device refers to:
Described cluster storage system is that the parameter value of file in the described shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all preferred value calculating parameters preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
Preferably, has DRBD at least two first nodes in the above-mentioned cluster storage system.
The embodiment of the invention adopts mixes the advantage that storage architecture is taken into account overcast jumbo advantage of centralised storage and distributed storage high reliability, make up the data extract category of model simultaneously and lay data, be convenient to data management, improve the disaster tolerance of whole cluster, for the safe operation of electronic information provides effective guarantee.
Description of drawings
Fig. 1 is a cluster storage system structural representation in the present embodiment 1.
Embodiment
Below in conjunction with drawings and the specific embodiments technical solution of the present invention is described in further details.Need to prove that under the situation of not conflicting, embodiment among the application and the feature among the embodiment be combination in any mutually.
At present, two kinds of storage modes that extensively adopt are arranged in the cluster storage system.First centralised storage mode adopts this mode, and memory becomes single failure node.It two is distributed storage modes, adopts this mode, and the memory disk utilance is too low, and data to lay strategy single, can not effectively manage.Based on this, the present patent application people considers to adopt and mixes the advantage that storage architecture is taken into account overcast jumbo advantage of centralised storage and distributed storage high reliability, make up the data extract model simultaneously, so that data management, improve the disaster tolerance of whole cluster.
Particularly, by modification/etc/multipath.conf configuration file, realize that each node in the cluster is visited and failover the multipath of shared storage device.DRBD equipment promptly is installed on plural first node at least, is realized coming the synchronous mirror entire equipment, be similar to the function of a network RAID a little by network service.That is to say when the user writes data file system on the local DRBD equipment, data can be sent on the other main frame in the network simultaneously, and be recorded in the file system, thereby reach the effect of distributed storage with identical form.Memory requirement that so both can the satisfying magnanimity data also can partly satisfy the requirement of data security, improves disk utilance and balance cost.
Embodiment 1
Present embodiment is based on above-mentioned thought, a kind of cluster storage system is provided, this system architecture as shown in Figure 1, comprise bit data tiered server, distributed copy block equipment (DRBD, Distributed ReplicatedBlock Device) and shared storage device, shared storage device is selected shared array for use in the present embodiment, service was not interrupted when the demand that shared array is used for satisfying service level HA guaranteed that node breaks down, DRBD then is used to satisfy the requirement of storage level HA, and the assurance significant data is not lost.As seen from Figure 1, all nodes all link to each other with shared array permutation, two first nodes except that with DRBD also is installed shared array links to each other.
Wherein, the data staging server is positioned on two first nodes, and it mainly is responsible for file of sharing in the array and makes up the data extract model determining the preferred value of each file, and will share preferred value in the array greater than the data upload of the file of set point to DRBD;
Particularly, on the basis of customer surveys, the data staging server with the parameter value of file and as the preferred value of file, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
In preferred version, the data staging server except with each parameter value of file and during as preferred value, also to consider the weight of each parameter, the parameter value that is file in the shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all the preferred value calculating parameters preferred value as this document.For example, the modification frequency values that the reading frequency value that the sizes values of file data is designated as x, file data is designated as y, file data is designated as z, and the user's of file correspondence grade point is designated as v, sets up the data extract model afterwards, determines that promptly the preferred value of file is as follows:
ax+by+cz+dv=f
Wherein, a, b, c and d are the weights of each parameter, can determine the weights of each parameter by sample training;
F is the preferred value of file.
In addition, also think data greater than the file of set point also will upload to DRBD concerning the key message of whole cluster operation.In case cluster collapse or array damage like this, the cost of data degradation can be reduced to minimum, make cluster in the shortest time, recover running simultaneously, reach the target that improves disaster tolerance.
And still be retained in the shared array for the data of preferred value less than the file of set point.
DRBD, the data of the file that storage data staging server is uploaded.
Wherein, in order to improve the disaster tolerance of cluster storage system, general DRBD is positioned on first node.
Like this, optical fiber switch can link to each other setting/etc/corosync/corosync.conf with each node with shared storage device (being the shared array in the present embodiment); Set up the high available cluster of active/active pattern by Pacemaker, each node all becomes the potential source node that is equipped with like this, select two big inner servers as first node, by DRBD and configuration file are set, set up the high available cluster of active/passive pattern, existing active/active pattern has the active/passive pattern again in a cluster like this, thereby realizes mixed architecture.
Share array, the memory priority value is less than the data of the file of set point.
Present embodiment, on the basis of a large amount of experiments and sampling statistics, with the user's of the modification frequency values of the reading frequency value of file data, file data, file correspondence grade point as parameter value, the design data extraction model, coding, the realization data are settled automatically, thereby reach the requirement that improves the disaster tolerance ability, safeguard a index simultaneously, be convenient to search data, record data migration situation.In addition, with configuration file important in the cluster, facility information backs up in DRBD, in case the cluster collapse can recover rapidly.By to subscriber authorisation, the authority of restricting user access DRBD, the fail safe that can improve this cluster.
Embodiment 2
Present embodiment is based on above-mentioned cluster storage system, a kind of date storage method of cluster storage system is proposed, its core is that data are screened, significant data (being the data of preferred value greater than the file of set point) is placed on distributed copy block equipment (DRBD, Distributed Replicated Block Device) in, general data (being the data of preferred value less than the file of set point) is put in shared storage device (being shared array in the present embodiment), even sharing array like this damages, also the loss of loss of data can be dropped to minimum, and also can back up important system information among the DRBD (as the configuration file of server, the Administrator Info, log information or the like is determined by the keeper) can fast quick-recovery when cluster collapses.
Particularly, this method comprises: cluster storage system is determined the preferred value of each file in the shared storage device, only with preferred value greater than the storage of the file of set point in DRBD.Wherein, DRBD generally is positioned on each first node.
Particularly, cluster storage system determines that the preferred value of each file refers in the shared storage device:
With the summation of the parameter value of file in the shared storage device preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
Also have in some preferred versions, cluster storage system is that the parameter value of file in the shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all the preferred value calculating parameters preferred value as this document.For example, the modification frequency values that the reading frequency value that the sizes values of file data is designated as x, file data is designated as y, file data is designated as z, and the user's of file correspondence grade point is designated as v, sets up the data extract model afterwards, determines that promptly the preferred value of file is as follows:
ax+by+cz+dv=f
Wherein, a, b, c and d are the weights of each parameter, can determine the weights of each parameter by sample training;
F is the preferred value of file.
From the foregoing description as can be seen, embodiments of the invention use DRBD and shared storage device by collocation, to the data classification, separately deposit, and have improved the disaster tolerance ability of whole system.Reach the purpose of taking into account storage security and reducing cost simultaneously.
The above is the preferred embodiments of the present invention only, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1. a cluster storage system comprises shared storage device, it is characterized in that, this system also comprises data staging server and the distributed copy block equipment (DRBD) that is positioned at first node, wherein:
Described data staging server is determined the preferred value of each file in the described shared storage device, and with preferred value greater than the data upload of the file of set point to described DRBD;
Described DRBD receives the data and the storage of the file that described data staging server uploads.
2. the system as claimed in claim 1 is characterized in that,
Described data staging server determines that the preferred value of each file in the described shared storage device refers to:
Described data staging server is the summation of the parameter value of file in the described shared storage device preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
3. the system as claimed in claim 1 is characterized in that,
Described data staging server determines that the preferred value of each file in the described shared storage device refers to:
The parameter value that described data staging server is a file in the described shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all preferred value calculating parameters preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
4. as claim 1,2 or 3 described systems, it is characterized in that,
Have data staging server and DRBD at least two first nodes in the described cluster storage system.
5. the date storage method of a cluster storage system as claimed in claim 1 is characterized in that, this method comprises:
Described cluster storage system is determined the preferred value of each file in the shared storage device, only with preferred value greater than the storage of the file of set point in distributed copy block equipment (DRBD).
6. method as claimed in claim 5 is characterized in that,
Described cluster storage system determines that the preferred value of each file in the described shared storage device refers to:
Described cluster storage system is the summation of the parameter value of file in the described shared storage device preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
7. method as claimed in claim 5 is characterized in that,
Described cluster storage system determines that the preferred value of each file in the described shared storage device refers to:
Described cluster storage system is that the parameter value of file in the described shared storage device is determined weights respectively, the product of the weights that each parameter value is corresponding with it is as the preferred value calculating parameter, and with the summation of all preferred value calculating parameters preferred value as this document, wherein, the parameter value of file comprise following one or more:
The user's of the reading frequency value of the sizes values of file data, file data, the modification frequency values of file data, file correspondence grade point.
8. as claim 5,6 or 7 described methods, it is characterized in that,
Has DRBD at least two first nodes in the described cluster storage system.
CN2011100575768A2011-03-102011-03-10Cluster storage system and data storage method thereofPendingCN102170460A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN2011100575768ACN102170460A (en)2011-03-102011-03-10Cluster storage system and data storage method thereof

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN2011100575768ACN102170460A (en)2011-03-102011-03-10Cluster storage system and data storage method thereof

Publications (1)

Publication NumberPublication Date
CN102170460Atrue CN102170460A (en)2011-08-31

Family

ID=44491436

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN2011100575768APendingCN102170460A (en)2011-03-102011-03-10Cluster storage system and data storage method thereof

Country Status (1)

CountryLink
CN (1)CN102170460A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102364465A (en)*2011-09-302012-02-29深圳市赫迪威信息技术有限公司File storage method and storage cluster
CN102810111A (en)*2012-05-072012-12-05互动在线(北京)科技有限公司Implementation method and system for keeping high availability of Oracle database service
CN103036744A (en)*2012-12-192013-04-10北京搜房网络技术有限公司Method for monitoring multi-machine distributed replicated block device and equipment
CN103384267A (en)*2013-06-072013-11-06曙光信息产业(北京)有限公司Parastor200 parallel storage management node high availability method based on distributed block device
WO2015049554A1 (en)*2013-10-042015-04-09Paul Kalappurakkal DibuHuman data relation
CN105260377A (en)*2015-09-012016-01-20浪潮(北京)电子信息产业有限公司 An upgrade method and system based on tiered storage
CN106844111A (en)*2016-12-262017-06-13创新科存储技术(深圳)有限公司The access method of cloud storage NFS
CN106911777A (en)*2017-02-242017-06-30郑州云海信息技术有限公司A kind of data processing method and server
CN107291633A (en)*2017-07-142017-10-24长沙开雅电子科技有限公司A kind of new cluster-based storage write buffer grading management method
CN107480073A (en)*2017-08-202017-12-15长沙曙通信息科技有限公司A kind of storage system hot spot data cache pre-reading implementation method
CN108153787A (en)*2016-12-022018-06-12航天星图科技(北京)有限公司A kind of disaster recovery method of database
CN109196459A (en)*2016-05-312019-01-11重庆大学A kind of distributed heterogeneous memory system data location mode of decentralization
CN111641680A (en)*2020-05-112020-09-08紫光云技术有限公司Management method of Ambari high-availability cluster
CN111966644A (en)*2020-08-122020-11-20南方科技大学Supercomputer data storage method, device, system and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1972311A (en)*2006-12-082007-05-30华中科技大学A stream media server system based on cluster balanced load
CN101821696A (en)*2007-08-282010-09-01Commvault系统公司Energy consumption management of data processing resources such as adaptive energy consumption management of data storage operations

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN1972311A (en)*2006-12-082007-05-30华中科技大学A stream media server system based on cluster balanced load
CN101821696A (en)*2007-08-282010-09-01Commvault系统公司Energy consumption management of data processing resources such as adaptive energy consumption management of data storage operations

Cited By (21)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN102364465A (en)*2011-09-302012-02-29深圳市赫迪威信息技术有限公司File storage method and storage cluster
CN102810111A (en)*2012-05-072012-12-05互动在线(北京)科技有限公司Implementation method and system for keeping high availability of Oracle database service
CN102810111B (en)*2012-05-072016-05-11互动在线(北京)科技有限公司A kind of oracle database service high available implementation method and system of keeping
CN103036744A (en)*2012-12-192013-04-10北京搜房网络技术有限公司Method for monitoring multi-machine distributed replicated block device and equipment
CN103036744B (en)*2012-12-192016-01-20北京搜房网络技术有限公司A kind of method and device monitoring multiple machine distributing copy block equipment
CN103384267B (en)*2013-06-072017-09-01曙光信息产业(北京)有限公司A kind of Parastor200 parallel memorizing management node high availability methods based on distributed block equipment
CN103384267A (en)*2013-06-072013-11-06曙光信息产业(北京)有限公司Parastor200 parallel storage management node high availability method based on distributed block device
WO2015049554A1 (en)*2013-10-042015-04-09Paul Kalappurakkal DibuHuman data relation
CN105260377A (en)*2015-09-012016-01-20浪潮(北京)电子信息产业有限公司 An upgrade method and system based on tiered storage
CN105260377B (en)*2015-09-012019-02-12浪潮(北京)电子信息产业有限公司 A method and system for upgrading based on hierarchical storage
CN109196459A (en)*2016-05-312019-01-11重庆大学A kind of distributed heterogeneous memory system data location mode of decentralization
CN109196459B (en)*2016-05-312020-12-08重庆大学 A Decentralized Distributed Heterogeneous Storage System Data Distribution Method
CN108153787A (en)*2016-12-022018-06-12航天星图科技(北京)有限公司A kind of disaster recovery method of database
CN108153787B (en)*2016-12-022019-11-26中科星图股份有限公司A kind of disaster recovery method of database
CN106844111A (en)*2016-12-262017-06-13创新科存储技术(深圳)有限公司The access method of cloud storage NFS
CN106844111B (en)*2016-12-262021-01-08深圳创新科技术有限公司 Access method of cloud storage network file system
CN106911777A (en)*2017-02-242017-06-30郑州云海信息技术有限公司A kind of data processing method and server
CN107291633A (en)*2017-07-142017-10-24长沙开雅电子科技有限公司A kind of new cluster-based storage write buffer grading management method
CN107480073A (en)*2017-08-202017-12-15长沙曙通信息科技有限公司A kind of storage system hot spot data cache pre-reading implementation method
CN111641680A (en)*2020-05-112020-09-08紫光云技术有限公司Management method of Ambari high-availability cluster
CN111966644A (en)*2020-08-122020-11-20南方科技大学Supercomputer data storage method, device, system and storage medium

Similar Documents

PublicationPublication DateTitle
CN102170460A (en)Cluster storage system and data storage method thereof
US8261033B1 (en)Time optimized secure traceable migration of massive quantities of data in a distributed storage system
CN107544862B (en) A method and device for reconstructing stored data based on erasure code, and a storage node
CN106341454B (en)Across computer room distributed data base management system (DDBMS) mostly living and method
US7546486B2 (en)Scalable distributed object management in a distributed fixed content storage system
US9489270B2 (en)Managing backup operations from a client system to a primary server and secondary server
CN103176860B (en)Data back up method and system
US9904605B2 (en)System and method for enhancing availability of a distributed object storage system during a partial database outage
US9053166B2 (en)Dynamically varying the number of database replicas
CN110990200B (en) A method and device for traffic switching based on a multi-active data center
CN110169040A (en)Distributed data storage method and system based on multilayer consistency Hash
CN103929500A (en)Method for data fragmentation of distributed storage system
CN108351806A (en)Database trigger of the distribution based on stream
US20170242761A1 (en)Fault tolerant listener registration in the presence of node crashes in a data grid
CN102667748A (en) Use fixed content storage replicated on content platforms with namespaced partitions
CN101986276B (en)Methods and systems for storing and recovering files and server
CN110807064A (en) Data Recovery Device in RAC Distributed Database Cluster System
CN102999399B (en)The method and apparatus that a kind of JBOD array is automatically renewed
CN104243195A (en)Remote disaster recovery processing method and device
CN110839023B (en) A multi-channel customer service system for power marketing
CN106648970A (en)File backup method and distributed file system
WO2024148856A1 (en)Data writing method and system, and storage hard disk, electronic device and storage medium
CN108319618A (en)A kind of data distribution control method, system and the device of distributed memory system
CN104391760B (en)A kind of method of data backup, server and system
CN101901173A (en)Disaster preparation system and method thereof

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
C12Rejection of a patent application after its publication
RJ01Rejection of invention patent application after publication

Application publication date:20110831


[8]ページ先頭

©2009-2025 Movatter.jp