Movatterモバイル変換


[0]ホーム

URL:


CN113553325B - A method and system for synchronizing aggregate objects in an object storage system - Google Patents

A method and system for synchronizing aggregate objects in an object storage system
Download PDF

Info

Publication number
CN113553325B
CN113553325BCN202110676401.9ACN202110676401ACN113553325BCN 113553325 BCN113553325 BCN 113553325BCN 202110676401 ACN202110676401 ACN 202110676401ACN 113553325 BCN113553325 BCN 113553325B
Authority
CN
China
Prior art keywords
site
objects
information
data information
aggregation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110676401.9A
Other languages
Chinese (zh)
Other versions
CN113553325A (en
Inventor
赵煜
陶桐桐
李欢欢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Jinan data Technology Co ltd
Original Assignee
Inspur Jinan data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Jinan data Technology Co ltdfiledCriticalInspur Jinan data Technology Co ltd
Priority to CN202110676401.9ApriorityCriticalpatent/CN113553325B/en
Publication of CN113553325ApublicationCriticalpatent/CN113553325A/en
Application grantedgrantedCritical
Publication of CN113553325BpublicationCriticalpatent/CN113553325B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

Translated fromChinese

本发明实施例中提供了一种对象存储系统中聚合对象的同步方法和系统,具体包括:从站点发送http请求,获取主站点的data_log中的桶分片信息;从站点通过所述桶分片信息中对象的索引信息,分析所述对象是否需要同步;若需要同步,则从站点发送get请求,获取主站点的所述对象;主站点根据所述对象的聚合特性读取所述对象的数据信息,并将所述对象的数据信息返回从站点;从站点按照预设的存储规则将所述对象的数据信息同步到从站点的存储集群中。本发明实施例提供的聚合对象的同步方法和系统,解决了对象存储系统中,站点间同步时同步特性和聚合特性不兼容的问题,提高了存储产品的适用性,提高了对象存储的竞争力。

The embodiment of the present invention provides a method and system for synchronizing aggregated objects in an object storage system, specifically including: sending an http request from a site to obtain bucket sharding information in the data_log of the master site; the slave site analyzes whether the object needs to be synchronized through the index information of the object in the bucket sharding information; if synchronization is required, sending a get request from the site to obtain the object of the master site; the master site reads the data information of the object according to the aggregation characteristics of the object, and returns the data information of the object to the slave site; the slave site synchronizes the data information of the object to the storage cluster of the slave site according to the preset storage rules. The method and system for synchronizing aggregated objects provided by the embodiment of the present invention solves the problem of incompatibility between synchronization characteristics and aggregation characteristics during synchronization between sites in an object storage system, improves the applicability of storage products, and improves the competitiveness of object storage.

Description

Method and system for synchronizing aggregate objects in object storage system
Technical Field
The invention relates to the technical field of storage systems, in particular to a method and a system for synchronizing aggregate objects in an object storage system.
Background
With the development of technologies such as cloud computing and mobile internet, objects in an object storage system show an explosive growth trend, the number of objects is large, and the operation is frequent.
In an object storage system, an aggregate object is to aggregate multiple small objects smaller than 512k into one large object. Only the metadata information of the small object is stored in the small object, the data information of the small object is stored in the aggregation large object, and the aggregation characteristic modifies the distribution rule of the object in the storage pool.
When the stations are synchronized, the slave stations read the object metadata information and the data information of the master station and write the object metadata information and the data information in the heads of the small objects in the large objects due to the aggregation characteristic, and when the stations are synchronized, the data information of the objects is read out, so that the problem that the synchronization characteristic and the aggregation characteristic are incompatible is caused.
Disclosure of Invention
The embodiment of the invention provides a method and a system for synchronizing an aggregate object in an object storage system, which are used for solving the problem that the synchronization characteristic and the aggregation characteristic are incompatible during the synchronization between sites in the object storage system.
The embodiment of the invention discloses the following technical scheme:
the first aspect of the present invention provides a method for synchronizing aggregate objects in an object storage system, the method comprising:
the slave station sends an http request and acquires barrel fragment information in data_log of the master station;
Analyzing whether the objects need to be synchronized or not by the slave station through index information of the objects in the barrel slicing information;
If synchronization is needed, a get request is sent from a slave station, and the object of the master station is obtained;
the master station reads the data information of the object according to the aggregation characteristic of the object, and returns the data information of the object to the slave station;
And the slave station synchronizes the data information of the object into a storage cluster of the slave station according to a preset storage rule.
Further, index information of the objects is distributed in the bucket fragment information according to a hash rule.
Further, the master site reads the data information of the object according to the aggregation characteristic of the object specifically:
if the object marks the aggregation characteristic, reading the data information of the object from an SSD storage pool of a master site;
And if the object is not marked with the aggregation characteristic, reading the metadata information of the object from an SSD storage pool of the master site, and acquiring the data information and the offset of the object from an HDD storage pool of the master site through the metadata information of the object.
Further, the preset storage rules comprise grading characteristics and aggregation characteristics;
A hierarchical property for storing objects in an SSD storage pool of the slave site, marking the objects with an aggregate property;
and the aggregation characteristic is used for aggregating the data information of the objects marked with the aggregation characteristic into one aggregation object and storing the aggregation object in an HDD storage pool of the site.
Further, the grading characteristic specifically includes:
storing metadata information for the object and data information for the object in an SSD storage pool of the secondary site;
if the capacity value of the object is smaller than a first preset value, marking aggregation characteristics for the object;
And if the capacity value of the object is larger than or equal to a first preset value, marking the aggregation characteristic for the object.
Further, the polymerization characteristics specifically include:
Reading the data information of a plurality of objects, the quantity of which is equal to a second preset value and the aggregation characteristic of which is marked, in the SSD storage pool according to a preset period;
Aggregating data information of a plurality of the objects into an aggregate object;
storing the data information and the offset of the aggregate object in an HDD storage pool of the slave site;
and deleting the data information of the plurality of objects in the SSD storage pool.
The second aspect of the present invention provides a synchronization system for aggregating objects in an object storage system, implemented based on the method, the system comprising:
The request sending module is used for sending http requests and get requests;
The main site is used for storing the object of the main site and the barrel fragment information in the data_log;
The synchronization judging module is used for judging whether the objects on the main site need synchronization or not;
A slave station for storing data information of an object to be synchronized to the slave station;
The preset storage rule module is used for presetting storage rules of the objects in the master site and the slave site;
And the object analysis module is used for analyzing index information in the barrel fragment information by reading the object in the master site and the barrel fragment information in the data_log through the request sending module and writing the object to be synchronized into the slave site.
Further, the preset storage rule module includes:
a hierarchical property unit for storing the object in an SSD storage pool of the slave station, marking the object with an aggregate property;
and the aggregation characteristic unit is used for aggregating the data information of the objects marked with the aggregation characteristic into one aggregation object and storing the aggregation object in the HDD storage pool of the slave site.
Further, the object analysis module includes:
An object reading unit for reading metadata information of the object in the main site, data information of the object and barrel fragment information in data_log according to the aggregation characteristic of the object;
an object writing unit for writing the object to be synchronized into the slave station;
And an object analysis unit which analyzes index information in the bucket fragment information.
Further, the process of reading the object by the object reading unit specifically includes:
if the object marks the aggregation characteristic, reading the data information of the object from an SSD storage pool of a master site;
And if the object is not marked with the aggregation characteristic, reading the metadata information of the object from an SSD storage pool of the master site, and acquiring the data information and the offset of the object from an HDD storage pool of the master site through the metadata information of the object.
The effects provided in the summary of the invention are merely effects of embodiments, not all effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:
The method for synchronizing the aggregate objects in the object storage system comprises the steps of obtaining bucket fragment information in data_log of a master site from a site, analyzing whether the objects need to be synchronized or not through index information of the objects in the bucket fragment information, reading the data information of the objects according to aggregation characteristics of the objects if the objects need to be synchronized, reading the data information of the objects from an SSD storage pool of the master site if the objects mark the aggregation characteristics, reading the metadata information of the objects from the SSD storage pool of the master site if the objects do not mark the aggregation characteristics, obtaining the data information and offset of the objects from an HDD storage pool of the master site through the metadata information of the objects, and returning the data information of the objects to a slave site by the master site, wherein the data information of the objects is synchronized to a storage cluster of the slave site according to a preset storage rule. Therefore, the invention solves the problem that the synchronization characteristic and the aggregation characteristic are incompatible when the stations are synchronized in the object storage system, improves the applicability of storage products and improves the competitiveness of object storage.
The object synchronizing system in the object storage system comprises a request sending module, a master station, a synchronization judging module, a slave station, a preset storage rule module and an object analysis module, wherein the request sending module sends http requests and get requests, the master station stores the object of the master station and bucket fragment information in data_log, the synchronization judging module judges whether the object on the master station needs to be synchronized, the slave station stores data information of the object which needs to be synchronized to the slave station, the preset storage rule module presets storage rules of the object in the master station and the slave station, and the object analysis module requests the sending module to read the object in the master station and the bucket fragment information in data_log, analyze index information in the bucket fragment information and write the object which needs to be synchronized to the slave station. If synchronization is needed, the master site reads the data information of the object according to the aggregation characteristic of the object, if the object marks the aggregation characteristic, the data information of the object is read from an SSD storage pool of the master site, if the object does not mark the aggregation characteristic, the metadata information of the object is read from the SSD storage pool of the master site, and the data information and the offset of the object are obtained from an HDD storage pool of the master site through the metadata information of the object. And the master station returns the data information of the object to the slave station, and the slave station synchronizes the data information of the object to a storage cluster of the slave station according to a preset storage rule. The system solves the problem that the synchronization characteristic and the aggregation characteristic are incompatible when the stations are synchronized in the object storage system, increases the compatibility of multiple stations and aggregation, improves the applicability of storage products, and improves the competitiveness of object storage.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required to be used in the description of the embodiments or the prior art will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a block diagram of bucket fragmentation information in data_log of the method of the present invention;
FIG. 3 is a diagram of the storage structure of objects in a method site according to the present invention;
fig. 4 is a block diagram of the system structure of the present invention.
Detailed Description
In order to clearly illustrate the technical features of the present solution, the present invention will be described in detail below with reference to the following detailed description and the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different structures of the invention. In order to simplify the present disclosure, components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and processes are omitted so as to not unnecessarily obscure the present invention.
A Memory system (Memory system) is a system in a computer, which is composed of various Memory devices for storing programs and data, control units, and devices (hardware) and algorithms (software) for managing information scheduling. The main memory of the computer can not meet the requirements of high access speed, large storage capacity and low cost at the same time, and the multi-level hierarchical memory with low speed and large capacity is needed to be arranged in the computer, so that the storage system with acceptable performance is formed by the optimal control scheduling algorithm and reasonable cost. The performance of the storage system is increasingly important in computers, the overall efficiency is affected by the quality of storage management and organization, and the requirements of modern information processing such as image processing, databases and knowledge bases on the storage system are high.
As the requirements of scientific computing and data processing on storage systems are increasing, there is a continuous need to improve existing storage technologies, research new storage media, and improve the structure and management of storage systems. Large scale integrated circuits and magnetic disks remain the primary storage medium.
The synchronization refers to data synchronization redundancy between two storage clusters, wherein one of the two storage clusters is normally used, and the other storage cluster is used for redundancy backup, so that one site immediately takes over the role of the other site after a disaster, and the daily used storage cluster is a master site, and the storage cluster used as a backup is a slave site.
Bucket shards (shard), which are index information of objects (objects) in a bucket. In the on bucket shards mode, one socket corresponds to one or more rados object (distributed objects)).
After the multi-site function is started, each object is uploaded or deleted, and the information of the barrel slice where the object is located is recorded in the data_log.
Embodiment one:
as shown in fig. 1, the method for synchronizing an aggregate object in an object storage system according to an embodiment of the present invention includes:
the slave station sends an http request and acquires barrel fragment information in data_log of the master station;
Analyzing whether the objects need to be synchronized or not by the slave station through index information of the objects in the barrel slicing information;
if synchronization is needed, a get request is sent from a slave station to acquire an object of a master station;
the master station reads the data information of the object according to the aggregation characteristic of the object and returns the data information of the object to the slave station;
and the slave station synchronizes the data information of the object into the storage cluster of the slave station according to a preset storage rule.
As shown in fig. 2, in order to improve the read-write performance of object storage, the index information of the object in the bucket is uniformly distributed on 128 bucket fragments according to the hash rule, so that the index information of the object is stored in the bucket fragments.
The main station reads the data information of the object according to the aggregation characteristic of the object specifically as follows:
If the object marks the aggregation characteristic, reading the data information of the object from an SSD storage pool of the master site;
If the object is not marked with the aggregation characteristic, the metadata information of the object is read from the SSD storage pool of the master site, and the data information and the offset (the offset is the starting position of the data information of the object stored in the HDD storage pool) of the object are obtained from the HDD storage pool of the master site through the metadata information of the object.
The preset storage rules comprise grading characteristics and aggregation characteristics;
A hierarchical property for storing objects in an SSD storage pool of the slave site, marking the objects with an aggregate property;
an aggregation characteristic for aggregating data information of a plurality of objects marked with the aggregation characteristic into one aggregate object, and storing the aggregate object in an HDD storage pool of the site.
According to the object storage system method without starting the grading characteristic, the files in mass storage data are in the unit of hundred million, and tens of billions of mass small files are stored in the same storage cluster according to a common file data storage mode. Such storage may have hundreds of billions of underlying objects. When the underlying storage object is higher, it can have an impact on the performance of the file system, recovery of failure scenario data, disk utilization, etc.
The hierarchical characteristics specifically comprise the steps of storing metadata information of objects and data information of the objects in an SSD storage pool of a slave station, marking the aggregation characteristics for the objects if the capacity value of the objects is smaller than a first preset value, not marking the aggregation characteristics for the objects if the capacity value of the objects is larger than or equal to the first preset value, and reading the data information of the objects without marking the aggregation characteristics from the SSD storage pool and writing the data information of the objects with the unlabeled aggregation characteristics into the HDD storage pool of the slave station. In this embodiment, the first preset value is set to 512k.
In this embodiment, when the storage cluster stores objects after turning on the hierarchy property, objects smaller than 512k are stored in the SSD storage pool, and objects larger than 512k are stored in the HDD storage pool.
The aggregation characteristic comprises the steps of reading data information of a plurality of objects, which are equal to a second preset value in number in an SSD storage pool and marked with the aggregation characteristic, according to a preset period, aggregating the data information of the plurality of objects into one aggregation object, storing the data information and the offset of the aggregation object in an HDD storage pool of a slave station, and deleting the data information of the plurality of objects in the SSD storage pool. In this embodiment, the second preset value is set to 1024.
The aggregation characteristic depends on the hierarchical characteristic, after the hierarchical aggregation characteristic is started, the storage cluster reads out the data information of the objects stored in the SSD storage pool, 1024 objects form an aggregation object, the aggregation object is written into the HDD storage pool, and then the data information of 1024 objects stored in the SSD storage pool is deleted.
FIG. 3 is a diagram showing the structure of the object storage in the website according to the method of the present invention. Each object contains two parts of information, namely data information data and metadata information xattr, wherein xattr records metadata information of the object, such as creation time, size, aggregation characteristics and other customized metadata information.
After the hierarchical aggregation characteristic is started, the object with the capacity value being more than or equal to 512k is stored in the site in the following process:
And reading the data information of the object without marking the aggregation characteristic from the SSD storage pool and writing the data information into the HDD storage pool of the station.
After the hierarchical aggregation characteristic is started, the process of storing the object with the capacity value smaller than 512k in the site is as follows:
Storing metadata information and data information of the object in an SSD storage pool of the site, and marking the object with an aggregation characteristic;
according to a preset period, the data information of 1024 objects marked with aggregation characteristics in the SSD storage pool is read out, the data information of 1024 objects is aggregated into one aggregation object, the data information and the offset of the aggregation object are stored in the HDD storage pool of the slave station, and the data information of a plurality of objects in the SSD storage pool is deleted.
Thus, in the method of the present invention, the storage clusters of the secondary sites comprise SSD storage pools and HDD storage pools, and the storage clusters of the primary sites also comprise SSD storage pools and HDD storage pools.
The method for synchronizing the aggregate objects in the object storage system comprises the steps of obtaining bucket fragment information in data_log of a master site from a site, analyzing whether the objects need to be synchronized or not through index information of the objects in the bucket fragment information, reading the data information of the objects according to aggregation characteristics of the objects if the objects need to be synchronized, reading the data information of the objects from an SSD storage pool of the master site if the objects mark the aggregation characteristics, reading the metadata information of the objects from the SSD storage pool of the master site if the objects do not mark the aggregation characteristics, obtaining the data information and offset of the objects from an HDD storage pool of the master site through the metadata information of the objects, and returning the data information of the objects to a slave site by the master site, wherein the data information of the objects is synchronized to a storage cluster of the slave site according to a preset storage rule. Therefore, the method solves the problem that the synchronization characteristic and the aggregation characteristic are incompatible when the stations are synchronized in the object storage system. The invention provides a synchronization method for an aggregate object in an object storage system, which increases multi-site and aggregate compatibility, improves the applicability of storage products and improves the competitiveness of object storage.
Embodiment two:
As shown in fig. 2, the synchronization system provided by the present invention is implemented based on the method, and the system includes:
The request sending module is used for sending http requests and get requests;
The main site is used for storing the object of the main site and the barrel fragment information in the data_log;
The synchronization judging module is used for judging whether the objects on the main site need synchronization or not;
A slave station for storing data information of an object to be synchronized to the slave station;
The preset storage rule module is used for presetting storage rules of the objects in the master site and the slave site;
and the object analysis module is used for analyzing index information in the barrel fragment information by reading the object in the master site and the barrel fragment information in the data_log through the request sending module and writing the object to be synchronized into the slave site.
In order to improve the read-write performance of object storage, index information of objects in a bucket is uniformly distributed on 128 bucket fragments according to a hash rule, so that the index information of the objects is stored in the bucket fragments.
The preset storage rule module comprises:
a hierarchical property unit for storing the object in an SSD storage pool of the slave station, marking the object with an aggregate property;
and an aggregation characteristic unit for aggregating data information of the plurality of objects marked with the aggregation characteristic into one aggregation object and storing the aggregation object in the HDD storage pool of the slave station.
The object storage system with the characteristic of no starting grading is characterized in that the files in mass storage data are in the unit of hundred million, and tens of billions of mass small files are stored in the same storage cluster according to a common file data storage mode. Such storage may have hundreds of billions of underlying objects. When the underlying storage object is higher, it can have an impact on the performance of the file system, recovery of failure scenario data, disk utilization, etc.
The hierarchical property unit is used for defining a hierarchical property, wherein the hierarchical property specifically comprises the steps of storing metadata information of an object and data information of the object in an SSD storage pool of a slave station;
if the capacity value of the object is smaller than a first preset value, the aggregation characteristic is marked for the object, if the capacity value of the object is larger than or equal to the first preset value, the aggregation characteristic is not marked for the object, and the data information of the object without the aggregation characteristic is read out from an SSD storage pool and written into the HDD storage pool of the site. In this embodiment, the first preset value is set to 512k.
In this embodiment, when the storage cluster stores objects after turning on the hierarchy property, objects smaller than 512k are stored in the SSD storage pool, and objects larger than 512k are stored in the HDD storage pool.
The aggregation characteristic unit is used for defining an aggregation characteristic, and specifically comprises the steps of reading out data information of a plurality of objects, the quantity of which is equal to a second preset value, in an SSD storage pool according to a preset period, wherein the data information of the objects is marked with the aggregation characteristic, aggregating the data information of the objects into one aggregation object, storing the data information and offset of the aggregation object in the HDD storage pool of a slave station, and deleting the data information of the objects in the SSD storage pool. In this embodiment, the second preset value is set to 1024.
The aggregation characteristic depends on the grading characteristic, and after the grading aggregation characteristic is started, the working process of the preset storage rule module is that the storage cluster reads out the data information of the objects stored in the SSD storage pool, 1024 objects form an aggregation object, the aggregation object is written into the HDD storage pool, and then the data information of 1024 objects stored in the SSD storage pool is deleted.
After the hierarchical aggregation characteristic is started, the process of storing the object with the capacity value larger than or equal to 512k in the site by the preset storage rule module is that metadata information and data information of the object are stored in an SSD storage pool of the site, the aggregation characteristic is not marked for the object, and the data information of the object with the unmarked aggregation characteristic is read out from the SSD storage pool and written into an HDD storage pool of the site.
After the hierarchical aggregation characteristic is started, a preset storage rule module stores metadata information and data information of objects with capacity value smaller than 512k in an SSD storage pool of the site, the aggregation characteristic is marked for the objects, data information of 1024 objects marked with the aggregation characteristic in the SSD storage pool is read out according to a preset period, the data information of 1024 objects is aggregated into one aggregation object, the data information and offset of the aggregation object are stored in an HDD storage pool of the slave site, and the data information of a plurality of objects in the SSD storage pool is deleted.
The object analysis module includes:
The object reading unit is used for reading metadata information of the objects in the main site, data information of the objects and barrel fragment information in the data_log according to the aggregation characteristics of the objects;
an object writing unit for writing the object to be synchronized into the slave station;
And an object analysis unit which analyzes index information in the bucket fragment information.
The process of reading the object by the object reading unit specifically comprises the following steps:
If the object marks the aggregation characteristic, reading the data information of the object from an SSD storage pool of the master site;
If the object is not marked with the aggregation characteristic, the metadata information of the object is read from the SSD storage pool of the master site, and the data information and the offset (the offset is the starting position of the data information of the object stored in the HDD storage pool) of the object are obtained from the HDD storage pool of the master site through the metadata information of the object.
The specific process of the work of the object analysis module is as follows:
When a slave station sends an http request to a master station through a request sending module, an object reading unit obtains barrel fragment information in data_log of the master station;
The object analysis unit analyzes index information in the barrel fragment information and sends the index information to the synchronous judgment module, and the synchronous judgment module judges whether the objects on the main site need to be synchronized or not;
If synchronization is needed, the slave station sends a get request to the master station through a request sending module, an object of the master station is obtained, and an object reading unit reads metadata information of the object and data information of the object in the master station according to aggregation characteristics of the object;
And the object writing unit writes the object which is read by the object reading unit and needs to be synchronized into the storage cluster of the slave station according to the grading characteristic and the aggregation characteristic which are defined by the preset storage rule module.
Thus, in the system of the present invention, the storage clusters of the secondary sites comprise SSD storage pools and HDD storage pools, and the storage clusters of the primary sites also comprise SSD storage pools and HDD storage pools.
The object synchronizing system in the object storage system comprises a request sending module, a master station, a synchronization judging module, a slave station, a preset storage rule module and an object analysis module, wherein the request sending module sends http requests and get requests, the master station stores the object of the master station and bucket fragment information in data_log, the synchronization judging module judges whether the object on the master station needs to be synchronized, the slave station stores data information of the object which needs to be synchronized to the slave station, the preset storage rule module presets storage rules of the object in the master station and the slave station, and the object analysis module requests the sending module to read the object in the master station and the bucket fragment information in data_log, analyze index information in the bucket fragment information and write the object which needs to be synchronized to the slave station. If synchronization is needed, the master site reads the data information of the object according to the aggregation characteristic of the object, if the object marks the aggregation characteristic, the data information of the object is read from an SSD storage pool of the master site, if the object does not mark the aggregation characteristic, the metadata information of the object is read from the SSD storage pool of the master site, and the data information and the offset of the object are obtained from an HDD storage pool of the master site through the metadata information of the object. And the master station returns the data information of the object to the slave station, and the slave station synchronizes the data information of the object to a storage cluster of the slave station according to a preset storage rule. The system solves the problem that the synchronization characteristic and the aggregation characteristic are incompatible when the stations are synchronized in the object storage system, increases the compatibility of multiple stations and aggregation, improves the applicability of storage products, and improves the competitiveness of object storage.
The foregoing is only a preferred embodiment of the present invention, and it will be apparent to those skilled in the art that numerous modifications and variations can be made without departing from the principles of the invention, and such modifications and variations are considered to be within the scope of the invention.

Claims (6)

Translated fromChinese
1.一种对象存储系统中聚合对象的同步方法,其特征在于,所述方法包括:1. A method for synchronizing aggregate objects in an object storage system, characterized in that the method comprises:从站点发送http请求,获取主站点的data_log中的桶分片信息;Send an http request from the site to obtain the bucket shard information in the data_log of the main site;从站点通过所述桶分片信息中对象的索引信息,分析所述对象是否需要同步;The slave site analyzes whether the object needs to be synchronized based on the index information of the object in the bucket shard information;若需要同步,则从站点发送get请求,获取主站点的所述对象;If synchronization is required, a get request is sent from the site to obtain the object from the master site;主站点根据所述对象的聚合特性读取所述对象的数据信息,并将所述对象的数据信息返回从站点;The master site reads the data information of the object according to the aggregation characteristics of the object, and returns the data information of the object to the slave site;从站点按照预设的存储规则将所述对象的数据信息同步到从站点的存储集群中;The slave site synchronizes the data information of the object to the storage cluster of the slave site according to the preset storage rules;主站点根据所述对象的聚合特性读取所述对象的数据信息具体为:The primary site reads the data information of the object according to the aggregation characteristics of the object as follows:若所述对象标记了聚合特性,则从主站点的SSD存储池中读取所述对象的数据信息;If the object is marked with an aggregation feature, data information of the object is read from the SSD storage pool of the primary site;若所述对象未标记聚合特性,则从主站点的SSD存储池中读取所述对象的元数据信息,并通过所述对象的元数据信息,从主站点的HDD存储池中获取所述对象的数据信息和偏移量;If the object is not marked with an aggregation feature, metadata information of the object is read from the SSD storage pool of the primary site, and data information and offset of the object are obtained from the HDD storage pool of the primary site through the metadata information of the object;所述预设的存储规则包括分级特性和聚合特性;The preset storage rules include classification characteristics and aggregation characteristics;分级特性,用于将对象存储在从站点的SSD存储池,为对象标记聚合特性;The tiering feature is used to store objects in the SSD storage pool of the slave site and tag the objects with aggregation features;聚合特性,用于将标记了聚合特性的多个所述对象的数据信息聚合为一个聚合对象,并将聚合对象存储在所述站点的HDD存储池中;An aggregation feature, used to aggregate data information of a plurality of objects marked with the aggregation feature into one aggregate object, and store the aggregate object in the HDD storage pool of the site;所述分级特性具体包括:The classification characteristics specifically include:将对象的元数据信息和对象的数据信息存储在从站点的SSD存储池中;The metadata information and data information of the object are stored in the SSD storage pool of the slave site;若所述对象的容量值小于第一预设值,则为所述对象标记聚合特性;If the capacity value of the object is less than a first preset value, marking the object with an aggregation characteristic;若所述对象的容量值大于等于第一预设值,则不为所述对象标记聚合特性;If the capacity value of the object is greater than or equal to a first preset value, then the aggregation characteristic is not marked for the object;所述聚合特性具体包括:The polymerization characteristics specifically include:按照预设的周期,将所述SSD存储池中数量等于第二预设值,且标记了聚合特性的多个所述对象的数据信息读取出来;According to a preset period, data information of a plurality of objects in the SSD storage pool, the number of which is equal to a second preset value and which are marked with an aggregation characteristic, is read out;将多个所述对象的数据信息聚合为一个聚合对象;Aggregate the data information of the plurality of objects into an aggregate object;将所述聚合对象的数据信息和偏移量存储在所述从站点的HDD存储池中;Storing the data information and offset of the aggregate object in the HDD storage pool of the slave site;删除所述SSD存储池中多个所述对象的数据信息。Delete data information of the plurality of objects in the SSD storage pool.2.根据权利要求1所述的一种对象存储系统中聚合对象的同步方法,其特征在于,所述对象的索引信息按照hash规则分布在所述桶分片信息中。2. According to a method for synchronizing aggregate objects in an object storage system according to claim 1, it is characterized in that the index information of the object is distributed in the bucket sharding information according to a hash rule.3.一种对象存储系统中聚合对象的同步系统,基于权利要求1-2任一项所述方法实现,其特征在于,所述系统包括:3. A synchronization system for aggregated objects in an object storage system, implemented based on the method according to any one of claims 1 to 2, characterized in that the system comprises:请求发送模块,用于发送http请求和get请求;Request sending module, used to send http requests and get requests;主站点,用于存储主站点的对象和data_log中的桶分片信息;The main site is used to store the objects of the main site and the bucket shard information in data_log;同步判断模块,用于判断主站点上的对象是否需要同步;A synchronization judgment module is used to judge whether the object on the primary site needs to be synchronized;从站点,用于存储需要同步到从站点的对象的数据信息;The slave site is used to store data information of objects that need to be synchronized to the slave site;预设存储规则模块,用于预设对象在主站点和从站点中的存储规则;A preset storage rule module is used to preset storage rules of objects in the master site and the slave site;对象分析模块,通过请求发送模块读取所述主站点中的对象和data_log中的桶分片信息,分析桶分片信息中的索引信息,将需要同步的对象写入从站点。The object analysis module reads the objects in the master site and the bucket shard information in the data_log through the request sending module, analyzes the index information in the bucket shard information, and writes the objects to be synchronized to the slave site.4.根据权利要求3所述的一种对象存储系统中聚合对象的同步系统,其特征在于,所述预设存储规则模块包括:4. The synchronization system for aggregated objects in an object storage system according to claim 3, wherein the preset storage rule module comprises:分级特性单元,用于将对象存储在从站点的SSD存储池,为对象标记聚合特性;A hierarchical feature unit, used to store objects in the SSD storage pool of the slave site and tag the objects with aggregate features;聚合特性单元,用于将标记了聚合特性的多个所述对象的数据信息聚合为一个聚合对象,并将聚合对象存储在所述从站点的HDD存储池中。The aggregation characteristic unit is used to aggregate the data information of the multiple objects marked with the aggregation characteristic into an aggregate object, and store the aggregate object in the HDD storage pool of the slave site.5.根据权利要求4所述的一种对象存储系统中聚合对象的同步系统,其特征在于,所述对象分析模块包括:5. The synchronization system for aggregated objects in an object storage system according to claim 4, wherein the object analysis module comprises:对象读取单元,根据所述对象的聚合特性,读取所述主站点中对象的元数据信息、对象的数据信息和data_log中的桶分片信息;An object reading unit, which reads metadata information of the object in the primary site, data information of the object, and bucket fragmentation information in data_log according to the aggregation characteristics of the object;对象写入单元,将需要同步的对象写入从站点;The object writing unit writes the objects that need to be synchronized to the slave site;对象分析单元,分析桶分片信息中的索引信息。The object analysis unit analyzes the index information in the bucket shard information.6.根据权利要求5所述的一种对象存储系统中聚合对象的同步系统,其特征在于,所述对象读取单元读取对象的过程,具体为:6. The synchronization system for aggregated objects in an object storage system according to claim 5, wherein the process of the object reading unit reading the object is specifically:若所述对象标记了聚合特性,则从主站点的SSD存储池中读取所述对象的数据信息;If the object is marked with an aggregation feature, data information of the object is read from the SSD storage pool of the primary site;若所述对象未标记聚合特性,则从主站点的SSD存储池中读取所述对象的元数据信息,并通过所述对象的元数据信息,从主站点的HDD存储池中获取所述对象的数据信息和偏移量。If the object is not marked with an aggregation feature, metadata information of the object is read from the SSD storage pool of the primary site, and data information and an offset of the object are obtained from the HDD storage pool of the primary site through the metadata information of the object.
CN202110676401.9A2021-06-182021-06-18 A method and system for synchronizing aggregate objects in an object storage systemActiveCN113553325B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202110676401.9ACN113553325B (en)2021-06-182021-06-18 A method and system for synchronizing aggregate objects in an object storage system

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202110676401.9ACN113553325B (en)2021-06-182021-06-18 A method and system for synchronizing aggregate objects in an object storage system

Publications (2)

Publication NumberPublication Date
CN113553325A CN113553325A (en)2021-10-26
CN113553325Btrue CN113553325B (en)2025-02-21

Family

ID=78130683

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202110676401.9AActiveCN113553325B (en)2021-06-182021-06-18 A method and system for synchronizing aggregate objects in an object storage system

Country Status (1)

CountryLink
CN (1)CN113553325B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN114546273B (en)*2022-02-222023-08-04苏州浪潮智能科技有限公司Method, system, device and storage medium for aggregation property compatible multi-site synchronization
CN116450739B (en)*2023-04-132025-08-19济南浪潮数据技术有限公司Multi-station data synchronization method, system, device and medium
CN116991949B (en)*2023-09-262024-01-23苏州元脑智能科技有限公司Data synchronization method, device, equipment, server and readable storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103177082A (en)*2013-02-212013-06-26用友软件股份有限公司Main server, slave server, index synchronizing system and index synchronizing method
CN108763473A (en)*2018-05-292018-11-06郑州云海信息技术有限公司A kind of the native object storage method and device of distributed storage

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US11531495B2 (en)*2014-04-212022-12-20David Lane SmithDistributed storage system for long term data storage
CN109241178A (en)*2018-07-202019-01-18华为技术有限公司The method and apparatus for managing memory space
CN112955873B (en)*2018-11-122024-03-26华为技术有限公司 Method for synchronizing mirrored file systems and storage devices thereof
CN109669929A (en)*2018-12-142019-04-23江苏瑞中数据股份有限公司Method for storing real-time data and system based on distributed parallel database
CN111008185B (en)*2019-10-292023-08-18厦门网宿有限公司Data sharing method, system and equipment
US11531642B2 (en)*2019-11-072022-12-20Netapp, Inc.Synchronous object placement for information lifecycle management

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103177082A (en)*2013-02-212013-06-26用友软件股份有限公司Main server, slave server, index synchronizing system and index synchronizing method
CN108763473A (en)*2018-05-292018-11-06郑州云海信息技术有限公司A kind of the native object storage method and device of distributed storage

Also Published As

Publication numberPublication date
CN113553325A (en)2021-10-26

Similar Documents

PublicationPublication DateTitle
CN106662981B (en)Storage device, program, and information processing method
CN113553325B (en) A method and system for synchronizing aggregate objects in an object storage system
US11422721B2 (en)Data storage scheme switching in a distributed data storage system
CN109947363B (en)Data caching method of distributed storage system
US9996557B2 (en)Database storage system based on optical disk and method using the system
CN103955530B (en)Data reconstruction and optimization method of on-line repeating data deletion system
US20130018855A1 (en)Data deduplication
CN108628542B (en) A file merging method and controller
CN110727406B (en) Data storage scheduling method and device
WO2017049764A1 (en)Method for reading and writing data and distributed storage system
US9189494B2 (en)Object file system
CN104050103B (en)A kind of buffer replacing method and system for data recovery
HK1219155A1 (en)Reduced redundancy in stored data
CN105069048A (en)Small file storage method, query method and device
CN103139300A (en)Virtual machine image management optimization method based on data de-duplication
US20240086362A1 (en)Key-value store and file system
CN103399823A (en)Method, equipment and system for storing service data
CN107015888A (en)A kind of data back up method and system based on submodule model
CN104050057B (en)Historical sensed data duplicate removal fragment eliminating method and system
RU2665272C1 (en)Method and apparatus for restoring deduplicated data
CN111444114A (en)Method, device and system for processing data in nonvolatile memory
CN108595589A (en)A kind of efficient access method of magnanimity science data picture
CN110134551B (en)Continuous data protection method and device
CN104079600A (en)File storage method, file storage device, file access client and metadata server system
CN108874315A (en)A kind of online data deduplicated file system data access performance optimization method

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp