Sharing scheduling method and system for scattered dataTechnical Field
The invention relates to a scattered data sharing technology, in particular to a method and a system for sharing and scheduling scattered data.
Background
In the office process of enterprises and office units, because office computers used by different users are independent, data among the users are also independent. In order to solve the problem of data sharing, a data sharing platform or a sharing folder is established at present, files needing to be shared are uploaded to the sharing platform or the sharing folder for being used by other people, and a mode of the sharing platform is adopted to carry out data sharing across a network.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide a method and a system for sharing and scheduling scattered data, which can not only realize cross-network data sharing, but also can be realized without uploading data by a user, and solve the problem that the format of the data needs to be cleaned in the traditional mode.
The purpose of the invention is realized by the following technical scheme:
a method for shared scheduling of scattered data, the method comprising:
1) installing a registered robot at each user terminal, and setting a folder address of a scanned file of the registered robot;
2) starting a registration robot, and synchronously monitoring a corresponding folder through a hook function;
3) the data provider edits the file at the user terminal, proofreads the file registration information, and then puts the file into the folder;
4) the registration robot checks the files in the folder, verifies the registration information according to the registration standard, records abnormal information and warns if the verification fails, writes the file registration information into the platform registry if the verification passes, and proceeds to the next step;
5) generating a file HTTP download address package, starting synchronous cloud uploading, uploading the file to a cloud cache, synchronously registering and uploading file information, and if the synchronous cloud uploading is not started, issuing the file to a specified directory of a retrieval platform according to the registration information;
6) and the data user searches and downloads the files according to the catalog classification, the dispatching center judges whether the cloud of the files has a copy, if so, the dispatching center returns an HTTP downloading address, if not, the dispatching center informs the publisher registration robot to return the HTTP downloading address, and after the files are downloaded, the information of the files downloaded by the user is recorded.
Compared with the traditional data sharing mode, the invention scans the fixed folder in real time by installing the registration robot with the functions of automatically scanning and identifying the files on each computer terminal, the data provider can share the data only by putting the data into the folder of the local computer, the data user can realize the collection of the dispersed data, and the effect of an enterprise is equivalent to accessing the data of each computer through a retrieval platform, thereby realizing the centralized management of the data, and the data provider can realize the information editing of the files through the self-defined column of the files when uploading the data, so that the files can effectively identify the information such as the file names and the like in the scanning or downloading process, and the visualization of the data is improved.
Further, whether the registered robot files are synchronously cloud-connected or not is set in the step 1).
Further, the file checking method in 4) includes MD5 code or Hash checking, and if the file passes the checking, the version number is increased if the file is the same name file.
Further, the version number information comprises a timestamp label, uploading person information and a hash code.
Furthermore, an encryption interface is reserved in the cloud end, and only a header file is encrypted during encryption.
A shared scheduling system for decentralized data, the system comprising:
the system comprises a plurality of user terminals, at least one scheduling center and at least one cloud server, wherein the scheduling center and the cloud server are connected with the user terminals;
each user terminal is provided with a registration robot, the folder address of a scanned file of the registration robot is set, the registration robot is started, and the corresponding folder is synchronously monitored through a hook function;
a data provider edits a document through a user terminal, corrects file registration information, then puts the file into a folder, a registration robot verifies the file in the folder, verifies the registration information according to a registration standard, records abnormal information and gives an alarm if the verification fails, writes the file registration information into a platform registry if the verification passes, generates a file HTTP download address package, starts synchronous cloud uploading, uploads the file to a cloud cache, synchronously registers and uploads the file information, and issues the file to a specified directory of a retrieval platform according to the registration information if the synchronous cloud uploading is not started;
the data user searches and downloads files according to the catalog classification in the search platform, the scheduling center judges whether the cloud of the files has a copy, if yes, the file returns to the HTTP download address, if not, the publisher is informed to register the robot to return to the HTTP download address, and after the files are downloaded, the information of the user downloaded the files is recorded.
Further, the user terminal is a computer, a tablet or a mobile phone.
Further, when the registered robot is installed, whether the registered robot files are synchronously cloud-up or not is set.
Furthermore, the file verification mode comprises MD5 code or Hash verification, and after the verification is passed, if the file passes the verification, the version number is increased, and the version number information comprises a timestamp label, uploading person information and a Hash code.
Furthermore, an encryption interface is reserved in the cloud end, and only a header file is encrypted during encryption.
The invention has the beneficial effects that: the invention utilizes the registration robots of all terminals to identify, scan and upload files, not only realizes the cross-network data sharing, but also solves the problem that the data sharing needs format cleaning in the traditional mode, the data can realize the cross-network data sharing only by putting the files into fixed folders, and a data user can inquire the information of the names, version numbers and the like of the files through the information of the files.
Drawings
FIG. 1 is a schematic flow diagram of a data provider;
FIG. 2 is a schematic diagram of a registration robot process;
FIG. 3 is a schematic flow chart of a user and a dispatch center;
fig. 4 is a dialog box for editing a document.
Detailed Description
The technical solution of the present invention is further described in detail with reference to the following specific examples, but the scope of the present invention is not limited to the following.
Referring to fig. 1 to 3, a method for shared scheduling of scattered data includes:
1) installing a registered robot at each user terminal, and setting a folder address of a scanned file of the registered robot; setting whether the files of the registered robots are synchronously cloud or not, wherein each registered robot has an independent ID or code to distinguish the sources of the data;
2) starting a registration robot, and synchronously monitoring a corresponding folder through a hook function;
3) the data provider edits the file at the user terminal, proofreads the file registration information, and then puts the file into a folder; referring to fig. 4, the content of the document is edited, and the name of the document is edited through the custom bar, so that the registered robot can automatically acquire the content when recognizing and scanning the file, rather than relying on the name of the file.
4) The registration robot checks the files in the folder, the checking mode comprises MD5 code or Hash checking, after the checking is passed, if the files are the same-name files, the version number is increased, the registration information is verified according to the registration standard, if the verification is not passed, abnormal information is recorded and warned, if the verification is passed, the file registration information is written into the platform registry, and the next step is carried out;
5) generating a file HTTP download address package, starting synchronous cloud uploading, reserving an encryption interface at the cloud end, and only encrypting a header file during encryption. Uploading the file to a cloud cache, synchronously registering and uploading file information, and issuing the file to a specified directory of a retrieval platform according to registration information if synchronous cloud uploading is not started;
6) the data user searches and downloads files according to the catalog classification in the search platform, the scheduling center judges whether the cloud of the files has a copy, if yes, the file returns to the HTTP download address, if not, the publisher is informed to register the robot to return to the HTTP download address, and after the files are downloaded, the information of the user downloaded the files is recorded.
Optionally, in the method for sharing and scheduling distributed data, the version number information includes a timestamp tag + upload person information + hash code.
The embodiment further provides a sharing scheduling system for scattered data, which includes:
the system comprises a plurality of user terminals, at least one scheduling center and at least one cloud server, wherein the scheduling center and the cloud server are connected with the user terminals;
each user terminal is provided with a registration robot, the folder address of a scanned file of the registration robot is set, the registration robot is started, and the corresponding folder is synchronously monitored through a hook function;
the method comprises the steps that a data provider edits a document through a user terminal, corrects file registration information, then puts the document into a folder, a registration robot verifies the document in the folder, verifies the registration information according to a registration standard, records abnormal information and gives an alarm if the verification fails, writes the file registration information into a platform registry if the verification passes, generates a file HTTP download address for encapsulation, starts synchronous cloud uploading, uploads the file to a cloud cache, synchronously registers and uploads the file information, and issues the file to a specified directory of a retrieval platform according to the registration information if the synchronous cloud uploading is not started;
the data user searches and downloads files according to the catalog classification in the search platform, the scheduling center judges whether the cloud of the files has a copy, if yes, the file returns to the HTTP download address, if not, the publisher is informed to register the robot to return to the HTTP download address, and after the files are downloaded, the information of the user downloaded the files is recorded.
Further, the user terminal is a computer, a tablet or a mobile phone. When the registered robot is installed, whether the registered robot files are synchronously on the cloud is also set. The file checking mode comprises MD5 code or Hash check, after the check is passed, if the file is the same name, the version number is increased, and the version number information comprises a timestamp label, uploading person information and a Hash code. And an encryption interface is reserved in the cloud end, and only the header file is encrypted during encryption.
The foregoing is illustrative of the preferred embodiments of this invention, and it is to be understood that the invention is not limited to the precise form disclosed herein and that various other combinations, modifications, and environments may be resorted to, falling within the scope of the concept as disclosed herein, either as described above or as apparent to those skilled in the relevant art. And that modifications and variations may be effected by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.