CN111031511A

Movatterモバイル変換

Info

Publication number: CN111031511A
Application number: CN201911371100.4A
Authority: CN
Inventors: 高彬; 徐晓斌; 赵辉
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2020-04-17

Abstract

Translated fromChinese

一种面向物联网的可变粒度实时环境数据采集方法，属于无线传感器网络(WSN)领域。无线传感器网络中节点受限于自身体积，所携带的能量有限，而且数据的传输能耗会占用很多的节点能耗，因此如何降低节点能耗是WSN中很重要的一个问题。考虑到在采集正常数据的WSN中，节点所采集的数据通常具有较强的数据相关性，这使得利用数据的线性相关性进行数据预测，减少数据传输量，降低数据传输的能耗，延长网络的生命周期成为了可能。在本方法中，需要在环境传感器节点和服务器汇聚节点上运行相同的预测模型。在同一时刻，预测模型在两端产生出相同的预测实时数据。同时在环境传感器节点上进行预测值和实际实时数据的偏差判断，如果满足预测，就抑制传感器节点通信行为以减少数据传输量，降低整体能耗。如果不满足预测，就需要环境传感器节点发送数据。同时本方法提出一种相应的数据编码方式以压缩数据体积和一种适用于双预测模型的数据包消息格式。

A variable granularity real-time environmental data collection method oriented to the Internet of Things belongs to the field of wireless sensor networks (WSN). Nodes in wireless sensor networks are limited by their own volume and carry limited energy, and the energy consumption of data transmission will occupy a lot of node energy consumption. Therefore, how to reduce node energy consumption is a very important issue in WSN. Considering that in the WSN that collects normal data, the data collected by nodes usually has strong data correlation, which makes the use of the linear correlation of data for data prediction, reduces the amount of data transmission, reduces the energy consumption of data transmission, and prolongs the network. life cycle is possible. In this method, the same prediction model needs to be run on the environmental sensor nodes and the server sink nodes. At the same time, the predictive model produces the same predictive real-time data on both ends. At the same time, the deviation judgment between the predicted value and the actual real-time data is carried out on the environmental sensor node. If the prediction is satisfied, the communication behavior of the sensor node is suppressed to reduce the amount of data transmission and the overall energy consumption. If the prediction is not met, the environmental sensor nodes are required to send data. At the same time, the method proposes a corresponding data encoding method to compress the data volume and a data packet message format suitable for the bi-predictive model.

Description

Variable-granularity real-time environment data acquisition method for Internet of things

The technical field is as follows:

the invention relates to a variable-granularity real-time environment data acquisition method for the Internet of things.

Background

The collection, processing and transmission of environmental data are one of the most important targets for the application of the internet of things, and limited hardware, network and computing resources are the main bottlenecks limiting the development of the internet of things: therefore, efficient and low-power-consumption environmental data collection becomes one of the most popular research directions in the field of internet of things, and the prediction model scheme is concerned about reducing the communication quantity between nodes to a certain extent.

Many error-based prediction model schemes now perform error estimation after performing complex mathematical prediction operations to obtain a value that satisfies the error. In fact, since the actual environmental data changes slightly and slowly, and the data changes within the error interval in a long time, performing the complicated mathematical prediction operation without deviating from the error interval consumes some unnecessary system resources. And many prediction model schemes do not encode data, actual data are directly transmitted, and communication bandwidth resources are wasted. Therefore, the scheme for acquiring the variable-granularity real-time environment data facing to the Internet of things is provided.

Disclosure of Invention

The invention designs an internet of things real-time data collection method, which is used for saving limited storage resources and operation resources of sensor nodes and reasonably utilizing limited communication capacity between the sensor nodes and sink nodes.

The hardware main body of the invention is a sensor node and a sink node, wherein the sensor node is responsible for collecting, processing and sending data, and the sink node is responsible for receiving and processing data and is a final data collection place. The deployment is detailed in fig. 1.

The prediction model scheme can effectively reduce the communication times between the sensor nodes and the sink nodes, so that the communication energy consumption between the nodes is reduced. The prediction model scheme is realized by running the same prediction algorithm on the sensor node and the sink node, meanwhile, the sensor node and the sink node have the same acquisition period, the sensor node collects and codes real environment data at a preset acquisition time, and then the real environment data is compared with prediction data generated by the prediction model, if the prediction is satisfied, the communication behavior is inhibited, and if the prediction is not satisfied, the coded data needs to be sent out. Meanwhile, the sink node at the other end updates the data pool by using the received data as data or by using the data generated by the prediction model as data, as shown in fig. 2 and 3. And at the preset acquisition time, the prediction algorithm is operated at the sensor node and the sink node to generate the same prediction result.

Real environmental data is often a collection of limited data spaces, in (U)_min，U_max) Now, the data space set is divided into a plurality of intervals with the length of 2 δ, and δ is the error allowed by the prediction model. According to equation (1), n segments will be generated.

Now, a fixed-length coding mode is adopted, and each interval segment corresponds to a binary code with the length of L. The length L is obtained by equation (2).

L＝log₂n (2)

Now, assuming that there is a set of humidity environment data, the unit is%, the distribution range is 0.00% to 7.99%, and the error tolerance is 0.25%, after processing, the limited data space can be divided into a plurality of binary codes, as shown in fig. 4.

Here a simple monitoring mechanism is used to build a prediction model, as shown in fig. 5, which predicts the overall trend of the change by periodic change monitoring for each bit of the encoded data. For each bit, a bit space is created, in which a queue of n cycles of historical data is stored, and only the most recently occurring n cycles of historical data are stored, the queue being denoted by t_n，t_n-1，t_n-2，…，t₁]The historical data period refers to the time for which the data retention state is unchanged. The next variation period T of each bit is predicted according to equation (3), as shown in fig. 6.

At each acquisition moment, the prediction model judges whether each digit needs to change at the current sampling moment according to the latest change moment of each digit and the change period T obtained by averaging the latest n times of historical data periods, and the final overall prediction is generated through local prediction of each digit.

The data message is sent to the sink node by the sensor node. The message format adopts UTF-8 coding.

Start symbol	Source of message	Message sequence number	Message body	Check code	Ending symbol
						1 byte	2 bytes	1 byte	N byte	2 bytes	1 byte

The start character: 0x23 (#);

the source of the message: sensor ID, the entire network can accommodate 256 × 256 sensor nodes.

Message sequence number: the sequence used for marking the message, for the same node message, the message counts from 0x00 to 0xFF according to the sending sequence, and counts again every 256 messages;

message body: all information in the message is contained;

checking: the check is performed from the start symbol to the end of the message body. CRC-16/IBM is adopted for checking;

an end symbol: 0x25 (%);

for example, the sink node receives a piece of information 0x23FF0403646C0925, the content of the message is start symbol 0x23, message source 0xFF04, message sequence 0x03, message body 0x64, check code 0x6C09, and end symbol 0x 25.

Advantageous effects

1. Based on the sensor binary number processing, floating point number conversion is not needed, the calculation is based on binary codes from beginning to end, the calculation is faster, and the cost is smaller;

2. the tolerance of the user to the error is put to the beginning instead of the end, so that the data volume is reduced at the beginning, and all subsequent calculations are convenient;

3. and the coded data is predicted and transmitted, so that the energy consumption is reduced.

Drawings

FIG. 1 is a schematic diagram of a device deployment;

FIG. 2 is a flow chart of sink node work;

FIG. 3 is a sensor node workflow diagram;

FIG. 4 is a schematic view of a process of converting data collected by a sensor into corresponding encoded data;

FIG. 5 is a schematic diagram of a monitoring mechanism;

FIG. 6 is a state update flow chart;

Detailed Description

How this model works is described below with specific cases: at a predetermined acquisition time, the sensor node converts the collected environmental data into its corresponding code, which is then compared with the predictive data generated by the contemporaneous predictive model. If the data is the same, the send operation is suppressed and instead the data needs to be sent and then the queue holding the historical data period is updated. Meanwhile, at the other end, the sink node needs to judge whether the sink node collects data at a preset time, if the sink node receives the data, the received data is used as a real-time data record, if the sink node does not receive the data, a data is generated according to a self prediction model and used as a real-time data record, and the data is the same as the data generated by a prediction model operated at the sensor end, and a queue storing a historical data period is updated. Finally, the encoded data is stored in the data pool, so that the encoded data needs to be restored into a data interval according to a preset encoding rule when the data pool is used at last.

Suppose that data encoded as 0011 is collected in real time at the sensor node side, and the prediction model also generates data encoded as 0011. At this time, the sensor node considers that the prediction is successful and suppresses the data uploading behavior. Since the sensor node prediction model and the sink node prediction model are the same, at the sink node, the prediction model generates the same data 0011. After the preset collection period is finished, the sink node does not receive any data, the data 0011 generated by the model is considered to be correct, and the data 0011 is stored in the data pool. After the preset acquisition period is finished, the sensor nodes and the sink nodes update the prediction model according to the data 0011.

Suppose that 0100 encoded data is collected at the sensor node end in real time, and 0011 encoded data is generated by the prediction model. At this time, the sensor node regards the prediction as failed and uploads the data. At the sink node at the other end, the predictive model produces prediction data 0011. In a preset acquisition period, as the sink node receives the data 0100, the sink node considers that the prediction fails, discards the predicted data 0011, and stores the data 0100 in the data pool. After the predetermined acquisition period is finished, the sensor nodes and the sink nodes update the prediction model according to the data 0100.

0-1 state machine:

after the acquisition period is finished, it is assumed that the prediction model is updated by 0100 data, and the specific process is as follows: the 0100 data is sequentially decomposed into four bit state values of 0, 1, 0 and 0, and then the related information value of each bit state is updated. Taking a firstbit state value 0 as an example, thestate value 0 is input into a first bit space corresponding to a model, a cycle queue [2, 2,3,4] of which the latest n is 4 state changes, a state identifier (0 or 1) of the bit, namely the previous data state and the time point of the latest state change, are stored in the first bit space of the model, the first 2 in [2, 2,3,4] is the latest encoded data change cycle, and the last 4 is the farthest data change cycle. If in this update it is found that theinput value 0 is different from the current bit identifier, i.e. the current bit identifier is 1, then the update cycle queue is required to be [1,2,2,3], update the current bit identifier to 0, update the latest state change time point. If the current bit identifier is found to be 0, i.e., the state has not changed, then the current bit identifier and the most recent state change time point are not changed, and only the periodic alignment is changed to [3,2,3,4 ].

Assuming that a prediction model is needed to generate data, for each bit, the prediction model takes the average value of the change period queue as the change period of the prediction model, further estimates whether the prediction model changes at the current time point, outputs the prediction state of each bit, and generates the final overall prediction by locally predicting each bit.

Claims

1. A variable granularity real-time environment data acquisition method facing to the Internet of things is characterized in that a hardware main body comprises sensor nodes and sink nodes, the sensor nodes are responsible for acquiring, processing and sending data, the sink nodes are responsible for receiving and processing the data and are final sink places of the data, and the method is characterized by comprising the following steps: the same prediction model is deployed at the sensor node and the sink node, and the communication traffic of data at two ends is reduced through the behavior of prediction data; the sensor nodes and the sink nodes have the same acquisition period, the sensor nodes acquire and encode environmental data at a preset acquisition time, and a prediction model on the sensor nodes also generates real-time prediction data; the sensor node compares the predicted data with the actually acquired data, and if the predicted data is the same as the actually acquired data, the communication behavior is inhibited; if the data are different, the actually acquired data need to be sent out; meanwhile, at the sink node at the other end, at a preset acquisition time, the prediction model deployed on the sink node also generates real-time prediction data, after the acquisition time is over, if the sink node receives the data, the data is taken as the real-time data, if the sink node does not receive the data, the prediction data is considered to meet the prediction, and the prediction data is taken as the real-time data.

2. The acquisition method according to claim 1, characterized in that: the coding process of the coded data generated by the sensor nodes is as follows:

the real environment data is a collection of finite data spaces, in (U)_min，U_max) In the meantime, the data space set is divided into a plurality of intervals with the length of 2 delta, delta is the error allowed by the prediction model, n interval segments are generated according to the formula (1),

at present, a fixed-length coding mode is adopted, each interval segment corresponds to a binary code with the length L, the length L is obtained by a formula (2),

L＝log₂n (2) 。

3. the acquisition method according to claim 1, wherein the prediction model is used for predicting the encoded data, and the prediction model uses a monitoring mechanism to generate a binary code having the same length as the encoded data, specifically, the change trend of the whole binary code is predicted by monitoring the periodic change of each bit of the binary code; wherein, for each bit, a queue [ t ] is established for storing the history data change period_n，t_n-1，t_n-2，…，t₁]Recording the latest change time of the corresponding bit data, and storing the state identifier of the corresponding bit; then, taking the average value of the recently generated n times of historical data periods as the next change period T of each digit; when the prediction data needs to be generated, the prediction moduleJudging whether each bit needs to be changed at the moment, and generating a final overall prediction by locally predicting each bit, wherein the condition of the change of the ith bit data is as follows:

current predicted time t_kThe average value of the last change time of the ith data and the latest n times of historical data cycles of the ith data.