Movatterモバイル変換


[0]ホーム

URL:


CN113514544A - Mobile robot pavement material identification method based on sound characteristics - Google Patents

Mobile robot pavement material identification method based on sound characteristics
Download PDF

Info

Publication number
CN113514544A
CN113514544ACN202011597385.6ACN202011597385ACN113514544ACN 113514544 ACN113514544 ACN 113514544ACN 202011597385 ACN202011597385 ACN 202011597385ACN 113514544 ACN113514544 ACN 113514544A
Authority
CN
China
Prior art keywords
sound data
road surface
surface material
pavement
mel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011597385.6A
Other languages
Chinese (zh)
Inventor
安毅
崔允浩
孙伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University of Technology
Original Assignee
Dalian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University of TechnologyfiledCriticalDalian University of Technology
Priority to CN202011597385.6ApriorityCriticalpatent/CN113514544A/en
Publication of CN113514544ApublicationCriticalpatent/CN113514544A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Classifications

Landscapes

Abstract

Translated fromChinese

本发明属于移动机器人路况识别与自主导航技术领域,一种基于声音特征的移动机器人路面材质识别方法,包括以下步骤:(1)路面材质声音数据集的采集与建立,(2)路面材质声音数据预处理,(3)路面材质声音的梅尔倒谱特征提取,(4)构建深度卷积神经网络进行训练,(5)基于训练好的神经网络模型进行路面材质识别。本发明具有以下优点:一是,本发明从听觉的角度利用声音特征实现对路面材质的分类识别,能够有效弥补视觉模态的不足;二是,本发明基于梅尔倒谱特征实现了对路面材质声音特征的有效提取,并搭建了深度卷积神经网络模型,能够基于材质声音特征有效进行多材质路面识别。

Figure 202011597385

The invention belongs to the technical field of road condition recognition and autonomous navigation of mobile robots, and a method for recognizing road surface materials of mobile robots based on sound features includes the following steps: (1) collection and establishment of road material sound data sets, (2) road material sound data Preprocessing, (3) Mel cepstrum feature extraction of pavement material sounds, (4) building a deep convolutional neural network for training, (5) identifying pavement materials based on the trained neural network model. The present invention has the following advantages: firstly, the present invention utilizes sound features to classify and identify pavement materials from the perspective of hearing, which can effectively make up for the deficiencies of visual modalities; Effective extraction of material and sound features, and a deep convolutional neural network model is built, which can effectively identify multi-material road surfaces based on material and sound features.

Figure 202011597385

Description

Mobile robot pavement material identification method based on sound characteristics
Technical Field
The invention relates to a mobile robot pavement material identification method based on sound characteristics, and belongs to the technical field of mobile robot road condition identification and autonomous navigation.
Background
Currently, intelligent products are beginning to appear in every corner of people's life, and can replace people to complete various works, and one of the most widespread applications is intelligent mobile robots. So far, the intelligent mobile robot technology has become mature and perfect, and the functions and application range are continuously expanded. Mobile robots have been successfully used in many fields, such as education, scientific research, medical care, agriculture, industry, logistics, etc.
The autonomous navigation is one of basic functions required by the intelligent mobile robot, and the current road material is accurately known in the outdoor autonomous mobile driving process, which is very important for realizing the autonomous navigation function. The related personnel at home and abroad carry out a great deal of research on the road surface material identification direction, and the real-time, comprehensive, accurate, rapid and reliable road surface material identification technology is the key and difficult point of the current research. At present, the research on the quality classification of pavement materials is mainly divided into three aspects based on laser radar, cameras and other types of sensors (acceleration sensors, optical sensors and the like). Among them, the method based on the laser radar and the camera is to recognize the road surface environment by simulating human vision. However, when the mobile robot encounters different material pavements with similar colors and textures or special conditions such as low-visibility haze weather, correct judgment is difficult to make in a visual mode. Most of the other sensor-based methods focus on the identification of road parameters, including the identification of adhesion coefficients, the identification of road unevenness, the identification of soft and hard roads, and the like, and an effective road material identification method is lacked.
In order to accurately and effectively identify the road surface material, the invention explores a novel road surface material identification method of the mobile robot based on the sound characteristics. The classification and recognition of the road surface material are realized by analyzing and judging the sound information of the road surface material in consideration of analysis from the perspective of hearing. The method can effectively make up for the deficiency of the visual mode.
Disclosure of Invention
In order to overcome the defects in the prior art, the invention aims to provide a mobile robot pavement material identification method based on sound characteristics so as to improve the autonomous navigation capability of the mobile robot. The method comprises the steps of classifying and dividing common road surfaces, collecting sound data of the road surface material, establishing an effective road surface material sound data set, and preprocessing original road surface material sound data, wherein the preprocessing comprises operations of filtering and noise reduction, framing and windowing, end point detection and the like, so that effective fragments of the road surface material sound are obtained. And extracting frequency domain characteristics of sound data of different road surface materials, building a neural network, and performing characteristic learning by taking the sound characteristics of the road surface materials as input signals of the network, thereby realizing classification and identification of the different road surface materials. The invention enriches the technology and theory of pavement material identification, becomes a very beneficial attempt, and has important theoretical significance and application value.
In order to achieve the purpose of the invention and solve the problems existing in the prior art, the invention adopts the technical scheme that: a mobile robot pavement material identification method based on sound characteristics comprises the following steps:
step 1, collecting and establishing a road surface material sound data set: the pavement materials are divided into 10 categories, namely grassland pavement, ceramic tile pavement, asphalt pavement, natural stone pavement, cement pavement, rubber pavement, floor tile pavement, wood board pavement, cobblestone pavement and gravel pavement; collecting road surface material sound data by adopting a sound collector according to the rule that every 50 knocks of each type of road surface form a group, and establishing a road surface material sound data set S ═ SiI is more than or equal to 1 and less than or equal to 10, wherein i is the road surface material type, and S isiThe data is the sound data of the i-th type pavement material;
step 2, preprocessing sound data of the road surface material: through segmentation, filtering noise reduction, framing windowing and endpoint detection preprocessing operations, the quality of the sound data of the pavement material is improved, and noise influence is removed, and the method specifically comprises the following substeps:
(a) and (3) dividing: the recording of the road surface material sound is carried out according to a 50-time continuous knocking method, the original data needs to be further divided, the sound of knocking the road surface material each time is used as a single road surface material sound data, i-th class road surface material sound data Si={sijJ is more than or equal to 1 and less than or equal to 50, wherein sijJ is the knocking times corresponding to the road surface material sound data;
(b) filtering and denoising: filtering the collected road surface material sound data through a high-pass filter, describing through a formula (1),
Figure BDA0002866842640000031
where α is the filter coefficient, sij(t) is the original data at the current moment,
Figure BDA0002866842640000032
for the road surface material sound data at the time t after the filtering and noise reduction processing, the frequency spectrum can be balanced through a filter, the signal-to-noise ratio (SNR) is improved, the proportion of low-frequency information can be suppressed, and the proportion of high-frequency useful components is relatively increased;
(c) framing and windowing: on the whole, the overall road surface material sound characteristic of the knocking road surface has time-varying characteristic and is not stable road surface material sound data, but the recorded road surface material sound data is considered to be stable in a short time, in order to meet the Fourier transformation condition, the road surface material sound data needs to be divided into short-time stable road surface material sound data, the frame division and interception time period ranges from 10 ms to 30ms, in order to ensure that the characteristic parameter continuously and stably changes, some frames are uniformly reserved between two frames without overlapping parts, the change of each overlapping part is called frame shift, the value range is 1/4-1/2 of the frame length, and the road surface material sound data f after frame division and framing is between 1/4-1/2ijWindowing to reduce sub-frame artifactsInfluence, windowed road texture Sound data cij=fijXxw, wherein the window function w adopts a Hamming window;
(d) and (3) end point detection: determining the initial position and the end position of the effective road material sound data by setting different volume thresholds, and firstly calculating the windowed road material sound data cijVolume maximum value vmax inijAnd minimum value vminijAnd subtracting to obtain the transformation range value vdif of the road surface material sound dataijRespectively taking vmin from the lowest pavement material sound data as a starting pointij+vdifij×0.1、vminij+vdifij×0.01、vminij+vdifijTaking three different threshold values of x 0.05 as the end point position judgment of the road surface material sound data, starting to record the starting point when the threshold value is exceeded, judging whether the sound data at the next moment is larger than the threshold value or not until the end point lower than the threshold value is detected and recorded, and taking the road surface material sound data between the final starting point position and the final position lower than the threshold value as the effective road surface material sound data e for knocking the road surfaceij
Step 3, Mel cepstrum feature extraction of the road surface material sound: the Mel cepstrum feature is a most common feature extraction method in sound data, and has the advantages of strong anti-interference capability and capability of reflecting feature differences among different sound data, so that the sound data of the road surface material can be effectively distinguished, and the method specifically comprises the following substeps:
(a) road surface material sound data e obtained by detecting end pointsijCarrying out one-dimensional Fourier transform to obtain corresponding frequency domain signals
Figure BDA0002866842640000041
Wherein N represents the number of sampling points in each frame of pavement material sound data, k is the sampling point, and the power spectrum gij=|pij(k)|2
(b) The power spectrum g of each frame of road surface material sound dataijPutting the obtained mixture into a Mel triangular filter bank for filtering, and calculating logarithmic energy qijAnd is described by the following formula,
Figure BDA0002866842640000042
Figure BDA0002866842640000043
wherein h isijRepresenting the frequency response of the mel filters, M being the number of mel filters;
(c) to q isijDiscrete cosine transform is carried out to obtain the final Mel cepstrum characteristic coefficient, which is described by the following formula,
Figure BDA0002866842640000044
wherein melijRepresenting a Mel cepstrum characteristic coefficient, v representing the order of the Mel cepstrum coefficient, and N representing the number of sampling points in each frame of pavement material sound data;
(d) for the obtained mel-frequency cepstrum characteristic coefficient melijCarrying out artificial marking and constructing a training set;
step 4, constructing a deep convolution neural network for training: constructing a deep convolutional neural network, and based on Mel cepstrum characteristic coefficients mel of sound data of different pavement materialsijClassifying to realize the identification of the pavement material, and mainly comprising the following substeps:
(a) enhancing the characteristics, adding some auxiliary characteristics, such as chrominance frequency characteristics, Mel frequency spectrum characteristics, spectral contrast characteristics and hue centroid characteristics in audio, forming 193-dimensional characteristic vectors through characteristic splicing and fusion, finally forming 196-dimensional characteristic vectors through zero padding and alignment, further converting the characteristics into a two-dimensional matrix form (14 × 14), and performing dimension expansion on an input end in the network to obtain a 3D tensor (14 × 14 × 1);
(b) the encoding end comprises 1 convolutional layer Conv1, parameters areconvolutional kernel size 1 multiplied by 1,step length 1, same filling and output channel number 32, Batch Normalization (BN) processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; 3 lightweight depth separable convolutional layers DConv2, DConv3, DConv4 connected in series; the parameters of DConv2 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 32, batch normalization processing is performed, and a Leaky _ ReLU activation function is used for activation; parameters of the pooling layer Maxpool1 are 2 × 2 maximum pooling, step size 2, no padding; the parameters of DConv3 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; the parameters of DConv4 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; parameters of the pooling layer Maxpool2 are 3 × 3 maximal pooling, step length 2 andfilling value 1, and then the parameters are flattened into 1024-dimensional feature vectors;
(c) the decoding end first performs Dropout operation to prevent overfitting, the ratio is set to 0.5, and then 3 full-connection layers Dense1, Dense2 and Dense3 are included; wherein the number of output neurons of the Dense1 is 512, and a Leaky _ ReLU activation function is adopted for activation; the number of output neurons of Dense2 is 128, a Leaky _ ReLU activation function is adopted for activation, and the number of output neurons of Dense3 is 10;
and 5, identifying the pavement material based on the trained neural network model.
The invention has the beneficial effects that: a mobile robot pavement material identification method based on sound characteristics comprises the following steps: (1) the method comprises the steps of (1) collecting and establishing a road surface material sound data set, (2) preprocessing road surface material sound data, (3) extracting Mel cepstrum characteristics of road surface material sounds, (4) constructing a deep convolution neural network for training, and (5) identifying the road surface material based on a trained neural network model. Compared with the prior art, the invention has the following advantages: firstly, the method realizes the classification and identification of the pavement material by using the sound characteristics from the perspective of hearing, and can effectively make up for the deficiency of visual mode; secondly, the method realizes effective extraction of the road surface material sound characteristics based on the Mel cepstrum characteristics, builds a deep convolution neural network model, and can effectively perform multi-material road surface identification based on the material sound characteristics.
Drawings
FIG. 1 is a flow chart of the method steps of the present invention.
FIG. 2 is a diagram of road surface texture images and corresponding texture audio data.
In the figure: (a) the sound data map is a sound data map of a grass road surface, (b) the sound data map is a sound data map of a grass road surface, (c) the sound data map is a sound data map of a tile road surface, (d) the sound data map is a sound data map of a tile road surface, (e) the sound data map is an asphalt road surface, (f) the sound data map is an asphalt road surface, (g) the sound data map is a sound data map of a natural stone road surface, (h) the sound data map is a sound data map of a natural stone road surface, (i) the sound data map is a sound data map of a cement road surface, (j) the sound data map is a sound data map of a cement road surface, (k) the sound data map of a rubber road surface, (l) the sound data map of a rubber road surface, (m) the sound data map of a road tile, (n) the sound data map of a road tile, (o) the sound data map of a wood road surface, (p) the sound data map of a wood road surface, (q) the sound data map of a cobblestone road surface, (r) the cobblestone pavement, (s road surface,(s) the sound data map of a gravel road surface, and (t) the sound data map of a gravel road surface.
FIG. 3 is a diagram of road surface material classification training accuracy based on a neural network.
Detailed Description
The invention will be further explained with reference to the drawings.
As shown in fig. 1, a method for identifying a road surface material of a mobile robot based on sound characteristics includes the following steps:
step 1, collecting and establishing a road surface material sound data set: the pavement materials are divided into 10 categories, namely grassland pavement, ceramic tile pavement, asphalt pavement, natural stone pavement, cement pavement, rubber pavement, floor tile pavement, wood board pavement, cobblestone pavement and gravel pavement; collecting road surface material sound data by adopting a sound collector according to the rule that every 50 knocks of each type of road surface form a group, and establishing a road surface material sound data set S ═ SiI is more than or equal to 1 and less than or equal to 10, wherein i is the road surface material type, and S isiThe i-th class road surface texture sound data, the road surface texture image and the corresponding sound data are shown in fig. 2.
Step 2, preprocessing sound data of the road surface material: through segmentation, filtering noise reduction, framing windowing and endpoint detection preprocessing operations, the quality of the sound data of the pavement material is improved, and noise influence is removed, and the method specifically comprises the following substeps:
(a) and (3) dividing: the recording of the road surface material sound is carried out according to a 50-time continuous knocking method, the original data needs to be further divided, the sound of knocking the road surface material each time is used as a single road surface material sound data, i-th class road surface material sound data Si={sijJ is more than or equal to 1 and less than or equal to 50, wherein sijJ is the knocking times corresponding to the road surface material sound data;
(b) filtering and denoising: filtering the collected road surface material sound data through a high-pass filter, describing through a formula (1),
Figure BDA0002866842640000071
where α is the filter coefficient, sij(t) is the original data at the current moment,
Figure BDA0002866842640000072
for the road surface material sound data at the time t after the filtering and noise reduction processing, the frequency spectrum can be balanced through a filter, the signal-to-noise ratio (SNR) is improved, the proportion of low-frequency information can be suppressed, and the proportion of high-frequency useful components is relatively increased;
(c) framing and windowing: on the whole, the overall road surface material sound characteristic of the knocking road surface has time-varying characteristic and is not stable road surface material sound data, but the recorded road surface material sound data is considered to be stable in a short time, in order to meet the Fourier transformation condition, the road surface material sound data needs to be divided into short-time stable road surface material sound data, the frame division and interception time period ranges from 10 ms to 30ms, in order to ensure that the characteristic parameter continuously and stably changes, some frames are uniformly reserved between two frames without overlapping parts, the change of each overlapping part is called frame shift, the value range is 1/4-1/2 of the frame length, and the road surface material sound data f after frame division and framing is between 1/4-1/2ijWindowing to reduce the influence of framing, windowed road surface material sound data cij=fijXxw, wherein the window function w adopts a Hamming window;
(d) and (3) end point detection: determining the initial position and the end position of the effective road material sound data by setting different volume thresholds, and firstly calculating the windowed road material sound data cijVolume maximum value vmax inijAnd minimum value vminijAnd subtracting to obtain the transformation range value vdif of the road surface material sound dataijRespectively taking vmin from the lowest pavement material sound data as a starting pointij+vdifij×0.1、vminij+vdifij×0.01、vminij+vdifijTaking three different threshold values of x 0.05 as the end point position judgment of the road surface material sound data, starting to record the starting point when the threshold value is exceeded, judging whether the sound data at the next moment is larger than the threshold value or not until the end point lower than the threshold value is detected and recorded, and taking the road surface material sound data between the final starting point position and the final position lower than the threshold value as the effective road surface material sound data e for knocking the road surfaceij
Step 3, Mel cepstrum feature extraction of the road surface material sound: the Mel cepstrum feature is a most common feature extraction method in sound data, and has the advantages of strong anti-interference capability and capability of reflecting feature differences among different sound data, so that the sound data of the road surface material can be effectively distinguished, and the method specifically comprises the following substeps:
(a) road surface material sound data e obtained by detecting end pointsijCarrying out one-dimensional Fourier transform to obtain corresponding frequency domain signals
Figure BDA0002866842640000081
Wherein N represents the number of sampling points in each frame of pavement material sound data, k is the sampling point, and the power spectrum gij=|pij(k)|2
(b) The power spectrum g of each frame of road surface material sound dataijPutting the obtained mixture into a Mel triangular filter bank for filtering, and calculating logarithmic energy qijAnd is described by the following formula,
Figure BDA0002866842640000082
Figure BDA0002866842640000083
wherein h isijRepresenting the frequency response of the mel filters, M being the number of mel filters;
(c) to q isijDiscrete cosine transform is carried out to obtain the final Mel cepstrum characteristic coefficient, which is described by the following formula,
Figure BDA0002866842640000084
wherein melijRepresenting a Mel cepstrum characteristic coefficient, v representing the order of the Mel cepstrum coefficient, and N representing the number of sampling points in each frame of pavement material sound data;
(d) for the obtained mel-frequency cepstrum characteristic coefficient melijCarrying out artificial marking and constructing a training set;
step 4, constructing a deep convolution neural network for training: constructing a deep convolutional neural network, and based on Mel cepstrum characteristic coefficients mel of sound data of different pavement materialsijClassifying to realize the identification of the pavement material, and mainly comprising the following substeps:
(a) enhancing the characteristics, adding some auxiliary characteristics, such as chrominance frequency characteristics, Mel frequency spectrum characteristics, spectral contrast characteristics and hue centroid characteristics in audio, forming 193-dimensional characteristic vectors through characteristic splicing and fusion, finally forming 196-dimensional characteristic vectors through zero padding and alignment, further converting the characteristics into a two-dimensional matrix form (14 × 14), and performing dimension expansion on an input end in the network to obtain a 3D tensor (14 × 14 × 1);
(b) the encoding end comprises 1 convolutional layer Conv1, parameters areconvolutional kernel size 1 multiplied by 1,step length 1, same filling and output channel number 32, Batch Normalization (BN) processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; 3 lightweight depth separable convolutional layers DConv2, DConv3, DConv4 connected in series; the parameters of DConv2 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 32, batch normalization processing is performed, and a Leaky _ ReLU activation function is used for activation; parameters of the pooling layer Maxpool1 are 2 × 2 maximum pooling, step size 2, no padding; the parameters of DConv3 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; the parameters of DConv4 are convolution kernel size 3 × 3,step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; parameters of the pooling layer Maxpool2 are 3 × 3 maximal pooling, step length 2 andfilling value 1, and then the parameters are flattened into 1024-dimensional feature vectors;
(c) the decoding end first performs Dropout operation to prevent overfitting, the ratio is set to 0.5, and then 3 full-connection layers Dense1, Dense2 and Dense3 are included; wherein the number of output neurons of the Dense1 is 512, and a Leaky _ ReLU activation function is adopted for activation; the number of output neurons of Dense2 is 128, a Leaky _ ReLU activation function is adopted for activation, and the number of output neurons of Dense3 is 10;
and 5, identifying the road surface material based on the trained neural network model, wherein the result is shown in figure 3.

Claims (1)

1. A mobile robot pavement material identification method based on sound characteristics is characterized by comprising the following steps:
step 1, collecting and establishing a road surface material sound data set: the pavement materials are divided into 10 categories, namely grassland pavement, ceramic tile pavement, asphalt pavement, natural stone pavement, cement pavement, rubber pavement, floor tile pavement, wood board pavement, cobblestone pavement and gravel pavement; collecting road surface material sound data by adopting a sound collector according to the rule that every 50 knocks of each type of road surface form a group, and establishing a road surface material sound data set S ═ SiI is more than or equal to 1 and less than or equal to 10, wherein i is the road surface material type, and S isiThe data is the sound data of the i-th type pavement material;
step 2, preprocessing sound data of the road surface material: through segmentation, filtering noise reduction, framing windowing and endpoint detection preprocessing operations, the quality of the sound data of the pavement material is improved, and noise influence is removed, and the method specifically comprises the following substeps:
(a) and (3) dividing: the recording of the road surface material sound is carried out according to a 50-time continuous knocking method, the original data needs to be further divided, the sound of knocking the road surface material each time is used as a single road surface material sound data, i-th class road surface material sound data Si={sijJ is more than or equal to 1 and less than or equal to 50, wherein sijJ is the knocking times corresponding to the road surface material sound data;
(b) filtering and denoising: filtering the collected road surface material sound data through a high-pass filter, describing through a formula (1),
Figure FDA0002866842630000011
where α is the filter coefficient, sij(t) is the original data at the current moment,
Figure FDA0002866842630000012
for the road surface material sound data at the time t after the filtering and noise reduction processing, the frequency spectrum can be balanced through a filter, the signal-to-noise ratio (SNR) is improved, the proportion of low-frequency information can be suppressed, and the proportion of high-frequency useful components is relatively increased;
(c) framing and windowing: on the whole, the overall road surface material sound characteristic of the knocking road surface has time-varying characteristic and is not stable road surface material sound data, but the recorded road surface material sound data is considered to be stable in a short time, in order to meet the Fourier transformation condition, the road surface material sound data needs to be divided into short-time stable road surface material sound data, the frame division and interception time period ranges from 10 ms to 30ms, in order to ensure that the characteristic parameter continuously and stably changes, some frames are uniformly reserved between two frames without overlapping parts, the change of each overlapping part is called frame shift, the value range is 1/4-1/2 of the frame length, and the road surface material sound data f after frame division and framing is between 1/4-1/2ijPerforming windowing to reduce pointsInfluence of frame, sound data c of windowed road surface materialij=fijXxw, wherein the window function w adopts a Hamming window;
(d) and (3) end point detection: determining the initial position and the end position of the effective road material sound data by setting different volume thresholds, and firstly calculating the windowed road material sound data cijVolume maximum value vmax inijAnd minimum value vminijAnd subtracting to obtain the transformation range value vdif of the road surface material sound dataijRespectively taking vmin from the lowest pavement material sound data as a starting pointij+vdifij×0.1、vminij+vdifij×0.01、vminij+vdifijTaking three different threshold values of x 0.05 as the end point position judgment of the road surface material sound data, starting to record the starting point when the threshold value is exceeded, judging whether the sound data at the next moment is larger than the threshold value or not until the end point lower than the threshold value is detected and recorded, and taking the road surface material sound data between the final starting point position and the final position lower than the threshold value as the effective road surface material sound data e for knocking the road surfaceij
Step 3, Mel cepstrum feature extraction of the road surface material sound: the Mel cepstrum feature is a most common feature extraction method in sound data, and has the advantages of strong anti-interference capability and capability of reflecting feature differences among different sound data, so that the sound data of the road surface material can be effectively distinguished, and the method specifically comprises the following substeps:
(a) road surface material sound data e obtained by detecting end pointsijCarrying out one-dimensional Fourier transform to obtain corresponding frequency domain signals
Figure FDA0002866842630000021
Wherein N represents the number of sampling points in each frame of pavement material sound data, k is the sampling point, and the power spectrum gij=|pij(k)|2
(b) The power spectrum g of each frame of road surface material sound dataijPutting into Mel triangular filter bank for filtering, and calculating pairsNumber energy qijAnd is described by the following formula,
Figure FDA0002866842630000031
Figure FDA0002866842630000032
wherein h isijRepresenting the frequency response of the mel filters, M being the number of mel filters;
(c) to q isijDiscrete cosine transform is carried out to obtain the final Mel cepstrum characteristic coefficient, which is described by the following formula,
Figure FDA0002866842630000033
wherein melijRepresenting a Mel cepstrum characteristic coefficient, v representing the order of the Mel cepstrum coefficient, and N representing the number of sampling points in each frame of pavement material sound data;
(d) for the obtained mel-frequency cepstrum characteristic coefficient melijCarrying out artificial marking and constructing a training set;
step 4, constructing a deep convolution neural network for training: constructing a deep convolutional neural network, and based on Mel cepstrum characteristic coefficients mel of sound data of different pavement materialsijClassifying to realize the identification of the pavement material, and mainly comprising the following substeps:
(a) enhancing the characteristics, adding some auxiliary characteristics, such as chrominance frequency characteristics, Mel frequency spectrum characteristics, spectral contrast characteristics and hue centroid characteristics in audio, forming 193-dimensional characteristic vectors through characteristic splicing and fusion, finally forming 196-dimensional characteristic vectors through zero padding and alignment, further converting the characteristics into a two-dimensional matrix form (14 × 14), and performing dimension expansion on an input end in the network to obtain a 3D tensor (14 × 14 × 1);
(b) the encoding end comprises 1 convolutional layer Conv1, parameters are convolutional kernel size 1 multiplied by 1, step length 1, same filling and output channel number 32, Batch Normalization (BN) processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; 3 lightweight depth separable convolutional layers DConv2, DConv3, DConv4 connected in series; the parameters of DConv2 are convolution kernel size 3 × 3, step size 1, same filling and output channel number 32, batch normalization processing is performed, and a Leaky _ ReLU activation function is used for activation; parameters of the pooling layer Maxpool1 are 2 × 2 maximum pooling, step size 2, no padding; the parameters of DConv3 are convolution kernel size 3 × 3, step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; the parameters of DConv4 are convolution kernel size 3 × 3, step size 1, same filling and output channel number 64, batch normalization processing is carried out, and a Leaky _ ReLU activation function is adopted for activation; parameters of the pooling layer Maxpool2 are 3 × 3 maximal pooling, step length 2 and filling value 1, and then the parameters are flattened into 1024-dimensional feature vectors;
(c) the decoding end first performs Dropout operation to prevent overfitting, the ratio is set to 0.5, and then 3 full-connection layers Dense1, Dense2 and Dense3 are included; wherein the number of output neurons of the Dense1 is 512, and a Leaky _ ReLU activation function is adopted for activation; the number of output neurons of Dense2 is 128, a Leaky _ ReLU activation function is adopted for activation, and the number of output neurons of Dense3 is 10;
and 5, identifying the pavement material based on the trained neural network model.
CN202011597385.6A2020-12-292020-12-29Mobile robot pavement material identification method based on sound characteristicsPendingCN113514544A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202011597385.6ACN113514544A (en)2020-12-292020-12-29Mobile robot pavement material identification method based on sound characteristics

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202011597385.6ACN113514544A (en)2020-12-292020-12-29Mobile robot pavement material identification method based on sound characteristics

Publications (1)

Publication NumberPublication Date
CN113514544Atrue CN113514544A (en)2021-10-19

Family

ID=78060872

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202011597385.6APendingCN113514544A (en)2020-12-292020-12-29Mobile robot pavement material identification method based on sound characteristics

Country Status (1)

CountryLink
CN (1)CN113514544A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN114234061A (en)*2021-12-202022-03-25北京工业大学 A neural network-based intelligent discrimination method for water leakage sound of water supply pipelines operating under pressure
CN114544772A (en)*2022-04-262022-05-27华南农业大学 Duck egg crack detection device and method based on convolutional neural network and sound spectrum
CN115422965A (en)*2022-07-222022-12-02上汽大众汽车有限公司Vehicle-interior noise-based vehicle running road surface identification method and system
CN116698977A (en)*2023-08-072023-09-05山东科技大学Coal gangue recognition sensor and recognition method based on sound signals

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2009046359A2 (en)*2007-10-032009-04-09University Of Southern CaliforniaDetection and classification of running vehicles based on acoustic signatures
CN106821337A (en)*2017-04-132017-06-13南京理工大学A kind of sound of snoring source title method for having a supervision
CN107545902A (en)*2017-07-142018-01-05清华大学A kind of article Material Identification method and device based on sound characteristic
US20180151183A1 (en)*2016-11-292018-05-31Baidu Online Network Technology (Beijing) Co., Ltd.Method and device for searching according to speech based on artificial intelligence
CN108550375A (en)*2018-03-142018-09-18鲁东大学A kind of emotion identification method, device and computer equipment based on voice signal
CN110909637A (en)*2019-11-082020-03-24清华大学Outdoor mobile robot terrain recognition method based on visual-touch fusion
CN111399505A (en)*2020-03-132020-07-10浙江工业大学Mobile robot obstacle avoidance method based on neural network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2009046359A2 (en)*2007-10-032009-04-09University Of Southern CaliforniaDetection and classification of running vehicles based on acoustic signatures
US20180151183A1 (en)*2016-11-292018-05-31Baidu Online Network Technology (Beijing) Co., Ltd.Method and device for searching according to speech based on artificial intelligence
CN106821337A (en)*2017-04-132017-06-13南京理工大学A kind of sound of snoring source title method for having a supervision
CN107545902A (en)*2017-07-142018-01-05清华大学A kind of article Material Identification method and device based on sound characteristic
CN108550375A (en)*2018-03-142018-09-18鲁东大学A kind of emotion identification method, device and computer equipment based on voice signal
CN110909637A (en)*2019-11-082020-03-24清华大学Outdoor mobile robot terrain recognition method based on visual-touch fusion
CN111399505A (en)*2020-03-132020-07-10浙江工业大学Mobile robot obstacle avoidance method based on neural network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
姜囡: "《语音信号识别技术与实践》", 31 January 2020, 东北大学出版社*

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN114234061A (en)*2021-12-202022-03-25北京工业大学 A neural network-based intelligent discrimination method for water leakage sound of water supply pipelines operating under pressure
CN114544772A (en)*2022-04-262022-05-27华南农业大学 Duck egg crack detection device and method based on convolutional neural network and sound spectrum
CN114544772B (en)*2022-04-262022-08-05华南农业大学Device and method for detecting duck egg cracks based on convolutional neural network and voice frequency spectrum
CN115422965A (en)*2022-07-222022-12-02上汽大众汽车有限公司Vehicle-interior noise-based vehicle running road surface identification method and system
CN116698977A (en)*2023-08-072023-09-05山东科技大学Coal gangue recognition sensor and recognition method based on sound signals
CN116698977B (en)*2023-08-072023-10-31山东科技大学Coal gangue recognition sensor and recognition method based on sound signals

Similar Documents

PublicationPublication DateTitle
CN113514544A (en)Mobile robot pavement material identification method based on sound characteristics
CN113111902B (en)Pavement material identification method based on voice and image multi-mode collaborative learning
CN110399840B (en) A fast lawn semantic segmentation and boundary detection method
CN103218831B (en)A kind of video frequency motion target classifying identification method based on profile constraint
CN114998852A (en)Intelligent detection method for road pavement diseases based on deep learning
CN102682303B (en)Crowd exceptional event detection method based on LBP (Local Binary Pattern) weighted social force model
CN111598142B (en) An Outdoor Terrain Classification Method for Wheeled Mobile Robots
CN110119687A (en)Detection method based on the road surface slight crack defect that image procossing and convolutional neural networks combine
CN109035188A (en)A kind of intelligent image fusion method based on target signature driving
CN110400293B (en) A no-reference image quality assessment method based on deep forest classification
CN108681737B (en) A method for image feature extraction under complex illumination
CN101551863A (en)Method for extracting roads from remote sensing image based on non-sub-sampled contourlet transform
CN103218619A (en)Image aesthetics evaluating method
CN110111290B (en)Infrared and visible light image fusion method based on NSCT and structure tensor
CN104636755A (en)Face beauty evaluation method based on deep learning
CN103295221A (en)Water surface target motion detecting method simulating compound eye visual mechanism and polarization imaging
CN118781491B (en)Homeland utilization classification method based on image segmentation
CN112419258B (en)Robust environment sound identification method based on time-frequency segmentation and convolutional neural network
CN111724074A (en) A pavement lesion detection and early warning method and system based on deep learning
CN109086803A (en)A kind of haze visibility detection system and method based on deep learning and the personalized factor
CN109214331A (en)A kind of traffic haze visibility detecting method based on image spectrum
CN114966836A (en)Earthquake P wave pickup method based on SAIP-Unet network
CN115184998A (en)Rayleigh wave frequency dispersion curve automatic extraction method based on improved U-net neural network
CN119672555A (en) A remote sensing image object segmentation method based on hierarchical feature extraction and decoding
CN111881922B (en)Insulator image recognition method and system based on salient features

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20211019

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp