Movatterモバイル変換


[0]ホーム

URL:


CN120318569A - Ultrasonic image data classification method and system based on artificial intelligence - Google Patents

Ultrasonic image data classification method and system based on artificial intelligence

Info

Publication number
CN120318569A
CN120318569ACN202510384168.5ACN202510384168ACN120318569ACN 120318569 ACN120318569 ACN 120318569ACN 202510384168 ACN202510384168 ACN 202510384168ACN 120318569 ACN120318569 ACN 120318569A
Authority
CN
China
Prior art keywords
classification
liver
ultrasound image
treatment
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202510384168.5A
Other languages
Chinese (zh)
Inventor
施海群
朱敏嘉
沈国倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Fifth People's Hospital Suzhou Occupational Disease Hospital Suzhou Occupational Disease And Chemical Poisoning Emergency Center Suzhou Liver Disease Research Institute
Original Assignee
Suzhou Fifth People's Hospital Suzhou Occupational Disease Hospital Suzhou Occupational Disease And Chemical Poisoning Emergency Center Suzhou Liver Disease Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Fifth People's Hospital Suzhou Occupational Disease Hospital Suzhou Occupational Disease And Chemical Poisoning Emergency Center Suzhou Liver Disease Research InstitutefiledCriticalSuzhou Fifth People's Hospital Suzhou Occupational Disease Hospital Suzhou Occupational Disease And Chemical Poisoning Emergency Center Suzhou Liver Disease Research Institute
Priority to CN202510384168.5ApriorityCriticalpatent/CN120318569A/en
Publication of CN120318569ApublicationCriticalpatent/CN120318569A/en
Pendinglegal-statusCriticalCurrent

Links

Landscapes

Abstract

The invention provides an ultrasonic image data classification method and system based on artificial intelligence, firstly acquiring a real-time ultrasonic scanning signal sequence containing time sequence change characteristics of tissue elastic parameters and hemodynamic parameters, generating a standardized ultrasonic image sequence after noise suppression and motion artifact compensation treatment and marking anatomical boundary coordinates, then executing multi-scale anatomical structure decomposition on an ultrasonic image to obtain local feature image sets of different tissue layers, inputting the local feature image sets into a cascading depth classification network, realizing cross-frame feature fusion and dynamic weight adjustment through a space-time feature alignment and multi-granularity attention allocation module, outputting abnormal region classification probability distribution and a spatial topological relation map, finally generating a multi-mode diagnosis report according to the abnormal region classification probability distribution and spatial topological relation map, and generating an interactive three-dimensional visual interface containing risk level labels and treatment suggestions after comparing the abnormal region classification probability distribution with historical cases, thereby improving ultrasonic image classification accuracy and diagnosis efficiency.

Description

Ultrasonic image data classification method and system based on artificial intelligence
Technical Field
The invention relates to the technical field of artificial intelligence, in particular to an ultrasonic image data classification method and system based on artificial intelligence.
Background
In the field of medical ultrasound imaging, accurate classification of ultrasound image data is critical for diagnosis and treatment of disease.
Most of the existing ultrasonic image classification methods are only based on static ultrasonic images for analysis, and neglect time sequence change characteristics of elastic parameters and hemodynamic parameters of tissues in a dynamic process, so that physiological and pathological information of the tissues cannot be comprehensively captured, and accuracy and reliability of diagnosis results are limited.
For the problems of noise and motion artifact commonly existing in an ultrasonic image, the traditional method has limited processing effect, and cannot effectively eliminate the influence of the traditional method on the image quality, so that the subsequent image analysis and feature extraction are interfered. In the aspect of feature extraction, the prior art only focuses on image features of a single layer, is difficult to deeply mine complex relations among different tissue layers, and cannot comprehensively display rich information contained in an ultrasonic image. On the classification model, the traditional classification network lacks an effective cross-frame feature fusion mechanism and dynamic weight adjustment capability, and is difficult to process complex space-time information in dynamic ultrasonic images, so that the classification of abnormal areas is not accurate enough. Moreover, the existing diagnostic report generation mode is single, usually only simply describes image features, lacks integration of multi-mode information, is rarely effectively compared with the historical cases, cannot provide visual and comprehensive diagnostic assistance for doctors, and is difficult to meet the requirements of high-efficiency and accurate diagnosis in clinical practice.
Disclosure of Invention
In view of the above-mentioned problems, in combination with the first aspect of the present invention, an embodiment of the present invention provides an ultrasound image data classification method based on artificial intelligence, the method including:
Acquiring a real-time ultrasonic scanning signal sequence of a target object, wherein the real-time ultrasonic scanning signal sequence comprises a plurality of frames of dynamic ultrasonic images, and time sequence change characteristics of an organization elastic parameter and a blood flow dynamic parameter are embedded in each frame of dynamic ultrasonic image;
performing noise suppression and motion artifact compensation processing on the real-time ultrasonic scanning signal sequence, generating a denoised standardized ultrasonic image sequence, and marking anatomical boundary coordinates of a region of interest in the standardized ultrasonic image sequence;
Performing multi-scale anatomical structure decomposition on each frame of standardized ultrasonic image based on the anatomical boundary coordinates, and generating local feature map sets corresponding to different tissue levels, wherein each local feature map in the local feature map sets comprises a three-dimensional matrix of an edge sharpness index, a texture consistency index and blood flow signal intensity distribution;
Inputting the local feature image set into a cascading type depth classification network, and carrying out cross-frame feature fusion and dynamic weight adjustment on the local feature image set through a nested space-time feature alignment module and a multi-granularity attention distribution module in the cascading type depth classification network to output classification probability distribution and a spatial topological relation map of an abnormal region in each frame of standardized ultrasonic image;
And generating a multi-mode diagnosis report according to the classification probability distribution and the spatial topological relation map, and performing similarity comparison on the multi-mode diagnosis report and the historical case characteristics in the medical image database to generate an interactive three-dimensional visual interface comprising a risk grade label and a treatment suggestion.
In yet another aspect, an embodiment of the present invention further provides an ultrasound image data classification system based on artificial intelligence, including a processor, a machine-readable storage medium, where the machine-readable storage medium is connected to the processor, and the machine-readable storage medium is used to store a program, an instruction, or a code, and the processor is used to execute the program, the instruction, or the code in the machine-readable storage medium, so as to implement the method described above.
Based on the above aspects, the embodiment of the application improves the accuracy and efficiency of ultrasonic diagnosis, effectively suppresses noise and compensates motion artifacts by acquiring and processing an ultrasonic scanning signal sequence in real time, ensures image quality, combines a multi-scale anatomical structure decomposition technology with comprehensive analysis of edge sharpness, texture consistency and blood flow signal intensity, deeply digs fine granularity characteristics in an ultrasonic image, and applies a cascade depth classification network to realize intelligent fusion and dynamic weight adjustment of cross-frame characteristics, accurately captures space-time variation of abnormal areas, and finally generates a multi-mode diagnosis report, thereby not only providing comprehensive classification probability and space topology information, but also outputting visual risk level and treatment suggestion by similarity comparison with historical cases, constructing a highly interactive three-dimensional visual interface, and greatly enhancing doctor's understanding and decision support of illness state.
Drawings
Fig. 1 is a schematic diagram of an execution flow of an artificial intelligence-based ultrasound image data classification method according to an embodiment of the present invention.
FIG. 2 is a schematic diagram of exemplary hardware and software components of an artificial intelligence based ultrasound image data classification system provided by an embodiment of the invention.
Detailed Description
The present invention will be described in detail with reference to the accompanying drawings, and fig. 1 is a schematic flow chart of an artificial intelligence-based ultrasound image data classification method according to an embodiment of the present invention, and the detailed description of the artificial intelligence-based ultrasound image data classification method will be described below.
Step S110, a real-time ultrasonic scanning signal sequence of a target object is obtained, wherein the real-time ultrasonic scanning signal sequence comprises a plurality of frames of dynamic ultrasonic images, and time sequence change characteristics of an organization elastic parameter and a blood flow dynamic parameter are embedded in each frame of dynamic ultrasonic image.
In this embodiment, taking a target object as an example of a patient suffering from liver disease, a doctor uses an ultrasonic diagnostic apparatus to scan a liver portion of the patient, and ultrasonic waves emitted by the ultrasonic apparatus propagate in liver tissue and reflect back, are received by the ultrasonic apparatus, and are converted into electrical signals, thereby forming a real-time ultrasonic scanning signal sequence. Wherein the ultrasound scanning signal sequence comprises a plurality of frames of dynamic ultrasound images. For example, in a first frame of images, the tissue elasticity parameters may reflect that the texture of a region of the liver is relatively stiff, possibly suggesting that there is fibrosis or stiffening of that region. The time sequence variation characteristic of the hemodynamic parameters is that the blood flow velocity and the blood flow in the blood vessels in the liver show regular variation along with the beating period of the heart. For example, in systole, the blood flow rate into the liver increases, and the blood flow increases, which is manifested as an increase in intravascular signal intensity in the image, and vice versa in diastole. Changes of the tissue elasticity parameters and the hemodynamic parameters are continuously recorded in each frame of images along with the time, and in the subsequent frames of images, the gradual decrease of the tissue elasticity around a certain suspected lesion area of the liver can be found, and the local abnormal fluctuation of the blood flow velocity appears, so that the changes have reference value for the subsequent judgment of the nature and the development trend of the lesion.
And step S120, carrying out noise suppression and motion artifact compensation processing on the real-time ultrasonic scanning signal sequence, generating a denoised standardized ultrasonic image sequence, and marking anatomical boundary coordinates of the region of interest in the standardized ultrasonic image sequence.
Taking a liver patient as an example, the original real-time ultrasonic scanning signal sequence has noise due to electronic noise of ultrasonic equipment, interference of surrounding environment and the like. First, each frame of ultrasonic scanning signal in the real-time ultrasonic scanning signal sequence is subjected to wavelet transformation decomposition. For example, for an ultrasonic scan signal containing the whole and part of the surrounding tissue of the liver, after wavelet transformation and decomposition, the high-frequency noise component may show some messy, weak and irregular signal fluctuation, and the low-frequency tissue signal component shows the rough outline of the main structure of the liver tissue, blood vessels and the like. And then, carrying out layer-by-layer filtering on the high-frequency noise component by adopting an adaptive threshold algorithm, wherein the set threshold value can be dynamically adjusted according to the characteristics of the signal in the process. For example, for regions where liver edges are relatively blurred, the threshold may be relaxed appropriately to preserve weak signals that may belong to liver tissue, while filtering is strict for regions of significant noise. After filtering, intermediate frequency signal components related to the movement of the vessel wall, such as the blood flow signal of the portal vein in the liver, are retained.
And then, carrying out inverse wavelet transformation reconstruction on the filtered ultrasonic scanning signals to obtain an ultrasonic image subjected to preliminary denoising. However, the images are subject to motion artifacts due to the possible slight breathing or body movement of the patient during the scan. The tissue displacement vector field between the ultrasonic scanning signals of the adjacent frames is calculated based on an optical flow method, and for the liver image, the moving direction and distance of the liver tissue between the adjacent frames are calculated. According to the tissue displacement vector field, motion track compensation is carried out on the preliminarily denoised ultrasonic image, for example, a certain characteristic point on the liver generates certain displacement between adjacent frames due to respiration of a patient, and the position of the characteristic point in the image is more accurate through compensation. And then, carrying out gray scale normalization operation on the ultrasonic image after the motion compensation, and mapping the pixel intensity to a preset standard dynamic range. Assuming that the preset standard dynamic range is 0-255, the gray level of the normal tissue in the liver may be adjusted to a specific range, such as 100-150, and the lesion area may have different gray level ranges.
And then carrying out edge-preserving smoothing treatment on the normalized ultrasonic image by adopting an anisotropic diffusion algorithm, wherein in the process, the algorithm can reduce the influence of noise on the premise of keeping sharp edges in the edge region of the liver image. And then the detail enhancement is carried out on the smooth ultrasonic image through a bilateral filter, for example, the boundary of the blood vessel in the liver is clearer, and the level of the blood flow signal in the blood vessel is clearer. Finally, region matching is carried out on the ultrasonic image with enhanced details according to a preset anatomical structure template, a standardized ultrasonic image sequence containing the boundary of the whole liver organ is cut out, and anatomical boundary coordinates of the liver, such as coordinate points of the upper edge, the lower edge, the boundary of the left and right leaves and the like of the liver are marked.
Step S130, performing multi-scale anatomical structure decomposition on each frame of standardized ultrasonic image based on the anatomical boundary coordinates, and generating local feature map sets corresponding to different tissue levels, wherein each local feature map in the local feature map sets comprises a three-dimensional matrix of edge sharpness indexes, texture consistency indexes and blood flow signal intensity distribution.
When processing a standardized ultrasound image of a liver patient, a rectangular region of interest containing the liver is truncated in each frame of the standardized ultrasound image according to previously marked anatomical boundary coordinates of the liver. For example, the rectangular region just covers a substantial portion of the liver and the major vessel branches. And then carrying out Gaussian pyramid decomposition on the rectangular region of interest to generate a multi-scale image group containing original resolution, half resolution and quarter resolution. For the original resolution image, the method can display the finest structure of liver tissue, such as the rough outline of tiny blood vessel branches and liver lobules, the half resolution image summarizes the structural characteristics of a larger area of the liver to a certain extent, the grasping of the whole structure is more beneficial, and the quarter resolution image can highlight macro-characteristics such as the whole shape of the liver and main blood vessel trend.
A multi-directional Gabor filter bank is applied to each scale image in the multi-scale image set. Taking the blood vessel in the liver as an example, the Gabor filter bank can extract a frequency domain response characteristic map matched with the branching direction of the blood vessel. In the original resolution image, the frequency domain features of the fine vessel branches may be accurately extracted, reflecting the texture direction of the vessel wall and the local variation of the blood flow signal. And inputting the frequency domain response feature graphs with different scales into a three-dimensional convolutional neural network, and aggregating edge gradient information under different resolutions through a trans-scale feature fusion layer. For example, at the liver edge, the edge gradient information in the original resolution image may be very detailed, and the edge gradient information in the quarter resolution image is more focused on the whole contour, so that the liver edge characteristics can be obtained more comprehensively and accurately after fusion.
And performing non-maximum suppression processing on the aggregated feature map, eliminating redundant edge response in the liver image and reserving maximum intensity edge points. For example, in a complex vascular network in the liver, some weak edges due to noise or artifacts are removed, leaving only the strongest edge points that truly represent important structures such as vessel boundaries. And constructing a space delaunay triangular grid based on the reserved edge points, and calculating curvature distribution and area occupation ratio indexes of each triangular patch of the triangular grid. For curved portions of the surface or internal structure of the liver, such as the fornix of the liver, the curvature distribution of the triangular patches may reflect the change in shape thereof, and the area ratio index may represent the relative importance of the different regions in the overall structure.
And finally, carrying out region division on the triangular meshes according to curvature distribution and area occupation ratio indexes to generate sub-mesh sets corresponding to different tissue types. In the liver, different sub-grids of liver parenchyma regions, vascular regions, etc. may be divided. And carrying out statistics on the space-time variation variance of the blood flow signal intensity in each sub-grid of the sub-grid set, and generating a local feature map set containing edge geometric features and hemodynamic features. For example, in a subgrid of a liver parenchymal region, the variance of the temporal-spatial variation of the blood flow signal intensity may be relatively small, whereas in a vascular subgrid near a lesion region, the variance of the temporal-spatial variation of the blood flow signal intensity may increase due to the influence of the lesion on the blood flow, these features being recorded in a three-dimensional matrix of edge sharpness index, texture uniformity index, and blood flow signal intensity distribution of a local feature map set.
Step S140, inputting the local feature map set into a cascade depth classification network, and performing cross-frame feature fusion and dynamic weight adjustment on the local feature map set through a space-time feature alignment module and a multi-granularity attention allocation module nested in the cascade depth classification network, so as to output classification probability distribution and a spatial topological relation map of an abnormal region in each frame of standardized ultrasonic image.
The local feature map set for the liver patient is input into a cascaded depth classification network. Firstly, a long-short time memory network of a space-time characteristic alignment module is entered to carry out time sequence state transfer. For example, in successive multi-frame liver ultrasound images, the lesion region may change in morphology, size, or blood flow characteristics over time. The long-short-term memory network can capture the rules of the changes and generate a time dimension feature vector containing the evolution rules of the focus morphology between continuous frames. For example, the lesion region appears circular in the first few frames of images and the boundary is relatively clear, becomes irregular in shape and blurred over time, and these changing features are accurately recorded in the time-dimensional feature vector.
And then inputting the time dimension feature vector into a deformable convolution layer to perform space deformation modeling. In ultrasound images of the liver, certain non-linear deformations of the liver tissue occur due to the effects of respiratory movements of the liver itself and heart beats. The deformable convolution layer can output a spatial deformation characteristic matrix matched with the nonlinear deformation of the liver tissue structure, and accurately reflects the actual morphological change of the liver in space.
And splicing the time dimension feature vector and the space deformation feature matrix to generate a space-time joint feature matrix, and inputting the space-time joint feature matrix into a multi-head self-attention mechanism of the multi-granularity attention distribution module. For different characteristic channels in the liver image, such as channels representing liver tissue texture, blood flow signals, edge characteristics and the like, the multi-head self-attention mechanism calculates global correlation weights among the different characteristic channels and outputs a global attention weight matrix. For example, when determining whether a liver lesion is malignant, the blood flow signal feature channel may have a higher correlation with the malignancy of the lesion, and then the weight corresponding to the blood flow signal feature channel in the global attention weight matrix may be higher.
And inputting the global attention weight matrix into a gating linear unit for feature channel screening, reserving a feature channel related to the current classification task (judging liver abnormal region), and generating a gating screening feature vector. For example, some characteristic channels associated with the interference of tissues around the liver may be screened, while channels associated with the internal structure and pathological features of the liver are emphasized. And then inputting the gating screening feature vector into a local window attention calculation layer, enhancing the spatial correlation of adjacent feature nodes, and generating a local window attention weight matrix. In liver images, there is a certain correlation in space between feature nodes between adjacent liver lobules or vessel branches, and the multi-granularity attention allocation module can enhance the correlation.
And weighting and fusing the global attention weight matrix and the local window attention weight matrix to generate a multi-granularity attention distribution map, and overlapping the multi-granularity attention distribution map to the space-time joint feature matrix through residual connection to generate an optimized classification feature vector. And finally, inputting the optimized classification feature vector into a fully-connected classification layer, and outputting classification probability distribution and a spatial topological relation map of abnormal regions (such as possible tumors, fibrosis regions and the like in the liver) in each frame of standardized ultrasonic image. The classification probability distribution represents the probability that each region of the liver is an abnormal region, and the spatial topological relation map accurately depicts the spatial relation between the abnormal region and surrounding normal tissues, blood vessels and other structures, such as whether the abnormal region is close to a large blood vessel, invades surrounding liver tissues and the like.
And step S150, generating a multi-mode diagnosis report according to the classification probability distribution and the spatial topological relation map, and comparing the similarity of the multi-mode diagnosis report with the characteristics of the historical cases in the medical image database to generate an interactive three-dimensional visual interface comprising a risk grade label and a treatment suggestion.
In detail, a multi-modal diagnostic report may be generated based on the classification probability distribution and the spatial topological relation map of the liver patient, mapping the classification probability distribution to spatial coordinates of the normalized ultrasound image sequence, generating an abnormal region overlay with translucent color coding. For example, in a liver image, a region with a high probability of abnormality is covered with a red translucent color, a region with a low probability is covered with a yellow translucent color, and a normal region maintains a primary color or represents a normal range with a green translucent color. And inputting the spatial topological relation map into a three-dimensional volume rendering engine, and constructing a three-dimensional volume rendering model for marking the vascular branches and the focus infiltration range. In a three-dimensional model of the liver, the trend of blood vessels in the liver and the infiltration of abnormal areas (such as tumors) into surrounding tissues can be clearly seen.
And carrying out spatial registration on the abnormal region covering layer and the three-dimensional rendering model to generate a visual basic frame fusing the two-dimensional probability distribution and the three-dimensional anatomical structure. And then extracting hemodynamic characteristic parameters in the classification probability distribution, searching historical case characteristics with similar hemodynamic characteristic parameters in a medical image database, and generating a case similarity comparison result. For liver patients, it is possible to find some historical liver disease cases that also have abnormal blood flow velocity and similar blood flow changes.
And (3) associating the case similarity comparison result with the visual basic framework to generate a side comparison view and a key difference parameter labeling layer. For example, in the side contrast view, information such as liver images, hemodynamic parameters and the like of the current patient and the historical case can be displayed at the same time, and the key difference parameter labeling layer can highlight differences between the current patient and the historical case in terms of liver size, lesion position, blood flow speed and the like. Based on the key difference parameter labeling layer matching with a preset treatment scheme flow chart, a logic association table of a risk level label and treatment suggestions is generated, and then the logic association table is embedded into a visual basic framework to generate an interactive three-dimensional visual interface comprising an interactive time axis control, a real-time ultrasonic playback function and a voice broadcasting engine. The doctor can check the liver ultrasonic images and analysis results at different time points through the time axis control, and confirm the details in the images again through the real-time ultrasonic playback function, and the voice broadcasting engine can broadcast the information such as the diagnosis results, the risk level, the treatment advice and the like, so that the doctor can conveniently operate in the diagnosis process and comprehensively judge the illness state of the patient.
Based on the steps, the embodiment of the application improves the accuracy and efficiency of ultrasonic diagnosis, effectively suppresses noise and compensates motion artifacts by acquiring and processing an ultrasonic scanning signal sequence in real time, ensures image quality, combines a multi-scale anatomical structure decomposition technology with comprehensive analysis of edge sharpness, texture consistency and blood flow signal intensity, deeply digs fine granularity characteristics in an ultrasonic image, realizes intelligent fusion and dynamic weight adjustment of cross-frame characteristics, accurately captures space-time variation of an abnormal region, and finally generates a multi-mode diagnosis report, thereby not only providing comprehensive classification probability and space topology information, but also outputting visual risk level and treatment suggestion by similarity comparison with historical cases, constructing a highly interactive three-dimensional visual interface, and greatly enhancing doctor understanding and decision support of the illness state.
In one possible implementation, step S120 includes:
Step S121, performing wavelet transform decomposition on each frame of ultrasonic scanning signal in the real-time ultrasonic scanning signal sequence, and separating high-frequency noise components and low-frequency tissue signal components.
Taking the previous ultrasound scan of a liver patient as an example, in this process, it is assumed that the wavelet transform uses a Daubechies wavelet basis function (e.g., db4 wavelet) that is effective in decomposing the signal into subbands of different frequencies. For a frame of liver ultrasonic scanning signal, after wavelet transformation and decomposition, high-frequency noise components are usually represented by signal parts with higher frequency, relatively lower energy and no obvious regularity, and the high-frequency noise may be derived from electronic elements of ultrasonic equipment, tiny interference of surrounding environment and the like. The low frequency tissue signal component contains the main structural information of liver tissue, such as the overall outline of liver, the general trend of larger blood vessels, etc., and has lower frequency and relatively concentrated energy.
Step S122, filtering the high-frequency noise component layer by adopting an adaptive threshold algorithm to obtain a filtered ultrasonic scanning signal, and reserving an intermediate frequency signal component related to the movement of the blood vessel wall.
For example, for the first layer of the high-frequency noise component of a liver ultrasound scanning signal of a certain frame, the algorithm analyzes the statistical parameters such as standard deviation, mean value and the like of the signal of the layer, if the standard deviation is 5 (assumed unit), the threshold value of the layer is 15 according to a certain algorithm rule (for example, the standard deviation is multiplied by a coefficient, and the coefficient is assumed to be 3). The signal portions above the threshold are considered to be potentially useful signals to be preserved, and those below the threshold are determined to be noise and filtered out. This adaptive thresholding can effectively remove noise while preserving intermediate frequency signal components associated with vessel wall motion throughout the layer-by-layer filtering process. For vessels in the liver, the frequency of the signals generated by the blood flow is in the intermediate frequency range, for example, the frequency is between 1 and 5MHz (assumption), and the intermediate frequency signals have important significance for the subsequent analysis of the motion state of the vessel wall, the blood flow speed and the like.
And step S123, performing inverse wavelet transformation reconstruction on the filtered ultrasonic scanning signals to generate an ultrasonic image subjected to preliminary denoising.
After the high-frequency noise component is filtered, the filtered ultrasonic scanning signals are subjected to inverse wavelet transformation reconstruction, and the operation is that the sub-band signals subjected to filtering treatment are recombined according to the inverse process of wavelet transformation, so that an ultrasonic image subjected to preliminary denoising is generated. In this frame of preliminary denoised liver ultrasound image, some blurred parts of the liver tissue become relatively sharp due to the removal of high frequency noise, the contours of the blood vessels are also more pronounced, but motion artifacts may be present in the image due to factors such as respiration of the patient during scanning, slight body movements, etc.
Step S124, calculating a tissue displacement vector field between adjacent frame ultrasonic scanning signals based on an optical flow method, performing motion track compensation on the primarily denoised ultrasonic image according to the tissue displacement vector field, performing gray scale normalization operation on the motion compensated ultrasonic image, and mapping pixel intensity to a preset standard dynamic range to obtain a normalized ultrasonic image.
To eliminate motion artifacts, a tissue displacement vector field between adjacent frames of ultrasound scan signals is calculated based on an optical flow method. The optical flow method determines the direction and speed of motion of an object by analyzing the brightness variation of pixels in adjacent frame images. For ultrasound images of a liver patient, assuming a time interval between two adjacent frames of images of 0.1 seconds (which is a typical time interval determined by the scan frame rate of the ultrasound device), the optical flow method calculates the displacement vector of the liver tissue within this 0.1 seconds. For example, if a specific point on the liver (assuming that the coordinates are (x 1, y 1)) is shifted by (x 2, y 2) from the position in the first frame image to the position in the second frame image, the calculated displacement vector is ((x 2-x 1), (y 2-y 1)). And performing motion track compensation on the primarily denoised ultrasonic image according to the tissue displacement vector field. If a suspected lesion area in the liver has obvious displacement in the image due to respiratory motion of the patient before the compensation, the position of the suspected lesion area in the image can more accurately reflect the real position of the suspected lesion area in the liver after the compensation through the motion trail.
And then performing gray scale normalization operation on the motion compensated ultrasonic image. Assuming that the preset standard dynamic range is 0-255 (which is a common image pixel intensity range), in the liver ultrasound image after motion compensation, the gray value range of all pixels in the image is counted first. For example, the gray scale value range of liver tissue may be 50-180 and the gray scale value range of blood vessels may be 30-100 (here, only assumed to be a general range). These raw gray values are then mapped to a standard dynamic range of 0-255 by means of a linear mapping. The aim of this is to make the images of different frames and the images of different patients comparable in gray scale, facilitating subsequent analysis and processing. And after the gray scale normalization operation, obtaining a normalized ultrasonic image.
And step S125, performing edge-preserving smoothing treatment on the normalized ultrasonic image by adopting an anisotropic diffusion algorithm, performing detail enhancement on the smoothed ultrasonic image by using a bilateral filter, performing region matching on the detail-enhanced ultrasonic image according to a preset anatomical structure template, and cutting out a standardized ultrasonic image sequence containing a complete organ boundary.
In this embodiment, when the anisotropic diffusion algorithm processes the liver ultrasound image, the direction and intensity of diffusion are adjusted according to the gradient information of the ultrasound image. At the liver edges, the algorithm limits diffusion perpendicular to the edge direction due to the large gray gradient of the edge, thus preserving the sharpness of the edge. For example, for a pixel point on the edge of the liver, the gray level difference value of the surrounding pixels is larger, and the algorithm can smooth along the edge direction preferentially in the diffusion process of the pixel point, so as to avoid blurring the edge. After being processed by an anisotropic diffusion algorithm, the detail enhancement is carried out on the ultrasonic image after the smoothing processing by a bilateral filter. The bilateral filter considers the space distance and gray scale difference of pixels, and can strengthen the detail characteristics of structures such as blood vessels, liver lobules and the like in the liver. For example, after the blood flow signal in the blood vessel is processed by the bilateral filter, the layering sense of the signal is stronger, and the boundary of the blood vessel wall is clearer.
The preset liver anatomical structure template contains information such as standard shape, size, and position of main vessel branches of the liver. And for the liver ultrasonic image with enhanced details, finding a liver region corresponding to the template in the image through a template matching algorithm. For example, the coordinate range of the upper edge of the liver in the image is defined as (y 1-y 2) in the template, and the region which accords with the coordinate range and has the liver tissue characteristics is found in the image through an algorithm. And finally cutting out a standardized ultrasonic image sequence containing the boundary of the complete liver organ, wherein each frame of image in the standardized ultrasonic image sequence is subjected to a series of processing, so that the standardized ultrasonic image sequence has lower noise, a clear structure and accurate liver region positioning, and high-quality image data is provided for subsequent analysis.
In one possible implementation, step S130 includes:
Step S131, in each frame of standardized ultrasonic image, a rectangular region of interest containing a target organ is intercepted according to the anatomical boundary coordinates.
In this embodiment, the anatomical boundary coordinates precisely define the location of the liver in the image, e.g., the upper border coordinate range of the liver is 100-150 rows of pixels from the top of the image down (assuming a total number of rows of the image of 500 rows), and the left and right border coordinate ranges are 200-300 columns of pixels and 400-500 columns of pixels from the left to the right of the image (assuming a total number of columns of the image of 600 columns), respectively. The rectangular region of interest defined by these coordinates encompasses a substantial portion of the liver and major vessel branches, so that subsequent analysis can be focused on the liver region, avoiding interference with surrounding tissue.
Step S132, performing gaussian pyramid decomposition on the rectangular region of interest, and generating a multi-scale image set including an original resolution, a half resolution, and a quarter resolution.
For the original resolution image, it maintains the finest structural information of the liver region, such as tiny branches of blood vessels in the liver, which may be as small as 1-2 pixels in diameter (assuming that one pixel in the image represents the actual length of 0.1 mm), and the general outline of the liver lobules is also clearly visible. The half resolution image is obtained by downsampling the original resolution image, and at this scale, the structural features of a larger region of the liver are more obvious, for example, features of a larger structure such as the interlobe of the liver are easier to grasp integrally, some tiny vessel branches can be combined into a thicker vessel representation, but the overall vessel trend is still clear. The quarter resolution image further summarizes the macroscopic features of the liver, the main vessel branches are more prominent, and the overall shape and general tissue structure distribution of the liver can be clearly seen.
And step S133, a multidirectional Gabor filter set is applied to each scale image in the multi-scale image set, and a frequency domain response characteristic diagram matched with the blood vessel branch direction is extracted.
For vessels in the liver, the Gabor filter bank captures certain directional and texture features because of the vessel. For example, assume that the directions of the Gabor filter group are set to four directions of 0 °, 45 °, 90 °, and 135 ° (the number and angle of directions can be adjusted according to actual demands). In the liver image of the original resolution, when the Gabor filter is filtered along an angle close to the blood vessel branching direction, a strong response can be obtained in the frequency domain. For smaller diameter vessel branches, such as intrahepatic arterioles (assuming a diameter of 2-3 pixels), the frequency domain response profile will exhibit a specific frequency and amplitude distribution in the Gabor filter direction coincident with the vessel direction, reflecting the vessel wall texture direction and local changes in the blood flow signal. In half resolution and quarter resolution images, the same principle is adopted, but due to the reduced resolution, the representation of the vessel branches is more generalized, and the frequency domain response characteristic diagram correspondingly reflects the comprehensive characteristics of a wider range of vessels.
Step S134, inputting frequency domain response feature graphs with different scales into a three-dimensional convolutional neural network, aggregating edge gradient information under different resolutions through a trans-scale feature fusion layer, performing non-maximum suppression processing on the aggregated feature graphs, eliminating redundant edge responses and reserving maximum intensity edge points, constructing a space Delaunay triangular grid based on the reserved edge points, and calculating curvature distribution and area occupation ratio indexes of each triangular patch of the triangular grid.
For the edges of the liver, the edge gradient information in the original resolution image is very detailed, e.g. at the boundary of the liver edge with the surrounding tissue, the gray value change of the pixels may have a large gradient change over several pixels, e.g. a sudden change from 100 (liver tissue gray value) to 50 (surrounding tissue gray value). The gradient information of the liver edge in the quarter resolution image is focused on the whole outline, and the change trend of the liver edge on a larger scale can be reflected although the detail is inferior to that of the original resolution image. And fusing the edge gradient information under different resolutions through a trans-scale feature fusion layer to obtain more comprehensive and accurate liver edge feature representation.
Further, in the liver image, there may be some redundant edge response due to the presence of edge information of various structures. For example, in the vicinity of complex vascular networks and hepatic lobular structures within the liver, some edge responses due to noise or weak structures do not truly represent edges of important structures. The non-maxima suppression process eliminates edge responses that are not local maxima by comparing the edge intensities of each pixel to its neighborhood. For the edge of the liver vessel, only the pixels truly representing the vessel boundary (with maximum edge intensity) will be preserved, and the other weaker edge responses will be suppressed. In this process, a comparison neighborhood, for example, a 3 x 3 pixel neighborhood (which may be adjusted according to the actual situation) may be set, in which the edge intensities are compared.
Further, for the surface or internal structure of the liver, these edge points constitute the vertices of a triangular mesh. The triangular mesh can be constructed to accurately reflect the shape change of the curved portion such as the hepatic fornix. Then, the curvature distribution and the area ratio index of each triangular patch of the triangular mesh are calculated. Assuming that a certain triangular patch on the liver is composed of three vertex coordinates (x 1, y1, z 1), (x 2, y2, z 2) and (x 3, y3, z 3) (here, three-dimensional image coordinates are assumed, and the z axis represents the depth direction), the curvature distribution is determined by calculating geometrical parameters such as the side length and angle of the triangle. For example, if the area of the triangular patch is relatively flat, the curvature value may be small, such as 0.1 (assuming a unit of curvature), and if the curvature value is large, such as 0.5, in the curved portion of the liver. The area ratio index is obtained by calculating the ratio of the area of each triangular patch to the total area of the whole triangular mesh, for example, the area of a certain triangular patch is 10 square pixels (assumed), and the total area of the whole triangular mesh is 100 square pixels, and the area ratio is 0.1.
And S135, carrying out region division on the triangular meshes according to the curvature distribution and the area ratio index, generating a sub-mesh set corresponding to different tissue types, and carrying out statistics on the space-time variation variance of the blood flow signal intensity in each sub-mesh of the sub-mesh set to generate a local feature map set containing edge geometric features and hemodynamic features.
In the liver, different sub-grids such as a hepatic parenchymal region and a vascular region can be divided according to these indices. For example, for areas with a smaller curvature and a larger area occupation may be divided into hepatic parenchyma subgrid, while for areas with a larger curvature and related to vessel trend, the area is divided into vessel subgrids. The space-time variance of the blood flow signal intensities is counted in each sub-grid of the sub-grid set. For a relatively slow and steady blood flow within the hepatic parenchymal subgrid, assuming that the mean value of the blood flow signal intensity is 80 (assuming intensity units) over a period of time (e.g., 5 seconds, assuming continuous scan time of the ultrasound device), the fluctuation range is small, and the space-time variance may be 5 (assuming variance units). In the vascular subgrid near the lesion area, the fluctuation of the blood flow signal intensity is larger due to the influence of the lesion on the blood flow, and the average value of the blood flow signal intensity is 100 in the same 5 seconds, but the fluctuation range is larger, and the space-time variation variance may be 20. From this statistics, a set of local feature maps is generated that includes edge geometry features and hemodynamic features.
In one possible implementation, step S140 includes:
step S141, inputting the local feature map set into a long-short time memory network of a space-time feature alignment module for time sequence state transfer, generating a time dimension feature vector containing a continuous inter-frame focus form evolution rule, inputting the time dimension feature vector into a deformable convolution layer for space deformation modeling, and outputting a space deformation feature matrix matched with the tissue structure nonlinear deformation.
In this embodiment, first, a local feature map set of a liver patient is input into a long short time memory network (LSTM) of a spatio-temporal feature alignment module for time-series state transfer. In the ultrasound image sequence of the liver, successive multi-frame images contain information about the change over time of the liver lesion, if present. It is assumed that the ultrasound image sequence comprises 10 frames of images (the number of frames is determined according to the duration of the actual scan and the frame rate of the device), each frame of images comprising previously generated local feature maps comprising information such as the edge geometry and hemodynamic features of the liver. The LSTM network gradually learns the form evolution rule of the focus among the continuous frames by carrying out time sequence state transfer on the local feature images corresponding to the 10 frames of images. For example, in the first frame of image, the suspected lesion area in the liver appears to be approximately circular, with a diameter of about 5 pixels (assuming that one pixel in the image represents an actual length of 0.1 mm), the shape of the lesion area gradually becomes elliptical as time goes by to the 5 th frame of image, the major axis direction goes along a certain blood vessel in the liver, the major axis length becomes 8 pixels, and the minor axis length becomes 4 pixels. The variation information of the focus in shape, size and relative position is encoded into a time dimension feature vector. The time dimension feature vector contains information of a plurality of dimensions, each dimension corresponds to different feature parameters, such as a shape change parameter, a position movement parameter, a relation change parameter with surrounding tissues and the like, and the specific numerical values of the parameters are obtained through calculation of an LSTM network according to the actual change condition of a lesion area in a liver image.
Then, the liver tissue structure may be non-linearly deformed due to physiological movements of the liver itself (e.g. respiratory movements, liver displacements and deformations caused by heart beats) and possibly the influence of lesions on surrounding tissues. The deformable convolution layer is capable of modeling such nonlinear deformation of liver tissue structures based on information in the time-dimensional feature vector. For example, in a region of the liver, which is normally a relatively flat portion of the liver, the liver parenchyma is depressed by compression in the region due to proximity to a growing lesion. The deformable convolution layer outputs a spatial deformation feature matrix matched with the nonlinear deformation of the tissue structure by analyzing relevant parameters in the time dimension feature vector. The element values in the spatial deformation feature matrix reflect the degree and direction of deformation of the liver tissue at different positions, for example, in the above-mentioned concave region, the corresponding matrix element may be expressed as a displacement of-2 pixels in the x-direction (assuming one direction of the image coordinate system) (representing 2 pixels shifted towards the negative x-axis direction), a displacement of-1 pixel in the y-direction, and a displacement of-0.5 pixel in the depth z-direction (if a three-dimensional image) (the displacement value is calculated by the deformable convolution layer according to the actual deformation condition).
Step S142, splicing the time dimension feature vector and the space deformation feature matrix to generate a space-time joint feature matrix, inputting the space-time joint feature matrix into a multi-head self-attention mechanism of a multi-granularity attention distribution module, calculating global correlation weights among different feature channels, and outputting a global attention weight matrix.
In this embodiment, the spatio-temporal joint feature matrix integrates the feature information of liver lesions in time and space. In the feature representation of the liver ultrasound image, there are a plurality of feature channels, such as channels representing texture features of liver tissue, blood flow signal feature channels, edge feature channels, etc. The multi-headed self-attention mechanism computes global correlation weights between different characteristic channels. The importance of different characteristic channels is different for judging abnormal regions of the liver. For example, in determining whether a liver lesion is malignant, the blood flow signal characteristic channel may have a high correlation with the malignancy of the lesion, as malignant lesions are often accompanied by abnormal blood flow supply. It is assumed that the correlation weight of the blood flow signal characteristic channel with the lesion judgment is calculated to be 0.6 (the weight value is calculated according to the data characteristic through a multi-head self-attention mechanism), and the correlation weight of the tissue texture characteristic channel is calculated to be 0.3, and the correlation weight of the edge characteristic channel is calculated to be 0.1. Through such a calculation, a global attention weight matrix is output, each element of which represents a global correlation weight of the corresponding feature channel.
Step S143, inputting the global attention weight matrix into a gating linear unit for feature channel screening, reserving a feature channel related to a current classification task, generating a gating screening feature vector, inputting the gating screening feature vector into a local window attention calculation layer, enhancing the spatial relevance of adjacent feature nodes, and generating a local window attention weight matrix.
In this embodiment, the gating linear unit retains a characteristic channel related to the current classification task (determining the abnormal liver region) according to the weight value in the global attention weight matrix. For example, for feature channels with weights below 0.2 (e.g., the edge feature channels described above), the gating linear unit may screen out the feature channels, while the higher-weight blood flow signal feature channels and tissue texture feature channels are emphasized. And generating a gating screening feature vector after screening. The gating screening feature vector mainly comprises feature information which is most valuable for judging abnormal liver areas.
Next, in the liver image, there is a certain correlation in space between feature nodes between adjacent liver lobules or vessel branches. The local window attention calculation layer enhances the spatial relevance of adjacent feature nodes by analyzing feature information in the gating screening feature vector. For example, for two adjacent liver leaflet regions within the liver, there is some spatial continuity and correlation of their tissue texture characteristics and blood flow signal characteristics. The local window attention calculating layer generates a local window attention weight matrix by calculating the association weight between the feature nodes corresponding to the two areas. The element values in the local window attention weighting matrix reflect the strength of spatial correlation between adjacent feature nodes, e.g. for two highly correlated liver lobular regions, the correlation weight between them may be 0.8 (where the weight values are calculated by the local window attention calculation layer based on the actual feature correlation).
Step S144, the global attention weight matrix and the local window attention weight matrix are combined in a weighted mode, a multi-granularity attention distribution map is generated, and the multi-granularity attention distribution map is overlapped to the space-time joint feature matrix through residual connection, so that an optimized classification feature vector is generated.
In this embodiment, the multi-granularity attention distribution map integrates the attention information of different granularities (global and local). For example, for a characteristic channel with a weight of 0.5 in the global attention weight matrix and a weight of 0.4 in the local window attention weight matrix, the final weight of the characteristic channel may be calculated according to a certain weighting rule, assuming 0.45 (the weighting rule here may be a simple linear weighting or other rule defined according to the network structure) in the multi-granularity attention distribution map. And superposing the multi-granularity attention distribution atlas to the space-time joint feature matrix through residual connection to generate an optimized classification feature vector. Residual connection is beneficial to retaining partial information in the original space-time joint feature matrix, avoiding information loss in the multi-layer network processing process, and the optimized classification feature vector integrates multiple aspects of attention information and original space-time features.
Step S145, inputting the optimized classification feature vector into a full-connection classification layer, and outputting classification probability distribution and a spatial topological relation map of an abnormal region in each frame of standardized ultrasonic image.
In the context of liver ultrasound images, a classification probability distribution and a spatial topological relation map of an abnormal region (such as a possible tumor, a fibrosis region and the like in the liver) in each frame of standardized ultrasound image are output. The classification probability distribution indicates the size of the probability that each region of the liver is an abnormal region, for example, the probability that a region in the liver is calculated as an abnormal region is 0.8, which means that the region is 80% likely to be abnormal based on a previous series of feature analysis and model calculation. The spatial topological relation map accurately depicts the spatial relation between the abnormal region and the surrounding normal tissue, blood vessels and other structures, for example, whether the abnormal region is close to a large blood vessel, whether the boundary between the abnormal region and the surrounding liver tissue is clear or fuzzy, whether the abnormal region invades surrounding liver lobules and the like. Such information is of great importance for accurate diagnosis of liver disease, assessment of lesion severity, and subsequent treatment planning.
In one possible implementation, step S150 includes:
And step S151, mapping the classification probability distribution to the space coordinates of the standardized ultrasonic image sequence, and generating an abnormal region covering layer with semitransparent color codes.
In this embodiment, for the diagnosis process of a liver patient, first, the classification probability distribution of the liver patient is mapped to the spatial coordinates of the standardized ultrasound image sequence, and an abnormal region cover layer with semitransparent color coding is generated. In the standardized ultrasound image sequence of the liver, each pixel has its corresponding spatial coordinates. The classification probability distribution indicates the size of the probability that each region of the liver is an abnormal region. For example, for a pixel point in a liver image, whose coordinates are (x, y) (assuming an image coordinate system), if the probability that the point is classified as an abnormal region is 0.8 (the probability value is calculated from the previous classification), the pixel point is coded and marked with a specific translucent color. For areas with a high probability of abnormality, such as areas with a probability of greater than 0.7, highly suspected lesions are represented by covering with a red translucent color, for areas with a probability of between 0.4 and 0.7, areas with a probability of possibly having a risk of lesions are represented by covering with a yellow translucent color, and for areas with a probability of less than 0.4, normal or low risk areas are represented by covering with a green translucent color. Thus, the whole liver image is covered with an abnormal region covering layer with color codes, and doctors can intuitively see the pathological change risk distribution conditions of different regions of the liver.
Step S152, inputting the space topological relation map into a three-dimensional rendering engine, and constructing a three-dimensional rendering model for marking the vascular branches and the focus infiltration range.
For example, a spatial topological relationship map of the liver contains spatial relationship information between various structures within the liver (e.g., blood vessels, liver lobules, diseased regions, etc.). The three-dimensional volume rendering engine builds a three-dimensional model of the liver from this information. In the three-dimensional model, blood vessel branches in the liver are marked clearly, and information such as trend, diameter and the like of portal veins and branches thereof can be accurately displayed. For lesion infiltration scope, if there is a tumor lesion in the liver, the model will accurately show the location, size, and infiltration relationship of the tumor to surrounding tissue within the liver. Assuming that the tumor in the liver is located in the right lobe of the liver, its center coordinates are (x 1, y1, z 1) (where the z axis represents the depth direction), and the diameter is 10 pixels (assuming that one pixel represents the actual length of 0.1 mm), the boundary of the tumor periphery and the surrounding liver tissue is represented in different colors or textures in the model to show the degree of infiltration thereof.
And step 153, performing spatial registration on the abnormal region covering layer and the three-dimensional rendering model to generate a visual basic frame fusing the two-dimensional probability distribution and the three-dimensional anatomical structure.
In this process, it is ensured that each pixel point in the abnormal region coverage layer is spatially exactly matched with a corresponding position in the three-dimensional volume rendering model. For example, the high risk areas of the red translucent markers in the abnormal region overlay are also accurately located at corresponding locations within the liver in the three-dimensional volume rendering model. Through the spatial registration, the generated visual basic frame fuses two-dimensional probability distribution information (abnormal region coverage layer) with three-dimensional anatomical structure information (three-dimensional rendering model), and a doctor can view two-dimensional risk distribution and three-dimensional structure relation of the liver in one view at the same time, so that the health condition of the liver can be more comprehensively known.
Step S154, extracting hemodynamic characteristic parameters in the classification probability distribution, searching historical case characteristics with similar hemodynamic characteristic parameters in a medical image database, and generating a case similarity comparison result.
In the classified probability distribution of the liver patient, the hemodynamic characteristic parameter includes information such as blood flow velocity, blood flow rate, blood flow direction, and the like of blood vessels in the liver. For example, the blood flow rate of a major blood vessel in the liver is 50 cm/s (this value is obtained from previous ultrasound image analysis) and the blood flow is 100 ml/min. In the medical image database, image data of a large number of historical cases of liver disease and their associated features are stored. By retrieving the hemodynamic characteristic parameters of the cases in the database, historical cases with similar hemodynamic characteristics to the current patient are found. Assuming that 3 historical cases are found, their hemodynamic characteristics differ from the current patient by within a certain allowable range (e.g., blood flow velocity differences are within + -10% and blood flow differences are within + -15%). And then comparing the historical cases with the current patient in detail, including comparison of the size of the liver, the position of the lesion, the type of the lesion and the like, and generating a case similarity comparison result.
Step S155, associating the case similarity comparison result with the visual basic frame to generate a side comparison view and a key difference parameter labeling layer, matching a preset treatment scheme flow chart based on the key difference parameter labeling layer to generate a logic association table of risk level labels and treatment suggestions, and embedding the logic association table into the visual basic frame to generate an interactive three-dimensional visual interface comprising an interactive time axis control, a real-time ultrasonic playback function and a voice broadcasting engine.
In the side contrast view, relevant information of the current patient and the history case is displayed at the same time. For example, a three-dimensional model of the liver of the current patient is displayed with an abnormal region overlay on one side, and information such as its hemodynamic characteristics, liver size, etc. is displayed, and the same type of information of the historical case is displayed on the other side. The key discrepancy parameter labeling layer highlights the important discrepancy between the current patient and the historical case. For example, if the current patient has a liver tumor size of 20 cubic millimeters and the tumor size in the historical case is 15 cubic millimeters, this size difference is explicitly noted in the key difference parameter labeling layer. And matching a preset treatment scheme flow chart based on the key difference parameter labeling layer, and generating a logic association table of the risk level label and the treatment suggestion. The preset treatment scheme flow chart comprises different treatment schemes and corresponding risk levels formulated according to different liver disease characteristics (such as lesion size, position, hemodynamic characteristics and the like). For example, if a liver tumor is less than 10 cubic millimeters in size and has normal hemodynamic characteristics, the risk level may be low, the corresponding treatment recommendation may be a periodic observation, and if the tumor is between 10-20 cubic millimeters in size, the risk level is medium risk, the treatment recommendation may be a medical treatment or a local intervention. And generating a logic association table of the risk level label and the treatment advice according to the difference between the current patient and the historical case and the preset rules.
Finally, the interactable timeline control allows a physician to view liver ultrasound images and related analysis results at different points in time. For example, the physician may go back to a previous ultrasound examination via the time axis to see the state of the liver at that time and the analysis results. The real-time ultrasound review function allows the physician to review the real-time course of the liver ultrasound scan again in order to more carefully view certain suspicious regions. The voice broadcasting engine can broadcast information such as diagnosis results, risk grades, treatment suggestions and the like, and a doctor can conveniently acquire the information in the operation process. For example, the voice broadcast can clearly say that "the right lobe of the liver of the current patient has suspected tumor lesions, the risk level is medium risk, and the treatment advice is drug treatment or local intervention treatment" and the like. The interactive three-dimensional visual interface provides a comprehensive and convenient diagnostic tool for doctors, and is beneficial to improving the diagnostic accuracy of liver diseases and the rationality of treatment decisions.
In one possible embodiment, the method further comprises:
step S210, inputting a training sample library with a multi-dimensional label into a data enhancement module, and executing random elastic deformation, local pixel shielding and multi-angle affine transformation to generate an expanded countermeasure training sample set.
In detail, the training sample library comprises a plurality of liver ultrasonic image samples, and each sample is provided with a multi-dimensional label which comprises information of health condition (normal or lesion type), hemodynamic characteristic parameters, tissue elasticity parameters and the like of the liver. For random elastic deformation operation, taking a liver region in a liver ultrasonic image as an example, the shape of the liver is changed according to a certain elastic deformation rule in the coordinate space of the image. Assuming a liver image of 500 x 600 pixels in size (where the pixel size is determined by the resolution of the ultrasound device imaging), during random elastic deformation, certain pixels at the liver edge may be displaced in the x-direction (horizontal) and y-direction (vertical). For example, a pixel point on the upper edge of the liver, originally having coordinates (200, 100), may become (203,98) after elastic deformation, and this deformation simulates the image deformation caused by respiration of the patient, body movement or physiological movement of the liver itself (such as micro displacement of the liver caused by heart beat) during actual ultrasonic scanning.
Local pixel occlusion is the random selection of pixels in small areas of the image for occlusion. For liver images, small areas inside the liver may be selected, for example, a region of 10 x 10 pixels in the right lobe of the liver (where the region size is determined according to a preset occlusion rule) for occlusion. This simulates the partial occlusion of tissue by other structures or local disturbances in the image acquisition process that may occur in an actual scan. The multi-angle affine transformation comprises operations such as rotation, scaling and translation on the image. Taking rotation as an example, the liver ultrasound image is rotated about the center of the image by an angle, such as 15 ° (this angle is a randomly determined affine transformation parameter). And expanding samples in the original training sample library through the random elastic deformation, the local pixel shielding and the multi-angle affine transformation operation to generate an expanded countermeasure training sample set. The number of samples in the challenge training sample set is significantly increased compared with the original training sample library, and the sample set contains more diversified sample forms, thereby being beneficial to improving the generalization capability of the subsequent network.
Step S220, inputting the expanded countermeasure training sample set into a focus loss function of a first training stage, and optimizing an inter-class balance weight parameter of a cascade deep classification network.
In this cascade of deep classification networks, there are different categories, such as liver normal, liver fibrosis, liver tumor etc. The focus loss function aims to solve the problem of unbalanced sample numbers of different types during training. Suppose there are 500 normal liver samples, 300 liver fibrosis samples, and 200 liver tumor samples in the augmented challenge training sample set. Due to the difference in the number of samples, the network may tend to better classify a larger number of normal samples, while ignoring a smaller number of fibrotic and tumor samples, under conventional loss functions. The focus loss function makes the network more concerned with the minority class samples which are difficult to classify by weighting and adjusting the losses of the different class samples. During training, a loss value of each sample is calculated according to the focus loss function, and the inter-class balance weight parameters of the cascade depth classification network are updated according to the loss values. For example, for liver tumor samples, the focus loss function increases the weight of the loss of the liver tumor samples because of the small number and difficult classification, so that the network pays more attention to the study of the characteristics of the liver tumor samples in the study process, and the balance weight parameters among the classes are adjusted to improve the classification accuracy of the samples of different classes.
Step S230, inputting the optimized balance weight parameters between classes into an countermeasure training strategy in a second training stage, and generating a countermeasure disturbance sample set through gradient symbol attack.
In the countermeasure training strategy, a countermeasure disturbance sample set is generated by a gradient sign attack method based on the optimized balance weight parameters among classes. For a cascade depth classification network, it is assumed that an input liver ultrasound image sample propagates forward through the network to obtain a prediction result. During a gradient sign attack, the gradient of the loss function with respect to the input samples is calculated from the prediction results and the real labels. Taking a liver ultrasound image sample as an example, the image data is a matrix of 500×600×3 (width×height×number of channels, where the number of channels is 3 representing a color image). The calculated gradient represents the rate of change of the loss function with the input image pixel value. The immunity is then generated from the sign (positive or negative) of the gradient, for example, at a certain pixel of the image, the value of that pixel is increased if the gradient is positive, and the value of that pixel is decreased if the gradient is negative. In this way, samples in the training sample set are perturbed, generating an anti-perturbation sample set. These challenge samples can increase the robustness of the network to challenge samples, i.e. increase the correct classification ability of the network in the face of input samples that may be maliciously modified or have disturbances.
Step S240, inputting the disturbance countermeasure sample set into a knowledge distillation framework of a third training stage, and compressing model complexity parameters of the student network by using a soft label transfer mechanism of the teacher network.
In the knowledge distillation framework, there is a teacher network and a student network. The teacher network is a pre-trained network with high accuracy and complexity, predicts the disturbance-resistant sample set, and the obtained prediction result exists in the form of soft labels. The soft tag contains not only probability information that the sample belongs to each category, but also more uncertainty information. For example, for an anti-disturbance sample of a liver ultrasound image, the teacher network predicts that it is a normal liver probability of 0.6, a liver fibrosis probability of 0.3, and a liver tumor probability of 0.1. A student network is a relatively simple, low model complexity network that attempts to learn useful information from the soft labels of the teacher network. Through the soft label transmission mechanism, the student network does not need to learn complex characteristic representation again, but obtains information such as relations among categories from soft labels of the teacher network, so that model complexity parameters of the student network are compressed on the premise of keeping certain accuracy. For example, a student network may have more convolution layers and neuron connections, and after knowledge distillation, some unnecessary convolution layers or neuron connections are removed, thereby reducing the complexity of the model and improving the training efficiency and generalization ability of the network.
And S250, inputting the compressed model complexity parameters into a verification set monitoring module, triggering an early-stopping mechanism according to the classification precision and the generalization performance, and rolling back to an optimal weight snapshot.
The validation set contains a portion of the liver ultrasound image samples and their labels independent of the training set. And applying the network corresponding to the compressed model complexity parameter to the verification set for testing. In this process, the classification accuracy of the network on the validation set is calculated, e.g., the ratio of the number of samples predicted to be correct to the total number of validation samples is calculated. Meanwhile, the generalization performance of the network is evaluated, the generalization performance can be measured by comparing the performance on the verification set with the performance on the training set, and if the performance on the verification set is not greatly different from the performance on the training set, the network is better in generalization capability. Assuming that the verification set has 200 samples, when the classification accuracy reaches 90% (the value is a performance index set according to actual requirements) and the generalization performance is good, the verification set monitoring module can consider that the current network has reached a better state. If the classification accuracy begins to drop or the generalization performance becomes worse in the subsequent training process, the early shutdown system will be triggered and the network will roll back to the weight snapshot when the optimal classification accuracy and generalization performance were reached before. The weight snapshot saves the weight parameters of the network in the optimal state, and by rolling back to the state, the problems of over fitting and the like of the network in the over training process can be avoided, and the performance of the network is ensured.
Step S260, inputting the optimal weight snapshot into a dynamic quantization encoder to generate a network weight parameter for deployment of 8-bit integer coding.
Step S270, inputting the network weight parameters for deployment into an online increment learning module, and updating the parameters of the full-connection layer through clinical feedback data to adapt to the imaging characteristic difference of the ultrasonic probe.
The dynamic quantization encoder carries out quantization processing on the weight parameters in the optimal weight snapshot. In the original network weight parameters, it may be represented by a high precision data type (e.g., 32-bit floating point number). The dynamic quantization encoder converts these weight parameters into an 8-bit integer code. For example, for a weight parameter representing a classification characteristic of liver lesions, the original value is 0.12345 (32-bit floating point number representation), and after being processed by a dynamic quantization encoder, the original value is converted into an 8-bit integer code value (the specific conversion process is performed according to a quantization algorithm). The network weight parameters for the deployment of the 8-bit integer codes generated in the way greatly reduce the storage requirement and the calculation amount of the network on the premise of keeping certain accuracy, so that the network is more suitable for being deployed in actual medical equipment (such as ultrasonic diagnostic equipment). The network weight parameter for deployment is input into an online increment learning module, and the full-connection layer parameter is updated through clinical feedback data to adapt to the imaging characteristic difference of the ultrasonic probe. In practical clinical applications, different ultrasound probes may have different imaging characteristics, such as differences in imaging resolution, imaging sharpness for different tissues, etc. The clinical feedback data comprises liver ultrasonic images obtained by imaging according to different ultrasonic probes in the actual use process and corresponding diagnosis results. The parameters of the full-connection layer are updated through the data, for example, if the display of the liver edge is not clear enough in a liver image obtained by imaging a certain ultrasonic probe, so that some deviation occurs in the classification process, the online increment learning module can adjust the weight parameters in the full-connection layer according to the clinical feedback data, so that the network can better adapt to the imaging characteristic differences of different ultrasonic probes, and the diagnosis accuracy is improved.
In one possible embodiment, the method further comprises:
Step S310, comparing the output confidence entropy value of the cascade deep classification network with a preset threshold value, triggering an uncertainty early warning event, inputting the uncertainty early warning event into an anti-network reconstruction module, calculating the reconstruction error distribution of the current ultrasonic image, performing conflict detection on the reconstruction error distribution and the classification probability distribution, positioning an artifact interference area and triggering a standby classification model switching instruction.
The confidence entropy reflects the degree of uncertainty of the cascaded deep classification network on the classification result. For example, in the classification of the ultrasound image of the liver, if the probability that a certain area of the liver is normal is 0.8, the probability that the area is lesion is 0.2, and the confidence entropy value can be calculated according to the calculation formula of entropy (- Σp (i) ×log (p (i)), where p (i) is the probability of each class. Assuming that the preset threshold is 0.5, when the calculated confidence entropy value is greater than the preset threshold, an uncertainty early warning event is triggered. This means that the classification result of the current liver ultrasound image by the network is not well defined, and there may be some interference factors or limitations of the network itself.
For liver ultrasound images, the image size is assumed to be 500 x 600 pixels, each pixel having its corresponding feature value. The reconstruction error distribution is obtained by generating an countermeasure network to attempt to reconstruct the original image, and calculating the difference between the reconstructed image and the original image. For example, in a 10×10 pixel sub-region of a liver region, the pixel value of the liver tissue in the original image averages 100 (assuming that the value represents a certain characteristic intensity of the liver tissue), and the pixel value of the corresponding region in the reconstructed image averages 110, so that there is a certain reconstruction error in the region. And carrying out conflict detection on the reconstruction error distribution and the classification probability distribution. If in areas where the classification probability is high (e.g. 0.8 is normal), the reconstruction error is large, which may imply that there is an artifact interference. By such collision detection, an artifact interference region can be located. For example, a region located to one 20×20 pixel in the liver image is an artifact interference region whose coordinate range is ((x 1, y 1) - (x 2, y 2)), and then a standby classification model switching instruction is triggered.
Step S320, inputting the artifact interference region coordinates and the model switching instruction into a self-healing log construction module, recording input data characteristics and output distribution deviation degree, generating a self-healing log, inputting the self-healing log into a graph neural network trainer, constructing a fault knowledge graph of an abnormal event association relationship, inputting the fault knowledge graph into a network architecture optimizer, and dynamically adjusting the number of attention heads and the convolution kernel size of a cascade type depth classification network.
For the input data features, features of the liver ultrasound image near the artifact interference region, such as hemodynamic feature parameters, tissue elasticity parameters, etc., are recorded. Assuming that the blood flow velocity is measured at 40 cm/s (which is extracted from the raw ultrasound image data) near the artifact interference region, the tissue elasticity parameter is a particular value. The output distribution deviation degree refers to the degree of difference between the classification result and the classification result under the condition of no interference in the presence of artifact interference. For example, the probability of a region of the liver being classified as normal is 0.9, the probability of a region of the liver being classified as a possible lesion in the presence of an artifact disturbance is 0.6, and the change in the probability is the degree of deviation of the output distribution. By recording this information, a self-healing log is generated.
The graph neural network trainer can analyze the data in the self-healing log to find out the association relation between different abnormal events (such as artifact interference at different positions, output distribution deviation degree at different degrees and the like). For example, it was found that when the artifact interference region is located at the edge of the liver and the blood flow velocity measurement is between 30-50 cm/s, this correlation is built into the fault knowledge graph, which often results in a shift of the classification result towards the lesion. The fault knowledge graph graphically represents the association relationships, the nodes possibly represent different abnormal events or data features, and the edges represent the association strength between the abnormal events or data features.
For cascaded deep classification networks, the number of attention headers and the convolution kernel size have a significant impact on the performance of the network. If the fault knowledge graph is displayed under certain specific ultrasonic image characteristics of the liver (such as specific hemodynamic characteristics and artifact interference modes), the classification effect of the network is poor, and the network architecture optimizer adjusts the attention head number according to the information. Assuming that the original attention header number is 8, it was found that reducing to 6 attention headers may improve the performance of the network in this case, based on analysis of the fault knowledge graph. For convolution kernel sizes, such as the original convolution kernel size of 3 x3 pixels, it is possible to adjust to 5 x 5 pixels to better capture features in the ultrasound image of the liver, particularly features near the artifact interference region.
And step S330, inputting the optimized network architecture parameters into a full-scale self-checking module, repairing numerical overflow nodes in the weight matrix, resetting internal state parameters of the optimizer, and synchronizing the repaired network architecture parameters and the self-healing rule base to the distributed edge nodes to maintain consistency of exception handling strategies among multiple devices.
Finally, in the weight matrix, there may be value overflow nodes due to previous adjustment or value overflow in the calculation process. For example, a certain weight value should be originally between-1 and 1, but becomes a very large value (e.g., 1000) due to calculation errors or data type limitations, which is a value overflow node. The full self-test module restores the value overflow nodes to a reasonable value range. Meanwhile, the internal state parameters of the optimizer are reset, so that the optimizer can be ensured to carry out subsequent training or adjustment in a correct state. And then synchronizing the repaired network architecture parameters and the self-healing rule base to the distributed edge nodes. In a multi-facility diagnostic system, distributed edge nodes may be distributed among different medical sites (e.g., different hospital departments or different regional medical centers). By synchronizing the repaired network architecture parameters with the self-healing rule base, each distributed edge node can keep a consistent exception handling strategy, so that the consistency of diagnosis results and handling modes among different devices is ensured when liver ultrasonic images are handled, and the reliability and accuracy of the whole diagnosis system are improved.
In a possible implementation manner, after the step S150, the method further includes:
Step S410, receiving an interaction instruction triggered by a user through a time axis control or a real-time ultrasonic playback function in the interactive three-dimensional visual interface, and analyzing a parameter adjustment request and focus region re-labeling coordinates in the interaction instruction.
For example, when a doctor looks at the visualization interface of a liver patient, the doctor returns to an ultrasonic image at a certain previous time point by using a time axis control, or looks back at a certain segment in the scanning process through a real-time ultrasonic playback function, which all generate interactive instructions. Therefore, the parameter adjustment request and the focus region re-labeling coordinates in the interaction instruction can be analyzed. Assuming that there may be some inaccuracy in the diagnosis before the doctor finds in the viewing process, the parameter adjustment request in the interactive instruction contains information of the confidence threshold offset in order to adjust the decision boundary of the classification probability distribution. For example, if the confidence threshold value, which is originally determined that a region of the liver is an abnormal region, is 0.7, the doctor adjusts the threshold value to 0.6 by the interactive instruction, that is, the confidence threshold value offset is-0.1. Meanwhile, if the doctor finds that the coordinates of the previously marked focus area are inaccurate, the focus area is marked again, and new coordinate information is also contained in the interactive instruction.
And step S420, correcting the judgment boundary of the classification probability distribution according to the confidence threshold offset in the parameter adjustment request, generating a corrected abnormal region coverage layer, carrying out topology consistency check on the focus region remark coordinate and the space topology relation map, and updating the blood vessel branch connection path in the three-dimensional rendering model.
In a liver ultrasound image, the classification probability distribution originally determines the probability that each region is an abnormal region. Taking an area of the right lobe of the liver as an example, the probability of being an abnormal area is calculated to be 0.65, and the abnormal area is determined to be a normal area when the confidence threshold is 0.7. But when the confidence threshold is adjusted to 0.6, the region is included in the outlier region. In this way, the whole liver image is re-determined, and a corrected abnormal region cover layer is generated. In this cover layer, the color coding may change according to a new determination result, for example, an originally green (indicating normal) region may change to yellow (indicating suspected abnormality). And meanwhile, carrying out topological consistency check on the focus region remark coordinates and the space topological relation map. The spatial topological relation map comprises the spatial relation among blood vessels, tissues and focuses in the liver. For the remarked focal region coordinates, it is checked whether it maintains a reasonable topological relation with other structures in the map. For example, if the relative position of the lesion area and the surrounding blood vessels conforms to the physiological structure, if the remarked lesion area is close to a certain blood vessel, but the area is far away from the certain blood vessel in the previous spatial topological relation map, further examination and adjustment are needed. And updating the blood vessel branch connection path in the three-dimensional volume rendering model according to the verification result. If the position of the focus area changes, the connection relation with the blood vessel branches may be affected, for example, the focus is originally considered to be not associated with a certain tiny blood vessel branch, and after re-labeling and verification, the connection path of the blood vessel branches needs to be updated in the three-dimensional rendering model to accurately reflect the structural relation inside the liver.
And S430, re-registering the corrected abnormal region coverage layer and the updated three-dimensional rendering model to generate a secondary verification visualization frame, and extracting a change gradient of the hemodynamic characteristic parameters in the secondary verification visualization frame.
In this process, it is ensured that each pixel point in the modified abnormal region coverage layer exactly matches a corresponding position in the updated three-dimensional volume rendering model. For example, a region marked as yellow (suspected abnormality) in the corrected abnormal region cover layer is also accurately located at a corresponding position in the liver in the three-dimensional volume rendering model. Through this re-registration, the generated secondary verification visualization framework integrates the modified two-dimensional probability distribution information and the updated three-dimensional anatomical structure information. And then extracting the change gradient of the hemodynamic characteristic parameters in the secondary verification visualization framework. In liver images, hemodynamic characteristic parameters include blood flow velocity, blood flow, and the like. Assuming that in the previous visualization frame the blood flow velocity of a certain major vessel of the liver was 50 cm/s, in the second verification visualization frame the blood flow velocity becomes 45 cm/s due to the re-labeling of the lesion area or other factors, the gradient of the change in blood flow velocity is-5 cm/s. Similar calculation is performed on other hemodynamic characteristic parameters such as blood flow, and the change gradient of the hemodynamic characteristic parameters in the whole secondary verification visualization framework is obtained.
Step S440, performing incremental search in the medical image database based on the hemodynamic characteristic parameter variation gradient, screening out a historical case subset matched with the corrected classification probability distribution, and recalculating risk level weight coefficients in case similarity comparison results.
The medical image database stores a great number of historical case image data of liver diseases and relevant characteristics thereof. Taking the blood flow velocity change gradient of-5 cm/s as an example, historical cases with similar hemodynamic characteristic parameter change conditions are searched in a database. Assuming 1000 historical cases exist in the database, and analyzing the hemodynamic characteristic parameters of each case to screen out the cases which are similar to the change gradient of the hemodynamic characteristic parameters in the current secondary verification visualization framework. For example, historical cases with blood flow velocity changes in the range of-4 to-6 cm/s are screened out to form a subset of historical cases that match the modified classification probability distribution. And then recalculating risk level weight coefficients in the case similarity comparison result. In the previous case similarity comparison, the risk level weight coefficient may be determined according to the liver size, lesion position and other factors. The risk level weight coefficient is re-evaluated after adding a new factor of the change gradient of the hemodynamic characteristic parameter. For example, for a historical case, the risk level weight coefficient originally calculated according to the liver size and lesion location is 0.6, and after considering the gradient of the change of the hemodynamic characteristic parameter, the risk level weight coefficient may be adjusted to be 0.7 or 0.5, and the specific value depends on the influence of the new factor on the overall risk assessment.
Step S450, adjusting treatment proposal priority order in the logic association table according to the recalculated risk level weight coefficient, generating a dynamically optimized treatment proposal flow chart, and superposing the dynamically optimized treatment proposal flow chart and the secondary verification visualization framework to generate a final diagnosis report confirmed by a doctor.
The logical association table originally determines different treatment recommendations and priorities thereof according to factors such as risk levels. For example, for liver lesions with moderate risk levels, the previous treatment recommendation may be to first take a medication and observe for a period of time, if the condition does not improve and take into account the intervention, this is a priority determined based on the previous risk level weighting coefficients. When the risk level weight coefficient changes, for example, the risk level becomes higher, the priority of the treatment advice may be adjusted to directly perform the interventional treatment, and then the drug treatment is assisted. In this way, the treatment advice prioritization in the logical association table is adjusted according to the recalculated risk level weight coefficients, resulting in a dynamically optimized treatment advice flow chart. The dynamically optimized treatment recommendation flow chart is overlaid with a secondary verification visualization framework to generate a final diagnostic report after physician confirmation. In the final diagnosis report, doctors can simultaneously see the optimized treatment advice and information such as liver images, structural relations, related characteristic parameters and the like in the secondary verification visualization framework, and the doctor can conveniently confirm the final diagnosis.
Step S460, inputting the correction parameters and the remark coordinates in the final diagnosis report confirmed by the doctor into an online increment learning module, updating the full-connection layer weight of the cascade deep classification network, and synchronizing the updated full-connection layer weight to a standby classification model of the distributed edge node to finish consistency calibration of the cross-equipment diagnosis strategy.
For example, correction parameters such as confidence threshold offset, hemodynamic characteristic parameter variation gradient, and remarked lesion area coordinates in the final diagnostic report are input to the online delta learning module. And the online incremental learning module updates the full-connection layer weight of the cascade deep classification network according to the new information. Assuming that a certain weight is originally 0.5 in the full connection layer, the weight may be adjusted to 0.6 or 0.4 according to the influence of the correction parameter during the update process. The updated full connection layer weights are synchronized to the standby classification model of the distributed edge node. In a multi-facility diagnostic system, distributed edge nodes are distributed in different places, such as different hospital departments or different regional medical centers. By synchronizing the updated weights, the same diagnosis strategy is adopted by each device when liver ultrasonic image diagnosis is carried out, consistency calibration of the cross-device diagnosis strategy is completed, and accuracy and reliability of the whole diagnosis system are improved.
In one possible embodiment, after the step of completing the cross-device diagnostic strategy consistency calibration, the method further comprises:
step S510, periodically collecting multi-stage ultrasound review data of the target object in a treatment stage, and performing noise suppression and motion artifact compensation processing of the real-time ultrasound scanning signal sequence on each stage of ultrasound review data to generate a standardized follow-up ultrasound image sequence.
For example, it is set that a liver ultrasound review is performed on a patient every two weeks, and the ultrasound data obtained by each review is one period of the multi-period ultrasound review data. For each period of ultrasound review data, a prior noise suppression and motion artifact compensation process for the real-time ultrasound scan signal sequence is performed to generate a normalized follow-up ultrasound image sequence.
When noise suppression and motion artifact compensation processing is performed, wavelet transformation decomposition is performed first for each frame of ultrasonic scanning signal in the first-period ultrasonic review data. Assuming that the size of the liver image corresponding to the ultrasonic scanning signal is 500×600 pixels (the pixel size is determined according to the imaging resolution of the ultrasonic device), the high-frequency noise component and the low-frequency tissue signal component are separated after the decomposition by wavelet transformation. The high frequency noise component presents some signal fluctuations in the high frequency band, relatively low energy and irregular, possibly originating from electronic components of the ultrasound device, minor disturbances of the surrounding environment, etc. The low frequency tissue signal component contains the rough outline information of the main structures such as liver tissue, blood vessels and the like.
The high frequency noise component is then filtered layer by layer using an adaptive thresholding algorithm. For example, for the first layer of the high-frequency noise component, statistical parameters such as standard deviation and mean value of the layer signal are analyzed, and if the standard deviation is 5 (assumed unit), the threshold value of the layer is 15 according to a certain algorithm rule (for example, the standard deviation is multiplied by a coefficient, and the coefficient is assumed to be 3). The signal portions above the threshold are considered to be potentially useful signals to be preserved, and those below the threshold are determined to be noise and filtered out. This adaptive thresholding can effectively remove noise while preserving intermediate frequency signal components associated with vessel wall motion throughout the layer-by-layer filtering process. For vessels in the liver, the frequency of the signals generated by the blood flow is in the intermediate frequency range, for example, the frequency is between 1 and 5MHz (assumption), and the intermediate frequency signals have important significance for the subsequent analysis of the motion state of the vessel wall, the blood flow speed and the like.
And then, carrying out inverse wavelet transformation reconstruction on the filtered ultrasonic scanning signals to generate an ultrasonic image subjected to preliminary denoising. But the images are subject to motion artifacts due to the possible slight breathing or body movement of the patient during the review scan. The tissue displacement vector field between the ultrasonic scanning signals of the adjacent frames is calculated based on an optical flow method, and for the liver image, the moving direction and distance of the liver tissue between the adjacent frames are calculated. Assuming that the time interval between two adjacent frames of images is 0.1 seconds (which is a typical time interval determined by the scanning frame rate of the ultrasonic device), the optical flow method calculates the displacement vector of the liver tissue within the 0.1 seconds. For example, if a specific point on the liver (assuming that the coordinates are (x 1, y 1)) is shifted by (x 2, y 2) from the position in the first frame image to the position in the second frame image, the calculated displacement vector is ((x 2-x 1), (y 2-y 1)). According to the tissue displacement vector field, motion track compensation is carried out on the primarily denoised ultrasonic image, for example, a suspected lesion region in the liver has obvious displacement in the image due to respiratory motion of a patient before the motion track compensation, and after the motion track compensation, the position of the region in the image can more accurately reflect the real position of the region in the liver.
And then, gray scale normalization operation is carried out on the ultrasonic image after the motion compensation, and the pixel intensity is mapped to a preset standard dynamic range. Assuming that the preset standard dynamic range is 0-255 (which is a common image pixel intensity range), in the liver ultrasound image after motion compensation, the gray value range of all pixels in the image is counted first. For example, the gray scale value range of liver tissue may be 50-180 and the gray scale value range of blood vessels may be 30-100 (here, only assumed to be a general range). These raw gray values are then mapped to a standard dynamic range of 0-255 by means of a linear mapping. The purpose of this is to make the images of different phases and the images of different patients comparable in gray scale, facilitating subsequent analysis and processing. And after the gray scale normalization operation, obtaining a normalized ultrasonic image.
And finally, carrying out edge-preserving smoothing treatment on the normalized ultrasonic image by adopting an anisotropic diffusion algorithm, carrying out detail enhancement on the smoothed ultrasonic image by a bilateral filter, carrying out region matching on the ultrasonic image with enhanced details according to a preset anatomical structure template, and cutting out a standardized follow-up ultrasonic image sequence containing the boundary of the complete liver organ.
Step S520, inputting the standardized follow-up ultrasonic image sequence into an updated cascade depth classification network, and outputting follow-up classification probability distribution and spatial topology evolution map of an abnormal region in each period of follow-up ultrasonic image.
In a cascaded depth classification network, standardized follow-up ultrasound images of each phase are analyzed. For example, for a period of liver follow-up ultrasound image, the probability that a region of the liver is an abnormal region is 0.3 (the probability value is calculated according to analysis of image features by the network), which is one data point in the follow-up classification probability distribution. Meanwhile, the spatial topological evolution map can accurately describe the spatial relation change of the abnormal region and surrounding structures such as normal tissues, blood vessels and the like in the follow-up process. For example, a vessel that was originally in the vicinity of the liver at the time of initial diagnosis may change in distance from the lesion region to the vessel during follow-up, and such a change in spatial relationship is recorded in the spatial topological evolution map.
And step S530, carrying out time series alignment on the follow-up classification probability distribution and the initial classification probability distribution, calculating the area shrinkage rate of the abnormal region and the blood flow signal intensity attenuation coefficient, and generating a treatment response quantification index set.
Assuming that the area of the lesion region in the liver in the initial classification probability distribution is calculated as 100 square pixels by image analysis (assuming that one pixel represents an actual length of 0.1mm so that the area is 1 square cm), in the follow-up classification probability distribution of a certain period, the area of the lesion region is calculated as 80 square pixels (i.e., 0.8 square cm), the abnormal region area shrinkage rate is (100-80)/100=0.2, i.e., 20%. For the blood flow signal intensity attenuation coefficient, for example, when the blood flow signal intensity of a certain blood vessel in a lesion region of the liver at the time of initial diagnosis is 80 (assuming intensity units), and at a certain stage in the follow-up process, the blood flow signal intensity of the blood vessel in the lesion region becomes 60, the blood flow signal intensity attenuation coefficient is (80-60)/80=0.25, that is, 25%. In this way, the area shrinkage rate and the blood flow signal intensity attenuation coefficient of the abnormal region in each period are calculated, and a treatment response quantization index set is generated.
Step S540, inputting the spatial topological evolution map into a three-dimensional difference rendering engine, extracting a blood vessel branch morphology change vector and a focus infiltration volume change matrix, and generating a three-dimensional treatment response dynamic model.
In the three-dimensional difference rendering engine, according to information in the spatial topological evolution map, analyzing morphological changes of blood vessel branches in the liver in the follow-up process. For example, a certain blood vessel branch has a diameter of 3 pixels (assuming that one pixel represents an actual length of 0.1 mm) and a length of 20 pixels (i.e., 2 mm) at the time of initial diagnosis, and the diameter becomes 2.5 pixels and the length becomes 18 pixels during follow-up, and a blood vessel branch morphology change vector is obtained by calculating these changes. For the lesion infiltration volume change matrix, it is assumed that the lesion infiltration volume is calculated to be 100 cubic pixels (i.e. 1 cubic centimeter) by three-dimensional analysis of a lesion area in the initial diagnosis, and different lesion infiltration volumes are calculated according to space topology evolution maps of different periods in the follow-up process, for example, 80 cubic pixels (i.e. 0.8 cubic centimeter) at a certain period, and the data are formed into the lesion infiltration volume change matrix. A three-dimensional treatment response dynamic model is generated through the data, and the three-dimensional treatment response dynamic model can intuitively display the three-dimensional structural change condition of the liver in the treatment process.
Step S550, fusing the treatment response quantification index set and the three-dimensional treatment response dynamic model, generating a multi-stage treatment effect comparison report, and performing prognosis pattern matching on the multi-stage treatment effect comparison report and the historical treatment cases in the medical image database.
The multi-stage treatment effect comparison report integrates quantified indexes (such as abnormal region area shrinkage rate and blood flow signal intensity attenuation coefficient) and a three-dimensional structure change model (a three-dimensional treatment response dynamic model). And then performing prognosis pattern matching on the multi-stage treatment effect comparison report and the historical treatment cases in the medical image database. The medical image database stores a great number of historical treatment cases of liver diseases and relevant treatment effect data. And (3) matching each index and model in the multi-stage treatment effect comparison report with the historical treatment cases to find out the historical treatment cases most similar to the current treatment conditions of the patient.
Step S560, screening the optimal historical treatment cases according to the prognosis pattern matching result, extracting the corresponding treatment scheme flow chart and key node monitoring parameters, and generating the personalized treatment monitoring billboard.
For example, by matching the prognosis pattern, it is found that a certain historical treatment case is most similar to the treatment condition of the current patient, and the treatment scheme flow chart of the historical treatment case includes information such as the type, dosage, treatment period of the drug treatment, and examination items required to be performed at different stages in the treatment process. Key node monitoring parameters may include expected ranges of changes in liver function indicators (e.g., glutamic pyruvic transaminase, glutamic oxaloacetic transaminase, etc.) at a certain stage of treatment, expected changes in size of diseased areas of the liver, etc. And extracting the information to generate the personalized treatment monitoring board.
Step 570, comparing the monitoring parameter threshold value in the personalized treatment monitoring billboard with the real-time ultrasonic review data stream in real time, and automatically adjusting the drug dosage parameter and the review time interval in the treatment proposal flow chart when the deviation pre-warning event is triggered, so as to generate the self-adaptive treatment plan update instruction.
For example, the area of a liver lesion area set at a certain stage of treatment in a personalized treatment monitoring board should be reduced to a certain range, and if the area of the liver lesion area is found to not reach the expected reduced range in a real-time ultrasonic review data stream, a deviation early warning event is triggered. At this time, according to preset rules, the drug dosage parameters in the treatment recommendation flow chart are automatically adjusted, for example, the dosage of a certain drug is increased from 10 mg to 15 mg every day, and the review time interval is adjusted, for example, the original one-time every two weeks is changed to one-time every week, so as to generate the adaptive treatment plan update instruction.
Step S580, the self-adaptive treatment plan updating instruction and the multi-stage treatment effect comparison report are logically associated, a treatment closed-loop feedback file is generated, key decision parameters in the treatment closed-loop feedback file are returned to the online incremental learning module, and the subsequent classification network weight updating direction is optimized.
The treatment closed loop feedback file records the logic association relation between the self-adaptive treatment plan updating instruction and the multi-stage treatment effect comparison report, and comprises information such as the basis for adjusting the dosage of the medicine and the rechecking time interval, the expected treatment effect and the like. And returning key decision parameters (such as medicine dosage adjustment quantity, review time interval adjustment quantity and the like) to an online increment learning module, and optimizing the updating direction of the weight of the subsequent classification network by the online increment learning module according to the parameters. For example, if the improvement of the diseased region of the liver is found to be better after the drug dosage is increased, the online incremental learning module adjusts the weights associated with the drug dosage and the treatment effect in the cascaded depth classification network so as to more accurately reflect the relationship in the follow-up diagnosis and treatment advice, thereby improving the accuracy and adaptability of the whole diagnosis and treatment system.
FIG. 2 illustrates a schematic diagram of exemplary hardware and software components of an artificial intelligence based ultrasound image data classification system 100 that may implement the concepts of the present application provided by some embodiments of the present application. For example, the processor 120 may be used on the artificial intelligence based ultrasound image data classification system 100 and to perform the functions of the present application.
The artificial intelligence based ultrasound image data classification system 100 may be a general purpose server or a special purpose server, both of which may be used to implement the artificial intelligence based ultrasound image data classification method of the present application. Although only one server is shown, the functionality described herein may be implemented in a distributed fashion across multiple similar platforms for convenience to balance processing loads.
For example, the artificial intelligence based ultrasound image data classification system 100 may include a network port 110 connected to a network, one or more processors 120 for executing program instructions, a communication bus 130, and a storage medium 140 of different forms, such as a disk, ROM, or RAM, or any combination thereof. Illustratively, the artificial intelligence based ultrasound image data classification system 100 may also include program instructions stored in ROM, RAM, or other types of non-transitory storage media, or any combination thereof. The method of the present application may be implemented in accordance with these program instructions. The artificial intelligence based ultrasound image data classification system 100 also includes an Input/Output (I/O) interface 150 between a computer and other Input/Output devices.
For ease of illustration, only one processor is depicted in the artificial intelligence based ultrasound image data classification system 100. It should be noted, however, that the artificial intelligence based ultrasound image data classification system 100 of the present application may also include multiple processors, and thus the steps performed by one processor described in the present application may also be performed jointly by multiple processors or separately. For example, if the processor of the artificial intelligence based ultrasound image data classification system 100 performs steps a and B, it should be understood that steps a and B may also be performed jointly by two different processors or separately in one processor. For example, the first processor performs step a, the second processor performs step B, or the first processor and the second processor together perform steps a and B.
In addition, the embodiment of the invention also provides a readable storage medium, wherein computer executable instructions are preset in the readable storage medium, and when a processor executes the computer executable instructions, the ultrasonic image data classification method based on artificial intelligence is realized.
It should be noted that in order to simplify the presentation of the disclosure and thereby aid in understanding one or more embodiments of the invention, various features are sometimes grouped together in a single embodiment, figure, or description thereof.

Claims (10)

Translated fromChinese
1.一种基于人工智能的超声图像数据分类方法,其特征在于,所述方法包括:1. An artificial intelligence-based ultrasound image data classification method, characterized in that the method comprises:获取目标对象的实时超声扫描信号序列,其中,所述实时超声扫描信号序列包含多帧动态超声图像,每帧动态超声图像中嵌入有组织弹性参数与血流动力学参数的时序变化特征;Acquire a real-time ultrasound scanning signal sequence of the target object, wherein the real-time ultrasound scanning signal sequence includes multiple frames of dynamic ultrasound images, and each frame of the dynamic ultrasound image is embedded with time-series variation characteristics of tissue elasticity parameters and hemodynamic parameters;对所述实时超声扫描信号序列进行噪声抑制与运动伪影补偿处理,生成去噪后的标准化超声图像序列,并在所述标准化超声图像序列中标记感兴趣区域的解剖边界坐标;Performing noise suppression and motion artifact compensation processing on the real-time ultrasound scanning signal sequence to generate a denoised standardized ultrasound image sequence, and marking the anatomical boundary coordinates of the region of interest in the standardized ultrasound image sequence;基于所述解剖边界坐标对每帧标准化超声图像执行多尺度解剖结构分解,生成与不同组织层次对应的局部特征图集合,所述局部特征图集合中的每个局部特征图包含边缘锐度指标、纹理一致性指标及血流信号强度分布的三维矩阵;Performing multi-scale anatomical structure decomposition on each frame of standardized ultrasound image based on the anatomical boundary coordinates to generate a set of local feature maps corresponding to different tissue levels, wherein each local feature map in the set of local feature maps includes an edge sharpness index, a texture consistency index, and a three-dimensional matrix of blood flow signal intensity distribution;将所述局部特征图集合输入级联式深度分类网络,通过所述级联式深度分类网络中嵌套的时空特征对齐模块与多粒度注意力分配模块,对所述局部特征图集合进行跨帧特征融合与动态权重调整,输出每帧标准化超声图像中异常区域的分类概率分布及空间拓扑关系图谱;Inputting the local feature map set into a cascaded deep classification network, performing cross-frame feature fusion and dynamic weight adjustment on the local feature map set through a spatiotemporal feature alignment module and a multi-granularity attention allocation module embedded in the cascaded deep classification network, and outputting a classification probability distribution and a spatial topological relationship map of abnormal areas in each frame of a standardized ultrasound image;根据所述分类概率分布与空间拓扑关系图谱生成多模态诊断报告,并将所述多模态诊断报告与医学影像数据库中的历史病例特征进行相似性比对,生成包含风险等级标签与治疗建议的可交互式三维可视化界面。A multimodal diagnosis report is generated based on the classification probability distribution and the spatial topological relationship map, and the multimodal diagnosis report is compared with the historical case characteristics in the medical imaging database for similarity to generate an interactive three-dimensional visualization interface including risk level labels and treatment recommendations.2.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述对所述实时超声扫描信号序列进行噪声抑制与运动伪影补偿处理,生成去噪后的标准化超声图像序列,包括:2. The ultrasound image data classification method based on artificial intelligence according to claim 1, characterized in that the noise suppression and motion artifact compensation processing of the real-time ultrasound scanning signal sequence to generate a denoised standardized ultrasound image sequence comprises:对所述实时超声扫描信号序列中的每一帧超声扫描信号进行小波变换分解,分离高频噪声分量与低频组织信号分量;Performing wavelet transform decomposition on each frame of ultrasound scanning signal in the real-time ultrasound scanning signal sequence to separate high-frequency noise components from low-frequency tissue signal components;采用自适应阈值算法对所述高频噪声分量进行逐层滤波,得到滤波后的超声扫描信号,保留与血管壁运动相关的中频信号分量;Adopting an adaptive threshold algorithm to filter the high-frequency noise component layer by layer to obtain a filtered ultrasonic scanning signal, and retaining an intermediate frequency signal component related to the movement of the blood vessel wall;对所述滤波后的超声扫描信号进行逆小波变换重构,生成初步去噪的超声图像;Performing inverse wavelet transform reconstruction on the filtered ultrasonic scanning signal to generate a preliminary denoised ultrasonic image;基于光流法计算相邻帧超声扫描信号之间的组织位移矢量场,并根据所述组织位移矢量场对所述初步去噪的超声图像进行运动轨迹补偿,对运动补偿后的超声图像执行灰度归一化操作,将像素强度映射至预设的标准动态范围,得到归一化后的超声图像;Calculating the tissue displacement vector field between adjacent frame ultrasound scanning signals based on the optical flow method, performing motion trajectory compensation on the preliminary denoised ultrasound image according to the tissue displacement vector field, performing grayscale normalization operation on the motion compensated ultrasound image, mapping pixel intensity to a preset standard dynamic range, and obtaining a normalized ultrasound image;采用各向异性扩散算法对所述归一化后的超声图像进行边缘保持平滑处理,并通过双边滤波器对平滑处理后的超声图像进行细节增强后,根据预设的解剖结构模板对细节增强后的超声图像进行区域匹配,裁剪出包含完整器官边界的标准化超声图像序列。The normalized ultrasound image is smoothed edge-preservingly by using an anisotropic diffusion algorithm, and the smoothed ultrasound image is enhanced in detail by using a bilateral filter. Then, the detail-enhanced ultrasound image is region-matched according to a preset anatomical structure template, and a standardized ultrasound image sequence containing complete organ boundaries is cropped.3.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述基于所述解剖边界坐标对每帧标准化超声图像执行多尺度解剖结构分解,生成与不同组织层次对应的局部特征图集合,包括:3. The ultrasound image data classification method based on artificial intelligence according to claim 1 is characterized in that the multi-scale anatomical structure decomposition is performed on each frame of the standardized ultrasound image based on the anatomical boundary coordinates to generate a set of local feature maps corresponding to different tissue levels, comprising:在每帧标准化超声图像中,根据所述解剖边界坐标截取包含目标器官的矩形感兴趣区域;In each frame of the standardized ultrasound image, a rectangular region of interest including the target organ is intercepted according to the anatomical boundary coordinates;对所述矩形感兴趣区域进行高斯金字塔分解,生成包含原始分辨率、二分之一分辨率及四分之一分辨率的多尺度图像组;Performing Gaussian pyramid decomposition on the rectangular region of interest to generate a multi-scale image group including original resolution, half resolution and quarter resolution;在所述多尺度图像组中每个尺度的图像中应用多方向Gabor滤波器组,提取与血管分支方向匹配的频域响应特征图;Applying a multi-directional Gabor filter group to each scale image in the multi-scale image group to extract a frequency domain response feature map matching the direction of the blood vessel branch;将不同尺度的频域响应特征图输入三维卷积神经网络,通过跨尺度特征融合层聚合不同分辨率下的边缘梯度信息,并对聚合后的特征图进行非极大值抑制处理,消除冗余边缘响应并保留最大强度边缘点后,基于保留的边缘点构建空间德劳内三角网格,计算所述三角网格的每个三角面片的曲率分布与面积占比指标;The frequency domain response feature maps of different scales are input into the three-dimensional convolutional neural network, the edge gradient information at different resolutions is aggregated through the cross-scale feature fusion layer, and the aggregated feature maps are subjected to non-maximum suppression processing. After eliminating redundant edge responses and retaining the maximum intensity edge points, a spatial Delaunay triangulated mesh is constructed based on the retained edge points, and the curvature distribution and area ratio index of each triangle face of the triangular mesh are calculated;根据所述曲率分布与面积占比指标对所述三角网格进行区域划分,生成与不同组织类型对应的子网格集合,并在所述子网格集合的每个子网格内统计血流信号强度的时空变化方差,生成包含边缘几何特征与血流动力学特征的局部特征图集合。The triangular mesh is divided into regions according to the curvature distribution and area ratio index to generate a set of sub-mesh sets corresponding to different tissue types, and the spatiotemporal variation variance of the blood flow signal intensity is statistically analyzed in each sub-mesh of the sub-mesh set to generate a set of local feature maps containing edge geometry features and hemodynamic features.4.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述将所述局部特征图集合输入级联式深度分类网络,通过所述级联式深度分类网络中嵌套的时空特征对齐模块与多粒度注意力分配模块,对所述局部特征图集合进行跨帧特征融合与动态权重调整,输出每帧标准化超声图像中异常区域的分类概率分布及空间拓扑关系图谱,包括:4. The method for classifying ultrasound image data based on artificial intelligence according to claim 1 is characterized in that the local feature map set is input into a cascaded deep classification network, and the local feature map set is subjected to cross-frame feature fusion and dynamic weight adjustment through a spatiotemporal feature alignment module and a multi-granularity attention allocation module nested in the cascaded deep classification network, and the classification probability distribution and spatial topological relationship map of the abnormal area in each frame of the standardized ultrasound image are output, including:将所述局部特征图集合输入时空特征对齐模块的长短时记忆网络进行时序状态传递,生成包含连续帧间病灶形态演变规律的时间维度特征向量,并将所述时间维度特征向量输入可变形卷积层进行空间形变建模,输出与组织结构非线性变形匹配的空间形变特征矩阵;The local feature map set is input into the long short-term memory network of the spatiotemporal feature alignment module for temporal state transfer, a time dimension feature vector containing the lesion morphological evolution law between consecutive frames is generated, and the time dimension feature vector is input into the deformable convolution layer for spatial deformation modeling, and a spatial deformation feature matrix matching the nonlinear deformation of the tissue structure is output;拼接所述时间维度特征向量与空间形变特征矩阵,生成时空联合特征矩阵,并将所述时空联合特征矩阵输入多粒度注意力分配模块的多头自注意力机制,计算不同特征通道间的全局相关性权重,输出全局注意力权重矩阵;splicing the time dimension feature vector and the spatial deformation feature matrix to generate a spatiotemporal joint feature matrix, and inputting the spatiotemporal joint feature matrix into the multi-head self-attention mechanism of the multi-granularity attention allocation module, calculating the global correlation weights between different feature channels, and outputting the global attention weight matrix;将所述全局注意力权重矩阵输入门控线性单元进行特征通道筛选,保留与当前分类任务相关的特征通道,生成门控筛选特征向量,并将所述门控筛选特征向量输入局部窗口注意力计算层,增强相邻特征节点的空间关联性,生成局部窗口注意力权重矩阵;Input the global attention weight matrix into the gated linear unit to perform feature channel screening, retain the feature channels related to the current classification task, generate a gated screening feature vector, and input the gated screening feature vector into the local window attention calculation layer to enhance the spatial correlation of adjacent feature nodes and generate a local window attention weight matrix;加权融合所述全局注意力权重矩阵与局部窗口注意力权重矩阵,生成多粒度注意力分配图谱,并将所述多粒度注意力分配图谱通过残差连接叠加至所述时空联合特征矩阵,生成优化后的分类特征向量;The global attention weight matrix and the local window attention weight matrix are weightedly fused to generate a multi-granularity attention allocation map, and the multi-granularity attention allocation map is superimposed on the spatiotemporal joint feature matrix through residual connection to generate an optimized classification feature vector;将所述优化后的分类特征向量输入全连接分类层,输出每帧标准化超声图像中异常区域的分类概率分布及空间拓扑关系图谱。The optimized classification feature vector is input into the fully connected classification layer, and the classification probability distribution and spatial topological relationship map of the abnormal area in each frame of the standardized ultrasound image are output.5.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述根据所述分类概率分布与空间拓扑关系图谱生成多模态诊断报告,并将所述多模态诊断报告与医学影像数据库中的历史病例特征进行相似性比对,生成包含风险等级标签与治疗建议的可交互式三维可视化界面,包括:5. The method for classifying ultrasound image data based on artificial intelligence according to claim 1 is characterized in that a multimodal diagnosis report is generated according to the classification probability distribution and the spatial topological relationship map, and the multimodal diagnosis report is compared with the historical case characteristics in the medical imaging database for similarity, and an interactive three-dimensional visualization interface containing risk level labels and treatment recommendations is generated, including:将所述分类概率分布映射至标准化超声图像序列的空间坐标,生成带有半透明颜色编码的异常区域覆盖层;Mapping the classification probability distribution to the spatial coordinates of a standardized ultrasound image sequence to generate an abnormal region overlay with semi-transparent color coding;将所述空间拓扑关系图谱输入三维体渲染引擎,构建标注血管分支与病灶浸润范围的三维体渲染模型;Inputting the spatial topological relationship map into a three-dimensional volume rendering engine to construct a three-dimensional volume rendering model with annotated blood vessel branches and lesion infiltration ranges;将所述异常区域覆盖层与三维体渲染模型进行空间配准,生成融合二维概率分布与三维解剖结构的可视化基础框架;Performing spatial registration of the abnormal area cover layer with the three-dimensional volume rendering model to generate a visualization basic framework integrating the two-dimensional probability distribution and the three-dimensional anatomical structure;提取所述分类概率分布中的血流动力学特征参数,在医学影像数据库中检索具有相似血流动力学特征参数的历史病例特征,生成病例相似性比对结果;Extracting the hemodynamic characteristic parameters in the classification probability distribution, retrieving historical case characteristics with similar hemodynamic characteristic parameters in the medical imaging database, and generating case similarity comparison results;将所述病例相似性比对结果与所述可视化基础框架进行关联,生成侧边对比视图及关键差异参数标注层,并基于所述关键差异参数标注层匹配预设治疗方案流程图,生成风险等级标签与治疗建议的逻辑关联表,后将所述逻辑关联表嵌入所述可视化基础框架,生成包含可交互时间轴控件、实时超声回放功能及语音播报引擎的可交互式三维可视化界面。The case similarity comparison result is associated with the visualization basic framework to generate a side comparison view and a key difference parameter annotation layer, and the preset treatment plan flow chart is matched based on the key difference parameter annotation layer to generate a logical association table of risk level labels and treatment recommendations. The logical association table is then embedded in the visualization basic framework to generate an interactive three-dimensional visualization interface including an interactive timeline control, a real-time ultrasound playback function and a voice broadcast engine.6.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述方法还包括:6. The method for classifying ultrasound image data based on artificial intelligence according to claim 1, characterized in that the method further comprises:将带有多维度标签的训练样本库输入数据增强模块,执行随机弹性形变、局部像素遮挡及多角度仿射变换,生成扩充后的对抗训练样本集;The training sample library with multi-dimensional labels is input into the data augmentation module, and random elastic deformation, local pixel occlusion and multi-angle affine transformation are performed to generate an expanded adversarial training sample set;将所述扩充后的对抗训练样本集输入第一训练阶段的焦点损失函数,优化级联式深度分类网络的类间平衡性权重参数;Inputting the expanded adversarial training sample set into the focal loss function of the first training stage to optimize the inter-class balance weight parameters of the cascaded deep classification network;将优化后的类间平衡性权重参数输入第二训练阶段的对抗训练策略,通过梯度符号攻击生成对抗扰动样本集;The optimized inter-class balance weight parameters are input into the adversarial training strategy of the second training phase, and an adversarial perturbation sample set is generated through gradient sign attack.将所述对抗扰动样本集输入第三训练阶段的知识蒸馏框架,利用教师网络的软标签传递机制压缩学生网络的模型复杂度参数;Inputting the adversarial perturbation sample set into the knowledge distillation framework of the third training stage, and utilizing the soft label transfer mechanism of the teacher network to compress the model complexity parameters of the student network;将压缩后的模型复杂度参数输入验证集监控模块,根据分类精度与泛化性能触发早停机制并回滚至最优权重快照;The compressed model complexity parameters are input into the validation set monitoring module, and the early stopping mechanism is triggered according to the classification accuracy and generalization performance, and the model is rolled back to the optimal weight snapshot.将所述最优权重快照输入动态量化编码器,生成8位整数编码的部署用网络权重参数;Inputting the optimal weight snapshot into a dynamic quantization encoder to generate an 8-bit integer-encoded network weight parameter for deployment;将所述部署用网络权重参数输入在线增量学习模块,通过临床反馈数据更新全连接层参数以适配超声探头成像特性差异。The network weight parameters for the deployment are input into an online incremental learning module, and the fully connected layer parameters are updated through clinical feedback data to adapt to the differences in ultrasound probe imaging characteristics.7.根据权利要求6所述的基于人工智能的超声图像数据分类方法,其特征在于,所述方法还包括:7. The method for classifying ultrasound image data based on artificial intelligence according to claim 6, characterized in that the method further comprises:将所述级联式深度分类网络的输出置信度熵值与预设阈值比较,触发不确定性预警事件,并将所述不确定性预警事件输入生成对抗网络重构模块,计算当前超声图像的重构误差分布,并将所述重构误差分布与分类概率分布进行冲突检测,定位伪影干扰区域并触发备用分类模型切换指令;Compare the output confidence entropy value of the cascaded deep classification network with a preset threshold value to trigger an uncertainty warning event, and input the uncertainty warning event into a generative adversarial network reconstruction module to calculate the reconstruction error distribution of the current ultrasound image, and perform conflict detection between the reconstruction error distribution and the classification probability distribution, locate the artifact interference area and trigger a backup classification model switching instruction;将所述伪影干扰区域坐标与模型切换指令输入自愈日志构建模块,记录输入数据特征及输出分布偏离度,生成自愈日志,将所述自愈日志输入图神经网络训练器,构建异常事件关联关系的故障知识图谱,并将所述故障知识图谱输入网络架构优化器,动态调整级联式深度分类网络的注意力头数量与卷积核尺寸;Input the coordinates of the artifact interference area and the model switching instruction into the self-healing log construction module, record the input data characteristics and the output distribution deviation, generate a self-healing log, input the self-healing log into the graph neural network trainer, build a fault knowledge graph of the abnormal event association relationship, and input the fault knowledge graph into the network architecture optimizer to dynamically adjust the number of attention heads and the size of the convolution kernel of the cascaded deep classification network;将优化后的网络架构参数输入全量自检模块,修复权重矩阵中的数值溢出节点并重置优化器内部状态参数后,将修复后的网络架构参数与自愈规则库同步至分布式边缘节点,维持多设备间的异常处理策略一致性。The optimized network architecture parameters are input into the full self-check module. After repairing the numerical overflow nodes in the weight matrix and resetting the internal state parameters of the optimizer, the repaired network architecture parameters and the self-healing rule base are synchronized to the distributed edge nodes to maintain the consistency of the exception handling strategy among multiple devices.8.根据权利要求1所述的基于人工智能的超声图像数据分类方法,其特征在于,所述生成包含风险等级标签与治疗建议的可交互式三维可视化界面之后,所述方法还包括:8. The method for classifying ultrasound image data based on artificial intelligence according to claim 1, characterized in that after generating an interactive three-dimensional visualization interface including risk level labels and treatment suggestions, the method further comprises:接收用户在所述可交互式三维可视化界面中通过时间轴控件或实时超声回放功能触发的交互指令,解析所述交互指令中的参数调整请求与病灶区域重标注坐标;Receiving an interactive instruction triggered by a user through a timeline control or a real-time ultrasound playback function in the interactive three-dimensional visualization interface, and parsing a parameter adjustment request and a lesion area re-marking coordinate in the interactive instruction;根据所述参数调整请求中的置信度阈值偏移量修正所述分类概率分布的判定边界,生成修正后的异常区域覆盖层,并将所述病灶区域重标注坐标与所述空间拓扑关系图谱进行拓扑一致性校验,更新所述三维体渲染模型中的血管分支连接路径;Correcting the decision boundary of the classification probability distribution according to the confidence threshold offset in the parameter adjustment request, generating a corrected abnormal area coverage layer, performing a topological consistency check on the re-labeled coordinates of the lesion area and the spatial topological relationship map, and updating the blood vessel branch connection path in the three-dimensional volume rendering model;将所述修正后的异常区域覆盖层与更新后的三维体渲染模型重新配准,生成二次验证可视化框架,并提取所述二次验证可视化框架中的血流动力学特征参数变化梯度;Re-registering the modified abnormal area cover layer with the updated three-dimensional volume rendering model to generate a secondary verification visualization framework, and extracting the change gradient of the hemodynamic characteristic parameters in the secondary verification visualization framework;基于所述血流动力学特征参数变化梯度在所述医学影像数据库中执行增量检索,筛选出与修正后分类概率分布匹配的历史病例子集,重新计算病例相似性比对结果中的风险等级权重系数;Performing incremental retrieval in the medical image database based on the change gradient of the hemodynamic characteristic parameter, screening out a subset of historical cases matching the modified classification probability distribution, and recalculating the risk level weight coefficient in the case similarity comparison result;根据重新计算的风险等级权重系数调整所述逻辑关联表中的治疗建议优先级排序,生成动态优化后的治疗建议流程图,并将所述动态优化后的治疗建议流程图与所述二次验证可视化框架叠加,生成医师确认后的最终诊断报告;Adjusting the priority ranking of treatment suggestions in the logic association table according to the recalculated risk level weight coefficient, generating a dynamically optimized treatment suggestion flow chart, and superimposing the dynamically optimized treatment suggestion flow chart with the secondary verification visualization framework to generate a final diagnosis report after confirmation by the physician;将所述医师确认后的最终诊断报告中的修正参数与重标注坐标输入在线增量学习模块,更新所述级联式深度分类网络的全连接层权重,并将更新后的全连接层权重同步至分布式边缘节点的备用分类模型,完成跨设备诊断策略一致性校准。The corrected parameters and re-labeled coordinates in the final diagnosis report confirmed by the physician are input into the online incremental learning module, the fully connected layer weights of the cascaded deep classification network are updated, and the updated fully connected layer weights are synchronized to the backup classification model of the distributed edge node to complete the cross-device diagnostic strategy consistency calibration.9.根据权利要求8所述的基于人工智能的超声图像数据分类方法,其特征在于,所述完成跨设备诊断策略一致性校准的步骤之后,所述方法还包括:9. The method for classifying ultrasound image data based on artificial intelligence according to claim 8, characterized in that after completing the step of calibrating the consistency of cross-device diagnostic strategies, the method further comprises:周期性采集所述目标对象在治疗阶段的多期超声复查数据,对每期超声复查数据执行所述实时超声扫描信号序列的噪声抑制与运动伪影补偿处理,生成标准化随访超声图像序列;Periodically collecting multiple periods of ultrasound review data of the target object during the treatment stage, performing noise suppression and motion artifact compensation processing of the real-time ultrasound scanning signal sequence on each period of ultrasound review data, and generating a standardized follow-up ultrasound image sequence;将所述标准化随访超声图像序列输入更新后的级联式深度分类网络,输出每期随访超声图像中异常区域的随访分类概率分布及空间拓扑演变图谱;Inputting the standardized follow-up ultrasound image sequence into the updated cascade deep classification network, and outputting the follow-up classification probability distribution and spatial topological evolution map of the abnormal area in each follow-up ultrasound image;将所述随访分类概率分布与初始分类概率分布进行时间序列对齐,计算异常区域面积收缩率与血流信号强度衰减系数,生成治疗响应量化指标集合;Aligning the follow-up classification probability distribution with the initial classification probability distribution in time series, calculating the abnormal area shrinkage rate and the blood flow signal intensity attenuation coefficient, and generating a set of quantitative indicators of treatment response;将所述空间拓扑演变图谱输入三维差异渲染引擎,提取血管分支形态变化向量与病灶浸润体积变化矩阵,生成三维治疗响应动态模型;Inputting the spatial topological evolution map into a three-dimensional difference rendering engine, extracting the morphological change vector of blood vessel branches and the change matrix of lesion infiltration volume, and generating a three-dimensional treatment response dynamic model;融合所述治疗响应量化指标集合与三维治疗响应动态模型,生成多期治疗效果对比报告,并将所述多期治疗效果对比报告与所述医学影像数据库中的历史治疗案例进行预后模式匹配;Fusion of the treatment response quantitative index set and the three-dimensional treatment response dynamic model to generate a multi-phase treatment effect comparison report, and prognostic pattern matching of the multi-phase treatment effect comparison report with historical treatment cases in the medical imaging database;根据预后模式匹配结果筛选最佳历史治疗案例,提取对应的治疗方案流程图与关键节点监测参数,生成个性化治疗监测看板;Filter the best historical treatment cases based on the prognostic pattern matching results, extract the corresponding treatment plan flowchart and key node monitoring parameters, and generate a personalized treatment monitoring dashboard;将所述个性化治疗监测看板中的监测参数阈值与实时超声复查数据流进行实时比对,触发偏离预警事件时,自动调整所述治疗建议流程图中的药物剂量参数与复查时间间隔,生成自适应治疗计划更新指令;The monitoring parameter thresholds in the personalized treatment monitoring dashboard are compared in real time with the real-time ultrasound review data stream. When a deviation warning event is triggered, the drug dosage parameters and review time intervals in the treatment recommendation flow chart are automatically adjusted to generate an adaptive treatment plan update instruction;将所述自适应治疗计划更新指令与多期治疗效果对比报告进行逻辑关联,生成治疗闭环反馈文件,并将所述治疗闭环反馈文件中的关键决策参数回传至所述在线增量学习模块,优化后续分类网络权重更新方向。The adaptive treatment plan update instruction is logically associated with the multi-phase treatment effect comparison report to generate a treatment closed-loop feedback file, and the key decision parameters in the treatment closed-loop feedback file are fed back to the online incremental learning module to optimize the subsequent classification network weight update direction.10.一种基于人工智能的超声图像数据分类系统,其特征在于,所述基于人工智能的超声图像数据分类系统包括处理器和存储器,所述存储器和所述处理器连接,所述存储器用于存储程序、指令或代码,所述处理器用于执行所述存储器中的程序、指令或代码,以实现上述权利要求1-9任意一项所述的基于人工智能的超声图像数据分类方法。10. An ultrasound image data classification system based on artificial intelligence, characterized in that the ultrasound image data classification system based on artificial intelligence includes a processor and a memory, the memory is connected to the processor, the memory is used to store programs, instructions or codes, and the processor is used to execute the programs, instructions or codes in the memory to implement the ultrasound image data classification method based on artificial intelligence described in any one of claims 1 to 9.
CN202510384168.5A2025-03-282025-03-28 Ultrasonic image data classification method and system based on artificial intelligencePendingCN120318569A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202510384168.5ACN120318569A (en)2025-03-282025-03-28 Ultrasonic image data classification method and system based on artificial intelligence

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202510384168.5ACN120318569A (en)2025-03-282025-03-28 Ultrasonic image data classification method and system based on artificial intelligence

Publications (1)

Publication NumberPublication Date
CN120318569Atrue CN120318569A (en)2025-07-15

Family

ID=96323369

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202510384168.5APendingCN120318569A (en)2025-03-282025-03-28 Ultrasonic image data classification method and system based on artificial intelligence

Country Status (1)

CountryLink
CN (1)CN120318569A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN120580526A (en)*2025-08-052025-09-02福建中医药大学Liver disease image recognition processing method, medium and equipment based on multi-mode fusion
CN120635604A (en)*2025-08-132025-09-12浙江大学Intelligent solid garbage sorting system based on image recognition

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN120580526A (en)*2025-08-052025-09-02福建中医药大学Liver disease image recognition processing method, medium and equipment based on multi-mode fusion
CN120635604A (en)*2025-08-132025-09-12浙江大学Intelligent solid garbage sorting system based on image recognition

Similar Documents

PublicationPublication DateTitle
CN120318569A (en) Ultrasonic image data classification method and system based on artificial intelligence
GB2605391A (en)Medical Image Analysis Using Neural Networks
CN117636116A (en)Method for intelligently fusing CT image data with MRI data
CN119887773B (en) A medical image recognition and processing system and method based on multimodal image fusion
CN118154547A (en)Joint MRI image recognition method and system based on artificial intelligence
US20250232442A1 (en)Apparatus and methods for generating a three-dimensional (3d) model of cardiac anatomy via machine-learning
US20250245919A1 (en)Apparatus and method for generating a three-dimensional (3d) model of cardiac anatomy based on model uncertainty
US12399932B1 (en)Apparatus and methods for visualization within a three-dimensional model using neural networks
CN120340784A (en) A medical image automatic diagnosis method and system based on deep learning
Jalali et al.VGA‐Net: Vessel graph based attentional U‐Net for retinal vessel segmentation
US20250157628A1 (en)Apparatus and methods for synthetizing medical images
US12322104B2 (en)Apparatus and methods for generating a three-dimensional (3D) model of cardiac anatomy via machine-learning
Wang et al.Segmentation of intravascular ultrasound images based on convex–concave adjustment in extreme regions
CN112862787B (en)CTA image data processing method, device and storage medium
CN120318236B (en)Ultrasonic image intelligent analysis system based on deep learning
CN120495295B (en)Osteoporosis diagnosis method based on image recognition
CN120375108B (en)Automatic recognition system for abnormal pumping blood after cardiac intervention operation based on machine learning
CN119649158B (en) Image feature selection method for Spark model of schizophrenia medical records
CN120047943B (en)Macrophage identification method and system based on IV-OCT image
US20250157632A1 (en)Apparatus and methods for synthetizing medical images
Kobayashi et al.Learning global and local features of normal brain anatomy for unsupervised abnormality detection
US20250245829A1 (en)Apparatus and method for generating a three-dimensional (3d) model of patients organ
US20250209697A1 (en)Apparatus and method for generating a three-dimensional (3d) model with an overlay
Lin et al.Ultrasound image-guided algorithms for tracking liver motion
JiaPopulation-based models of shape, structure, and deformation in atrial fibrillation

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination

[8]ページ先頭

©2009-2025 Movatter.jp