Disclosure of Invention
In view of the above, the present invention is directed to a method, an apparatus and a terminal device for identifying a maxillofacial fracture based on a CBCT database, so as to at least partially solve the above problems. The invention is based on a convolutional neural network algorithm, carries out deep learning training on a CBCT data model of the maxillofacial fracture, tests set model verification, and learns artificial intelligence to assist in diagnosing frostbite and maxillofacial wound by means of human experience, thereby forming an intelligent diagnosis platform and improving the stability and reaction efficiency of disease diagnosis and treatment. The invention solves the efficiency problem of jaw face fracture identification, and assists doctors to improve diagnosis efficiency by automatically detecting the fracture position.
In a first aspect of the present invention, there is provided a method for identifying a maxillofacial fracture based on a CBCT database, the method comprising: decomposing the jaw face CBCT data to be identified into image block sequences of different anatomical regions according to an anatomical structure; inputting each image block in the image block sequence into a corresponding trained fracture judgment model respectively to obtain a fracture judgment result sequence of the image block; and determining that the fracture judgment result sequence meets fracture judgment conditions, and outputting corresponding image blocks and fracture positions.
Optionally, decomposing the maxillofacial CBCT data to be identified into image block sequences of different anatomical regions according to the anatomical structure includes: inputting the CBCT image sequence of the CBCT data into a trained bone structure positioning model; generating a detection rectangular frame sequence; positioning the fault range of each anatomical region according to the rectangular frames of different anatomical regions; and respectively extracting the image block sequence of each anatomical region layer by layer from the CBCT image sequence in the positioned fault range.
Optionally, the bone structure positioning model is a convolutional neural network model, the input of the convolutional neural network model is a CBCT image sequence of the CBCT data, and the output is a rectangular frame detection result corresponding to each image in the CBCT image sequence.
Optionally, the trained bone structure localization model is obtained through the following steps: marking CBCT data with maxillofacial fracture characteristics and marking the CBCT data according to anatomical structures to serve as a first training sample; dividing the first training sample into a training set and a validation set; after the parameters of the bone structure positioning model are preset, iterative training is carried out on the bone structure positioning model by adopting the training samples in the training set and a gradient descent algorithm; and determining the optimal parameters of the bone structure positioning model by adopting the training samples in the verification set.
Optionally, before the dividing the training samples into the training set and the verification set, the method further includes: and mapping the pixel gray value of the CBCT image sequence to a preset pixel gray value range.
Optionally, the fracture discrimination model is: one of EfficientNet 3, ResNet, DenseNet, and 3D-ResNet.
Optionally, the trained fracture discrimination model is obtained through the following steps: determining a model structure of the fracture discrimination model; decomposing the first training sample into image block sequences of different anatomical regions according to the anatomical structure, wherein the image block sequence of each anatomical region forms a partition training sample; the fracture discrimination model of each different anatomical region is trained by adopting the following steps: dividing the partition training samples corresponding to the anatomical region into a training set and a verification set; after the parameters of the fracture discrimination model are preset, iterative training is carried out on the fracture discrimination model by adopting training samples in the training set and a gradient descent algorithm; and determining the optimal parameters of the fracture discrimination model by adopting the training samples in the verification set.
Optionally, determining that the fracture judgment result sequence satisfies the fracture judgment condition, and outputting a corresponding image block and a corresponding fracture position, including: acquiring the number and the position relation of fracture judgment results in the fracture judgment result sequence, wherein the fracture judgment results are 'fractures'; and determining that the relationship between the number and the position accords with fracture characteristics, outputting an anatomical region with fracture judgment result as a fracture position, and outputting a corresponding image block.
In a second aspect of the present invention, there is also provided a CBCT database-based maxillofacial fracture identification apparatus, the apparatus comprising: the image input module is used for acquiring the CBCT data of the maxillofacial region to be identified; the bone structure positioning module is used for decomposing the jaw face CBCT data into image block sequences of different anatomical regions according to anatomical structures; the fracture judgment module is used for respectively inputting each image block in the image block sequence into a corresponding trained fracture judgment model to obtain a fracture judgment result sequence of the image block; and the identification output module is used for determining that the fracture judgment result sequence meets the fracture judgment condition and outputting a corresponding image block and a corresponding fracture position.
In a third aspect of the present invention, there is also provided a terminal device, including a memory, a processor and a computer program stored in the memory and operable on the processor, wherein the processor implements the steps of the recognition method for maxillofacial fractures based on a CBCT database.
In a fourth aspect of the present invention, there is also provided a computer-readable storage medium having stored therein instructions, which when run on a computer, cause the computer to execute the aforementioned recognition method of maxillofacial fractures based on a CBCT database.
Through the technical scheme provided by the invention, the following beneficial effects are achieved: the CBCT data base-based jaw face fracture artificial intelligence diagnosis method provided by the invention is based on a convolutional neural network algorithm, deep learning training is carried out on a jaw face fracture CBCT data model, test set model verification is carried out, artificial intelligence is learned by means of human experience to assist in diagnosis of frostbite and jaw face wound, an intelligent diagnosis platform is formed, and the stability and the reaction efficiency of disease diagnosis and treatment are improved. The system solves the limitation that the traditional diagnosis and treatment depends on professional doctors and specific diagnosis and treatment places, and makes the instant diagnosis and treatment possible on accident sites requiring quick response, accurate judgment and timely treatment, such as major traffic accidents, large-scale sports and the like.
Additional features and advantages of the invention will be set forth in the detailed description which follows.
Detailed Description
In addition, the embodiments of the present invention and the features of the embodiments may be combined with each other without conflict.
The following detailed description of embodiments of the invention refers to the accompanying drawings. It should be understood that the detailed description and specific examples, while indicating the present invention, are given by way of illustration and explanation only, not limitation.
Fig. 1 is a schematic flow chart of a recognition method for maxillofacial fractures based on a CBCT database according to an embodiment of the present invention, as shown in fig. 1. In a first aspect of the present invention, there is provided a method for identifying a maxillofacial fracture based on a CBCT database, the method comprising: decomposing the jaw face CBCT data to be identified into image block sequences of different anatomical regions according to an anatomical structure; inputting each image block in the image block sequence into a corresponding trained fracture judgment model respectively to obtain a fracture judgment result sequence of the image block; and determining that the fracture judgment result sequence meets fracture judgment conditions, and outputting corresponding image blocks and fracture positions.
Therefore, the maxillofacial CBCT data are decomposed into a plurality of structural members on anatomical structures, whether the fracture exists in the maxillofacial CBCT data is identified by identifying whether a single structural member is fractured, so that the whole maxillofacial CBCT data can be judged, and the fracture position can be identified. The embodiment of the invention decomposes and judges the CBCT data according to the regions by the trained image processing characteristics of machine learning, has the advantages of fast processing and small information loss, and simultaneously avoids the problem of low accuracy caused by the integral judgment of the CBCT data of the maxillofacial region. The algorithm of the embodiment mainly comprises two models: the bone structure positioning model and the fracture distinguishing model are mainly used for finding specific positions and contours of the maxilla and the mandible, extracting different key parts, and then inputting the extracted different parts into the fracture distinguishing model to identify the fracture type.
In an embodiment of the present invention, the decomposing the maxillofacial CBCT data to be identified into image block sequences of different anatomical regions according to the anatomical structure includes: inputting the CBCT image sequence of the CBCT data into a trained bone structure positioning model; generating a detection rectangular frame sequence; positioning the fault range of each anatomical region according to the rectangular frames of different anatomical regions; and respectively extracting the image block sequence of each anatomical region layer by layer from the CBCT image sequence in the positioned fault range. The CBCT image sequence is sequentially input into a trained detection model, a detection rectangular frame sequence is output, the fault ranges of 8 anatomical regions are positioned according to detected rectangular frames of different types, then the 8 anatomical regions are respectively extracted from the CBCT image sequence within the positioned fault ranges layer by layer, and as the cheekbone, the mandible, the mandibular angle, the ascending limb, the condylar process and the coracoid process have bilateral symmetry, 13 image block sequences are finally extracted.
In an embodiment provided by the present invention, the bone structure localization model is a convolutional neural network model, the input of the convolutional neural network model is a CBCT image sequence of the CBCT data, and the output is a rectangular frame detection result corresponding to each image in the CBCT image sequence. Specifically, the RetinaNet convolutional neural network structure can be adopted in the embodiment, the input of the model is a single CBCT image sequence, and the output is a corresponding rectangular frame detection result. The detection model RetinaNet may be replaced by other models, such as SSD, YOLO, FasterRCNN, and other target detection models.
In an embodiment of the present invention, the trained bone structure localization model is obtained by the following steps: marking CBCT data with maxillofacial fracture characteristics and marking the CBCT data according to anatomical structures to serve as a first training sample; dividing the first training sample into a training set and a validation set; after the parameters of the bone structure positioning model are preset, iterative training is carried out on the bone structure positioning model by adopting the training samples in the training set and a gradient descent algorithm; and determining the optimal parameters of the bone structure positioning model by adopting the training samples in the verification set. The method comprises the following specific steps:
and establishing a sample database. Various fracture subjects are recruited in the maxillofacial surgery outpatient clinic, project contents are informed, informed consent is given, and the two clinics are switched to shoot a large-visual-field CBCT film. The CBCT data of the patient is labeled. And obtaining the mask mark and the mark of the fracture type layer by layer. Exporting the DICOM files of all labels and the label table. This step can be further divided into the following three substeps:
1-1: the standard was included. Chinese 18-80 years old with a history of jaw face trauma fracture within half a month. The inclusion criteria were: 1) the age of the Han adults is 18-80 years old; 2) the history of jaw face trauma fracture is kept within half a month; 3) the maxillofacial region has no history of serious tumors; 4) no systemic bone metabolism disease; 5) no development deformity of maxillofacial region; 6) there was no history of radiotherapy and chemotherapy.
The types of fractures mainly include: left/right coracoid fracture, left/right ascending branch fracture, left/right mandibular angle fracture, left/right mandibular body fracture, left/right condylar fracture, chin fracture, alveolar fracture, maxillary fracture, left/right zygomatic bone fracture. Exclusion criteria were: 1) there are congenital facial asymmetries such as severe deviation of the jaw, deviation of nasal septum, and small ear deformities; 2) there is history of surgery on hard tissues of maxillofacial region; 3) old fracture or greenstick fracture; 4) the whole body is poor in condition and cannot tolerate referral or sit up. 5) Women with pregnancy period of 1-3 months.
1-2: an annotation is obtained. The content of the annotation mainly comprises mask annotation and fracture type annotation layer by layer. And (3) carrying out layer-by-layer labeling, importing the DICOM format of CBCT data of the subject into Mimics Medical 21.0 software for segmentation, and exporting the segmented zygomatic maxilla and mandible mask in the DICOM format. Observing a CBCT fault containing a fracture line, marking and exporting; the fracture type labeling divides the fracture type into 14 types, corresponding to each case, the fracture type of a positive result is labeled as '1', a negative result is labeled as '0', and all labeled DICOM files and labeling tables are exported.
1-3: and exporting the DICOM format file of the label and the original CBCT. The algorithm mainly comprises two models: the bone structure positioning model is mainly used for finding specific positions and contours of the maxilla and the mandible, extracting different key parts, and then inputting the extracted different parts into the discrimination model to recognize the fracture type.
Training samples are formed through the steps, and then the model is trained by the training samples. Taking 10% of training samples, namely 40 samples, as a verification set, and taking the rest as a training set; the model parameters adopt parameters pre-trained on a large natural image data set ImageNet, iterative training is carried out on the model by adopting a gradient descent algorithm SGD, and the optimal parameters of the model are determined according to mAP values on a verification set.
In one embodiment of the present invention, before the dividing the training samples into a training set and a validation set, the method further includes: and mapping the pixel gray value of the CBCT image sequence to a preset pixel gray value range. In an embodiment provided by the present invention, the mapping the pixel gray scale value of the CBCT image to a preset range includes: mapping pixel gray values of the CBCT image to [0, 255] according to a linear mode, wherein the mapping formula is as follows:
y=(x-xmin)/(xmax-xmin)*255
where y represents the mapped value, xminRepresenting the minimum pixel gray value, x, of CBCTmaxRepresenting the maximum pixel gray value of CBCT. In the embodiment, the gray value is compressed into 8 bits, so that the consistency of the training image in gray processing is realized.
In one embodiment of the present invention, the fracture discrimination model is: one of EfficientNet 3, ResNet, DenseNet, and 3D-ResNet. The models are all of a convolutional neural network structure, the input of each model is a single-area image block, and the output of each model is a corresponding binary discrimination result.
The fracture discrimination model in the previous embodiment also needs to be trained. Optionally, the trained fracture discrimination model is obtained through the following steps: determining a model structure of the fracture discrimination model; decomposing the first training sample into image block sequences of different anatomical regions according to the anatomical structure, wherein the image block sequence of each anatomical region forms a partition training sample; the fracture discrimination model of each different anatomical region is trained by adopting the following steps: dividing the partition training samples corresponding to the anatomical region into a training set and a verification set; after the parameters of the fracture discrimination model are preset, iterative training is carried out on the fracture discrimination model by adopting training samples in the training set and a gradient descent algorithm; and determining the optimal parameters of the fracture discrimination model by adopting the training samples in the verification set. Specifically, the fracture detection step comprises: carrying out deep learning discrimination model training according to the extracted region image block sequence and the corresponding label, and carrying out automatic identification on various fractures; wherein the model training of the fracture discrimination model comprises: taking 80% of the extracted region image block sequences as a training set and 20% as a verification set; the model parameters adopt parameters pre-trained on a large natural image data set ImageNet, a gradient descent algorithm Adam is adopted to carry out iterative training on the model, and the optimal parameters of the model are determined according to the F1 values on the verification set.
In one embodiment of the present invention, determining that the fracture determination result sequence satisfies fracture determination conditions and outputting corresponding image blocks and fracture positions includes: acquiring the number and the position relation of fracture judgment results in the fracture judgment result sequence, wherein the fracture judgment results are 'fractures'; and determining that the relationship between the number and the position accords with fracture characteristics, outputting an anatomical region with fracture judgment result as a fracture position, and outputting a corresponding image block. Specific fracture detection results are generated: and sequentially inputting the 13 region image block sequences into the trained discrimination model, and outputting to form a binary discrimination result sequence. For a bone structure with bilateral symmetry, the sequence of the discrimination result can be regarded as a left binary sequence and a right binary sequence (0 represents no fracture, and 1 represents fracture); the bone structure without left-right symmetry can be regarded as 1 binary sequence according to the judgment result, and finally 13 binary sequences can be obtained; for any binary sequence, the following rules are made according to the characteristics of fracture continuity and on the premise of ensuring high sensitivity: if the number of the continuous elements 1 in the binary sequence is more than 1, the type fracture identification result of the sample is positive, and a corresponding positive layer image is output, otherwise, the type fracture identification result is negative.
Further, the test group data were labeled by two hospitalizers for maxillofacial surgery as "gold standard" for fracture diagnosis. Comparing the output result of the test group data machine with the 'gold standard', and calculating the sensitivity and specificity of the fracture diagnosis of each part.
In one embodiment provided by the invention, a recognition device for the maxillofacial fracture based on the CBCT database is also provided. Fig. 2 is a schematic structural diagram of a recognition device for maxillofacial fracture based on CBCT database according to an embodiment of the present invention, as shown in fig. 2. The device comprises: the image input module is used for acquiring the CBCT data of the maxillofacial region to be identified; the bone structure positioning module is used for decomposing the jaw face CBCT data into image block sequences of different anatomical regions according to anatomical structures; the fracture judgment module is used for respectively inputting each image block in the image block sequence into a corresponding trained fracture judgment model to obtain a fracture judgment result sequence of the image block; and the identification output module is used for determining that the fracture judgment result sequence meets the fracture judgment condition and outputting a corresponding image block and a corresponding fracture position.
For the specific definition of the recognition device for the maxillofacial fracture based on the CBCT database, reference may be made to the above definition of the recognition method for the maxillofacial fracture based on the CBCT database, which is not described herein again. The various modules in the above-described apparatus may be implemented in whole or in part by software, hardware, and combinations thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In an embodiment of the present invention, there is also provided a terminal device, including a memory, a processor and a computer program stored in the memory and operable on the processor, wherein the processor executes the computer program to implement the steps of the recognition method for maxillofacial fracture based on CBCT database.
Fig. 3 is a schematic diagram of a terminal device according to an embodiment of the present invention, as shown in fig. 3. Theterminal device 10 may be a computing device such as a desktop computer, a notebook, a palm computer, and a cloud server.Terminal device 10 may include, but is not limited to, aprocessor 100, amemory 101. Those skilled in the art will appreciate that fig. 3 is merely an example of aterminal device 10 and does not constitute a limitation ofterminal device 10 and may include more or fewer components than shown, or some components may be combined, or different components, e.g., the terminal device may also include input-output devices, network access devices, buses, etc.
TheProcessor 100 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
Thememory 101 may be an internal storage unit of theterminal device 10, such as a hard disk or a memory of theterminal device 10. Thememory 101 may also be an external storage device of theterminal device 10, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like, which are provided on theterminal device 10. Further, thememory 101 may also include both an internal storage unit of theterminal device 10 and an external storage device. Thememory 101 is used for storing thecomputer program 102 and other programs and data required by theterminal device 10. Thememory 101 may also be used to temporarily store data that has been output or is to be output.
Fig. 4 is a diagram illustrating steps of the recognition method for maxillofacial fracture based on CBCT database according to an embodiment of the present invention, as shown in fig. 4. The embodiment of the invention provides a CBCT database-based maxillofacial fracture identification method and device aiming at the problems of complex identification and low accuracy rate of CBCT data fracture identification. The embodiment provided by the invention is applied to a medical image processing system.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). The memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.