Summary of the invention
The purpose of this invention is to provide a kind of can to medical image effectively compress and improve the medical image file transfer speed medical image region of interest automatically identification and scalable compression method; Come identification region of interest and classification automatically through image processing method based on artificial neural network; Thereby realize region of interest (ROI) and non-region-of-interest different stage are adopted the different compression method, reach the diagnosis and treatment efficient of raising tele-medicine and the dual purpose of diagnosis and treatment quality.
The technical scheme that the present invention adopted is:
Medical image region of interest based on artificial neural network is discerned and scalable compression method automatically, it is characterized in that:
Realize by following steps:
Step 1: to the medical digital image is that the DICOM image carries out pre-service; Comprise that eliminating ground unrest, window/horizontal adjustment and histogram equalization handles; And utilization is based on the edge detection algorithm of grid search and hough transform; Discern the tissue profile, confirm the boundary of inside of human body tissue and epidermis muscle;
Step 2: through using BPF. and wavelet filter algorithm, characteristics of image is strengthened in the tissue image thatstep 1 the is split computing that repeatedly superposes, and discerns region of interest; Extract the region of interest eigenwert then, and according to characteristic value information, the using artificial neural networks method is classified, confirm region of interest and corresponding rank;
Step 3: to the Network Transmission effect of requirement, region of interest and non-region of interest that step 2 is confirmed adopt the different compression mode according to different stage according to current network bandwidth condition and doctor.
Pre-service described in thestep 1 is meant the image information of the normal society of removing the image border, artificial filling, irrelevant noise data and tissue profile air background in addition.
Eigenwert described in the step 2 is density, form, texture, position and the range information of region of interest image.
In the described step 3; Based on the current network bandwidth condition comprise the broadband, wireless, the dialing network access and current network speed; The doctor comprises that to the requirement of network laser propagation effect number and compress mode that every width of cloth figure comprises region of interest are Fast Compression, suitably compression or precise compression, adopts the different compression mode to generate tiff image by the different stage of region of interest and non-region of interest;
Region of interest to step 3 is confirmed is used LZW coding lossless compress;
The whole figure that comprises region of interest and non-area-of-interest is adopted the JPEG lossy compression method of corresponding different compression factor.
The present invention has the following advantages:
(1) the present invention carries out noise reduction process to the medical image file; And the characteristic of the region of interest image at focus place strengthened, image file is significantly diminished, improved transfer rate; Also kept the effective necessary information that is used for diagnosis and treatment in the image, be convenient to the doctor and read.
(2) patient can be at the community medicine center, MEC or medical image center; Utilize the local facilities image data; And through wired network or wireless network transmissions data to the teleprocessing center, obtain the specialty analysis result, need not that hospital registers, lines up to the down town again.
(3) medical institutions need not buy expensive tele-medicine specialized equipment; Can utilize general calculation machine hardware and network; Can only need a notebook or a TD internet-of-things terminal that has wireless network card in outlying rural area; Just can utilize advanced network technology, obtain the diagnostic result of specialty.And the doctor who implements remote diagnosis need not to drop into the extra time as talk in real time with patient, many people repeat to read the sheet and the consultation of doctors; Through the automatic analysis of intelligent algorithm and the scheduling of network; Can alleviate doctor's workload greatly; Effectively reduce medical operating cost, and promoted the medical skill service level.
(4) existing health check-up has often limited the effect of its prophylactic treatment owing to lack the expert of accurate otherwise effective technique and association area.Present technique will help promoting the reduction of medical examination cost, popularize conventional gonosome inspection.Along with the raising of medical science remote Diagnosis Technology, annual routine physical examination will become the important means of prophylactic treatment, eliminate for a lot of major diseases to provide safeguard in early days, realize the advanced theory of prophylactic treatment, improve the masses' healthy living level.
Embodiment
Below in conjunction with embodiment the present invention is carried out detailed explanation.
As shown in Figure 1, the compression method of image file in the numerical dialing system medical image transmission course involved in the present invention, realize by following steps:
Step 1: pre-service:
Gather the medical digital image in client, promptly the DICOM image carries out pre-service to this image file, comprises the background elimination, window/level correction and histogram equalization and removal noise and irrelevant information.
Image background comprises because collimating apparatus is blocked the normal society at the X-ray film edge cause, artificial annotate zone outside information and the patient contour.Eliminating background and will help to strengthen visual quality for images, is that follow-up processing reduces workload simultaneously.
In order to ensure not wiping the relevant image error of diagnosis, adopt the dynamic thresholding method of analyzing based on image histogram, promptly on histogram, confirm two peak values, then the lowest part of frequency is made as between the twoT1
Window with a fixed size partly reaches scanning on every side in picture centre again, and the window that picture centre partly is higher than mean value should calculate mean value and variance in the exit window in tissue, and the frequency values of choosing one-sided 0.95 degree of belief then is made asT2
The minimum value of choosing in the two is come separating background as threshold value:Min{T1,T2.
This method is more safer than traditional image histogram analysis method; (region of interest comprises the tissue and the fat deposit of bulk in worst case; But its edge is very fuzzy) under, because the light intensity of fat deposit generally is lower than its hetero-organization, may also separate the adipose tissue that is lower than 10%; But consider that fat deposit often is not that pathology produces the position, so be safe.
Window/level correction is used to regulate picture contrast and display quality.Window is meant the distribution range of gray-scale value, and the window value reduces then to increase the contrast of image.Level value is window scope center at interval.Automatically find out histogrammic maximal value of entire image and minimum value through computing machine, suppress irrelevant gray-scale value, generate the default look-up table that is used for the image demonstration, thereby improve the display quality of image with window and level.
Histogram equalization is the gradation of image information distribution to be got even as far as possible, through analysis image intensity and light step, from the histogram calculation normalization cumulative histogram of image, thereby generates the image that image detail is obviously strengthened.
Hough transform (Hough Transform) algorithm is to utilize image overall Characteristics Detection objective contour, and edge pixel is connected to form a kind of common method of closed boundary, and its advantage is that the influence that be interrupted by noise and curve is less.In order to check the boundary of human body interior tissue and epidermis muscle, two kinds of methods can be arranged: grid search and curve vector method.The grid method marks boundary information sometimes in some cases by error, and is big but the shortcoming of curve vector method is a calculated amount.In order to satisfy system to response time and performance demands, adopted the search of comprehensive grid and based on the edge detection algorithm of hough transform, realize confirming rapidly and accurately the border of tissue and epidermis, generate and organize mask, thereby isolate the tissue image.
Step 2: identification region of interest:
The tissue image thatstep 1 is split carries out graphical analysis, judges region of interest (ROI).
BPF. (band-pass filters) algorithm carries out computing through the pixel grey scale with a field around the pending pixel with corresponding BPF. element and improves signal to noise ratio (S/N ratio) and strengthen image.What Fig. 2 showed is the BPF. that has Gaussian convolution operator (Gaussian kernels) formation of various criterion poor (standard deviation) by 4, is used to strengthen central tissue's signal.
The image border is the point of discontinuity of gray scale on the image or the place of gray scale acute variation, and in real image, because the existence of noise, rim detection becomes a difficult problem.Classical edge detection method such as Roberts operator, the Sobel operator, the Prewitt operator, Log operator and Canny operator all have weak point, under some concrete condition, can not detect the best edge of object.Through research, adopt the Edge-Detection Algorithm of using two-dimensional discrete wavelet conversion, at first to discrete wavelet transform coefficients
Wm1t(n1?+?s,?n2?+?s)
AndWm2T (n1+ s, n2+ s)
Do the enhancing conversion,, can do the nonlinear contrast degree to image and strengthen, when suppressing noise, higher edge precision is provided then to adjusted coefficient reconstruction.
With the result of the above two kinds of algorithm process computing that superposes, thereby region of interest (ROI, Region of Interest) characteristic is strengthened in identification; Again to the image of region of interest, extract density, form, texture, position, image intensity average and standard deviation that eigenwert comprises image, with information such as the number of epidermis distance, spicule and size.
Characteristic value information with the above region of interest is input to artificial neural network, confirms region of interest.Artificial neural network is exactly that interactional dynamic process of while between the neuron is passed through in anthropomorphic dummy's thinking, is a nonlinear dynamic system, and its characteristic is the distributed storage and the concurrent collaborative processing of information.The most ripe at present, use the most widely artificial neural network " backpropagation algorithm (Back Propagation; BP) "; Artificial neural network backpropagation algorithm (Back Propagation; BP) Algorithm design mainly comprises the several aspects of transition function between input layer, latent layer, output layer and each layer, sees Fig. 3._
1), the network number of plies.The BP network can comprise different latent layers, and is verified in theory: have latent layer of network that adds a linear output layer of deviation and at least one S type, can approach any rational function.Latent experience of counting layer by layer is chosen as: generally can adopt perceptron or adaptive network to solve for linear problem, and not adopt nonlinear network, because individual layer can not have been given play to the speciality of non-linear activation function; Nonlinear problem; Two-layer or the two-layer above latent layer of general employing; But the raising of error precision in fact also can obtain through the neuron number that increases in the latent layer; Its training effect is also observed more easily and is adjusted than increasing the number of plies, so generally speaking, should pay the utmost attention to the neuron number that increases in the latent layer.2), the node number of the node number of input layer and output layer.Input layer plays memory buffer, and it receives outside input data, so its node number depends on the dimension of input vectorm, corresponding input layer also has the neuron of different numbers.The node number of output layer depends on two aspects, output data type and the required size of data of expression the type.Because the output sample of prediction is the output vector of 2 dimensions is region of interest type (like lump or micro-calcification clusters) and (between 0 and 1,0 expression is not a pathology to rank, and 1 representes to make a definite diagnosis pathology; Rank is high more, and the possibility of pathology is big more), therefore, output layer has 2 neurons._
3), the node number of hidden neuron.The node number of hidden neuron is confirmed through different neuron numbers being trained contrast, some surpluses of suitable then increase.Rule of thumb, design with reference to following formula:
In the formula:nBe the number of hidden nodes;niBe input number of nodes;n0Be the output node number;aIt is the constant between 1~10.4), transition function.The BP network structure all is to adopt S type activation function at latent layer generally speaking, and output layer adopts linear activation function._ 5), training method chooses.Adopting the additional momentum method to make backpropagation reduce network is absorbed in low ebb on the error surface possibility helps to reduce the training time.Too big learning rate causes the instability learnt, too little value to cause the extremely long training time again.Adaptive learning speed has reached rational two-forty through under the prerequisite that guarantees stable training, can reduce the training time.6), the experience of learning rate is selected.Tend to choose less learning rate generally speaking to guarantee the stability of system, the scope of choosing of learning rate is recommended between 0.01~0.8.
7), anticipation error chooses.If the target output value of sample doesTj, error function thenEFor:
E=∑(Oj-Tj)2
Connecting weight is modified to by following formula:
ΔWij(n+1)=ηδjOj+αΔWij(n)
Wherein: Δ WIj(n)=∑ Δ pWIjBe sample total
Generally speaking, as a comparison, can train the network of two different expected error value simultaneously, confirm one of them network through the consideration of composite factor at last.In sum: the BP network topology structure doesm*n* 2 structure.The transport function of hidden neuron is a S type tanTansigBecause the output rank is normalized in the interval [ 0,1 ], the method for normalizing of employing is:
The neuronic transport function of output layer adoptsPurelinFunction.The training function of network adopts the weights and the threshold values function of the variable momentum BP algorithm correction neural network of learning rateTraingdx_
Use the backpropagation algorithm neural network, need not the mathematical model of the system that sets up, as long as there are enough training samples (being obtained by experimental data or emulated data) can confirm region of interest and corresponding rank, rank is high more, possibly be lesion region more.This method is not to judge according to the experience to problem, thereby has adaptation function, has overcome the deficiency of regression methods analysis small sample data.
Through overtesting, the Application of BP neural network has obtained effect preferably in the region of interest identification of breast X-ray image.Collected 84 mammary gland cases through the internet from the database of southern good fortune Flo-Rida-Low university; 50 examples have been used training network; 34 examples are as test set; The result shows that actual interest district (suspicious focus) and prediction region of interest difference do not have statistical significance, and the related coefficient of predicted value and actual value is 0.9162.
Step 3: generate images:
Comprise that based on the current network bandwidth condition network speed and network access are broadband, wireless, dialing; With the doctor to the requirement of network laser propagation effect be Fast Compression, the region of interest number that suitably comprises at most among compression or precise compression and the every width of cloth figure, adopt the different compression mode to generate tiff image by the different stage of region of interest and non-region of interest: the region of interest rank based on step 2 is confirmed selects the high region of interest of rank to use LZW (Lempel-Ziv & Welch) coding lossless compress; Whole figure to comprising region of interest and non-area-of-interest adopts the JPEG lossy compression method.The different stage of pressing region of interest and non-region of interest like this adopts different compression mode and compression factor to generate tiff image, reaches the requirement on both side to diagnostic imaging effect and remote transmission efficient.
TIFF (full name Tagged Image File Format) image, file extent TIF by name or TIFF are to use industry standard bitmap file form the most widely.GIFf format can be stored single raster image by any color depth, also becomes bit depth (like 32 bit images) de facto standards picture format, is widely used in image processing program.GIFf format is supported multiple compress modes such as RAW, RLE, LZW, JPEG, CCITT3 group and 4 groups.
Wherein, Compression method (LZW) is non-loss property (data of image does not reduce, and promptly information can not lost in processing procedure), with the abbreviation (Lempel of three inventor's names; Ziv; Welch) name, it is right that its principle is that the value with each byte all will be made into a character with the value of next byte, and for each character to setting a code.When a same character when occurring once again, just replace this character right, and then match with this code name and next character with code name.A key character of LZW coding principle is that code not only can replace a string data with value, also can replace the data of a string different value.In view data,, also can find a code name to replace these serial datas if there are the data of some different value often to repeat to occur.Compression is reversible with dispose procedure, and all information all keep.LZW lossless compress mode can produce 2:1 or better ratio of compression, thereby source document is subdued to over half.
The JPEG lossy compression method has been utilized the characteristic of people's visual angle system, uses to quantize and lossless compression-encoding the combine redundant information of removing the visual angle and the redundant information of data itself, can significantly reduce image file size in the maintenance high-quality while of image.Use this lossy compression method method, be generally the ratio of compression of 20:1,, then can reach 60:1 or higher ratio of compression for the 8bit grayscale image for coloured image.
The advantage of tiff format is to support the high resolving power color simultaneously, and it is divided into bulk to the different piece of piece image, or perhaps data block.For each bulk portion, all preserved a sign, wherein provide bulk to look like which type of information.Block advantage is to support the software package of tiff format only need preserve the current that part of image that is presented on the screen.And also be not kept on the hard disk ability graftabl when needing by the time in images displayed part on the screen.When very large high-definition picture of editor, this characteristic is just very important.It is pointed out that in recent years the continuous development along with the Image Compression standard, tiff format is also supported the new JPEG2000 standard of releasing, can be in a width of cloth figure compatible lossless compress and lossy compression method, and support encoding region of interest.Like this, just can use JPEG2000 region of interest is adopted lossless compression-encoding, the lossy compression method coding is adopted in other zones, can realize to exceed the compressibility about 30% than JPEG.
Owing to removed DICOM markup information, noise, background and other irrelevant informations; And the Applied Digital image processing techniques is to the computing that repeatedly superposes of the tissue image curve in the specific region; Change numerical information into visual information, naked eyes can't see at all before making information and minute lesion can clearly manifest comprehensively.In addition, regulate window/level automatically to the best visual effect, optimized peripheral image,, want the image transmitted data volume, improved the readability of image thereby significantly reduced like skin and muscle layer image based on the characteristic of the region of interest of discerning.
The original DICOM image (seeing accompanying drawing two) of one width of cloth 26.6M; The tiff image size that generates after treatment is merely 523K; The compressed image file size is merely 1/50th of source document, and image space is saved and reached 98.4%, and picture quality is read than being more suitable for the doctor before handling.Even the file of size in the backcountry that does not have optical fiber, also can carry out remote transmission through mobile phone or telephone modem easily like this, for carrying out Network Transmission and remote diagnosis very advantageous conditions is provided.
With mammary X-ray digitizing remote diagnosis system is example, and Telemedicine System is made up of several parts such as community clinic or health check-up service centre, cloud processing enter, hospital diagnosis workstations.After the patient of community film making; After the pre-service of X line image process, ROI reinforcement and the processed compressed; With other information of patient; Can upload or wired cloud processing enter that is networked to via cdma wireless together like electronic health record etc., patient information can pass through ordering, encrypt and stack suspicious lesions information, and the consultation of doctors of informing network hospital expert.The expert uses the hospital diagnosis workstation to insert the cloud processing enter, accomplishes that diagnosis is identified and report generates, and passes Community Service Center back, if desired, also can provide advice, services such as audio frequency and video conversation.
Be the image comparison before and after handling like Fig. 4-8.
Specific operation process comprises:
1. community medicine outpatient service/MEC: the patient carries out the film making of breast molybdenum target X-ray through reservation, and the digitized picture of generation uploads to the cloud computing processing enter automatically through after pre-service, optimizing reinforcement, compression and encryption.
2, cloud computing processing enter: after the image behind the compress-encrypt passes to the cloud processing enter; Can be according to the analysis in like manner of anatomical knowledge and different views; Use multiple classifying and analyzing method comprise the sorting technique of artificial neural network algorithm, rule-based (rule), based on the sorting technique of discrimination threshold (threshold); Use expert system to remove wrong report false positive data (False Positive), supply decision analysis with the testing result that optimization is provided.According to demand to the image of uploading sort, after the decryption processing, application of expert system is screened suspicious focus, and the diagnosis of informing network hospital expert*Simultaneously for each hospital database provides communication, inquiry service, and the monitoring management of responsible workflow.
3. hospital backstage service management system: use medical diagnosis workstation and case image data base etc. by hospital expert, the testing result of comparison expert system provides the last diagnostic suggestion.
4. cloud computing processing enter: the last diagnostic suggestion according to the result and the hospital of expert system uploads generates diagnosis report, and passes community medicine outpatient service/MEC back.
5. community medicine outpatient service/MEC: the diagnosis report that can print return; And on the image of diagnosis report mark suspicious lesions position and relevant information; Notify patient to come to obtain the result with modes such as SMS or Emails, or directly post the address that patient stays.
If do not find suspicious focus through second step, can be without the 3rd step, and directly return to client.