This repository was archived by the owner on Oct 18, 2022. It is now read-only.

wikke/Tianchi-Medical-LungTumorDetectPublic archive

NotificationsYou must be signed in to change notification settings
Fork152
Star428

天池医疗AI大赛[第一季]：肺部结节智能诊断 UNet/VGG/Inception/ResNet/DenseNet

428 stars 152 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
assets		assets
train_ipynbs		train_ipynbs
.gitignore		.gitignore
README.md		README.md
config.py		config.py
generators.py		generators.py
model_DenseNet.py		model_DenseNet.py
model_Inception.py		model_Inception.py
model_ResNet.py		model_ResNet.py
model_UNet.py		model_UNet.py
model_VGG.py		model_VGG.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train_classification.py		train_classification.py
train_segmentation.py		train_segmentation.py
visual_utils.py		visual_utils.py

Repository files navigation

阿里云天池医疗大赛·肺结节检测

Features

3D Segmentation & Classification with Keras
Finepreprocessing with scikit-image
Finevisualization for clarification
Modified UNet forsegmentation
Modified VGG/Inception/ResNet/DenseNet forclassification ensemble
Finehyperparameter tunning with both models and training process.

Code Hierarchy

- config.py # good practice to centralize hyper parameters- preprocess.py # Step 1, preprocess, store numpy/meta 'cache' at ./preprocess/- train_segmentation.py # Step 2, segmentation with UNet Model- model_UNet.py # UNet model definition- train_classificaion.py # Step 3, classificaiton with VGG/Inception/ResNet/DenseNet- model_VGG.py # VGG model definition- model_Inception.py # Inception model definition- model_ResNet.py # ResNet model definition- model_DenseNet.py # DenseNet model definition- generators.py # generator for segmentation & classificaiton models- visual_utils.py # 3D visual tools- dataset/ # dataset, changed in config.py- preprocess/ # 'cache' preprocessed numpy/meta data, changed in config.py- train_ipynbs # training process notebooks

Preprocess

useSimpleITK to read CT files, process, and store into cache with numpy arrays
process withscikit-image lib, try lots of parameters for best cutting
- binarized
- clear-board
- label
- regions
- closing
- dilation
collect all meta information(seriesuid, shape, file_path, origin, spacing, coordinates, cover_ratio, etc.) and store inONE cache file for fast training init.
see preprocessing in/train_ipynbs/preprocess.ipynb file

Distribution of the lung part takes on a whole CT.

Tumor size distribution

Segmentation

Asimplified and full UNet both tested.
dice_coef_loss as loss function.
Periodically evaluate model withlots of metrics, which helps a lot to understand the model.
30% of negative sample, which has no tumor, for generalization.
Due to memory limitation, 16 batch size used.

Classification

VGG

A simplified and full VGG model both tested. Use simplified VGG as baseline.

Pictures tells that:hyperparameter tunning really matters.

Inception

A simplified Inception-module based network, with each block has 4-5 different type of conv.
- 1*1*1depth-size seperable conv
- 1*1*1depth-size seperable conv, then 3*3*3 conv_bn_relu
- 1*1*1depth-size seperable conv, then 2 3*3*3 conv_bn_relu
- AveragePooling3D, then 1*1*1depth-size seperable conv
- (optional in config) 1*1*1depth-size seperable conv, and (5, 1, 1), (1, 5, 1), (1, 1, 5)spatial separable convolution
- Concatenate above.

ResNet

usebottleneck block instead ofbasic_block for implementation.
Abottleneckresidual block consists of:
- (1, 1, 1) conv_bn_relu
- (3, 3, 3) conv_bn_relu
- (1, 1, 1) conv_bn_relu
- (optional in config) kernel_size=(3, 3, 3), strides=(2, 2, 2) conv_bn_relu for compression.
- Add(not Concatenate) with input
LeaveRESNET_BLOCKS as config to tune

DenseNet

DenseNet draws tons of experience from origin paper.https://arxiv.org/abs/1608.06993
- 3 dense_block with 5 bn_relu_conv layers according to paper.
- transition_block after every dense_block, expcet the last one.
- Optional config forDenseNet-BC(paper called it):1*1*1 depth-size seperable conv, andtransition_block compression.

Fine Tunning & Experience Got

Learning rate:3e-5 works well for UNet,1e-4 works well for classification models.
Due to memory limitation, 16 batch size used.
Data Augumentation: shift, rotate, etc.
Visualization cannot be more important!!!
coord(x, y, z) accord to (width, height, depth), naughty bugs.
Put all config in one file save tons of time. Make everything clean and tidy
Disk read is bottle neck. Read fromSSD.
Different runs has different running log dirs, for better TensorBoard visualization. Make it like/train_logs/<model-name>-run-<hour>-<minute>.
Lots ofdebug options in config file.
4 times probability strengthened for tumors < 10mm, 3 for tumor > 10mm and < 30mm, keep for > 30mm. Give more focus on small tumors, like below.

About

天池医疗AI大赛[第一季]：肺部结节智能诊断 UNet/VGG/Inception/ResNet/DenseNet

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!