WenmuZhou/DBNet.pytorchPublic

NotificationsYou must be signed in to change notification settings
Fork257
Star1k

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

License

Apache-2.0 license

1k stars 257 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
base		base
config		config
data_loader		data_loader
datasets		datasets
imgs/paper		imgs/paper
models		models
post_processing		post_processing
test		test
tools		tools
trainer		trainer
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.MD		README.MD
environment.yml		environment.yml
eval.sh		eval.sh
generate_lists.sh		generate_lists.sh
multi_gpu_train.sh		multi_gpu_train.sh
predict.sh		predict.sh
requirement.txt		requirement.txt
singlel_gpu_train.sh		singlel_gpu_train.sh

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

note: some code is inherited fromMhLiao/DB

中文解读

update

2020-06-07: 添加灰度图训练，训练灰度图时需要在配置里移除dataset.args.transforms.Normalize

Install Using Conda

conda env create -f environment.ymlgit clone https://github.com/WenmuZhou/DBNet.pytorch.gitcd DBNet.pytorch/

Install Manually

conda create -n dbnet python=3.6conda activate dbnetconda install ipython pip# python dependenciespip install -r requirement.txt# install PyTorch with cuda-10.1# Note that you can change the cudatoolkit version to the version you want.conda install pytorch torchvision cudatoolkit=10.1 -c pytorch# clone repogit clone https://github.com/WenmuZhou/DBNet.pytorch.gitcd DBNet.pytorch/

Requirements

pytorch 1.4+
torchvision 0.5+
gcc 4.9+

Download

TBD

Data Preparation

Training data: prepare a texttrain.txt in the following format, use '\t' as a separator

./datasets/train/img/001.jpg./datasets/train/gt/001.txt

Validation data: prepare a texttest.txt in the following format, use '\t' as a separator

./datasets/test/img/001.jpg./datasets/test/gt/001.txt

Store images in theimg folder
Store groundtruth in thegt folder

The groundtruth can be.txt files, with the following format:

x1, y1, x2, y2, x3, y3, x4, y4, annotation

Train

config thedataset['train']['dataset'['data_path']',dataset['validate']['dataset'['data_path']inconfig/icdar2015_resnet18_fpn_DBhead_polyLR.yaml

. single gpu train

bash singlel_gpu_train.sh

. Multi-gpu training

bash multi_gpu_train.sh

Test

eval.py is used to test model on test dataset

configmodel_path ineval.sh
use following script to test

bash eval.sh

Predict

predict.py Can be used to inference on all images in a folder

configmodel_path,input_folder,output_folder inpredict.sh
use following script to predict

bash predict.sh

You can change themodel_path in thepredict.sh file to your model location.

tips: if result is not good, you can changethre inpredict.sh

The project is still under development.

Performance

ICDAR 2015

only train on ICDAR2015 dataset

Method	image size (short size)	learning rate	Precision (%)	Recall (%)	F-measure (%)	FPS
SynthText-Defrom-ResNet-18(paper)	736	0.007	86.8	78.4	82.3	48
ImageNet-resnet18-FPN-DBHead	736	1e-3	87.03	75.06	80.6	43
ImageNet-Defrom-Resnet18-FPN-DBHead	736	1e-3	88.61	73.84	80.56	36
ImageNet-resnet50-FPN-DBHead	736	1e-3	88.06	77.14	82.24	27
ImageNet-resnest50-FPN-DBHead	736	1e-3	88.18	76.27	81.78	27

examples

TBD

todo

mutil gpu training

reference

If this repository helps you，please star it. Thanks.

About

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

update

Install Using Conda

Install Manually

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

WenmuZhou/DBNet.pytorch

Folders and files

Latest commit

History

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

update

Install Using Conda

Install Manually

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages