Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.

License

NotificationsYou must be signed in to change notification settings

miendinh/VietnameseOCR

Repository files navigation

Apply Deep Learning ( CNN networks ) to train a model uses for recognizing Vietnamese characters, it works well with Latin characters.

Dataset in big image ( 10.000 samples, 2800 x 2800 pixel)

Requirements

python 3.6.5tensorflowPIL

Model Summary

LayerShapeKernelStridePadding
INPUT[28, 28, 1]
CONV1[3, 3, 32, 32][1, 1]SAME
POOL1
CONV2[3, 3, 32, 64][1, 1]SAME
POOL2
CONV3[3, 3, 64, 128][1, 1]SAME
POOL3
FC1
FC2[625, 190]

Results

Training.........Epoch: 38 cost = 0.312853018Epoch: 39 cost = 0.298816641Epoch: 40 cost = 0.293328794Evaluation------------------------------Test Accuracy: 0.974867469544

Training

Prepare dataset for training

git clone https://github.com/miendinh/VietnameseOCR.gitcd VietnameseOCR/data/train/charactersunzip dataset.zip

Let's train.

python train.py

Create you own dataset

Prepare fonts for generating text-image
  • You could add more fonts
cd VietnameseOCR/data/train/charactersunzip google.zipunzip win.zip
Create font list, then save it in fonts.list
source ./list.sh
Generate Text Image Dataset
python generate_data.py

Play with pretrained model

  • All pretrained weights of model is save to file vocr.brain
  • Let's test with random character in dataset
python predict.py

Further working

  • Character classification.
  • Dataset augmentation.
  • Improve accuracy.
  • Text location.
  • Text recognition.
  • Apply NLP for spell checking.

References

  1. STN-OCR: A single Neural Network for Text Detection and Text Recognition
  2. Automatic Dataset Augmentation
  3. VGG16 implementation in TensorFlow
  4. Vietnamese Dict (VietOCR3)

Author mien.hust [at] gmail [dot] com

About

Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp