Robust pdf loading for empty pages by@lolipopshock in#115
fix to issue#94 -- avoiding TesseractAgent.detect() inferring any sequence of digit as float by@K-for-Code in#95
Better layout comparison by@lolipopshock in#128
Better visualization functions by@lolipopshock in#129

Example Updates

Minor update to Deep Learning Parser example notebook by@Jim-Salmons in#56
Setinplace to True in sorting function by@yusanshi in#104
Add notebook for customizing LayoutParser Models with Label Studio Annotation by@lolipopshock in#124

New Contributors

@Jim-Salmons made their first contribution in#56
@yusanshi made their first contribution in#104
@K-for-Code made their first contribution in#95

Full Changelog:v0.3.2...v0.3.3

Contributors

Jim-Salmons, lolipopshock, and 2 other contributors

Assets2

0Join discussion

v0.3.2: Patch Release

23 Sep 17:36

lolipopshock

v0.3.2

29fb2fb

Compare

Choose a tag to compare

v0.3.2: Patch Release

Important fixes for multibackend layout model support:

Resolves the issues mentioned in#78 with other fixes to improve the multibackend layout model support#79
Better tests for different backends#79 for preventing future related issues

Assets2

v0.3.1: Patch Release

15 Sep 20:58

lolipopshock

v0.3.1

867b89e

Compare

Choose a tag to compare

v0.3.1: Patch Release

Fixes for automatically settinglabel_map inDetectron2LayoutModel#75
Remove unnecessary class annotations (that might breaks Python 3.6 users)#75

Assets2

v0.3.0: Multi-backend Support, Additional Models, Better Visualizations, and many more

13 Sep 19:44

lolipopshock

v0.3.0

73e3015

This commit was created on GitHub.com and signed with GitHub’sverified signature. The key has expired.

GPG key ID:4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.3.0: Multi-backend Support, Additional Models, Better Visualizations, and many more

We are excited to release LayoutParser v0.3.0, with a lot of exciting updates and functional improvements.

New Features

The biggest change in this version is that LayoutParser now supports multiple deep learning backends: Detectron2, effdet, and paddledetection. This allows for more flexible usage of thelayoutparser library, and makes it easier for implementing customized layout models in the future.#54 #67
Additionally, the newly addedAutoModel and improved model configuration parsing makes it easier load and use the layout detection models.#69
- e.g,model = lp.AutoLayoutModel("lp://efficientdet/PubLayNet").
To support this multi-backend framework, we implement the dynamic importing mechanism as well as better ways for installinglayoutparser and the needed dependencies (seeinstructions).#65 #68

And nowlayoutparser supports directly loading PDF files into aslayout objects:#71

importlayoutparseraslppdf_layout,pdf_images=lp.load_pdf("path/to/pdf",load_images=True)lp.draw_box(pdf_images[0],pdf_layout[0])

To support more flexible processing of the layout objects, a set of new toolkits are available:#72

importlayoutparseraslppage_layout=lp.load_pdf("tests/fixtures/io/example.pdf")[0]pdf_lines=lp.simple_line_detection(page_layout)

New Models

Add MFD model that can detect (display) equation regions within scientific documents#59

Assets2

Layout Parser v0.2.0: New features, models, and improvements!

12 Apr 19:59

lolipopshock

v0.2.0

b1dc7f7

Compare

Choose a tag to compare

Layout Parser v0.2.0: New features, models, and improvements!

Layout Parser v0.2.0 Release Notes

New Features

Support for loading and exporting the layout data injson andcsv , see#6
Add support forunion andintersect operations, see#20 and thedetailed explanation

Improvements

Functional improvements:
1. When loading Layout Parser official models,Detectron2LayoutModel can automatically detect the label_map, . For example,
```
model=lp.Detectron2LayoutModel("lp://HJDataset/faster_rcnn_R_50_FPN_3x/config")model.label_map# {1: 'Page Frame', ... }
```
2. Detectron2LayoutModel now supports theenforce_cpu flag that enforces using cpu even when CUDA devices are available.
3. Forvisualization.draw_box, it now supports ashow_element_type flag that shows the bbox category name on the top left corner of the layout objects.
Improve installation command and documentation, especially for installing Detectron2 on Windows platforms#25

New Models

Add the table bank detection models that can identify table regions

Fixes

Fix the incorrectlayout issue mentioned in#9 - Thanks to@remidbs.
Fix the some of the dependency issues mentioned in#11 and#13 by usingiopath instead offvcore. See#18, Thanks to@edisongustavo.

Assets2

New models and bug fixes

21 Dec 21:57

lolipopshock

v0.1.3

dee2594

Compare

Choose a tag to compare

New models and bug fixes

Improvements:

Supports lazy loading for the Detectron2 module. Now the dependency for Detectron2 will be requested only when you explicitly create aDetectron2LayoutModel object. This might be helpful for using the plainlayoutparser library without installing the Detectron2 module.

New models:

Incorporated a pre-trained model based on theNewspaperNavigator dataset:lp://NewspaperNavigator/faster_rcnn_R_50_FPN_3x/config

Fixes:

Corrected a bug in visualization that might overwrite original the image

Assets2

New models and improvements

30 Oct 05:45

lolipopshock

v0.1.2

8d1a6f0

Compare

Choose a tag to compare

New models and improvements

In this version, we released a new model for publaynet and made several improvements:

We released themask_rcnn_X_101_32x8d_FPN_3x model trained on thepublaynet dataset. Note: it's been trained on the full training set (while others are only trained on the validation set), and you could expect a 15% performance improvement based on this new model.
We improved the support for PIL images for both layout modeling and visualization
We improved the Default Language Settings for the Tesseract OCR model

Assets2

Model fixes and updates

16 Jul 23:15

lolipopshock

v0.1.1

e37d5d9

Compare

Choose a tag to compare

Model fixes and updates

Fixes

Fixed a bug that could cause errors in loading Prima Models

Updates

Update the prima MASK RCNN model with higher accuracy, and listed detailed evaluation reports.

Assets2

v0.1.0: The foundation version that covers four major functionalities

24 Jun 05:26

lolipopshock

v0.1.0

1f7c029

Compare

Choose a tag to compare

v0.1.0: The foundation version that covers four major functionalities

layoutparser now supports the following functionalities:

Coordinate system:
- Supports the 3 basic coordinate system and their geometric relationships
- Supports the TextBlook and Layout system for convenient coordinate and text processing
OCR System:
- Supports OCR based on Google Cloud Vision and Tesseract API.
Layout Modeling:
- Supports using pre-trained Deep Learning models for layout object detection using Detection2
Visualization:
- Supports highly-customizable presentation of the box coordinates and text in the detected layout

Assets2

Movatterモバイル変換

Releases: Layout-Parser/layout-parser

v0.3.4: Patch Release

Bug fixes

Contributors

Uh oh!

v0.3.3: Patch Release

Functional Updates

Example Updates

New Contributors

Contributors

Uh oh!

v0.3.2: Patch Release

Uh oh!

v0.3.1: Patch Release

Uh oh!

v0.3.0: Multi-backend Support, Additional Models, Better Visualizations, and many more

New Features

New Models

Uh oh!

Layout Parser v0.2.0: New features, models, and improvements!

Layout Parser v0.2.0 Release Notes

New Features

Improvements

New Models

Fixes

Uh oh!

New models and bug fixes

Uh oh!

New models and improvements

Uh oh!

Model fixes and updates

Fixes

Updates

Uh oh!

v0.1.0: The foundation version that covers four major functionalities

Uh oh!