oarriaga/pazPublic

NotificationsYou must be signed in to change notification settings
Fork108
Star671

Hierarchical perception library in Python for pose estimation, object detection, instance segmentation, keypoint estimation, face recognition, etc.

License

MIT license

671 stars 108 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 3,345 Commits
.github		.github
docs		docs
examples		examples
paz		paz
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Repository files navigation

(PAZ) Perception for Autonomous Systems

Hierarchical perception library in Python.

Selected examples:

PAZ is used in the following examples (links toreal-time demos and training scripts):

Probabilistic 2D keypoints	6D head-pose estimation	Object detection

Emotion classifier	2D keypoint estimation	Mask-RCNN (in-progress)

Semantic segmentation	Hand pose estimation	2D Human pose estimation

3D keypoint discovery	Hand closure detection	6D pose estimation

Implicit orientation	Attention (STNs)	Haar Cascade detector

Eigenfaces	Prototypical Networks	3D Human pose estimation

MAML

All models can be re-trained with your own data (except for Mask-RCNN, we are working on ithere).

Installation

PAZ has onlythree dependencies:Tensorflow2.0,OpenCV andNumPy.

To install PAZ with pypi run:

pip install pypaz --user

Documentation

Full documentation can be foundhttps://oarriaga.github.io/paz/.

Hierarchical APIs

PAZ can be used with three different API levels which are there to be helpful for the user's specific application.

High-level

Easy out-of-the-box prediction. For example, for detecting objects we can call the following pipeline:

frompaz.applicationsimportSSD512COCOdetect=SSD512COCO()# apply directly to an image (numpy-array)inferences=detect(image)

There are multiple high-level functions a.k.a.pipelines already implemented in PAZhere. Those functions are build using our mid-level API described now below.

Mid-level

While the high-level API is useful for quick applications, it might not be flexible enough for your specific purpose. Therefore, in PAZ we can build high-level functions using our a mid-level API.

Mid-level: Sequential

If your function is sequential you can construct a sequential function usingSequentialProcessor. In the example below we create a data-augmentation pipeline:

frompaz.abstractimportSequentialProcessorfrompazimportprocessorsaspraugment=SequentialProcessor()augment.add(pr.RandomContrast())augment.add(pr.RandomBrightness())augment.add(pr.RandomSaturation())augment.add(pr.RandomHue())# you can now use this now as a normal functionimage=augment(image)

You can also addany function not only those found inprocessors. For example we can pass a numpy function to our original data-augmentation pipeline:

augment.add(np.mean)

There are multiple functions a.k.a.Processors already implemented in PAZhere.

Using these processors we can build more complex pipelines e.g.data augmentation for object detection:pr.AugmentDetection

Mid-level: Explicit

Non-sequential pipelines can be also build by abstractingProcessor. In the example below we build a emotion classifier fromscratch using our high-level and mid-level functions.

frompaz.applicationsimportHaarCascadeFrontalFace,MiniXceptionFERimportpaz.processorsasprclassEmotionDetector(pr.Processor):def__init__(self):super(EmotionDetector,self).__init__()self.detect=HaarCascadeFrontalFace(draw=False)self.crop=pr.CropBoxes2D()self.classify=MiniXceptionFER()self.draw=pr.DrawBoxes2D(self.classify.class_names)defcall(self,image):boxes2D=self.detect(image)['boxes2D']cropped_images=self.crop(image,boxes2D)forcropped_image,box2Dinzip(cropped_images,boxes2D):box2D.class_name=self.classify(cropped_image)['class_name']returnself.draw(image,boxes2D)detect=EmotionDetector()# you can now apply it to an image (numpy array)predictions=detect(image)

Processors allow us to easily compose, compress and extract away parameters of functions. However, most processors are build using our low-level API (backend) shown next.

Low-level

Mid-level processors are mostly built from small backend functions found in:boxes,cameras,images,keypoints andquaternions.

These functions can found inpaz.backend:

frompaz.backendimportboxes,camera,image,keypoints,quaternion

For example, you can use them in your scripts to load or show images:

frompaz.backend.imageimportload_image,show_imageimage=load_image('my_image.png')show_image(image)

Additional functionality

PAZ hasbuilt-in messages e.g.Pose6D for an easier data exchange with other frameworks such asROS.
There are customcallbacks e.g. MAP evaluation for object detectors while training.
PAZ comes withdata loaders for the multiple datasets:OpenImages,VOC,YCB-Video,FAT,FERPlus,FER2013,CityScapes.
We have an automaticbatch creation and dispatching wrappers for an easy connection between youpipelines and tensorflow generators. Please look at thetutorials for more information.

Models

The following models are implemented in PAZ and they can be trained with your own data:

Task (link to implementation)	Model (link to paper)
Object detection	SSD-300
Object detection	SSD-512
Probabilistic keypoint est.	Gaussian Mixture CNN
Detection and Segmentation	MaskRCNN (in progress)
Keypoint estimation	HRNet
Semantic segmentation	U-NET
6D Pose estimation	Pix2Pose
Implicit orientation	AutoEncoder
Emotion classification	MiniXception
Discovery of Keypoints	KeypointNet
Keypoint estimation	KeypointNet2D
Attention	Spatial Transformers
Object detection	HaarCascades
2D Human pose estimation	HigherHRNet
3D Human pose estimation	Simple Baseline
Hand pose estimation	DetNet
Hand closure classification	IKNet
Hand detection	SSD512
Few-shot classification	Prototypical Networks
Few-shot classification	Model Agnostic Meta Learning (MAML)

Motivation

Even though there are multiple high-level computer vision libraries in different deep learning frameworks, I felt there was not a consolidated deep learning library for robot-perception in my framework of choice (Keras).

As a final remark, I would like to mention, that I feel that we might tend to forget the great effort and emotional status behind every (open-source) project.I feel it's easy to blurry a company name with the individuals behind their work, and we forget that there is someone feeling our criticism and our praise.Therefore, whatever good code you can find here, is all dedicated to the software-engineers and contributors of open-source projects like Pytorch, Tensorflow and Keras.You put your craft out there for all of us to use and appreciate, and we ought first to give you our thankful consideration.

Why the namePAZ?

The name PAZ satisfies it's theoretical definition by having it as an acronym forPerception for Autonomous Systems where the letter S is replaced for Z in order to indicate that for "System" we mean almost anything i.e. Z being a classical algebraic variable to indicate an unknown element.

Tests and coverage

Continuous integration is managed troughgithub actions usingpytest.You can then check for the tests by running:

pytest tests

Test coverage can be checked usingcoverage.You can install coverage by calling:pip install coverage --userYou can then check for the test coverage by running:

coverage run -m pytest tests/coverage report -m

Citation

If you use PAZ please consider citating it. You can also find our paper herehttps://arxiv.org/abs/2010.14541.

@misc{arriaga2020perception,title={Perception for Autonomous Systems (PAZ)},author={Octavio Arriaga and Matias Valdenegro-Toro and Mohandass Muthuraja and Sushma Devaramani and Frank Kirchner},year={2020},eprint={2010.14541},archivePrefix={arXiv},primaryClass={cs.CV}}

Funding

PAZ is currently developed in theRobotics Group of theUniversity of Bremen, together with theRobotics Innovation Center of theGerman Research Center for Artificial Intelligence (DFKI) inBremen.PAZ has been funded by the German Federal Ministry for Economic Affairs and Energy and theGerman Aerospace Center (DLR).PAZ been used and/or developed in the projectsTransFIT andKiMMI-SF.