ivanDonadello/Visual-Relationship-Detection-LTNPublic

NotificationsYou must be signed in to change notification settings
Fork6
Star27

This repository contains the dataset and the source code for the detection of visual relationships with the Logic Tensor Networks framework.

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
Visual-Relationship-Detection-master		Visual-Relationship-Detection-master
data		data
models		models
README.md		README.md
logictensornetworks.py		logictensornetworks.py
predicate_detection.py		predicate_detection.py
refine_predictions.py		refine_predictions.py
relationship_phrase_detection.py		relationship_phrase_detection.py
train.py		train.py
visual_relationship_dataset.py		visual_relationship_dataset.py

Repository files navigation

LOGIC TENSOR NETWORKS FOR VISUAL RELATIONSHIP DETECTION

This repository contains the dataset, the source code and the models for the detection of visual relationships withLogic Tensor Networks.

Introduction

Semantic Image Interpretation is the task of extracting a structured semantic description from images. This requires the detection ofvisual relationships: triples (subject, relation, object) describing a semantic relation between the bounding box of a subject and the bounding box of an object. Here, we perform the detection of visual relationships by using Logic Tensor Networks (LTNs), a novel Statistical Relational Learning framework that exploits both the similarities with other seen relationships and background knowledge, expressed with logical constraints between subjects, relations and objects. The experiments are conducted on the Visual Relationship Dataset (VRD).

A detailed description of the work is provided in our paperCompensating Supervision Incompleteness with Prior Knowledge in Semantic Image Interpretation at IJCNN 2019:

 @inproceedings{donadello2019compensating,  author    = {Ivan Donadello and Luciano Serafini},  title     = {Compensating Supervision Incompleteness with Prior Knowledge in Semantic Image Interpretation},  booktitle = {{IJCNN}},  pages     = {1--8},  publisher = {{IEEE}},  year      = {2019}}

Here a video shows a demo of the system.

Using the Source Code

Thedata folder contains the LTNs encoding of the VRD training and test set, the ontology that defines the logical constraints and the images of the VRD test set. Images and their annotations can be downloaded fromhttps://cs.stanford.edu/people/ranjaykrishna/vrd/.
Themodels folder contains the trained grounded theories of the experiments;
TheVisual-Relationship-Detection-master folder contains the object detector model and the evaluation code provided inhttps://github.com/Prof-Lu-Cewu/Visual-Relationship-Detection for the evaluation of the phrase, relationship and predicate detection tasks on the VRD.

Requirements

We train and test the grounded theories with the following software configuration. However, more recent versions of the libraries could also work:

Ubuntu 14.04;
Matlab R2014a;
Python 2.7.6;
TensorFlow 0.11.0;
Numpy 1.13.1;
Scikit-learn 0.18.1;
Matplotlib 1.5.1;

Training a grounded theory

To run a train use the following command:

$ python train.py

The trained grounded theories are saved in themodels folder in the filesKB_nc_2500.ckpt (no constraints) andKB_wc_2500.ckpt (with constraints). The number in the filename (2500) is a parameter in the code to set the number of iterations.

Evaluating the grounded theories

To run the evaluation use the following commands

$ python predicate_detection.py$ python relationship_phrase_detection.py

Then, launch Matlab, move into theVisual-Relationship-Detection-master folder, execute the scriptspredicate_detection_LTN.m andrelationship_phrase_detection_LTN.m and see the results.

About

This repository contains the dataset and the source code for the detection of visual relationships with the Logic Tensor Networks framework.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

LOGIC TENSOR NETWORKS FOR VISUAL RELATIONSHIP DETECTION

Introduction

Using the Source Code

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

ivanDonadello/Visual-Relationship-Detection-LTN

Folders and files

Latest commit

History

Repository files navigation

LOGIC TENSOR NETWORKS FOR VISUAL RELATIONSHIP DETECTION

Introduction

Using the Source Code

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages