arunimasundar/Supervised-Learning-of-ProceduresPublic

NotificationsYou must be signed in to change notification settings
Fork1
Star0

License

Apache-2.0 license

0 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Action		Action
Pose		Pose
Tracking		Tracking
chunks		chunks
converted		converted
dataset		dataset
ocr frames		ocr frames
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
academic_main.py		academic_main.py
academic_video_output.txt		academic_video_output.txt
action_output_befspeech.txt		action_output_befspeech.txt
comp.py		comp.py
demo_output.txt		demo_output.txt
exercise_main.py		exercise_main.py
exercise_video_output.txt		exercise_video_output.txt
extract.py		extract.py
hk_sq_pu_jj_merge.mp4		hk_sq_pu_jj_merge.mp4
main.py		main.py
ocr_notremoved.txt		ocr_notremoved.txt
origin_data.txt		origin_data.txt
output.txt		output.txt
speechtotext.py		speechtotext.py
speechtotext.txt		speechtotext.txt
sumvid.mp4		sumvid.mp4
utils.py		utils.py

Repository files navigation

Supervised Learning of Procedures from Tutorial Videos

Introduction

This system produces a set of comprehensible, step-by-step instructions from video input. The videos can be of two categories: Academic Videos and Exercise Videos. To enable extracting procedures from these videos, Text Recognition, Speech Recognition and Action Recognition are used.

Dependencies

python >= 3.5
Opencv >= 3.4.1
scikit-learn==0.22.2
tensorflow & keras
numpy & scipy
pathlib
pytesseract
SpeechRecognition

Usage

Download the graph models fromhere, and place it under the Pose folder.
python main.py e videoname.mp4, willperform action and speech recognition and write the actions and speech recognized from the video toexercise_video_output.txt.
python main.py e webcam, willperform action recognition in realtime.
python main.py a videoname.mp4, willperform text and speech recognition and write the text and speech recognized from the video toacademic_video_output.txt.

Training with own dataset

prepare data(actions) by runningmain.py, remember touncomment the code of data collecting, the origin data will be saved as a.txt.
transforming the.txt to.csv, you can use EXCEL to do this.
do the training with thetrain.py inAction/training/, remember tochange the action_enum and output-layer of model.

Test result

Check exercise_video_output.txt for exercise videos and academic_video_output.txt for academic videos.

Acknowledge

Thanks to the following works:

Online-Realtime-Action-Recognition-based-on-OpenPose

About

No description, website, or topics provided.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Supervised Learning of Procedures from Tutorial Videos

Introduction

Dependencies

Usage

Training with own dataset

Test result

Acknowledge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

arunimasundar/Supervised-Learning-of-Procedures

Folders and files

Latest commit

History

Repository files navigation

Supervised Learning of Procedures from Tutorial Videos

Introduction

Dependencies

Usage

Training with own dataset

Test result

Acknowledge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages