alipsgh/tornadoPublic

NotificationsYou must be signed in to change notification settings
Fork30
Star130

The Tornado 🌪️ framework, designed and implemented for adaptive online learning and data stream mining in Python.

License

MIT license

130 stars 30 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
archiver		archiver
classifier		classifier
data_streams		data_streams
data_structures		data_structures
dictionary		dictionary
drift_detection		drift_detection
evaluators		evaluators
filters		filters
graphic		graphic
img		img
plotter		plotter
streams		streams
tasks		tasks
tutorial_img		tutorial_img
.DS_Store		.DS_Store
LICENSE.txt		LICENSE.txt
README.md		README.md
_config.yml		_config.yml
github_generate_stream.py		github_generate_stream.py
github_prequential_multi_test.py		github_prequential_multi_test.py
github_prequential_test.py		github_prequential_test.py

Repository files navigation

The Tornado Framework

Tornado is a framework for data stream mining in Python. The framework includes various incremental/online learning algorithms as well as concept drift detection methods.

You must have Python 3.5 or above (either 32-bit or 64-bit) on your system to run the framework without any error. Note that thenumpy,scipy,matplotlib, andpympler packages are used in the Tornado implementations. You may use thepip command in order to install these packages, for example:

pip install numpy

Although you can use an installer fromhttps://www.python.org/downloads/ to install Python on your system, I highly recommendAnaconda, one of the Python distributions, since it includes thenumpy,scipy, andmathplotlib packages by default. You may download one of the Anaconda's installers fromhttps://www.anaconda.com/download/. Please note that, you still need to install thepympler package for Anaconda. For that, run the following command in a command prompt or a terminal:

conda install -c conda-forge pympler

Once you have all the packages installed, you may run the framework.

Three sample codes are prepared to show how you can use the framework. Those files are:

github_prequential_test.py - This file lets you evaluate an adaptive algorithm, i.e. a pair of a learner and a drift detector, prequentially. In this example, Naive Bayes is the learner and Fast Hoeffding Drift Detection Method (FHDDM) is the detector. You find lists of incremental learners intornado/classifier/ and drift detectors intornado/drift_detection/. The outputs in the created project directory are similar to:

github_prequential_multi_test.py - This file lets you run multiple adaptive algorithms together against a data stream. While algorithms are learning from instances of a data stream, the framework tells you which adaptive algorithm is optimal by consideringclassification,adaptation, andresource consumption measures. The outputs in the created project directory are similar to:

github_generate_stream.py - The file helps you use the Tornado framework for generating synthetic data streams containing concept drifts. You find a list of stream generators intornado/streams/generators/.

Citation

Please kindly cite the following papers, or thesis, if you plan to use Tornado or any of its components:

Pesaranghader, Ali. "A Reservoir of Adaptive Algorithms for Online Learning from Evolving Data Streams", Ph.D. Dissertation, Université d'Ottawa/University of Ottawa, 2018.
DOI:http://dx.doi.org/10.20381/ruor-22444
Pesaranghader, Ali, et al. "Reservoir of Diverse Adaptive Learners and Stacking Fast Hoeffding Drift Detection Methods for Evolving Data Streams",Machine Learning Journal, 2018.
Pre-print available at:https://arxiv.org/abs/1709.02457, DOI:https://doi.org/10.1007/s10994-018-5719-z
Pesaranghader, Ali, et al. "A framework for classification in data streams using multi-strategy learning",International Conference on Discovery Science, 2016.
Pre-print available at:http://iwera.ir/~ali/papers/ds2016.pdf, DOI:https://doi.org/10.1007/978-3-319-46307-0_22

About

The Tornado 🌪️ framework, designed and implemented for adaptive online learning and data stream mining in Python.

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

The Tornado Framework

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors2

Languages

Movatterモバイル変換

License

alipsgh/tornado

Folders and files

Latest commit

History

Repository files navigation

The Tornado Framework

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Languages

Packages