Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

A generic motif discovery algorithm for sequential data

License

NotificationsYou must be signed in to change notification settings

kljensen/Gemoda

Repository files navigation

This is the source code implementing the Gemoda algorithm described in the following paper

A generic motif discovery algorithm for sequential data.
Jensen KL, Styczynski MP, Rigoutsos I, Stephanopoulos GN.
Bioinformatics. 2006 Jan 1;22(1):21-8. Epub 2005 Oct 27.

MOTIVATION: Motif discovery in sequential data is a problem of great interest and with many applications. However, previous methods have been unable to combine exhaustive search with complex motif representations and are each typically only applicable to a certain class of problems.

RESULTS: Here we present a generic motif discovery algorithm (Gemoda) for sequential data. Gemoda can be applied to any dataset with a sequential character, including both categorical and real-valued data. As we show, Gemoda deterministically discovers motifs that are maximal in composition and length. As well, the algorithm allows any choice of similarity metric for finding motifs. Finally, Gemoda's output motifs are representation-agnostic: they can be represented using regular expressions, position weight matrices or any number of other models for any type of sequential data. We demonstrate a number of applications of the algorithm, including the discovery of motifs in amino acids sequences, a new solution to the (l,d)-motif problem in DNA sequences and the discovery of conserved protein substructures

Also see thispresentation about Gemoda on Slideshare.

Use the following sequence of commands to install the Gemoda implementation provided by this software:

./configure
make
make install

Note that the software requires theGNU Scientific Library.

To familiarize you with the software, we included 3 examples in the examples/ subdirectory. The README files in these folders details how to run the software.

Kyle
kljensen@gmail.com

About

A generic motif discovery algorithm for sequential data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp