Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Deepred-Mt is a novel method based on deep neural networks to predict C-to-U editing sites in angiosperm mitochondrial RNA.

License

NotificationsYou must be signed in to change notification settings

aedera/deepredmt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

Deepred-Mt: Deep Representation Learning for Predicting C-to-U RNA Editing in Plant Mitochondria

This repository contains the official implementation of Deepred-Mt, along withinstructions for reproducing results presented in"Deepred-Mt: Deep Representation Learning for Predicting C-to-U RNA Editing in Plant Mitochondria",by A. A. Edera, I. Small, D. H. Milone, andM. V. Sanchez-Puerta.Download PDF.

In land plants, the editosome is a highly sophisticated molecular machine ableto convert post-transcriptionally cytidines into uridines (C-to-U) at highlyspecific RNA positions called editing sites. This RNA editing seems to be partially governed bycis elements, which still remain recalcitrant to characterization.

Deepred-Mt is a novel neural network able to predict C-to-U editing sites inangiosperm mitochondria. Given an RNA sequence, consisting of a centralcytidine flanked by 20 nucleotides on each side, Deepred-Mt scores howprobable its editing is.

Convolution

The score is computed from complexcis elements or motifs automaticallyextracted from the flanking bases by a multi-layer convolutional neuralnetwork, whose full architecture is schematically shown below.

Deepred-Mt

Submit RNA sequences for predictions

To submit RNA/DNA sequences for predicting their C-to-U editing sites withDeepred-Mt, use the following link:

Submit sequences

Note 1: To be able to submit, you must be logged in with a Google Account(e.g.,Gmail).

Note 2: If difficulties are experienced when submitting sequences, try touseGoogle Chrome as the web browser.

If you encounter problems when submitting sequences please reportan issue.

Installation

To install Deepred-Mt on your computer, the following dependencies must beinstalled:

First, create and activate a new Conda environment

conda create -n deepredmt python=3.7conda activate deepredmt

Next, install Deepred-Mt from the sources

pip install -U"deepredmt @ git+https://github.com/aedera/deepredmt.git"

Usage

Command line

Once installed, Deepred-Mt can be executed on the command line to predictC-to-U editing sites from a desired FASTAfile.Hereis an example FASTA file calledseqs.fas:

deepredmt seqs.fas

This command extracts cytidines from the FASTA file to make predictions basedon their surrounding nucleotides.

Demo notebooks

The following notebooks reproduce experiments in the article.

DescriptionNotebook
Use Deepred-Mt on the command line to predict C-to-U editing sites from a given FASTA file
Compare the predictive performance of Deepred-Mt and state-of-the art methods for predicting editing sites
Train Deepred-Mt from scratch

Data

The experiments reported in the manuscript used three datasets built fromtheseFASTA files, extracted from nucleotide sequences encoding mitochondrial proteins from 21 plant species. In these files, 'E' nucleotides indicateC-to-U editing sites identified by using published RNAseq data, obtained from theEuropean Nucleotide Archive.

DatasetDescription
Training data41-bp nucleotide windows whose center positions are either unedited (C) or edited (E) cytidines. Nucleotide windows are labeled according to both the nucleotide in their central positions (0/C, 1/E) and their corresponding editing extents (a value ranging from 0 to 1)
Task-related sequencesSequences used for the augmentation strategy proposed in the article. These sequences are 41-bp nucleotide windows whose center positions are thymidines homologous to one of the editing sites in the training data
Control dataControl data containing fake editing signal "GGCG" within the downstream regions of nucleotide windows that are labeled as 1 (edited)

More information on the data format is providedhere.

Results

In our experiments, Deepred-Mt was compared to two state-of-the-art methods for predicting editingsites: PREP-Mt and PREPACT. The following figure shows precision-recall curvesobtained from the predictions of each method. Deepred-Mt achieves the highestF1 scores and the best areas under the curves (AUPRC) for two predictivescenarios: one excluding synonymous sites (dashed lines) and other includingthem (solid lines).

Deepred-Mt performance

MethodExcludedIncluded
AUPRCF1AUPRCF1
PREPACT0.910.890.790.82
PREP-Mt0.880.910.760.84
Deepred-Mt0.960.920.910.86

Contributing

Contributions from anyone are welcome. You can start by adding a new entryhere.

License

Deepred-Mt is licensed under the MIT license. SeeLICENSE for more details.


[8]ページ先頭

©2009-2025 Movatter.jp