Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Tool for validation of methylation motifs detected with SMRT sequencing

NotificationsYou must be signed in to change notification settings

fanglab/MeMoRe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

82 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MeMoRe stand for Methylation Motif Refiner. This is a tool designed to validate and, if necessary, refine methylation motifs detected from Bacteria and Archaea with SMRT and ONT sequencing.

Authors' notes

We are actively developingMeMoRe to facilitate usage and broaden features. All feedback is more than welcome. You can reach us on twitter (@iamfanggang and@AlanTourancheau) or directly through theGitHub issues system.

Content

Usage

To runMeMoRe on your own dataset or use our testing dataset please follow:https://fanglab-tools.shinyapps.io/MeMoRe/.MeMoRe is designed to be user friendly and therefore is distributed as aShiny application throughshinyapps.io hosting solution, no installation needed.

Tool showcase

To showcase the toolbox applications and facilitate an understanding of the methods, we build-in example datasets for SMRT and ONT analyses inMeMoRe app. Typical analysis results for both SMRT and ONT datasets are presented below.

Principle

In Bacteria and Archaea, DNA methylation events (6mA, 4mC, and 5mC) are motif-driven, meaning that nearly all occurrences of the same sequence motif(s) will be modified. This property can be used to refine the motifs discovered fromSMRTPortal/SMRTLink Base Modification Analysis ornanodisco pipelines.

For each methylation motifde novo discovered, we identify all occurrences in the provided reference genome, and we aggregate the methylation signal to provide a simple visual representation for motif sequence validation. The same procedure is repeated for all related motifs with one substitution to confirm that the methylation is precisely represented by a motif of interest. For example, considering GATCde novo discovered, we also extract the methylation signal for:

  • 1st base substitution: AATC, CATC, TATC.
  • 2nd base substitution: GCTC, GGTC, GTTC.
  • 3rd base substitution: GAAC, GACC, GAGC.
  • 4th base substitution: GATA, GATG, GATT.

Analysis of SMRT results

In SMRT sequencing, DNA methylation affect the kinetics of the polymerases during real-time DNA synthesis. The changes of polymerase's kinetics are observed through the Inter-Pulse Duration (IPD) metric which are compared to prediction from anin silico model at each genomic position. The resulting metric is called the IPD ratio (IPD native/IPDin silico). For 6mA and 4mC DNA modification, the IPD ratio increase on top of the methylated positions while an IPD ratio of 1 means no kinetic change. It is worth noting that 5mC do not typically produce detectable signal and cannot be reliably found from SMRT data.

The following figure showcases a typicalMeMoRe results for GTAT6mAC methylation motif in aC. perfringens strain. It shows high values only for the IPD ratios and for the Score from the motif of interest (i.e. GTATAC), while the related motifs (with one substitution) have metrics at background levels (~1 for the IPD ratio and ~0 for the Score).Output SMRTFigure 1: MeMoRe results for SMRT dataset of C. perfringens's GTAT6mAC methylation motif. Three metrics are visualized: top. IPD ratio distribution, middle. Score distribution, bottom. Coverage distribution.

Analysis of ONT results

In ONT sequencing, DNA methylation affect the electric current measured while the DNA molecules transfers through the nanopores. Usingnanodisco, current differences between the native and the Whole Genome Amplified samples are computed at each genomic position and this metric represent the methylation signal for ONT dataset. The further from 0 the current difference are, the more likely the genomic is modified. Contrary to SMRT sequencing, the signal is broadly distributed and not restricted to the modified base, meaning that signal for multiple genomic positions needs to be monitored.

The following figure showcase a typicalMeMoRe results for GTAT6mAC methylation motif in aC. perfringens strain. It shows disturbed current differences only from the motif of interest (i.e. GTATAC), while the related motifs (with one substitution) have current difference at background levels (distribution centered around zero). This characteristic is described in details inTourancheau et al., 2021.Output ONTFigure 2: MeMoRe results for ONT dataset of C. perfringens's GTAT6mAC methylation motif. Two metrics are visualized: top. Current differences distribution, bottom. Methylation motif score

Documentation

For a comprehensive description ofMeMoRe including a detailed tutorial, please consult thecomplete documentation.

Citation

A preprint is in preparation. Meanwhile, please cite the GitHub repository:https://github.com/fanglab/MeMoRe.

About

Tool for validation of methylation motifs detected with SMRT sequencing

Resources

Stars

Watchers

Forks

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp