Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Barcoded Molecular Families

License

NotificationsYou must be signed in to change notification settings

ARUP-NGS/BMFtools

Repository files navigation

#BMFtools###SummaryBMFtools (BarcodedMolecularFamilies tools) is a suite of tools for barcoded reads which takes advantage of PCR redundancy for error reduction/elimination. The core functionality consists ofmolecular demultiplexing at fastq stage, producing a unique observation for each sequenced founded template molecule. Accessory tools provide postprocessing, filtering, quality control, and summary statistics

===================

Requirements:gcc4.9+samtools 1.2+

git clone https://github.com/ARUP-NGS/BMFtools --recursivecd BMFtoolsmake

Use

bmftools
bmftools<--help/-h>
bmftools<subcommand><-h>

Tools

NameUse
bmftools capPostprocess a tagged BAM for BMF-agnostic tools.
bmftools depthCalculates depth of coverage over a set of bed intervals.
bmftools collapseCollapse initial fastq records by barcode
bmftools errCalculate error rates based on cycle, base call, and quality score.
bmftools famstatsCalculate family size statistics for a bam alignment file.
bmftools filterFilter or split a bam file by a set of filters.
bmftools markAdd tags for rsq.
bmftools stackA maximally-permissive variant caller using molecular barcode metadata analogous to samtools mpileup.
bmftools rsqRescue reads with using positional inference to collapse to unique observations in spite of errors in the barcode sequence.
bmftools sortSort for bam rescue
bmftools targetCalculates on-target rate.
bmftools vetCurate variant calls from another variant caller (.bcf) and a bam alignment.

These tools are divided into four categories:

  1. Core functionality
  2. Manipulation
  3. Analysis

Core Functionality

####bmftools collapsebmftools collapse combines reads sharing barcodes into single observations respectively.

First, the barcodes are added to the comment fields of the fastqs and split the records into subsets based on the first characters in the barcode.Then, reads with exactly-matching barcode are collapsed, with a meta-analysis performed on each base call.

bmftools collapse inline collapses templates where both strands were sequenced, whereas collapse secondary lacks strand information.

####bmftools rsqbmftools rsq uses positional information to collapse reads sharing alignment signatures with close barcodesunder the assumption that they came from the same original founding molecule but with errors in reading thebarcode.

Manipulation

####bmftools capCaps quality scores using barcode metadata to facilitate working with barcode-agnostic tools.

####bmftools filterFilters or splits a bam file based on a set of filters. These can be inverted with -v (analogous to grep).

Filters:

  • Fail reads with insufficient mapping quality.

  • Fail reads with insufficient family size.

  • Fail read pairs by aligned fraction.

  • Fail reads outside of a bed region.

  • Fail reads without all bits in given parameter in the sam flag field.

  • Fail reads with any bits in given parameter in the sam flag field.

####bmftools vetCurates SNV calls from a tumor/normal variant call file using barcode metadata from the bams used to produce the variant call file.

Analysis

####bmftools depthCalculates depth of coverage across a bed file using barcode metadata.

####bmftools targetCalculates on-target fraction for bed file using barcode metadata.

####bmftools errCalculates error rates by a variety of parameters.Additionally, pre-computes the quality score recalibration for the optional collapse recalibration step.

####bmftools famstatsCalculates summary statistics related to family size and demultiplexing.

####bmftools stackA maximally-permissive variant caller using molecular barcode metadata analogous to samtools mpileup.

BMF Tags

TagContentFormat
DRWhether the read was sequenced from both strands. Only valid for inline chemistry.Integer [0, 1]
FANumber of reads in Family which Agreed with final sequence at each baseuint32_t array
FMSize of family (number of reads sharing barcode.), e.g., "Family Members"Integer
FPRead Passes Filter related to barcoding. Determines QC fail flag in bmftools mark (without -q).Integer [0, 1]
NFMean number of differences between reads and consensus per read in familyFloat
PVPhred Values for a base call after meta-analysisuint32_t array
RVNumber of reversed reads in consensus. Only for inline chemistry.Integer

[8]ページ先頭

©2009-2025 Movatter.jp