COMBINE-lab/salmonPublic

NotificationsYou must be signed in to change notification settings
Fork168
Star812

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

License

GPL-3.0 license

812 stars 168 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 1,907 Commits
.drone		.drone
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
cmake		cmake
doc		doc
docker		docker
include		include
scripts		scripts
src		src
tests		tests
.clang-format		.clang-format
.cmakelintrc		.cmakelintrc
.drone.yml		.drone.yml
.drone.yml.sig		.drone.yml.sig
.gitignore		.gitignore
.lgtm.yml		.lgtm.yml
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
current_version.txt		current_version.txt
sample_data.tgz		sample_data.tgz

Repository files navigation

Try out the newalevin-fry framework for single-cell analysis; tutorials can be foundhere!

Help guide the development of Salmon,take our survey

What is Salmon?

Salmon is awicked-fast program to produce a highly-accurate, transcript-level quantification estimates fromRNA-seq data. Salmon achieves its accuracy and speed via a number of different innovations, including theuse ofselective-alignment (accurate but fast-to-compute proxies for traditional read alignments), andmassively-parallel stochastic collapsed variational inference. The result is a versatile tool that fits nicelyinto many different pipelines. For example, you can choose to make use of ourselective-alignment algorithm by providing Salmon with raw sequencing reads, or, if it is more convenient, you can provide Salmon with regular alignments (e.g. anunsorted BAM file with alignments to the transcriptome produced with your favorite aligner), and it will use the samewicked-fast, state-of-the-art inference algorithm to estimate transcript-level abundances for your experiment.

Give salmon a try! You can find the latest binary releaseshere.

The current version number of the master branch of Salmon can be foundhere

Documentation

The documentation for Salmon is available onReadTheDocs, check it outhere.

Salmon is, and will continue to be,freely and actively supported on a best-effort basis.If you need industrial-grade technical support, please consider the options atoceangenomics.com/contact.

Decoy sequences in transcriptomes

tl;dr: fast is good but fast and accurate is better!Alignment and mapping methodology influence transcript abundance estimation, and accounting for theaccounting for fragments of unexpected origin can improve transcript quantification. To this end, salmon provides the ability to index both the transcriptome as well as decoy seuqence that can be considered during mapping and quantification. The decoy sequence accounts for reads that might otherwise be (spuriously) attributed to some annotated transcript. Thistutorial provides a step-by-step guide on how to efficiently index the reference transcriptome and genome to produce a decoy-aware index. Specifically, there are 3 possible ways in which the salmon index can be created:

cDNA-only index : salmon_index -https://combine-lab.github.io/salmon/getting_started/. This method will result in the smallest index and require the least resources to build, but will be the most prone to possible spurious alignments.
SA mashmap index: salmon_partial_sa_index - (regions of genome that have high sequence similarity to the transcriptome) - Details can be found inthis README and usingthis script. While running mashmap can require considerable resources, the resulting decoy files are fairly small. This will result in an index bigger than the cDNA-only index, but still mucch smaller than the full genome index below. It will confer many, though not all, of the benefits of using the entire genome as a decoy sequence.
SAF genome index: salmon_sa_index - (the full genome is used as decoy) - The tutorial for creating such an index can be foundhere. This will result in the largest index, but likely does the best job in avoiding spurious alignments to annotated transcripts.

Facing problems with Indexing?, Check if anyone else already had this problem in the issues section or fill the index generationrequest form

NOTE:

If you are generating an index to be used for single-cell or single-nucleus quantification withalevin-fry, then we recommend you consider building a spliced+intron (splici) reference. This serves much of the purpose of a decoy-aware index when quantifying with alevin-fry, while also providing the capability to attribute splicing status to mapped fragments. More details about thesplici reference and the Unspliced/Spliced/Ambiguous quantification mode it enables can be foundhere.

Chat live about Salmon

You can chat with the Salmon developers and other users via Gitter (Note: Gitter is much less frequently monitored than GitHub, so if you have an important problem or question, please consider opening an issue here on GitHub)!

About

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

combine-lab.github.io/salmon

Releases46

Salmon v1.10.1 Latest

Mar 12, 2023

+ 45 releases

Packages

No packages published

Contributors40

+ 26 contributors

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

What is Salmon?

Documentation

Decoy sequences in transcriptomes

NOTE:

Chat live about Salmon

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases46

Packages

Uh oh!

Contributors40

Uh oh!

Languages

Movatterモバイル変換

License

COMBINE-lab/salmon

Folders and files

Latest commit

History

Repository files navigation

What is Salmon?

Documentation

Decoy sequences in transcriptomes

NOTE:

Chat live about Salmon

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases46

Packages0

Uh oh!

Contributors40

Uh oh!

Languages

Packages