Movatterモバイル変換

malonge/seqtkPublic

forked fromlh3/seqtk

NotificationsYou must be signed in to change notification settings
Fork2
Star1

Toolkit for processing sequences in FASTA/Q formats

License

MIT license

1 star 315 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
khash.h		khash.h
kseq.h		kseq.h
seqtk.c		seqtk.c

Repository files navigation

Introduction

malonge -- Added "-D" to "seqtk seq" to remove reads with duplicated headers

malonge -- Added "-B" to "seqtk seq" to filter reads by the average Q value

Seqtk is a fast and lightweight tool for processing sequences in the FASTA orFASTQ format. It seamlessly parses both FASTA and FASTQ files which can also beoptionally compressed by gzip. To installseqtk,

git clone https://github.com/lh3/seqtk.git;cd seqtk; make

The only library dependency is zlib.

Seqtk Examples

Convert FASTQ to FASTA:
```
  seqtk seq -a in.fq.gz > out.fa
```
Convert ILLUMINA 1.3+ FASTQ to FASTA and mask bases with quality lower than 20 to lowercases (the 1st command line) or toN (the 2nd):
```
  seqtk seq -aQ64 -q20 in.fq > out.fa  seqtk seq -aQ64 -q20 -n N in.fq > out.fa
```
Fold long FASTA/Q lines and remove FASTA/Q comments:
```
  seqtk seq -Cl60 in.fa > out.fa
```
Convert multi-line FASTQ to 4-line FASTQ:
```
  seqtk seq -l0 in.fq > out.fq
```
Reverse complement FASTA/Q:
```
  seqtk seq -r in.fq > out.fq
```
Extract sequences with names in filename.lst, one sequence name per line:
```
  seqtk subseq in.fq name.lst > out.fq
```
Extract sequences in regions contained in filereg.bed:
```
  seqtk subseq in.fa reg.bed > out.fa
```
Mask regions inreg.bed to lowercases:
```
  seqtk seq -M reg.bed in.fa > out.fa
```
Subsample 10000 read pairs from two large paired FASTQ files (remember to use the same random seed to keep pairing):
```
  seqtk sample -s100 read1.fq 10000 > sub1.fq  seqtk sample -s100 read2.fq 10000 > sub2.fq
```
Trim low-quality bases from both ends using the Phred algorithm:
```
  seqtk trimfq in.fq > out.fq
```
Trim 5bp from the left end of each read and 10bp from the right end:
```
  seqtk trimfq -b 5 -e 10 in.fa > out.fa
```

About

Toolkit for processing sequences in FASTA/Q formats

Releases

4tags

Packages

No packages published

Languages

C99.7%
Makefile0.3%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Introduction

Seqtk Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

malonge/seqtk

Folders and files

Latest commit

History

Repository files navigation

Introduction

Seqtk Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages