Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Extract 3'UTR, 5'UTR, CDS, Promoter, Genes, Introns, Exons from GTF files

License

NotificationsYou must be signed in to change notification settings

saketkc/gencode_regions

Repository files navigation

Extract 3'UTR, 5'UTR, CDS, Promoter, Genes from GTF files.

Data

If you only care about the final output, they are hosted build and GTF version wise onriboraptor.

Using Python

Dependencies

We recommend setting up aconda environment withPython>=3 andPython<=3.7 withgffutilsv0.9 andpybedtools:

conda create --name gencode_env python=3.7conda activate gencode_envconda install -c bioconda gffutils=0.9 pybedtools

Notebooks

The corresponding output gzipped beds are in thedata directory.

Using R

Dependencies

Run

./create_regions_from_gencode.R <path_to_GFF/GTF> <path_to_output_dir>

Will createexons.bed, 3UTR.bed, 5UTR.bed, genes.bed, cds.bed in<output_dir>

Example

  • Download GFF/GTF(GRCh37, v25, comprehensive, CHR) from gencodegenes.org:
   wget ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_25/gencode.v25.annotation.gff3.gz \   && gunzip gencode.v25.annotation.gff3.gz
  • Create regions:
./create_regions_from_gencode.R gencode.v25.annotation.gff3 /path/to/GRCh37/annotation

First exons, Last exons

We useGenePred format to make the process a bit simple.

  • DownloadgtfToGenePred

  • Convert gtf to GenePred:

    gtfToGenePred gencode.v25.annotation.gtf gencode.v25.annotation.genepred
  • Extractfirst exons:

    python genepred_to_bed.py --first_exon gencode.v25.annotation.genepred
  • Extractlast exons:

    python genepred_to_bed.py --last_exon gencode.v25.annotation.genepred

Confused about exons and UTRs?

This should be helpful:

img

Source: Wikipedia

or probably this:

img

Source: Biostar

About

Extract 3'UTR, 5'UTR, CDS, Promoter, Genes, Introns, Exons from GTF files

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp