Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

RNA-seq data analysis practical

NotificationsYou must be signed in to change notification settings

babasaraki/TeachingMaterial

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This tutorial will illustrate how to use standalone tools, together with R and Bioconductor for the analysis of RNA-seq data. Keep in mind that this is a rapidly evolving field and that this document is not intended as a review of the many tools available to perform each step; instead, we will cover one of the many existing workflows to analyse this type of data.

We will be working with a subset of a publicly available dataset fromDrosophila melanogaster, which is available both in the Short Read archive (SRP001537 - raw data) and in Bioconductor (pasilla package - processed data). For more information about this dataset please refer to the original publication (Brooks et al. 2010).

The tools and R packages that we will be using during the practical are listed below (seeSoftware requirements) and the necessary data files can be foundhere. After dowloading and uncompressing thetar.gz file, you should have the following directory structure in your computer:

RNAseq|-- reference               # reference info (e.g. genome sequence and annotation)`-- data    |-- raw                 # raw data: fastq files    |-- mapped              # mapped data: BAM files    `-- demultiplexing      # extra fastq files for the demultiplexing section

This work is licensed under aCreative Commons Attribution-ShareAlike 3.0 Unported License. This means that you are able to copy, share and modify the work, as long as the result is distributed under the same license.

Table of contents

  1. Dealing with raw data
    1. The FASTQ format
    2. Quality assessment (QA)
    3. Filtering FASTQ files
    4. De-multiplexing samples
    5. Aligning reads to the genome
  2. Dealing with aligned data
    1. The SAM/BAM format
    2. Visualising aligned reads
    3. Filtering BAM files
    4. Gene-centric analyses:
      1. Counting reads overlapping annotated genes
        • With htseq-count
        • With R
        • Alternative approaches
      2. Normalising counts
        • With RPKMs
        • With DESeq2
      3. Differential gene expression
    5. Exon-centric analyses:
      1. Differential exon usage
    6. Transcript-centric analyses:
      1. Identification, annotation and visualisation of splicing switch events

Software requirements

Note: depending on the topics covered in the course some of these tools might not be used.

Other resources

Course data

Getting started in R and UNIX

More on RNA-seq and NGS

Aknowledgments

This tutorial has been inspired on material developed by Ângela Gonçalves, Nicolas Delhomme, Simon Anders and Martin Morgan, who I would like to thank and acknowledge. Special thanks must go to Ângela Gonçalves and Mitra Barzine, with whom I have been teaching, and to Gabriella Rustici, for always finding a way to organise a new course.

About

RNA-seq data analysis practical

Resources

Stars

Watchers

Forks

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp