Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner.

License

NotificationsYou must be signed in to change notification settings

SamThilmany/microarray-analysis_treatment-kinetics

Repository files navigation

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner. Internally the packagelimma by Gordon K. Smythet al.[1] is used to read and analyze the data.

Aim of the Experiment

This script is used to analyze the raw readouts of two biological experiments. In both experiments, the cell culture media were supplemented with three different pharmaceuticals, resulting in four samples per time point; three treatments, and one negative control. The data should show whether treatment with different pharmaceuticals affects gene expression and, if so, how this influence changes over time. Furthermore, we are interested in the specific genes that changed over time and how these genes relate to each treatment,i.e., whether a particular gene is differentially expressed for only one treatment or multiple treatments and whether a gene that is differentially expressed for multiple treatments is always up- or down-regulated.

Raw Data Generation

Cells were harvested at different time points, and RNA was isolated using a modified protocol of Qiagen's RNeasy Protect Cell Mini Kit Protocol.[2] Further sample preparation and microarray-based gene expression analysis was performed according to Agilent's protocol.[3] The microarray used was a SurePrint G3 Human Gene Expression v3 8x60K (P/N G4851C). Each glass slide carries eight high-definition 60K arrays containing cDNA for 26,803 unique Entrez genes and 30,606 unique lncRNAs, and 3000 replicates.[4]

Raw Data Analysis

The Agilent SureScan generates a QC Report and a.txt file with various data and metadata for each sample. Eight samples could be analyzed per glass slide; the files of these eight samples were stored in a folder named after the serial number of the glass slide. (Seedata/*)

After importing the raw data vialimma'sread.maimages function[5], the data was annotated with the ensembl gene IDs. Background correction was done via thebackgroundCorrect function fromlimma using thenormexp method.[6] After background correction, the data were filtered; here, the data that could not be annotated with data from ensembl and whose expression level was not significantly above the background were excluded. The remaining data were normalized with thelimma functionnormalizeBetweenArrays using thequantile method to ensure a similar distribution of expression levels between the different arrays.[7]

A polynomial trend was allowed for the baseline and the individual treatments to examine the effect of each treatment over the treatment period. These polynomial trends were used to create a design matrix fitted to the data using thelmFit function fromlimma.[9] Professor Gordon K. Smyth of the Walter and Eliza Hall Institute of Medical Research in Melbourne, creator of thelimma package, advised this procedure.[8]

The fitted data were then statistically analyzed using empirical Bayes statistics for differential expression. TheeBayes function fromlimma was used for this purpose.[10]

The generated data were filtered according to different criteria (logFC, p-value, and p-value with logFC). For all criteria, lists of genes that were differentially expressed in multiple or all treatments were generated; genes that were differentially expressed in only one treatment were also flagged.

Want to know more?

For more precise information on the evaluation, you may look at the code and refer to the individual functions in the manual oflimma[1] or the respective R package.

About this Project

This project is part of Teresa Hardy's Master's thesis, which was conducted under the supervision of Sam Thilmany at the Federal Institute for Drugs and Medical Devices, Bonn, Germany.

The concept for the data analysis was a joint effort of Teresa Hardy and Sam Thilmany; Sam Thilmany did the programming.

About

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner.

Topics

Resources

License

Stars

Watchers

Forks

Languages


[8]ページ先頭

©2009-2025 Movatter.jp