SamThilmany/microarray-analysis_treatment-kineticsPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star1

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner.

License

MIT license

1 star 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SpotTypes.tsv		SpotTypes.tsv
Targets.tsv		Targets.tsv
_analysis-script.R		_analysis-script.R
_annotation-file-generator.R		_annotation-file-generator.R
run-analysis.R		run-analysis.R

Repository files navigation

Microarray Analysis of Treatment Kinetics

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner. Internally the packagelimma by Gordon K. Smythet al.[1] is used to read and analyze the data.

Aim of the Experiment

This script is used to analyze the raw readouts of two biological experiments. In both experiments, the cell culture media were supplemented with three different pharmaceuticals, resulting in four samples per time point; three treatments, and one negative control. The data should show whether treatment with different pharmaceuticals affects gene expression and, if so, how this influence changes over time. Furthermore, we are interested in the specific genes that changed over time and how these genes relate to each treatment,i.e., whether a particular gene is differentially expressed for only one treatment or multiple treatments and whether a gene that is differentially expressed for multiple treatments is always up- or down-regulated.

Raw Data Generation

Cells were harvested at different time points, and RNA was isolated using a modified protocol of Qiagen's RNeasy Protect Cell Mini Kit Protocol.[2] Further sample preparation and microarray-based gene expression analysis was performed according to Agilent's protocol.[3] The microarray used was a SurePrint G3 Human Gene Expression v3 8x60K (P/N G4851C). Each glass slide carries eight high-definition 60K arrays containing cDNA for 26,803 unique Entrez genes and 30,606 unique lncRNAs, and 3000 replicates.[4]

Raw Data Analysis

The Agilent SureScan generates a QC Report and a.txt file with various data and metadata for each sample. Eight samples could be analyzed per glass slide; the files of these eight samples were stored in a folder named after the serial number of the glass slide. (Seedata/*)

After importing the raw data vialimma'sread.maimages function[5], the data was annotated with the ensembl gene IDs. Background correction was done via thebackgroundCorrect function fromlimma using thenormexp method.[6] After background correction, the data were filtered; here, the data that could not be annotated with data from ensembl and whose expression level was not significantly above the background were excluded. The remaining data were normalized with thelimma functionnormalizeBetweenArrays using thequantile method to ensure a similar distribution of expression levels between the different arrays.[7]

A polynomial trend was allowed for the baseline and the individual treatments to examine the effect of each treatment over the treatment period. These polynomial trends were used to create a design matrix fitted to the data using thelmFit function fromlimma.[9] Professor Gordon K. Smyth of the Walter and Eliza Hall Institute of Medical Research in Melbourne, creator of thelimma package, advised this procedure.[8]

The fitted data were then statistically analyzed using empirical Bayes statistics for differential expression. TheeBayes function fromlimma was used for this purpose.[10]

The generated data were filtered according to different criteria (logFC, p-value, and p-value with logFC). For all criteria, lists of genes that were differentially expressed in multiple or all treatments were generated; genes that were differentially expressed in only one treatment were also flagged.

Want to know more?

For more precise information on the evaluation, you may look at the code and refer to the individual functions in the manual oflimma[1] or the respective R package.

About this Project

This project is part of Teresa Hardy's Master's thesis, which was conducted under the supervision of Sam Thilmany at the Federal Institute for Drugs and Medical Devices, Bonn, Germany.

The concept for the data analysis was a joint effort of Teresa Hardy and Sam Thilmany; Sam Thilmany did the programming.

About

This R script is used to analyze microarray data acquired by an Agilent SureScan Microarray Scanner.

Releases1

v1.0.0 Latest

Jan 13, 2023

Languages

R100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Microarray Analysis of Treatment Kinetics

Aim of the Experiment

Raw Data Generation

Raw Data Analysis

Want to know more?

About this Project

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases1

Languages

Movatterモバイル変換

License

SamThilmany/microarray-analysis_treatment-kinetics

Folders and files

Latest commit

History

Repository files navigation

Microarray Analysis of Treatment Kinetics

Aim of the Experiment

Raw Data Generation

Raw Data Analysis

Want to know more?

About this Project

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases1

Languages