Why, When and How to Adjust Your P Values?
- PMID:30124010
- PMCID: PMC6099145
- DOI: 10.22074/cellj.2019.5992
Why, When and How to Adjust Your P Values?
Abstract
Currently, numerous papers are published reporting analysis of biological data at different omics levels by making statistical inferences. Of note, many studies, as those published in this Journal, report association of gene(s) at the genomic and transcriptomic levels by undertaking appropriate statistical tests. For instance, genotype, allele or haplotype frequencies at the genomic level or normalized expression levels at the transcriptomic level are compared between the case and control groups using the Chi-square/Fisher's exact test or independent (i.e. two-sampled) t-test respectively, with this culminating into a single numeric, namely the P value (or the degree of the false positive rate), which is used to make or break the outcome of the association test. This approach has flaws but nevertheless remains a standard and convenient approach in association studies. However, what becomes a critical issue is that the same cut-off is used when 'multiple' tests are undertaken on the same case-control (or any pairwise) comparison. Here, in brevity, we present what the P value represents, and why and when it should be adjusted. We also show, with worked examples, how to adjust P values for multiple testing in the R environment for statistical computing (http://www.R-project.org).
Keywords: Bias; Gene Expression Profiling; Genetic Variation; Research Design; Statistical Data Analyses.
Copyright© by Royan Institute. All rights reserved.
Conflict of interest statement
There is no conflict of interest in this study.
Figures

Similar articles
- Response to letter to the editor from Dr Rahman Shiri: The challenging topic of suicide across occupational groups.Niedhammer I, Milner A, Witt K, Klingelschmidt J, Khireddine-Medouni I, Alexopoulos EC, Toivanen S, Chastang JF, LaMontagne AD.Niedhammer I, et al.Scand J Work Environ Health. 2018 Jan 1;44(1):108-110. doi: 10.5271/sjweh.3698. Epub 2017 Dec 8.Scand J Work Environ Health. 2018.PMID:29218357
- FSHB -211 G>T is a major genetic modulator of reproductive physiology and health in childbearing age women.Rull K, Grigorova M, Ehrenberg A, Vaas P, Sekavin A, Nõmmemees D, Adler M, Hanson E, Juhanson P, Laan M.Rull K, et al.Hum Reprod. 2018 May 1;33(5):954-966. doi: 10.1093/humrep/dey057.Hum Reprod. 2018.PMID:29617818
- Statistical evaluation of SAGE libraries: consequences for experimental design.Ruijter JM, Van Kampen AH, Baas F.Ruijter JM, et al.Physiol Genomics. 2002 Oct 29;11(2):37-44. doi: 10.1152/physiolgenomics.00042.2002.Physiol Genomics. 2002.PMID:12407185Review.
- Scientific basis of the OCRA method for risk assessment of biomechanical overload of upper limb, as preferred method in ISO standards on biomechanical risk factors.Colombini D, Occhipinti E.Colombini D, et al.Scand J Work Environ Health. 2018 Jul 1;44(4):436-438. doi: 10.5271/sjweh.3746.Scand J Work Environ Health. 2018.PMID:29961081
- Methodological quality in pharmacogenetic studies with binary assessment of treatment response: a review.Cobos A, Sánchez P, Aguado J, Carrasco JL.Cobos A, et al.Pharmacogenet Genomics. 2011 May;21(5):243-50. doi: 10.1097/FPC.0b013e32834300fb.Pharmacogenet Genomics. 2011.PMID:21301379Review.
Cited by
- Twenty-four-hour time-use composition and cognitive function in older adults: Cross-sectional findings of the ACTIVate study.Mellow ML, Dumuid D, Wade AT, Stanford T, Olds TS, Karayanidis F, Hunter M, Keage HAD, Dorrian J, Goldsworthy MR, Smith AE.Mellow ML, et al.Front Hum Neurosci. 2022 Nov 24;16:1051793. doi: 10.3389/fnhum.2022.1051793. eCollection 2022.Front Hum Neurosci. 2022.PMID:36504624Free PMC article.
- Transcriptomic Effects of Acute Ultraviolet Radiation Exposure on TwoSyntrichia Mosses.Ekwealor JTB, Mishler BD.Ekwealor JTB, et al.Front Plant Sci. 2021 Oct 28;12:752913. doi: 10.3389/fpls.2021.752913. eCollection 2021.Front Plant Sci. 2021.PMID:34777431Free PMC article.
- Identification of shared genetic architecture between non-alcoholic fatty liver disease and type 2 diabetes: A genome-wide analysis.Tan Y, He Q, Chan KHK.Tan Y, et al.Front Endocrinol (Lausanne). 2023 Mar 22;14:1050049. doi: 10.3389/fendo.2023.1050049. eCollection 2023.Front Endocrinol (Lausanne). 2023.PMID:37033223Free PMC article.
- Distal Upper Extremity Arterial Calcification as a Predictor for Subclinical Coronary Artery Disease by Coronary Artery Calcium Scoring.Iskandarova A, Rao SJ, Yohe GJ, Shah AB, Giladi AM.Iskandarova A, et al.Plast Reconstr Surg Glob Open. 2024 Apr 24;12(4):e5768. doi: 10.1097/GOX.0000000000005768. eCollection 2024 Apr.Plast Reconstr Surg Glob Open. 2024.PMID:38660336Free PMC article.
- RNA-seq data science: From raw data to effective interpretation.Deshpande D, Chhugani K, Chang Y, Karlsberg A, Loeffler C, Zhang J, Muszyńska A, Munteanu V, Yang H, Rotman J, Tao L, Balliu B, Tseng E, Eskin E, Zhao F, Mohammadi P, P Łabaj P, Mangul S.Deshpande D, et al.Front Genet. 2023 Mar 13;14:997383. doi: 10.3389/fgene.2023.997383. eCollection 2023.Front Genet. 2023.PMID:36999049Free PMC article.Review.
References
- Fisher RA. Tests of significance in harmonic analysis. Proc R Soc A Math Phys Eng Sci. 1929;125(796):594–599.
- Balding DJ. A tutorial on statistical methods for population association studies. Nat Rev Genet. 2006;7(10):781–791. - PubMed
- Cohen J. The earth is round (p<.05): rejoinder. Am Psychol. 1995;50(12):1103–1103.
- Lee JK. Statistical bioinformatics. 1st ed. New Jersey: John Wiley & Sons Inc; 2010.
LinkOut - more resources
Full Text Sources