Movatterモバイル変換


[0]ホーム

URL:


Skip to contents

heplots

Visualizing Hypothesis Tests in Multivariate Linear Models

Version 1.8.0; documentation built forpkgdown 2025-12-01

Description

Theheplots package provides functions for visualizing hypothesis tests in multivariate linear models (“MLM” = {MANOVA, multivariate multiple regression, MANCOVA, and repeated measures designs}). It also provides other tools for analysis and graphical display of MLMs.

HE plots represent sums-of-squares-and-products matrices for linear hypotheses (H) and for error (E) using ellipses (in two dimensions), ellipsoids (in three dimensions), or by line segments in one dimension. For the theory and applications, see:

If you use this work in teaching or research, please cite it as given bycitation("heplots") or seeCitation.

Other topics now addressed here include:

  • robust MLMs, using iteratively re-weighted least squared to down-weight observations with large multivariate residuals,robmlm().
  • Mahalanobis() calculates classical androbust Mahalanobis squared distances using MCD and MVE estimators of center and covariance.
  • visualizing tests for equality of covariance matrices in MLMs (Box’s M test),boxM() andplot.boxM(). Also:bartlettTests() andLeveneTests() for homogeneity of variance for each response in a MLM.
  • χ2\chi^2 Q-Q plots for MLMs (cqplot()) to detect outliers and assess multivariate normality of residuals.
  • bivariate coefficient plots showing elliptical confidence regions (coefplot()).

In this respect, theheplots package now aims to provide a wide range of tools for analyzing and visualizing multivariate response linear models, together with other packages:

candisc logo

  • The relatedcandisc package provides HE plots incanonical discriminant space, the space of linear combinations of the responses that show the maximum possible effects and for canonical correlation in multivariate regression designs. See thepackage documentation for details.

mvinfluence logo

Several tutorial vignettes are also included. Seevignette(package="heplots").

Installation

CRAN versioninstall.packages("heplots")
R-universeinstall.packages("heplots", repos = c('https://friendly.r-universe.dev')
Development versionremotes::install_github("friendly/heplots")

HE plot functions

The graphical functions contained here all display multivariate model effects in variable (data) space, for one or more response variables (or contrasts among response variables in repeated measures designs).

  • heplot() constructs two-dimensional HE plots for model terms and linear hypotheses for pairs of response variables in multivariate linear models.

  • heplot3d() constructs analogous 3D plots for triples of response variables.

  • Thepairs method,pairs.mlm() constructs a scatterplot matrix of pairwise HE plots.

  • heplot1d() constructs 1-dimensional analogs of HE plots for model terms and linear hypotheses for single response variables.

Other functions

  • glance.mlm() extendsbroom::glance.lm() to multivariate response models, giving a one-line statistical summary for each response variable.uniStats() does something similar, but formatted more like a ANOVA table.

  • boxM() Calculates Box’sM test for homogeneity of covariance matrices in a MANOVA design. Aplot method displays a visual representation of the components of the test. Associated with this,bartletTests() andlevineTests() give the univariate tests of homogeneity of variance for each response measure in a MLM.

  • covEllipses() draw covariance (data) ellipses for one or more group, optionally including the ellipse for the pooled within-group covariance.

  • coefplot() for an MLM object draws bivariate confidence ellipses.

Repeated measure designs

For repeated measure designs, between-subject effects and within-subject effects must be plotted separately, because the error terms (E matrices) differ. For terms involving within-subject effects, these functions carry out a linear transformation of the matrixY of responses to a matrixY M, whereM is the model matrix for a term in the intra-subject design and produce plots of theH andE matrices in this transformed space. The vignette"repeated" describes these graphical methods for repeated measures designs. (This paperHE plots for repeated measures designs is now provided as a PDF vignette.)

Datasets

The package also provides a large collection of data sets illustrating a variety of multivariate linear models of the types listed above, together with graphical displays. The table below classifies these with method tags. Their names are linked to their documentation with graphical output on thepkgdown website, [http://friendly.github.io/heplots].

datasetrowscolstitletags
AddHealth43443Adolescent Health DataMANOVA ordered
Adopted626Adopted ChildrenMMRA repeated
Bees2466Captive and maltreated beesMANOVA
Diabetes1456Diabetes DatasetMANOVA
dogfood163Dogfood PreferencesMANOVA contrasts candisc
FootHead907Head measurements of football playersMANOVA contrasts
Headache986Treatment of Headache Sufferers for Sensitivity to NoiseMANOVA repeated
Hernior329Recovery from Elective HerniorrhaphyMMRA candisc
Iwasaki_Big_Five2037Personality Traits of Cultural GroupsMANOVA
mathscore123Math scores for basic math and word problemsMANOVA
MockJury11417Effects Of Physical Attractiveness Upon Mock Jury DecisionsMANOVA candisc
NeuroCog24210Neurocognitive Measures in Psychiatric GroupsMANOVA candisc
NLSY2436National Longitudinal Survey of Youth DataMMRA
oral565Effect of Delay in Oral Practice in Second Language LearningMANOVA
Oslo33214Oslo Transect Subset DataMANOVA candisc
Overdose177Overdose of AmitriptylineMMRA cancor
Parenting604Father Parenting CompetenceMANOVA contrasts
peng3338Size measurements for adult foraging penguins near Palmer StationMANOVA
Plastic205Plastic Film DataMANOVA
Pottery24812Chemical Analysis of Romano-British PotteryMANOVA candisc
Probe115Response Speed in a Probe ExperimentMANOVA repeated
RatWeight276Weight Gain in Rats Exposed to Thiouracil and ThyroxinMANOVA repeated
ReactTime106Reaction Time Datarepeated
Rohwer6910Rohwer Data SetMMRA MANCOVA
RootStock485Growth of Apple Trees from Different Root StocksMANOVA contrasts
Sake3010Taste Ratings of Japanese Rice Wine (Sake)MMRA
schooldata708School DataMMRA robust
Skulls1505Egyptian SkullsMANOVA contrasts
SocGrades4010Grades in a Sociology CourseMANOVA candisc
SocialCog1395Social Cognitive Measures in Psychiatric GroupsMANOVA candisc
TIPI179916Data on the Ten Item Personality InventoryMANOVA candisc
VocabGrowth644Vocabulary growth datarepeated
WeightLoss347Weight Loss Datarepeated

Examples

This example illustrates HE plots using the classiciris data set. How do the means of the flower variables differ bySpecies? This dataset was the impetus for R. A. Fisher (1936) to propose a method of discriminant analysis using data collected by Edgar Anderson (1928). Though some may rightly deprecate Fisher for being a supporter of eugenics, Anderson’siris dataset should not be blamed.

A basic HE plot shows theH andE ellipses for the first two response variables (here:Sepal.Length andSepal.Width). The multivariate test is significant (by Roy’s test)iff theH ellipse projectsanywhere outside theE ellipse.

The positions of the group means show how they differ on the two response variables shown, and provide an interpretation of the orientation of theH ellipse: it is long in the directions of differences among the means.

iris.mod<-lm(cbind(Sepal.Length,Sepal.Width,Petal.Length,Petal.Width)~Species, data=iris)heplot(iris.mod)
HE plot of sepal length and Sepal width for the iris data

HE plot of sepal length and Sepal width for the iris data

Contrasts

Contrasts or other linear hypotheses can be shown as well, and the ellipses look better if they are filled. We create contrasts to test the differences betweenversacolor andvirginca and also betweensetosa and the average of the other two. Each 1 df contrast plots as degenerate 1D ellipse– a line.

Because these contrasts are orthogonal, they add to the total 2 df effect ofSpecies. Note how the first contrast, labeledV:V, distinguishes the means ofversicolor fromvirginica; the second contrast,S:VV distinguishessetosa from the other two.

par(mar=c(4,4,1,1)+.1)contrasts(iris$Species)<-matrix(c(0,-1,1,2,-1,-1), nrow=3, ncol=2)contrasts(iris$Species)#>            [,1] [,2]#> setosa        0    2#> versicolor   -1   -1#> virginica     1   -1iris.mod<-lm(cbind(Sepal.Length,Sepal.Width,Petal.Length,Petal.Width)~Species, data=iris)hyp<-list("V:V"="Species1","S:VV"="Species2")heplot(iris.mod, hypotheses=hyp,       fill=TRUE, fill.alpha=0.1)
HE plot of sepal length and Sepal width for the iris data, showing lines reflecting two contrasts among iris species.

HE plot of sepal length and Sepal width for the iris data, showing lines reflecting two contrasts among iris species.

All pairwise HE plots

All pairwise HE plots are produced using thepairs() method for MLM objects.In the plot, note how the means of most pairs of variables are very highly correlated, in the order Setosa < Versicolor < Virginica, but this pattern doesn’t hold for relations withSepal.Width.

pairs(iris.mod, hypotheses=hyp, hyp.labels=FALSE,      fill=TRUE, fill.alpha=0.1)
Scatterplot matrix of pairwise HE plots for the iris data.

Scatterplot matrix of pairwise HE plots for the iris data.

Canonical discriminant view

For more than two response variables, a multivariate effect can be viewed more simply by projecting the data into canonical space — the linear combinations of the responses which show the greatest differences among the group means relative to within-group scatter. The computations are performed with thecandisc package, which has anheplot.candisc() method.

library(candisc)iris.can<-candisc(iris.mod)|>print()#>#> Canonical Discriminant Analysis for Species:#>#>    CanRsq Eigenvalue Difference  Percent Cumulative#> 1 0.96987   32.19193     31.907 99.12126     99.121#> 2 0.22203    0.28539     31.907  0.87874    100.000#>#> Test of H0: The canonical correlations in the#> current row and all that follow are zero#>#>   LR test stat approx F numDF denDF   Pr(> F)#> 1      0.02344  199.145     8   288 < 2.2e-16 ***#> 2      0.77797   13.794     3   145 5.794e-08 ***#> ---#> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The HE plot in canonical space shows that the differences among species are nearly entirely one-dimensional. The weights for the variables on the first dimension show howSepal.Width differs from the other size variables.

# HE plot in canonical spaceheplot(iris.can, var.pos=1, scale=40)
Canonical HE plot for the iris data

Canonical HE plot for the iris data

Covariance ellipses

MANOVA relies on the assumption that within-group covariance matrices are all equal. It is useful to visualize these in the space of some of the predictors.covEllipses() provides this both for classical and robust (method="mve") estimates. The figure below shows these for the three Iris species and the pooled covariance matrix, which is the same as theE matrix used in MANOVA tests.

covEllipses(iris[,1:4],iris$Species)covEllipses(iris[,1:4],iris$Species,            fill=TRUE, method="mve", add=TRUE, labels="")
Covariance ellipses for the iris data, showing the classical and robust estimates.

Covariance ellipses for the iris data, showing the classical and robust estimates.

References

Anderson, E. (1928). The Problem of Species in the Northern Blue Flags, Iris versicolor L. and Iris virginica L.Annals of the Missouri Botanical Garden,13, 241–313.

Fisher, R. A. (1936). The Use of Multiple Measurements in Taxonomic Problems.Annals of Eugenics,8, 379–388.

Friendly, M. (2006).Data Ellipses, HE Plots and Reduced-Rank Displays for Multivariate Linear Models: SAS Software and Examples.Journal of Statistical Software,17, 1-42.

Friendly, M. (2007).HE plots for Multivariate General Linear Models.Journal of Computational and Graphical Statistics,16(2) 421-444.DOI.

Fox, J., Friendly, M. & Monette, G. (2009).Visualizing hypothesis tests in multivariate linear models: The heplots package for RComputational Statistics,24, 233-246.

Friendly, M. (2010).HE plots for repeated measures designs.Journal of Statistical Software,37, 1–37.

Friendly, M.; Monette, G. & Fox, J. (2013).Elliptical Insights: Understanding Statistical Methods Through Elliptical GeometryStatistical Science,28, 1-39.

Friendly, M. & Sigal, M. (2017).Graphical Methods for Multivariate Linear Models in Psychological Research: An R Tutorial.The Quantitative Methods for Psychology,13, 20-45.

Friendly, M. & Sigal, M. (2018):Visualizing Tests for Equality of Covariance Matrices,The American Statistician,DOI

Links

License

  • GPL (>= 2)

Citation

Developers

Dev status

  • Lifecycle: stable
  • CRAN_Status_Badge
  • R_Universe
  • Last Commit
  • Downloads
  • DOI
  • Docs

[8]ページ先頭

©2009-2025 Movatter.jp