INTERPRETING MANOVA PARAMETER ESTIMATES

David P. Nichols

From SPSS Keywords, February 1993

MANOVA is only one of a number of SPSS procedures in which categorical independent variables (factors) are handled automatically by the procedure via creation of sets of contrast variables. Because what a procedure needs in terms of raw variable codings are not always the same as the resulting contrasts these raw variable codings produce, the design information produced sometimes confuses users when they attempt to interpret the parameter estimates. The purpose of this article is to clarify the default procedure by which MANOVA estimates parameters.

Let's consider an example in which we have two factors, A with two levels, and B with three levels. The following commands submitted to MANOVA request a full factorial analysis of the 2 by 3 design:

MANOVA Y BY A(1,2) B(1,3) /PRINT=DESIGN(ONEWAY OVERALL) PARAM /OMEANS TABLES(CONSTANT, A, B, A BY B) /DESIGN=CONSTANT, A, B, A BY B.

The PRINT subcommand requests MANOVA to provide us with the ONEWAY and OVERALL DESIGN or basis matrices, as well as the parameter estimates. The OMEANS subcommand requests the grand mean, A and B marginal means, and the mean for each cell. The DESIGN subcommand explicitly requests that the test of significance and parameter estimate for the CONSTANT be included on our printout. While these are not printed by default (except with SPSS/PC+), they are included in the model unless NOCONSTANT is requested on the METHOD subcommand.

The omission of any CONTRAST subcommands means that MANOVA will fit the default DEVIATION contrasts for both factors. These contrasts compare all but one of the levels of a factor against the remaining categories. This is equivalent to comparing each level to the mean of all categories. One category is left out (the reference category) because to include it would produce a redundancy in the set of contrasts (recall that with K levels to a factor, there are K-1 degrees of freedom for fitting contrasts). By default, the last category is the reference category.

The first thing that MANOVA will print out for us are the tables of "Combined Observed Means" for the CONSTANT, A, B and A BY B, as requested on the OMEANS subcommand. These appear as:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Combined Observed Grand Means Variable .. Y1 GMEAN WGT. 5.94737 UNWGT. 6.00000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Combined Observed Means for A Variable .. Y1 A 1 WGT. 5.10000 UNWGT. 5.00000 2 WGT. 6.88889 UNWGT. 7.00000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Combined Observed Means for B Variable .. Y1 B 1 WGT. 6.66667 UNWGT. 7.00000 2 WGT. 5.50000 UNWGT. 5.50000 3 WGT. 5.71429 UNWGT. 5.50000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Combined Observed Means for A BY B Variable .. Y1 A 1 2 B 1 WGT. 6.00000 8.00000 UNWGT. 6.00000 8.00000 2 WGT. 5.00000 6.00000 UNWGT. 5.00000 6.00000 3 WGT. 4.00000 7.00000 UNWGT. 4.00000 7.00000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Note that there are two sets of means here, weighted and unweighted. These will always be the same for terms representing individual cell means (the A BY B term here) and will also be the same for marginal and grand means if there are the same number of observations in each cell of the design. If the design has different numbers of observations in the cells, the weighted and unweighted versions of all marginals and the grand mean will generally not be the same. The weighted means weight each cell by the number of observations in it, while the unweighted means are simple means of cell means. Our focus here will be on the unweighted means, since these are what MANOVA uses in parameter estimation.

To illustrate how these are calculated, we can see that the first level marginal mean of A is given by:

_ __ __ __ A = (AB + AB + AB ) / 3 = ( 6 + 5 + 4 ) / 3 = 5 . 1 11 12 13 _ The second level mean A = ( 8 + 6 + 7 ) / 3 = 7 . The B means are: 2 _ __ __ B = (AB + AB ) / 2 = ( 6 + 8 ) / 2 = 7 , 1 11 21 _ _ B = ( 5 + 6 ) / 2 = 5.5 and B = ( 4 + 7 ) / 2 = 5.5 . 2 3

The grand mean is simply the unweighted average of the six cell means, which is the same as the unweighted average of each set of marginal means:

__ __ __ __ __ __ __ AB = ( AB + AB + AB + AB + AB + AB ) / 6 = 11 12 13 21 22 23 _ _ _ _ _ ( A + A ) / 2 = ( B + B + B ) / 3 = 6 . 1 2 1 2 3

Now let us see how MANOVA goes about comparing these means. Take the case of the two level factor A. The contrast can be written in two equivalent ways:

_ _ 1 _ _ CONTRAST A = A - A = - ( A - A ) 1 1 2 1 2

While the first is the more intuitive, as it shows that the contrast simply compares the first level of A to the overall A mean, the second expression is easier to use computationally, as it involves only A marginal means. If we express this contrast as a row vector of contrast coefficients, it becomes simply [ 1/2 -1/2 ]. If we add on an expression for a constant term, we have a full contrast matrix for factor A:

_ _ _1_ | 1 1 | 2 |_ 1 -1 _|

If we were to explicitly request DEVIATION contrasts via input of a SPECIAL contrast matrix, this is what we would specify (actually, MANOVA does not allow fractions or factoring out of a term of a contrast matrix, but here we are using the form that is easiest to see in terms of the requested contrasts). Experienced MANOVA users might recall that the CONSTANT row is often specified as all ones, rather than the .5 or 1/2 used here. MANOVA internally converts this to an average, so that the mean of the means is estimated as opposed to the sum of the means.

After showing us the means, MANOVA next prints out the ONEWAY design or basis matrix for each factor. For A, this appears as:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - One-way Basis for factor A 1 2 1 1.000 1.000 2 1.000 -1.000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Many MANOVA users mistake this matrix for the contrast matrix for factor A, which it is not. It is, however, related to the contrast matrix in the following manner: If the contrast matrix is C, and the basis matrix is X,

-1 C = ( X ' X ) X ' .

This relationship results from the well known expression for least squares estimates of parameters in a linear model:

^ -1 B = ( X ' X ) X ' Y .

There is also a way to calculate X from C:

-1 X = C ' ( C C ' ) .

Since users are generally interested in being certain that they know what contrasts are being fitted, interest generally revolves around C. However, MANOVA does not have to form C explicitly in order to produce parameter estimates and significance tests. The optional CHOLESKY method decomposes X ' X and the default QR method decomposes X directly. C is never explicitly formed unless input as a SPECIAL contrast matrix by the user. This is why there is no option to print out the actual contrast matrix.

The above equations for C and X can be used in the SPSS MATRIX procedure (not available for SPSS/PC+) to ensure that the desired matrix has been specified. In most cases there are also simpler ways to confirm the specifications. If a contrast matrix is partitioned into a row for the constant and the remaining rows for the factor effect, and the columns of the basis matrix are similarly partitioned, the following features can be observed: The effect row(s) of an orthonormal contrast matrix are the same as the transpose of the effect column(s) of the corresponding basis matrix. Also, two important types of nonorthogonal contrasts, DEVIATION and SIMPLE, are mirror images of each other. That is, the basis matrix for DEVIATION contrasts is the transpose of the contrast matrix for SIMPLE contrasts, and vice versa.

This can be seen by comparing the basis matrix for A with a SIMPLE contrast matrix for a two level factor. Recall that SIMPLE contrasts compare each level of a factor except a reference category (or baseline) to that reference category. A SIMPLE contrast matrix for a two level factor would thus look like:

_ _ | .5 .5 | |_ 1 -1 _|

As is easily seen, the nonconstant row here is the transpose of the nonconstant column in the basis matrix for A. The same thing is true of the basis and contrast matrices for factor B. The basis matrix given below

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - One-way Basis for factor B 1 2 3 1 1.000 1.000 .000 2 1.000 .000 1.000 3 1.000 -1.000 -1.000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

has effect columns that when transposed match SIMPLE contrasts [1 0 -1] and [0 1 -1]. The contrasts specified by the B basis matrix are:

_ _ _ _ _ CONTRAST B = B - B = ( 2B - B - B ) / 3, and 1 1 1 2 3 _ _ _ _ _ CONTRAST B = B - B = ( - B + 2B - B ) / 3, 2 2 1 2 3

which can also be expressed in a contrast matrix as:

_ _ 1 | 1 1 1 | - | 2 -1 -1 | 3 |_-1 2 -1_|

Once the ONEWAY basis matrices have been specified, the overall basis matrix is constructed from Kronecker products of the column partitions of the ONEWAY matrices:

BASIS AB = [ A x B | A x B | A x B | A x B ] 1 1 2 1 1 2 2 2

where Ai and Bi represent the ith column partitions of the A and B basis matrices. The result is the overall basis matrix printed by MANOVA:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Between-Subjects Design Matrix (Reduced Model) 1-A 2-B Factor PARAMETER 1 2 1 2 3 4 5 6 1 1 1.00000 1.00000 1.00000 .00000 1.00000 .00000 1 2 1.00000 1.00000 .00000 1.00000 .00000 1.00000 1 3 1.00000 1.00000 -1.00000 -1.00000 -1.00000 -1.00000 2 1 1.00000 -1.00000 1.00000 .00000 -1.00000 .00000 2 2 1.00000 -1.00000 .00000 1.00000 .00000 -1.00000 2 3 1.00000 -1.00000 -1.00000 -1.00000 1.00000 1.00000 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

The resulting overall contrast matrix could be derived from the formula given earlier, or constructed from Kronecker products of the row partitions of the A and B contrast matrices (in a manner similar to that shown for the overall basis construction). The resulting overall contrast matrix is:

_ _ | 1 1 1 1 1 1 | 1 | 1 1 1 -1 -1 -1 | - | 2 -1 -1 2 -1 -1 | 6 | -1 2 -1 -1 2 -1 | | 2 -1 -1 -2 1 1 | |_-1 2 -1 1 -2 1_|

Factoring out the divisor of 6 which applies to all terms allows us to see the nature of the contrasts more easily. The CONSTANT term is simply the unweighted grand mean:

__ __ __ __ __ __ _ _ CONSTANT = ( AB + AB + AB + AB + AB + AB ) / 6 = ( A + A ) / 2 = 11 12 13 21 22 23 1 2 _ _ _ ( B + B + B ) / 3 = 6 1 2 3

The A contrast measures the deviation of level 1 of A from the grand mean:

__ __ __ __ __ __ A = ( AB + AB + AB - AB - AB - AB ) / 6 = 1 11 12 13 21 22 23 _ _ _ __ ( A - A ) / 2 = A - AB = -1 1 2 1

The B contrasts measure the deviations of levels 1 and 2 of B from the grand mean:

__ __ __ __ __ __ B = ( 2AB - AB - AB + 2AB - AB - AB ) / 6 = 1 11 12 13 21 22 23 _ _ _ _ __ ( 2B - B - B ) / 3 = ( B - AB ) = 1 1 2 3 1 __ __ __ __ __ __ B = (- AB + 2AB - AB - AB + 2AB - AB ) / 6 = 2 11 12 13 21 22 23 _ _ _ _ __ (- B + 2B - B ) / 3 = B - AB = -.5 1 2 3 2

The interaction contrasts are specified by the main effect contrasts. The first contrast tests whether the deviation of level 1 of factor A is the same at level 1 of factor B as at levels 2 and 3 of factor B, which is equivalent to testing whether the deviation of level 1 of factor B is the same at both levels of factor A. The last contrast tests whether the deviation of level 1 of factor A is the same at level 2 of B as at levels 1 and 3, which is equivalent to testing whether the deviation of level 2 of factor B is the same at both levels of factor A. In comparing these formulas with the output given below, note that the parameter numbering in MANOVA is hard coded and cannot be changed by the user.

__ __ __ __ __ __ AB = ( 2AB - AB - AB - 2AB + AB + AB ) / 6 = 1 11 12 13 21 22 23 __ _ __ _ __ _ [ 2 ( AB - B ) - ( AB - B ) - ( AB - B ) ] / 3 = 11 1 12 2 13 3 __ _ __ _ [ ( AB - A ) - ( AB - A ) ] / 2 = 0 , 11 1 21 2 __ __ __ __ __ __ AB = (- AB + 2AB - AB + AB - 2AB + AB ) / 6 = 2 11 12 13 21 22 23 __ _ __ _ __ _ [ 2 ( AB - B ) - ( AB - B ) - ( AB - B ) ] / 3 = 12 2 11 1 13 3 __ _ __ _ [ ( AB - A ) - ( AB - A ) ] / 2 = .5 . 12 1 22 2

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Estimates for Y1 --- Individual univariate .9500 confidence intervals CONSTANT Parameter Coeff. Std. Err. t-Value Sig. t Lower -95% CL- Upper 1 6.00000000 .49786 12.05161 .00000 4.92444 7.07556 A Parameter Coeff. Std. Err. t-Value Sig. t Lower -95% CL- Upper 2 -1.0000000 .49786 -2.00860 .06582 -2.07556 .07556 B Parameter Coeff. Std. Err. t-Value Sig. t Lower -95% CL- Upper 3 1.00000000 .72575 1.37789 .19149 -.56788 2.56788 4 -.50000000 .70408 -.71015 .49016 -2.02107 1.02107 A BY B Parameter Coeff. Std. Err. t-Value Sig. t Lower -95% CL- Upper 5 .000000000 .72575 .00000 1.00000 -1.56788 1.56788 6 .500000000 .70408 .71015 .49016 -1.02107 2.02107 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Now that you have seen how MANOVA handles default contrasts, you might want to apply the formulas listed earlier that relate the contrast and basis matrices to other types of contrasts you use in your own analyses. This is only one of a number of topics on which the SPSS MATRIX procedure can be an invaluable aid in understanding the functioning of MANOVA and other SPSS procedures.

Appendix: Data

A B Y1 1 1 8 1 1 5 1 1 7 1 1 4 1 2 7 1 2 2 1 2 6 1 3 4 1 3 1 1 3 7 2 1 7 2 1 9 2 2 4 2 2 6 2 2 8 2 3 7 2 3 9 2 3 6 2 3 6