Chapter Contents
Chapter Contents
Previous
Previous
Next
Next
Introduction to Structural Equations with Latent Variables

A Combined Measurement-Structural Model with Reciprocal Influence and Correlated Residuals

To illustrate a more complex model, this example uses some well-known data from Haller and Butterworth (1960). Various models and analyses of these data are given by Duncan, Haller, and Portes (1968), J\ddot{o}reskog and S\ddot{o}rbom (1988), and Loehlin (1987).

The study is concerned with the career aspirations of high-school students and how these aspirations are affected by close friends. The data are collected from 442 seventeen-year-old boys in Michigan. There are 329 boys in the sample who named another boy in the sample as a best friend. The observations to be analyzed consist of the data from these 329 boys paired with the data from their best friends.

The method of data collection introduces two statistical problems. First, restricting the analysis to boys whose best friends are in the original sample causes the reduced sample to be biased. Second, since the data from a given boy may appear in two or more observations, the observations are not independent. Therefore, any statistical conclusions should be considered tentative. It is difficult to accurately assess the effects of the dependence of the observations on the analysis, but it could be argued on intuitive grounds that since each observation has data from two boys and since it seems likely that many of the boys will appear in the data set at least twice, the effective sample size may be as small as half of the reported 329 observations.

The correlation matrix is taken from J\ddot{o}reskog and S\ddot{o}rbom (1988).

   title 'Peer Influences on Aspiration: Haller & Butterworth (1960)';
   data aspire(type=corr);
      _type_='corr';
      input _name_ $ riq rpa rses roa rea fiq fpa fses foa fea;
      label riq='Respondent: Intelligence'
            rpa='Respondent: Parental Aspiration'
            rses='Respondent: Family SES'
            roa='Respondent: Occupational Aspiration'
            rea='Respondent: Educational Aspiration'
            fiq='Friend: Intelligence'
            fpa='Friend: Parental Aspiration'
            fses='Friend: Family SES'
            foa='Friend: Occupational Aspiration'
            fea='Friend: Educational Aspiration';
      datalines;
   riq   1.      .      .      .      .      .       .      .      .      .
   rpa   .1839  1.      .      .      .      .       .      .      .      .
   rses  .2220  .0489  1.      .      .      .       .      .      .      .
   roa   .4105  .2137  .3240  1.      .      .       .      .      .      .
   rea   .4043  .2742  .4047  .6247  1.      .       .      .      .      .
   fiq   .3355  .0782  .2302  .2995  .2863  1.       .      .      .      .
   fpa   .1021  .1147  .0931  .0760  .0702  .2087   1.      .      .      .
   fses  .1861  .0186  .2707  .2930  .2407  .2950  -.0438  1.      .      .
   foa   .2598  .0839  .2786  .4216  .3275  .5007   .1988  .3607  1.      .
   fea   .2903  .1124  .3054  .3269  .3669  .5191   .2784  .4105  .6404  1.
   ;

The model analyzed by J\ddot{o}reskog and S\ddot{o}rbom (1988) is displayed in the following path diagram:

icaf4.gif (5830 bytes)

Figure 14.17: Path Diagram: Career Aspiration, J\ddot{o}reskog and S\ddot{o}rbom

Two latent variables, f_ramb and f_famb, represent the respondent's level of ambition and his best friend's level of ambition, respectively. The model states that the respondent's ambition is determined by his intelligence and socioeconomic status, his perception of his parents' aspiration for him, and his friend's socioeconomic status and ambition. It is assumed that his friend's intelligence and socioeconomic status affect the respondent's ambition only indirectly through his friend's ambition. Ambition is indexed by the manifest variables of occupational and educational aspiration, which are assumed to have uncorrelated residuals. The path coefficient from ambition to occupational aspiration is set to 1.0 to determine the scale of the ambition latent variable.

This model can be analyzed with PROC CALIS using the LINEQS statement as follows, where the names of the parameters correspond to those used by J\ddot{o}reskog and S\ddot{o}rbom (1988). Since this TYPE=CORR data set does not contain an observation with _TYPE_='N' giving the sample size, it is necessary to specify the degrees of freedom (sample size minus one) with the EDF= option in the PROC CALIS statement.

   title2 'Joreskog-Sorbom (1988) analysis 1';
   proc calis data=aspire edf=328;
      lineqs    /* measurement model for aspiration */
             rea=lambda2 f_ramb + e_rea,
             roa=f_ramb + e_roa,
             fea=lambda3 f_famb + e_fea,
             foa=f_famb + e_foa,
                /* structural model of influences */
             f_ramb=gam1 rpa + gam2 riq + gam3 rses +
                gam4 fses + beta1 f_famb + d_ramb,
             f_famb=gam8 fpa + gam7 fiq + gam6 fses +
                gam5 rses + beta2 f_ramb + d_famb;
      std d_ramb=psi11,
          d_famb=psi22,
          e_rea e_roa e_fea e_foa=theta:;
      cov d_ramb d_famb=psi12,
          rpa riq rses fpa fiq fses=cov:;
   run;

Specify a name followed by a colon to represent a list of names formed by appending numbers to the specified name. For example, in the COV statement, the line

   rpa riq rses fpa fiq fses=cov:;

is equivalent to

   rpa riq rses fpa fiq fses=cov1-cov15;

The results from this analysis are as follows.

Peer Influences on Aspiration: Haller & Butterworth (1960)
Joreskog-Sorbom (1988) analysis 1

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

Fit Function 0.0814
Goodness of Fit Index (GFI) 0.9844
GFI Adjusted for Degrees of Freedom (AGFI) 0.9428
Root Mean Square Residual (RMR) 0.0202
Parsimonious GFI (Mulaik, 1989) 0.3281
Chi-Square 26.6972
Chi-Square DF 15
Pr > Chi-Square 0.0313
Independence Model Chi-Square 872.00
Independence Model Chi-Square DF 45
RMSEA Estimate 0.0488
RMSEA 90% Lower Confidence Limit 0.0145
RMSEA 90% Upper Confidence Limit 0.0783
ECVI Estimate 0.2959
ECVI 90% Lower Confidence Limit 0.2823
ECVI 90% Upper Confidence Limit 0.3721
Probability of Close Fit 0.4876
Bentler's Comparative Fit Index 0.9859
Normal Theory Reweighted LS Chi-Square 26.0113
Akaike's Information Criterion -3.3028
Bozdogan's (1987) CAIC -75.2437
Schwarz's Bayesian Criterion -60.2437
McDonald's (1989) Centrality 0.9824
Bentler & Bonett's (1980) Non-normed Index 0.9576
Bentler & Bonett's (1980) NFI 0.9694
James, Mulaik, & Brett (1982) Parsimonious NFI 0.3231
Z-Test of Wilson & Hilferty (1931) 1.8625
Bollen (1986) Normed Index Rho1 0.9082
Bollen (1988) Non-normed Index Delta2 0.9864
Hoelter's (1983) Critical N 309

Figure 14.18: Career Aspiration Data: J&S Analysis 1

J\ddot{o}reskog and S\ddot{o}rbom (1988) present more detailed results from a second analysis in which two constraints are imposed:

This analysis can be performed by changing the names beta1 and beta2 to beta and omitting the line from the COV statement for psi12:

   title2 'Joreskog-Sorbom (1988) analysis 2';
   proc calis data=aspire edf=328;
      lineqs    /* measurement model for aspiration */
             rea=lambda2 f_ramb + e_rea,
             roa=f_ramb + e_roa,
             fea=lambda3 f_famb + e_fea,
             foa=f_famb + e_foa,
                /* structural model of influences */
             f_ramb=gam1 rpa + gam2 riq + gam3 rses +
                gam4 fses + beta f_famb + d_ramb,

             f_famb=gam8 fpa + gam7 fiq + gam6 fses +
                gam5 rses + beta f_ramb + d_famb;
      std d_ramb=psi11,
          d_famb=psi22,
          e_rea e_roa e_fea e_foa=theta:;
      cov rpa riq rses fpa fiq fses=cov:;
   run;

The results are displayed in Figure 14.19.

Peer Influences on Aspiration: Haller & Butterworth (1960)
Joreskog-Sorbom (1988) analysis 2

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

Fit Function 0.0820
Goodness of Fit Index (GFI) 0.9843
GFI Adjusted for Degrees of Freedom (AGFI) 0.9492
Root Mean Square Residual (RMR) 0.0203
Parsimonious GFI (Mulaik, 1989) 0.3718
Chi-Square 26.8987
Chi-Square DF 17
Pr > Chi-Square 0.0596
Independence Model Chi-Square 872.00
Independence Model Chi-Square DF 45
RMSEA Estimate 0.0421
RMSEA 90% Lower Confidence Limit .
RMSEA 90% Upper Confidence Limit 0.0710
ECVI Estimate 0.2839
ECVI 90% Lower Confidence Limit .
ECVI 90% Upper Confidence Limit 0.3592
Probability of Close Fit 0.6367
Bentler's Comparative Fit Index 0.9880
Normal Theory Reweighted LS Chi-Square 26.1595
Akaike's Information Criterion -7.1013
Bozdogan's (1987) CAIC -88.6343
Schwarz's Bayesian Criterion -71.6343
McDonald's (1989) Centrality 0.9851
Bentler & Bonett's (1980) Non-normed Index 0.9683
Bentler & Bonett's (1980) NFI 0.9692
James, Mulaik, & Brett (1982) Parsimonious NFI 0.3661
Z-Test of Wilson & Hilferty (1931) 1.5599
Bollen (1986) Normed Index Rho1 0.9183
Bollen (1988) Non-normed Index Delta2 0.9884
Hoelter's (1983) Critical N 338

Figure 14.19: Career Aspiration Data: J&S Analysis 2

Peer Influences on Aspiration: Haller & Butterworth (1960)
Joreskog-Sorbom (1988) analysis 2

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

roa = 1.0000   f_ramb + 1.0000   e_roa
rea = 1.0610 * f_ramb + 1.0000   e_rea
Std Err   0.0892   lambda2        
t Value   11.8923            
foa = 1.0000   f_famb + 1.0000   e_foa
fea = 1.0736 * f_famb + 1.0000   e_fea
Std Err   0.0806   lambda3        
t Value   13.3150            

 


Peer Influences on Aspiration: Haller & Butterworth (1960)
Joreskog-Sorbom (1988) analysis 2

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

f_ramb = 0.1801 * f_famb + 0.2540 * riq + 0.1637 * rpa + 0.2211 * rses + 0.0773 * fses + 1.0000   d_ramb
Std Err   0.0391   beta   0.0419   gam2   0.0387   gam1   0.0419   gam3   0.0415   gam4        
t Value   4.6031       6.0673       4.2274       5.2822       1.8626            
f_famb = 0.1801 * f_ramb + 0.0684 * rses + 0.3306 * fiq + 0.1520 * fpa + 0.2184 * fses + 1.0000   d_famb
Std Err   0.0391   beta   0.0387   gam5   0.0412   gam7   0.0364   gam8   0.0395   gam6        
t Value   4.6031       1.7681       8.0331       4.1817       5.5320            


Peer Influences on Aspiration: Haller & Butterworth (1960)
Joreskog-Sorbom (1988) analysis 2

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

Variances of Exogenous Variables
Variable Parameter Estimate Standard
Error
t Value
riq   1.00000    
rpa   1.00000    
rses   1.00000    
fiq   1.00000    
fpa   1.00000    
fses   1.00000    
e_rea theta1 0.33764 0.05178 6.52
e_roa theta2 0.41205 0.05103 8.07
e_fea theta3 0.31337 0.04574 6.85
e_foa theta4 0.40381 0.04608 8.76
d_ramb psi11 0.28113 0.04640 6.06
d_famb psi22 0.22924 0.03889 5.89

Covariances Among Exogenous Variables
Var1 Var2 Parameter Estimate Standard
Error
t Value
riq rpa cov1 0.18390 0.05246 3.51
riq rses cov3 0.22200 0.05110 4.34
rpa rses cov2 0.04890 0.05493 0.89
riq fiq cov8 0.33550 0.04641 7.23
rpa fiq cov7 0.07820 0.05455 1.43
rses fiq cov9 0.23020 0.05074 4.54
riq fpa cov5 0.10210 0.05415 1.89
rpa fpa cov4 0.11470 0.05412 2.12
rses fpa cov6 0.09310 0.05438 1.71
fiq fpa cov10 0.20870 0.05163 4.04
riq fses cov12 0.18610 0.05209 3.57
rpa fses cov11 0.01860 0.05510 0.34
rses fses cov13 0.27070 0.04930 5.49
fiq fses cov15 0.29500 0.04824 6.12
fpa fses cov14 -0.04380 0.05476 -0.80


The difference between the chi-square values for the two preceding models is 26.8987 - 26.6972= 0.2015 with 2 degrees of freedom, which is far from significant. However, the chi-square test of the restricted model (analysis 2) against the alternative of a completely unrestricted covariance matrix yields a p-value of 0.0596, which indicates that the model may not be entirely satisfactory (p-values from these data are probably too small because of the dependence of the observations).

Loehlin (1987) points out that the models considered are unrealistic in at least two aspects. First, the variables of parental aspiration, intelligence, and socioeconomic status are assumed to be measured without error. Loehlin adds uncorrelated measurement errors to the model and assumes, for illustrative purposes, that the reliabilities of these variables are known to be 0.7, 0.8, and 0.9, respectively. In practice, these reliabilities would need to be obtained from a separate study of the same or a very similar population. If these constraints are omitted, the model is not identified. However, constraining parameters to a constant in an analysis of a correlation matrix may make the chi-square goodness-of-fit test inaccurate, so there is more reason to be skeptical of the p-values. Second, the error terms for the respondent's aspiration are assumed to be uncorrelated with the corresponding terms for his friend. Loehlin introduces a correlation between the two educational aspiration error terms and between the two occupational aspiration error terms. These additions produce the following path diagram for Loehlin's model 1.

icaf5.gif (8703 bytes)

Figure 14.20: Path Diagram: Carrer Aspiration -- Loehlin

The statements for fitting this model are as follows:

   title2 'Loehlin (1987) analysis: Model 1';
   proc calis data=aspire edf=328;
      lineqs    /* measurement model for aspiration */
             rea=lambda2 f_ramb + e_rea,
             roa=f_ramb + e_roa,
             fea=lambda3 f_famb + e_fea,
             foa=f_famb + e_foa,
             /* measurement model for intelligence and environment */
             rpa=.837 f_rpa + e_rpa,
             riq=.894 f_riq + e_riq,
             rses=.949 f_rses + e_rses,
             fpa=.837 f_fpa + e_fpa,
             fiq=.894 f_fiq + e_fiq,
             fses=.949 f_fses + e_fses,
                /* structural model of influences */
             f_ramb=gam1 f_rpa + gam2 f_riq + gam3 f_rses +
                gam4 f_fses + bet1 f_famb + d_ramb,
             f_famb=gam8 f_fpa + gam7 f_fiq + gam6 f_fses +
                gam5 f_rses + bet2 f_ramb + d_famb;
      std d_ramb=psi11,
          d_famb=psi22,
          f_rpa f_riq f_rses f_fpa f_fiq f_fses=1,
          e_rea e_roa e_fea e_foa=theta:,
          e_rpa e_riq e_rses e_fpa e_fiq e_fses=err:;
      cov d_ramb d_famb=psi12,
          e_rea e_fea=covea,
          e_roa e_foa=covoa,
          f_rpa f_riq f_rses f_fpa f_fiq f_fses=cov:;
   run;

The results are displayed in Figure 14.21.

Peer Influences on Aspiration: Haller & Butterworth (1960)
Loehlin (1987) analysis: Model 1

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

Fit Function 0.0366
Goodness of Fit Index (GFI) 0.9927
GFI Adjusted for Degrees of Freedom (AGFI) 0.9692
Root Mean Square Residual (RMR) 0.0149
Parsimonious GFI (Mulaik, 1989) 0.2868
Chi-Square 12.0132
Chi-Square DF 13
Pr > Chi-Square 0.5266
Independence Model Chi-Square 872.00
Independence Model Chi-Square DF 45
RMSEA Estimate 0.0000
RMSEA 90% Lower Confidence Limit .
RMSEA 90% Upper Confidence Limit 0.0512
ECVI Estimate 0.3016
ECVI 90% Lower Confidence Limit .
ECVI 90% Upper Confidence Limit 0.3392
Probability of Close Fit 0.9435
Bentler's Comparative Fit Index 1.0000
Normal Theory Reweighted LS Chi-Square 12.0168
Akaike's Information Criterion -13.9868
Bozdogan's (1987) CAIC -76.3356
Schwarz's Bayesian Criterion -63.3356
McDonald's (1989) Centrality 1.0015
Bentler & Bonett's (1980) Non-normed Index 1.0041
Bentler & Bonett's (1980) NFI 0.9862
James, Mulaik, & Brett (1982) Parsimonious NFI 0.2849
Z-Test of Wilson & Hilferty (1931) -0.0679
Bollen (1986) Normed Index Rho1 0.9523
Bollen (1988) Non-normed Index Delta2 1.0011
Hoelter's (1983) Critical N 612

Figure 14.21: Career Aspiration Data: Loehlin Model 1

Peer Influences on Aspiration: Haller & Butterworth (1960)
Loehlin (1987) analysis: Model 1

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

riq = 0.8940   f_riq + 1.0000   e_riq
rpa = 0.8370   f_rpa + 1.0000   e_rpa
rses = 0.9490   f_rses + 1.0000   e_rses
roa = 1.0000   f_ramb + 1.0000   e_roa
rea = 1.0840 * f_ramb + 1.0000   e_rea
Std Err   0.0942   lambda2        
t Value   11.5105            
fiq = 0.8940   f_fiq + 1.0000   e_fiq
fpa = 0.8370   f_fpa + 1.0000   e_fpa
fses = 0.9490   f_fses + 1.0000   e_fses
foa = 1.0000   f_famb + 1.0000   e_foa
fea = 1.1163 * f_famb + 1.0000   e_fea
Std Err   0.0863   lambda3        
t Value   12.9394            

 


Peer Influences on Aspiration: Haller & Butterworth (1960)
Loehlin (1987) analysis: Model 1

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

f_ramb = 0.1190 * f_famb + 0.1837 * f_rpa + 0.2800 * f_riq + 0.2262 * f_rses + 0.0870 * f_fses + 1.0000   d_ramb
Std Err   0.1140   bet1   0.0504   gam1   0.0614   gam2   0.0522   gam3   0.0548   gam4        
t Value   1.0440       3.6420       4.5618       4.3300       1.5884            
f_famb = 0.1302 * f_ramb + 0.0633 * f_rses + 0.1688 * f_fpa + 0.3539 * f_fiq + 0.2154 * f_fses + 1.0000   d_famb
Std Err   0.1207   bet2   0.0522   gam5   0.0493   gam8   0.0674   gam7   0.0512   gam6        
t Value   1.0792       1.2124       3.4205       5.2497       4.2060            


Peer Influences on Aspiration: Haller & Butterworth (1960)
Loehlin (1987) analysis: Model 1

The CALIS Procedure
Covariance Structure Analysis: Maximum Likelihood Estimation

Variances of Exogenous Variables
Variable Parameter Estimate Standard
Error
t Value
f_rpa