• Welcome to MEDVERSATION®
  • Log InREGISTER
  • SITE HELP
  • MEDVERSATION® is brought to you by Centocor Ortho Biotech Inc.
Introduction to Statistics
Outline Show

Testing Signifigance and Strength of Relationship in Simple Regression

Although you can do a test of significance on the Pearson correlation (see  p. 58) to determine if there is a relationship between the IVs and DVs, this significance testing commonly comes about in a different way. Frequently, the computer printout will list a table of numbers with headings like those in Fig.1175. The coefficients are the intercept (constant), which equals 98.0 and can be identified on the graph. Similarly, the second line of the table is the slope of the regression line, which equals 1.14. The computer also calculates the standard error (SE) of these estimates, using complicated formulas. The t test is the coefficient divided by its SE, with n–2 degrees of freedom (where n is the sample size), and the significance level follows.

Figure 1171 – Figure 6-1: Relationship between blood sugar and skinfold.

VIEW LARGER IMAGE

5343

Usually, the computer also prints out an “ANOVA table.” But, you say, “I thought we were doing regression, not ANOVA.” We have already drawn the parallel between regression and ANOVA in the previous section. These Sums of Squares end up in the ANOVA table as shown in Fig.1177. If you take the square root of the F ratio, it equals exactly the t value calculated earlier (as it should because it is testing the same relationship).

Figure 1175 – Table 6-1: Output from Regression Analysis


5343

Figure 1177 – Table 6-2: ANOVA Table from Regression Analysis


5343

Finally, the strength of relationship could then be expressed as the ratio of Sum of Squares (SS) (regression) to [SS (regression) + SS (residual)], expressing the proportion of variance accounted for by the IV. In fact, the square root usually is used and is called a Pearson correlation coefficient.

Figure 1448 –


5343

So, in the present example, the correlation is:

Figure 1451 –


5343

We also could have tested significance of the relationship directly by looking up significance levels for different values of the correlation coefficient and different sample sizes. This is, of course, unnecessary at this point.

We can interpret all this graphically by referring back to Fig.1171. In general, the individual data points constitute an ellipse around the fitted line. The correlation coefficient is related to the length and width of the ellipse. A higher correlation is associated with a thinner ellipse and better agreement between actual and predicted values.5343 

Content on this page was last changed on March 19, 2009.

© 2002 BC Decker Inc. Show Disclaimer

References:

5343.  Norman GR, Streiner DL. PDQ Statistics . 3rd ed. Hamilton, Ontario: BC Decker Inc.; 2003.

Next Page: Two or More Independent Variables: Multiple Regression »

Last Complete Site Update On: August 16, 2010