Lab 3:  Regression and Correlation



OBJECTIVES:    This lab is designed to show you how to analyze the relationship between two variables using correlation and regression.  Further investigation into the regression portion of the lab will involve comparing the effects of a linear fit vs. a quadratic fit, as well as the effects of transforming the response variable.

DIRECTIONS:    Follow the instructions below, answering all questions. Your answers for each of the questions, including output and any plots, should be summarized in the form of a brief report (Word), to be handed in to the instructor before 1:00pm Friday Oct. 5.
__________________________________________________________________________________________________________
__________________________________________________________________________________________________________

Because there will be quite a few plots generated here, you may want to title each of your plots appropriately, taking into account what type of fit ( linear or quadratic) you're using, whether the data is transformed or not, etc.!).
 

1.)    Correlation . . .


a.)    Download the "energy.mtw" Minitab data worksheet from the web.  (On the Stat 280 home page, under the "Lab Assignments" section!! ;)  ).

b.)    Produce a scatter plot of Energy Consumption vs. Machine Settings ("Graph-> Character Graphs-> Scatter Plot") and comment on the relationship between the two variables.  Can you detect any strong correlation between the two variables?

c.)    Now, determine the actual correlation between Energy Consumption and Machine Settings.
        (Hint:    Minitab can do this very quickly if you take a look under Stat/Basic Statistics/Correlation . . .)


__________________________________________________________________________________________________________
__________________________________________________________________________________________________________
 

2.)    Regression . . .  A Linear Fit . . .
 


a.)    We will now use regression to analyze how the response variable "Energy Consumption" changes as the explanatory variable, "Machine Setting" changes.

b.)    Plot the simple linear regression line for this data.  ("Stat->Regression->Fitted Line Plot").  Before doing the regression, be sure to store the resulting residuals, fits, and coefficients of this first fit in your worksheet.  (Hint:    Try the "Storage" option!)
 

c.)    Note the results in the session window, as well as the least squares line fitted to the energy data.


d.)    Generate a plot of the residuals vs. the explanatory variable (Machine Settings) for this data.  (Hint:  Look under Stat/Regression/Regression, and investigate the "Graphs" options - choose the appropriate plot accordingly . . .).


__________________________________________________________________________________________________________
__________________________________________________________________________________________________________
 

3.)     Regression . . .  A Quadratic Fit  . . .

a.)    Now, apply a quadratic fit to this data, (See question 2.b.) above if you don't recall how to get to this and check the option "quadratic" instead of "linear" this time) and again perform a regression.  Again, be sure to store the resulting residuals, fits, and coefficients of this second fit in your worksheet before doing the regression.  Also, be sure you have your all results from the linear fit above for comparison!
 

b.)    Note the results in the session window, as well as the quadratic fit of the energy data.


c.)    Now, apply a log (base 10) transformation to the response variable (Energy Consumption).
           (Hint:    This can easily be done if you again look under Stat/Regression/Fitted Line Plot, and look into the "Options" feature!  You don't have to display the data on a log scale , although feel free to see how the plot looks on that scale if you'd like).