OBJECTIVES: This lab is designed to introduce you to Minitab and show you how to calculate basic descriptive statistics & generate some statistical plots.
DIRECTIONS: Follow the instructions below,
answering all questions. Your answers should be in the form of a brief
report (Word), to be handed in to the instructor before you leave.
__________________________________________________________________________________________________________
__________________________________________________________________________________________________________
1.)
Data Description:
In 1798, H. Cavendish set out to determine the density of the earth relative to that of water, using a torsion balance. He conducted 29 very precise experiments assessing this relative density. A working assumption to begin with is that the values of his measurements were taken from a normal distribution centered around the true relative density. The data from all 29 experiments are given below (from Stigler, S., "Do Robust Estimators Work with Real Data?", Annals of Statistics, 5, 1977).
5.50 5.55 5.57 5.42 5.30 5.34
5.61 5.36 5.53 5.79 5.47 5.75
4.88 5.29 5.62 5.10 5.63 5.68
5.07 5.58 5.29 5.27 5.34 5.85
5.26 5.65 5.44 5.39 5.46
a.) Enter the above data in column one & name it CAVEND.
b.) Construct a boxplot of the data.
c.) Construct a histogram with a suitable binwidth.
d.) Based on the two plots above, does the
data seem symmetric or skewed? Also , comment on the modality.
Are there outliners in the data?
e.) Guesstimate the sample mean & standard deviation from the histogram.
f.) Calculate & record the basic summary statistics, such as the mean, standard deviation, quartiles, minimum, and maximum.
g.) Find the proportion of the observations
within plus or minus one standard deviation of the mean.
Repeat for plus or minus two standard
deviations. Compare with the "standard deviation rule of thumb" from
the text.
__________________________________________________________________________________________________________
__________________________________________________________________________________________________________
2.)
a.) Download cervical data.
b.) Perform the same analyses (parts b. - g. above) on the second column of actual data ("Fluor2").
c.) Apply both a logarithmic and fourth-root
transformation to the cervical cancer data,
(again, just the second column) and repeat
parts b.-d. from number 1.
(The 4th root transformation can be
performed using the calculator feature, with the
expression: "[column name]**0.25").
d.) How did the transformations affect the
histograms?