3.2 trees
data
trees
contains data on the volume, height and girth (diameter) of 31 felled black cherry trees; girth is measured four feet six inches above ground. The problem is to find a simple linear model for predicting volume from height and girth.
pairs(trees, panel = panel.smooth)
pairs(log(trees), panel = panel.smooth)
coplot
generates conditioning plots, in which the relationship between two variables is displayed conditional on subsets of values of other variables. This is useful to see if the
relationship is stable over the range of other variables. To assess this for the relationship
of log volume and log girth, conditional on height:
attach(trees)
coplot(log(Volume) ~ log(Girth) | Height, panel = panel.smooth)
Try this on the orginal scale also. For an initial fit, we take a linear model and assess model fit using diagnostic plots:
glm(Volume ~ Girth + Height)
fit <-summary(fit)
plot.glm.diag(fit)
What do you make of the fit?
To assess the possibility of transformation:
boxcox(fit)
Both \(\lambda = 1\) and \(\lambda = 0\) lie outside the confidence interval, though the latter is better supported.
One possibility is to take \(\lambda = 1/3\), corresponding to response \(\texttt{Volume}^{1/3}\). What
transformations for Girth
and Height
are then needed for dimensional compatibility?
Fit this model, give interpretations of the parameter estimates, and discuss its suitability.
An alternative is to suppose that a tree is conical in shape, in which case \[\texttt{Volume} \propto \texttt{Height} \times \texttt{Girth}^2.\] Equivalently, we fit
glm(log(Volume) ~ log(Girth) + log(Height))
fit <-summary(fit)
plot.glm.diag(fit)
Are the parameter estimates consistent with this model? Does it fit adequately? What advantage has it over the others for prediction of future volumes?
(Chapter 8; Atkinson, 1985, p. 63)