Figure 1: Each point shows an individual RIL’s mean pollinator visits on a log(visits + 0.2) scale. Flower colors means and 95% confidence intervals are plotted as thick black bars, with a dashed line connecting the group means for comparison.
Here’s a brief refresher of our previously introduced standard summaries of associations between a categorical explanatory variable and a continuous response.
Because we are calculating statistics on transformed data, we should summarise the transformed data. Later in this section we will learn how to back-transform.
Estimating summaries of each group:
We can summarise within group means, and variances (or even 95% CI if we want etc) as we saw in the previous chapter. (Some of) these high-level summaries are presented in Figure 1 and calculated below:
We would also like to summarise the data jointly, including the variance, the difference in groups means, and a standardized summary of this difference:
The pooled variance: To both estimate the effect size as Cohen’s D, and estimate uncertainty we need to calculate the variance. But we have two groups, so we need something like “the average variance within each group.” The pooled variance, \(s^2_p\) – the variance in each group weighted by their degrees of freedom and divided by the total degrees of freedom is this average (see margin for hand calculation).
The difference in means: To find this simply subtract one from the other: \(\text{mean}\_\text{diff}= 0.366 - (-0.455)= 0.811\).
Cohen’s D as the difference in means weighted by the pooled standard deviation. Cohen’s D \(=\frac{\text{mean diff}}{s_p}\)\(=\frac{0.811}{\sqrt{0.693}}\)\(= 0.974\). This is a large effect size!!!
We can calculate these global summaries from summaries of each petal color (above):