Extending the Four Basic Designs

Prof Randi Garcia
April 4, 2018

Reading Free-Write (5 minutes)

People with severe psychiatric problems find it harder to focus their attention; this theory predicts that psychiatrically normal subjects would be less affected than others by distracting sounds while they tried to remember things. In an experiment, participants were classified as 1) schizophrenic, 2) schizotypal, 3) borderline, or 4) psychiatrically normal. In each of a series of trials, subjects heard a voice read a series of digits and were asked to repeat them back. The digit strings were either 3, 4, or 7 digits long, and the trials either had a distracting male voice between the female voice reading the letters or it did not have interference. One response was defined by lumping together all the trials of a given type, such as four digits/no interference, and adding up the number of time the subject remembered the string correctly.

Announcements

  • Adriana Colom Cruz, Lab Instructor candidate, talk/demo today 12:00-1:00p in Bass 103 (Pizza will be served!)
  • Women in Statistics and Data Science Conference
    • Submit and anstract by April 19th
    • Conference is October 18-October 20 in Cincinnati, Ohio
    • SDS can maybe give you travel money!
  • HW 7 is due today at 5pm
  • Project data collection should start soon!

Agenda

  • Split plot designs
  • Analysis in R
  • Extending designs by factorial crossing

Formal ANOVA for the Latin Square

\[ {y}_{ijk}={\mu}+{\alpha}_{i}+{\beta}_{j}+{\tau}_{k}+{e}_{ijk} \]

  • \( {\mu} \) is the benchmark
  • \( {\alpha}_{i} \) is the row effect
  • \( {\beta}_{j} \) is the column effect
  • \( {\tau}_{k} \) is the treatment effect
  • There are p rows, columns, and treatments
Source SS df MS F
rows \( \sum_{i=1}^{p}p(\bar{y}_{i..}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{A}}{{df}_{A}} \) \( \frac{{MS}_{A}}{{MS}_{E}} \)
columns \( \sum_{j=1}^{p}p(\bar{y}_{.j.}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{B}}{{df}_{B}} \) \( \frac{{MS}_{B}}{{MS}_{E}} \)
treatment \( \sum_{k=1}^{p}p(\bar{y}_{..k}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{T}}{{df}_{T}} \) \( \frac{{MS}_{T}}{{MS}_{E}} \)
Error \( \sum_{i=1}^{p}\sum_{j=1}^{p}({y}_{ijk}-\bar{y}_{i..}-\bar{y}_{.j.}-\bar{y}_{..k}+2\bar{y}_{..})^{2} \) \( (p-1)(p-2) \) \( \frac{{SS}_{E}}{{df}_{E}} \)

Residuals in the LS design (redux)

Split Plot/Repeated Measures Design

  • Can use split plot language if blocking is created by sud-dividing blocks (whole plot and subplot factors)
  • We can use the repeated measures language if blocking is created by reusing subjects/material (within and between subjects factors)
  • We can always use the terms blocks and thus, between-blocks and within-blocks factors

Split Plot Design

If you suspect a design in a split-plot design, you should be able to answer the following questions:

  1. What are the whole plots or blocks? That is, what is the nuisance factor?
  2. What is the between-blocks factor? Is it observational or experimental?
  3. What is the within-blocks factor? Is it observational or experimental?

Example from HW7: Parsnip Plants

Under the control conditions of this study, wild parsnip plants averaged about a thousand seeds from their first set of flowers (primary umbels), about twice as many from the second set of flowers, but only about 250 from the third set. For plants attacked by the parsnip webworm, which destroyed most of the primary umbels, the pattern was quite different: the seed production from primary, secondary, and tertiary umbels averaged about 200, 2400, and 1300, respectively.

Diabetic Dogs

The disease diabetes affects the rate of turnover of lactic acid in a system of biochemical reactions called the Cori cycle. This experiment compares two methods of using radioactive carbon-14 to measure rate of turnover. Method 1 is injection all at once, and method 2 is infused continuously. 10 dogs were sorted into two groups, 5 were controls and 5 had their pancreas removed (to make it diabetic). The rate of turnover was then measured twice for each dog, once for each method. The order of the two methods was randomly assigned.

Draw the factor diagram for the data on page 263. (ignore the ordering for now)

Crossing versus Nesting

  1. Crossing: Two sets of treatments are crossed if all possible combinations of treatments occur in the design. The design is called a two-way factorial and has factorial treatment structure.
  2. Nesting: One factor is nested within another if each level of the first (“inside”) factor occurs with exactly one level of the second (“outside”) factor.

Algebraic Notation for the Spilt Plot Design

\[ {y}_{ijk}={\mu}+{\alpha}_{i}+{\beta}_{j(i)}+{\gamma}_{k}+({\alpha\gamma})_{ik}+{e}_{ijk} \]

  • \( {\mu} \) is the benchmark
  • \( {\alpha}_{i} \) effect of level i of the between-blocks factor, \( i \) from \( 1 \) to \( a \)
  • \( {\beta}_{j(i)} \) effect of block \( j \) (for level \( i \) of the within block factor), \( j \) from \( 1 \) to \( n \)
  • \( {\gamma}_{k} \) effect of level \( k \) of the within-block factor, \( k \) from \( 1 \) to \( t \)
  • \( ({\alpha\gamma})_{ik} \) interaction effect for level \( i \) of the between-blocks factor with level \( k \) of the within-blocks factor

Formal ANOVA for the Spilt Plot Design

Source SS df MS F
Between \( \sum_{i=1}^{a}tn(\bar{y}_{i..}-\bar{y}_{...})^{2} \) \( a-1 \) \( \frac{{SS}_{A}}{{df}_{A}} \) \( \frac{{MS}_{A}}{{MS}_{B}} \)
Blocks \( t\sum_{i=1}^{a}\sum_{j=1}^{n}(\bar{y}_{ij.}-\bar{y}_{i..})^{2} \) \( N-a \) \( \frac{{SS}_{B}}{{df}_{B}} \) \( \frac{{MS}_{B}}{{MS}_{E}} \)
Within \( \sum_{k=1}^{t}N(\bar{y}_{..k}-\bar{y}_{...})^{2} \) \( t-1 \) \( \frac{{SS}_{T}}{{df}_{T}} \) \( \frac{{MS}_{T}}{{MS}_{E}} \)
Interaction \( \sum_{i=1}^{a}\sum_{k=1}^{t}n(\bar{y}_{i.k}-\bar{y}_{i..}-\bar{y}_{..k}+\bar{y}_{...})^{2} \) \( (a-1)(t-1) \) \( \frac{{SS}_{AT}}{{df}_{AT}} \) \( \frac{{MS}_{AT}}{{MS}_{E}} \)
Error \( \sum_{i=1}^{a}\sum_{j=1}^{n}\sum_{k=1}^{t}({y}_{ijk}-\bar{y}_{i.k}-\bar{y}_{ij.}+\bar{y}_{i..})^{2} \) \( (N-a)(t-1) \) \( \frac{{SS}_{E}}{{df}_{E}} \)

Extensions by Factorial Crossing

We can now imagine adding complexity to these four basic designs by including additional factors crossed with our structural factors.

Take our diabetic dogs example, and now let us add in the fact that the order of the two methods was randomly assigned. What design do we have now?
- We have an order factor and there are two levels: order 1 and order 2
- The new design is a SP/RM[2,1]

Example

The purpose of this experiment was to study the way one species of crabgrass competed with itself and with another species for nitrogen (N), phosphorus (P), and potassium (K). Bunches of crabgrass were planted in vermiculite, in 16 Styrofoam cups; after the seeds head srouted, the plants were thinned to 20 plants per cup. Each of the 16 cups were randomly assigned to get one of 8 nutrient combinations added to its vermiculite. For example, yes-nitrogen/no-phosphorus/yes-potassium. The response is mean dry weight per plant, in milligrams.

Example

Worms that live at the mouth of a river must deal with varying concentrations of salt. Osomoregulating worms are able to maintain reltaively constant concentration of salt in the body. An experiment wanted to test the effects of mixtures of salt water on two species of worms: Nereis virens (N) and Goldfingia gouldii (G). Eighteen worms of each species were weighted, then randomly assigned in equal numbers to one of three conditions. Six worms of each kind were placed in 100% sea water, 67% sea water, or 33% sea water. The worms were then weighted after 30, 60, and 90 minutes, then placed in 100% sea water and weighted one last time 30 minutes later. The response was body weight as percentage of initial body weight.

Analysis in R

Compound within Block Factors

In an experiment, researchers wanted to compare how easy it is to remember four different kinds of words: 1) concrete, frequent: fork, brtoher, radio,… 2) concrete, infrequent: blimp, warthog, fedora, … 3) abstract, frequent: truth, anger, foolishness, … and 4) abstract, infrequent: slot, vastness, apostasy, …

Ten students in a pscyhology lab served as subject. During each of the 4 time slots, subjects heard a list of words from one of the four kinds, and then was tested for recall.

Compound within Block Factors

There are two possible models for chance error in models with compound within-block factors.

  1. The additive model
  2. The non-additive model

Compound within Block Factors

  1. The additive model - assumes that chance error is the same for all within-block factors, thus we could pool residual terms.
  2. The non-additive model - does not make this (often incorrect) assuption, but tests using this model are lower in power.

How can we decide?

  • Think about whether or not you would expect block X treatment interaction effects. If you would, then the additive model will be wrong.

Rule for Compound within Block F-ratios (non-additive)

\[ F = \frac{{MS}_{Factor}}{{MS}_{Blocks\times Factor}} \]

Rule for Compound between Block F-ratios

\[ F = \frac{{MS}_{Factor}}{{MS}_{Blocks}} \]

Analysis in R