CRAN Task View: Design of Experiments (DoE) & Analysis of Experimental Data

Maintainer:Ulrike Groemping
Contact:groemping at bht-berlin.de
Version:2009-09-25

This task view collects information on R packages for experimental design and analysis of data from experiments. Please feel free to suggest enhancements, and please send information on new packages or major package updates if you think they belong here. Contact details are given on my Web page .

Experimental design is applied in many areas, and methods have been tailored to the needs of various fields. This task view starts out with a section on the most general packages, continues with specific sections on agricultural and industrial experimentation and closes with a section on the various special experimental design packages that have been developed for specific purposes. Of course, the division into fields is not always clear-cut, and some packages from the more specialized sections can also be applied in general contexts.

Experimental designs for general purposes

There are a few packages for creating and analyzing experimental designs for general purposes: First of all, the standard (generalized) linear model functions in the base package stats are of course very important for analyzing data from designed experiments (especially functions lm(), aov() and the methods and functions for the resulting linear model objects). These are concisely explained in Kuhnert and Venables (2005, p. 109 ff.); Vikneswaran (2005) points out specific usages for experimental design (using function contrasts(), multiple comparison functions and some convenience functions like model.tables(), replications() and plot.design()). granova offers some interesting non-standard graphical representations for results of simply-structured experiments (one-way and two-way layouts, paired data). AlgDesign creates full factorial designs with or without additional quantitative variables, creates mixture designs (i.e., designs where the levels of factors sum to 1=100%) and creates D-, A-, or I-optimal designs exactly or approximately. NOTE: Bob Wheeler, the author and maintainer of AlgDesign, would like to retire from this job and is looking for an "heir" whom he can entrust with continuing the package. Please contact Bob, if you are interested. Package conf.design allows to create a design with certain interaction effects confounded with blocks (function conf.design()) and allows to combine existing designs in several ways (e.g., useful for Taguchi's inner and outer array designs in industrial experimentation). blockTools assigns units to blocks in order to end up with homogeneous sets of blocks in case of too small block sizes.

Experimental designs for agricultural and plant breeding experiments

agricolae offers extensive functionality on experimental design especially for agricultural and plant breeding experiments, which can also be useful for other purposes. It supports planning of lattice designs, factorial designs, randomized complete block designs, completely randomized designs, (Graeco-)Latin square designs, balanced incomplete block designs and alpha designs. There are also various analysis facilities for experimental data, e.g. treatment comparison procedures and several non-parametric tests, but also some quite specialized possibilities for specific types of experiments.

Experimental designs for industrial experiments

Some further packages especially handle designs for industrial experiments that are often highly fractionated, intentionally confounded and have few extra degrees of freedom for error.

A suite of related packages (cf. Groemping 2009), DoE.base, FrF2 and DoE.wrapper, works with the same principal syntax and output structure, for which a central feature is the class design for experimental design objects. A fourth package, RcmdrPlugin.DoE (beta version!), integrates the functionality from these three packages into the R-Commander (package Rcmdr, Fox 2005) for the benefit of those R users who cannot or do not want to do command line programming.

DoE.base provides full factorial designs and some orthogonal arrays for main effects experiments (not very many yet) and provides the infrastructure for the other two packages of the suite, including a class design for experimental design data frames, and various methods for this class. FrF2 generates regular Fractional Factorial designs for factors with 2 levels as well as Plackett-Burman type screening designs. Regular fractional factorials default to maximum resolution minimum aberration designs and can be customized in various ways, supported by an incorporated catalogue of designs (including the designs catalogued by Chen, Sun and Wu 1993). Analysis-wise, FrF2 provides simple graphical analysis tools (normal and half-normal effects plots (modified from BsMD, cf. below), main effects plots and interaction plot matrices similar to those in Minitab software, and a cube plot for the combinations of three factors). It can also show the alias structure for regular fractional factorials of 2-level factors. DoE.wrapper brings the functionality from packages rsm and lhs (cf. also below) into the context of the other two packages and adds convenience features and other possibilities (especially regarding automatic choice and augmentation of the cube portion of central composite designs). It is planned (but may take some time) to also include AlgDesign into this package.

BHH2 accompanies the 2nd edition of the book by Box, Hunter and Hunter and provides various of its data sets. It can generate full and fractional factorial two-level-designs from a number of factors and a list of defining relations (function ffDesMatrix(), less comfortable than package FrF2). It also provides several functions for analyzing data from 2-level factorial experiments: The function anovaPlot assesses effect sizes relative to residuals, and the function lambdaPlot() assesses the effect of Box-Cox transformations on statistical significance of effects. BsMD provides Bayesian charts as proposed by Box and Meyer (1986) as well as effects plots (normal, half-normal and Lenth) for assessing which effects are active in a fractional factorial experiment with 2-level factors. Apart from tools for planning and analysing factorial designs, R also offers support for response surface optimization via package rsm. This package supports sequential optimization with first order and second order response surface models, offering optimization approaches like steepest ascent and visualization of the response function for linear model objects. Also, coding for response surface investigations is facilitated.

Experimental designs for special purposes

Various further packages handle special situations in experimental design: lhs provides latin hypercube designs which are especially useful for computer experimentation whenever changing levels is cheap so that factors can have many different levels. Furthermore, the package provides ways to analyse such computer experiments with emphasis on what follow-up experiments to conduct. desirability provides ways to combine several target criteria into a desirability function in order to simplify multi-criteria analysis. experiment contains tools for clinical experiments, e.g., a randomization tool, and it provides a few special analysis options for clinical trials. ldDesign suggests appropriate designs for linkage equilibrium studies, qtlDesign offers designs for quantitative trait locus experiments. crossdes creates and analyses cross-over designs of various types (including latin squares, mutually orthogonal latin squares and Youden squares) that can for example be used in sensometrics. Package SensoMineR contains special designs for sensometric studies, e.g., for the triangle test.

References

CRAN packages:

Related links: