BioPharmNet
Topics
BioPharmNet forum
 
Navigation
Book review
Review of Statistical Monitoring of Clinical Trials: A Unified Approach by Michael A. Proschan, K.K. Gordon Lan, Janet Turk Wittes [Springer, New York, 2006]
Written by Alex Dmitrienko
The book by Proschan, Lan and Wittes is a welcome addition to the recently published books on sequential clinical trials. The books in this area tend to fall in the theoretical or practical/case study categories. The books by Whitehead (1997), Jennison and Turnbull (2000) are examples of theoretical books with a textbook flavor. On the other hand, Ellenberg, Fleming and DeMets (2002) and DeMets, Furberg and Friedman (2005) focus on operating principles/case studies/lessons learned in clinical trials with interim assessments. Proschan, Lan and Wittes (2006) addresses both methodological and practical aspects of sequential monitoring of clinical studies by including a detailed description of relevant statistical methods, numerous case studies, numerical examples and advice from experts.

The book begins with a description of the underlying mathematical framework. This framework is built around Brownian motion and helps the authors develop a unified approach to virtually all types of clinical trials, including trials with continuous, binary and survival outcome variables, repeated measures, etc. The authors describe "classical" group-sequential methods in Chapters 3-7, related topics in Chapters 8-10 and a more advanced topic (sample size modification) in Chapter 11. In what follows, I will provide a short outline of each chapter with emphasis on points that I found particularly interesting.

Chapters 3, 4 and 5 form the backbone of the book and introduce key concepts such as conditional power, stopping rules for benefit, harm and futility, and error spending. The theory and applications of conditional power are covered in Chapter 3. The authors discuss the standard frequentist definition and a Bayesian-type extension that relies on averaging the conditional power over the posterior distribution of the treatment difference (predictive power approach). Chapter 4 deals with traditional group-sequential designs based on Pocock and O'Brien-Fleming boundaries and Chapter 5 describes the error spending function approach introduced in the seminal paper by Lan and DeMets (1983). It is important to note that the authors utilize error spending functions to both design and execute group-sequential trials. This approach enables a seamless transition between the design and monitoring stages and is more natural/efficient than an alternative approach that begins with a traditional group-sequential design and then relies on an approximate error spending function during the monitoring stage. Chapter 5 also contains an interesting discussion of benefit and harm/futility stopping rules in sequential trials. The authors discuss the inherent asymmetry of benefit and harm stopping boundaries (a negative treatment difference does not have to be statistically significant to justify an early termination of the clinical trial) and the relationship between benefit and futility stopping rules. They recommend computing the benefit and futility boundaries in an independent manner to give data monitoring committees (DMCs) more flexibility with respect to futility stopping. One exception to this rule is the comparison of two active treatments. Another exception that comes to mind is data monitoring in mortality clinical trials (in this case, it seems prudent to make futility stopping mandatory to avoid exposing patients to an inefficient treatment).

Chapter 6 discusses sequential monitoring in clinical trials with time-to-event outcomes, for example, survival trials. It includes a good discussion of issues specific to survival monitoring such as the computation of information fractions and analysis of non-proportional hazards. Chapter 7 describes adjusted inferences at the last look (bias-corrected point estimates, confidence intervals and p-values) that account for the sequential nature of the trial. Chapter 8 covers statistical methods that can be considered when the Brownian motion-based framework becomes unreliable (for instance, permutation methods in small samples).

Chapter 9 touches upon a topic which is often downplayed in statistical papers on trials with sequential designs --- monitoring of safety data. The chapter describes more formal safety decision rules that are similar to decision rules for efficacy variables and, in addition, provides a summary of recommendations that come in handy when a DMC reviews adverse events (for example, approaches to adverse event classification, including hierarchical classification schemes).

Chapter 10 briefly reviews Bayesian approaches to sequential data monitoring, including a Bayesian formulation of the frequentist decision rules described earlier in the book. The chapter also discusses the selection of prior distributions in Bayesian stopping rules and explores connections between frequentist and Bayesian monitoring in sequential trials.

Chapter 11 deals with a topic that has attracted much attention in recent years --- adaptive clinical trials. Specifically, it discusses sample size reassessment at an interim analysis based on updated estimates of nuisance parameters (for example, the variance of a continuous outcome variable) or an updated estimate of the treatment effect. The former case is quite straightforward whereas the latter one has caused some controversy. The authors should be commended for giving a detailed and objective assessment of available options in this important area of research and providing practical advice.

The last chapter (Chapter 12) quickly reviews topics not covered in the book. This includes, among other things, multiplicity-adjusted confidence intervals for the true treatment difference computed at each interim look (known as repeated confidence intervals). The authors argue that repeated confidence intervals are not generally informative and I agree that these confidence intervals are inferior to bias-adjusted confidence intervals derived at the last look. However, DMC members may find repeated confidence intervals useful in benefit/risk assessments when it is important to understand the likely magnitude of clinical benefit given the interim results.

Finally, the two appendices give a crash course in survival analysis and information on group-sequential software.

One of the attractive features of the book is an impressive collection of case studies. Examples of real clinical trial are used throughout the book to illustrate properties of statistical approaches discussed in each chapter and get important points across to the reader. To give an example, the CAST I study is first mentioned in the introduction and then reappears in the discussion of futility rules based on conditional power, benefit and harm error rates in sequential clinical trials and safety monitoring. Several case studies in this book are really thought-provoking and help the reader appreciate how much work goes into the design and monitoring of sequential trials (CAST I, CAST II, RALES and other trials). These examples make it clear that there are no cookie-cutter solutions in the world of sequential clinical trials.

Numerical examples are included in almost every section. They range from fairly basic examples such as the computation of conditional power (Chapter 3) to more complex ones arising in challenging statistical problems such as the derivation of bias-adjusted estimates of the treatment effect and confidence limits following a group sequential trial (Chapter 7). All of these examples include detailed instructions and will be appreciated by practitioners. It is worth noting that most of numerical examples involve straightforward calculations and reinforce an important point: sequential monitoring of clinical studies does not always require specialized software and in many cases calculations can be done on a pocket calculator. The book also features software-driven numerical examples; however, all of these examples rely on open-source software freely available on the Internet. This includes the R language and software developed by David Reboussin, David DeMets, KyungMann Kim and Gordon Lan (http://www.biostat.wisc.edu/landemets/).

My last comment pertains to the use of graphical displays in this book. Effective graphical displays are known to be helpful when one is dealing with complex concepts and Figures 3.3 and 3.6 serve as excellent examples of plots that are worth a thousand formulas. The figures give an elegant geometric interpretation of conditional power. It literally takes a few seconds to understand key properties of conditional power, for example, the relationship between conditional power and assumptions about the distribution of future data in a clinical trial (conditional power under the alternative hypothesis of a positive treatment effect is higher than that based on the null hypothesis). The reader will find quite a few other helpful plots and diagrams throughout the book.

The new book gives an excellent overview of issues related to the design and conduct of sequential clinical trials. Researchers working in this area will find this comprehensive book very useful.
References
DeMets DL, Furberg CD, Friedman LM (editors). Data Monitoring in Clinical Trials: A Case Studies Approach. Springer, New York, 2005.
Ellenberg SS, Fleming TR, DeMets DL. Data Monitoring Committees in Clinical Trials: A Practical Perspective. Wiley, New York, 2002.
Jennison C, Turnbull BW. Group Sequential Methods with Applications to Clinical Trials. Chapman and Hall/CRC Press, London/Boca Raton, FL, 2000.
Lan KKG, DeMets DL. Discrete sequential boundaries for clinical trials. Biometrika. 1983; 70:659-63.
Whitehead J. The Design and Analysis of Sequential Clinical Trials (Second edition). Wiley, London, 1997.