Analyzing self-controlled case series data when case confirmation rates are estimated from an internal validation sample

Stanley Xu, Christina L. Clarke, Sophia R. Newcomer, Matthew F. Daley, Jason M. Glanz

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


Vaccine safety studies are often electronic health record (EHR)-based observational studies. These studies often face significant methodological challenges, including confounding and misclassification of adverse event. Vaccine safety researchers use self-controlled case series (SCCS) study design to handle confounding effect and employ medical chart review to ascertain cases that are identified using EHR data. However, for common adverse events, limited resources often make it impossible to adjudicate all adverse events observed in electronic data. In this paper, we considered four approaches for analyzing SCCS data with confirmation rates estimated from an internal validation sample: (1) observed cases, (2) confirmed cases only, (3) known confirmation rate, and (4) multiple imputation (MI). We conducted a simulation study to evaluate these four approaches using type I error rates, percent bias, and empirical power. Our simulation results suggest that when misclassification of adverse events is present, approaches such as observed cases, confirmed case only, and known confirmation rate may inflate the type I error, yield biased point estimates, and affect statistical power. The multiple imputation approach considers the uncertainty of estimated confirmation rates from an internal validation sample, yields a proper type I error rate, largely unbiased point estimate, proper variance estimate, and statistical power.

Original languageEnglish
Pages (from-to)748-760
Number of pages13
JournalBiometrical Journal
Issue number4
StatePublished - Jul 2018


  • confirmation rate of cases
  • internal validation sample
  • multiple imputation
  • self-controlled case series
  • vaccine safety


Dive into the research topics of 'Analyzing self-controlled case series data when case confirmation rates are estimated from an internal validation sample'. Together they form a unique fingerprint.

Cite this