TY - JOUR
T1 - A primer on quantitative bias analysis with positive predictive values in research using electronic health data
AU - Newcomer, Sophia R.
AU - Xu, Stan
AU - Kulldorff, Martin
AU - Daley, Matthew F.
AU - Fireman, Bruce
AU - Glanz, Jason M.
N1 - Publisher Copyright:
© 2019 The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: [email protected].
PY - 2019/11/15
Y1 - 2019/11/15
N2 - Objective: In health informatics, there have been concerns with reuse of electronic health data for research, including potential bias from incorrect or incomplete outcome ascertainment. In this tutorial, we provide a concise review of predictive value-based quantitative bias analysis (QBA), which comprises epidemiologic methods that use estimates of data quality accuracy to quantify the bias caused by outcome misclassification. Target Audience: Health informaticians and investigators reusing large, electronic health data sources for research. Scope: When electronic health data are reused for research, validation of outcome case definitions is recommended, and positive predictive values (PPVs) are the most commonly reported measure. Typically, case definitions with high PPVs are considered to be appropriate for use in research. However, in some studies, even small amounts of misclassification can cause bias. In this tutorial, we introduce methods for quantifying this bias that use predictive values as inputs. Using epidemiologic principles and examples, we first describe how multiple factors influence misclassification bias, including outcome misclassification levels, outcome prevalence, and whether outcome misclassification levels are the same or different by exposure. We then review 2 predictive value-based QBA methods and why outcome PPVs should be stratified by exposure for bias assessment. Using simulations, we apply and evaluate the methods in hypothetical electronic health record-based immunization schedule safety studies. By providing an overview of predictive value-based QBA, we hope to bridge the disciplines of health informatics and epidemiology to inform how the impact of data quality issues can be quantified in research using electronic health data sources.
AB - Objective: In health informatics, there have been concerns with reuse of electronic health data for research, including potential bias from incorrect or incomplete outcome ascertainment. In this tutorial, we provide a concise review of predictive value-based quantitative bias analysis (QBA), which comprises epidemiologic methods that use estimates of data quality accuracy to quantify the bias caused by outcome misclassification. Target Audience: Health informaticians and investigators reusing large, electronic health data sources for research. Scope: When electronic health data are reused for research, validation of outcome case definitions is recommended, and positive predictive values (PPVs) are the most commonly reported measure. Typically, case definitions with high PPVs are considered to be appropriate for use in research. However, in some studies, even small amounts of misclassification can cause bias. In this tutorial, we introduce methods for quantifying this bias that use predictive values as inputs. Using epidemiologic principles and examples, we first describe how multiple factors influence misclassification bias, including outcome misclassification levels, outcome prevalence, and whether outcome misclassification levels are the same or different by exposure. We then review 2 predictive value-based QBA methods and why outcome PPVs should be stratified by exposure for bias assessment. Using simulations, we apply and evaluate the methods in hypothetical electronic health record-based immunization schedule safety studies. By providing an overview of predictive value-based QBA, we hope to bridge the disciplines of health informatics and epidemiology to inform how the impact of data quality issues can be quantified in research using electronic health data sources.
KW - bias
KW - electronic health records
KW - medical informatics
KW - outcome assessment
UR - http://www.scopus.com/inward/record.url?scp=85075092465&partnerID=8YFLogxK
U2 - 10.1093/jamia/ocz094
DO - 10.1093/jamia/ocz094
M3 - Review article
C2 - 31365086
AN - SCOPUS:85075092465
SN - 1527-974X
VL - 26
SP - 1664
EP - 1674
JO - Journal of the American Medical Informatics Association : JAMIA
JF - Journal of the American Medical Informatics Association : JAMIA
IS - 12
ER -