Study Type |
Question |
Level I |
Level II |
Level III |
Level IV |
Level V |
Diagnostic—Investigating a diagnostic test |
Is this (early detection) test worthwhile? |
|
|
|
|
|
Is this diagnostic or monitoring test accurate? |
|
|
|
|
|
Prognostic—Investigating the effect of a patient characteristic on the outcome of a disease |
What is the natural history of the condition? |
|
|
|
|
|
Therapeutic—Investigating the results of a treatment |
Does this treatment help? What are the harms?g |
|
|
|
|
|
Economic |
Does the intervention offer good value for dollars spent? |
Computer simulation model (Monte Carlo simulation, Markov model) with inputs derived from level-I studies, lifetime time duration, outcomes expressed in dollars per quality-adjusted life years (QALYs) and uncertainty examined using probabilistic sensitivity analyses |
Computer simulation model (Monte Carlo simulation, Markov model) with inputs derived from level-II studies, lifetime time duration, outcomes expressed in dollars per QALYs and uncertainty examined using probabilistic sensitivity analyses |
Computer simulation model (Markov model) with inputs derived from level-II studies, relevant time horizon, less than lifetime, outcomes expressed in dollars per QALYs and stochastic multilevel sensitivity analyses |
Decision tree over the short time horizon with input data from original level-II and III studies and uncertainty is examined by univariate sensitivity analyses |
Decision tree over the short time horizon with input data informed by prior economic evaluation and uncertainty is examined by univariate sensitivity analyses |
aThis chart was adapted from OCEBM Levels of Evidence Working Group, “The Oxford 2011 Levels of Evidence,” Oxford Centre for Evidence-Based Medicine, http://www.cebm.net/ocebm-levels-of-evidence/. A glossary of terms can be found here: http://www.cebm.net/glossary/. |
bLevel-I through -IV studies may be graded downward on the basis of study quality, imprecision, indirectness, or inconsistency between studies or because the effect size is very small; these studies may be graded upward if there is a dramatic effect size. For example, a high-quality randomized controlled trial (RCT) should have ‡80% follow-up, blinding, and proper randomization. The level of evidence assigned to systematic reviews reflects the ranking of studies included in the review (ie, a systematic review of level-II studies is level II). A complete assessment of the quality of individual studies requires critical appraisal of all aspects of study design.
c Investigators formulated the study question before the first patient was enrolled.
d In these studies, “cohort” refers to a nonrandomized comparative study. For therapeutic studies, patients treated one way (eg, cemented hip prosthesis) are compared with those treated differently (eg, noncemented hip prosthesis).
e Investigators formulated the study question after the first patient was enrolled.
f Patients identified for the study on the basis of their outcome (eg, failed total hip arthroplasty), called “cases,” are compared with those who did not have the outcome (eg, successful total hip arthroplasty), called “controls.”
g Sufficient numbers are required to rule out a common harm (affects >20% of participants). For long-term harms, follow-up duration must be sufficient. |
Reproduced with permission from Marx RG, Wilson WS, Swiontokowski MF: Updating the assignment levels of evidence. J Bone Joint Surg Am 2015;64(1):1-2; Table 1. |