There are two broad categories into which OA tools can be assigned, subjective and objective.
50 Subjective OA tools are patient-driven, whereas objective measures are driven by the HCP. This chapter discusses the subjective OA tools and the objective tools are discussed in
Chapters 10 and
12. There are several outcomes assessment tools included in the appendix to this chapter. When available, the MDC score is reported.
There have been several classifications of the various domains or groups of OA tools.
51 Bombardier describes a core set of measures that should be considered when managing patients with spinal disorders—pain, generic health status, disability or functional status, work status, and patient satisfaction (
Table 8.2).
21 Psychological distress is a sixth domain that should also be addressed and emphasized.
Pain
In the assessment of pain, there are several measures to consider, including the pain severity, pain affect, pain location, and pain persistence (chronicity). The severity of pain is related to how much a person hurts, whereas pain affect measures the mental or emotional component of pain. When assessing pain severity for
chronic and recurrent pain conditions, assessing the pain severity during a specified time period such as 1 week, 1 month, 6 months, etc., may be more important than reporting the pain status at a particular point of time.
52 Von Korff describes key parameters of pain status based on a retrospective report to include: (a) the number of days pain is experienced during a specified time frame; (b) the average or usual pain intensity (PI) when in pain; (c) average interference with activities; and (d) the cumulative number of activity limitation days caused by pain.
52
Pain Severity/Intensity Measuring PI can be accomplished using verbal rating scales, VAS, and/or NRSs. Von Korff concludes that “… 0-10 NRSs have many advantages over the alternatives for clinical use and for research in clinical populations in which a simple and robust measurement method is needed.”
53 Hence, a 0-to-10 NRS anchored by “no pain” at the “0” end and “extreme pain” at the “10” end (or vice versa) is a commonly used and practical approach.
A VAS of current pain has been shown to be less responsive than a rating of pain over the past 24 hours, week, or 2 weeks.
29,54,55,56 Therefore, when asking a patient to rate pain, the usual or average pain level may be the best choice when limiting the number of questions asked regarding PI to one. The report of average PI has been found to correlate with a 3-month daily pain diary in a number of studies.
57,58,59 These validity studies support using measures of average or usual PI for up to a 3-month recall period with acceptable discrimination. An example of a simple 0-to-10 NRS for PI using the “usual,” “typical,” or “average” is depicted in
Figure 8.1.
Hagg and colleagues found a change of 18 to 19 out of 100 in the VAS of cLBP patients to be clinically significant.
40 Turner studied the correlation of pain with disability.
60 If the initial VAS was 5 or more, a change of at least 2 points was needed to influence disability scores significantly. If the initial VAS score was <5, then a VAS change of at least 1 point would have a clinically relevant effect on functioning.
Regarding the validity of weekly recall ratings of PI in neck pain patients, Bolton and colleagues reported the following: Average pain over the prior week (Pearson r = 0.95); worst pain over the prior week (Pearson r = 0.93); least pain over the prior week (Pearson r = 0.92).
61 This was calculated by having 78 patients with nonspecific neck pain complete a 7-day neck pain diary rating their pain levels four times a day on a 0- to 11-point NRS. From the 28 ratings, the patients’ “actual average” pain was computed. On day 8, they were asked to rate their current pain as well as their pain “on average,” at its “worst,” and its “least” over the prior week. Recall of average pain over the prior week was shown to be a valid measure using the data as stated above.
The MCIC for PI was reported studying 1,349 subacute and cLBP patients with and without leg pain (LP) as seen in routine clinical practice over a 12-week time frame in Spain to represent the Southern European LBP patient population.
62 Three different methods of calculating the MCIC were used: (a) the mean change score (MCS); (b) the MDC; and (c) the optimal cutoff point (OCP) in receiver operant curves. External criterion included the patient’s own “global perceived effect.” The effect on MCIC of initial scores, duration of pain, and existence of LBP was addressed. Different results were calculated with each method with OCP being the smallest. The MCIC for LBP ranged from 1.5 to 3.2 PI-NRS points when baseline scores were <7/10; from 2.5 to 4.3 points with a baseline score of ≥9/10.
Pain Affect PI may be defined as the amount a person hurts, whereas pain affect can be defined as the emotional arousal and disruption created by the pain experience.
63,64,65,66,67 The McGill pain questionnaire
68,69 includes 20 category scales of verbal pain descriptors categorized in order of severity and clustered into four subscales:
Sensory discrimination
Affective
Evaluative
Miscellaneous
A detailed description for scoring this instrument is available elsewhere.
51
Pain Diagrams The pain diagram or drawing is perhaps the best way to obtain the patient’s perception of the location of their symptoms.
51,70,71 Improvement or exacerbation can quickly be determined by comparing current to previously completed pain diagrams. Pain diagrams enhance the HCP’s ability to differentiate between a mechanical low back, nerve root, and psychogenic problem (
Fig. 8.2).
Abnormal illness behavior or somatization is suggested if the pain diagram shows multiple types of pain qualities (achy, stabbing, burning, numbness, pins and needles, etc.) in all four extremities and the trunk, and/or if markings outside of the body such as lightning bolts are present. This can then be correlated with other subjective information such as psychometric “yellow flags” that include poor coping strategies, depression, and anxiety, as well as objective tools such as the Waddell Non-Organic Low Back Pain signs (see
Chapter 7).
Though the pain drawing is usually used qualitatively, there are several validated methods for scoring pain drawings.
72,73,74,75,76,77 One scoring method is accomplished by overlapping the patient’s pain drawing with a transparency that includes the same drawing but with grid lines and adding up points based on the number of body regions/extremities marked and the quantity of pain quality markings used.
Summary The NRS and pain diagram have the greatest utility for the typical practitioner.
General Health
Patient-based general health OMs can be classified into two general categories: generic and disease or condition-specific measures.
78,79 Generic measures
include global ratings of health status and multidimensional measures of health-related quality of life, which include the Sickness Impact Profile (SIP),
80,81 SF-36 Health Survey,
82 Nottingham Health Profile,
83 Dartmouth COOP Health Charts,
83,84,85 and others. The strength of generic measures of general health is that these are not specific to any one condition or disease and, therefore, are applicable across populations regardless of their health status. However, this is also a weakness because they are not as responsive to change over time compared with condition-specific tools.
51 An example of a highly responsive condition-specific version of the SIP General Health Questionnaire is the Roland-Morris questionnaire.
The SF-36 is a popular generic outcome tool that has been used in outcomes-based research has been translated into >40 languages as part of the International Quality of Life Assessment, and it is often utilized in clinical settings.
86,87 The strength of the SF-36 lies in the fact that normative data exist for healthy and nonhealthy populations.
88,89,90
Both versions 1.0 and 2.0 are divided into eight scales representing different aspects of general health.
82,91,92 Utilizing the eight individual scales, version 2.0 yields two composite scores, which include mental health and physical health.
Table 8.3 lists the eight scale titles, the number of items or questions that are used to compute the score, the specific scale items, and the minimum number of items needed to compute a score.
The physical component summary (PCS) is made up of the following four scales: Physical Function, Role Physical, Bodily Pain, and General Health. The Mental Health component (MHC) is made up of Mental Health, Role Emotional, Social Function, and Vitality. The advantage of grouping all 36 questions into two rather than eight scales results in an improvement in reliability. The mean score for a healthy adult population regarding both scales is 50 ± 10 points, which carries a reliability level of 0.92 and 0.88 for the PCS and the MHC, respectively.
The SF-36 has generally been shown to be a responsive instrument for measuring clinically meaningful change in low back pain (LBP) and sciatica individuals in certain studies,
47,93,94 whereas in others it has not.
95 Even in the Taylor et al study in which it was found to be responsive, it was not as good as the ODI.
47 The scales with the greatest responsiveness were Physical Function, Bodily Pain, and Social Function. In fact, the Physical Function scale was more sensitive to change than the ODI.
47
The SF-12 is an abbreviated version derived from the SF-36 that was designed to improve the practicality and utility of the longer 36-item version introduced.
96,97,98
The SF-36 can also be utilized to form two distinct scales, the physical function and mental health scales. The advantage of the SF-12 over the SF-36 is that the length of time needed to complete the form is only 2 to 5 minutes. Standard and acute versions of the SF-12 and 36 are available in multiple languages.
87
Summary If the clinician is planning to assess other outcome domains, it may be more practical to use the SF-12 instead of the SF-36 for measuring general health status. If time is still deemed excessive, a single question about self-perceived health can be utilized as has been used in “yellow flags” questionnaire
99 (see
Chapter 8).
Patient-Reported Outcome Measure Information System
The Patient-Reported Outcome Measure Information System (PROMIS) is a continually evolving set of PROMS that are reliable and precise. The PROMIS Health Organization collects information from multiple domains within global, physical, mental, and social health.
100 All of the items in each of the PROMIS tools have been extensively tested to ensure the responsiveness, validity, and other psychometric properties are appropriate for use across diverse populations.
11,101,102,103,104,105,106,107
Assessment can be delivered in a number of different ways with the option of different short or long forms in each domain, and the option to conduct computer adaptive testing, which will tailor the questions depending on the responses given.
102,103,107,108,109,110 PROMIS also includes pediatric and parent proxy instruments, in multiple languages, enabling data to be captured across all age groups.
103,111,112,113,114
Each of the domains in PROMIS has extensive item banks. For example, in the domain of mental health, there are item banks for the Profile Domains of Depression and Anxiety, and further item banks in the Additional Domains of Anger, Cognitive Function, Alcohol Use, Consequences & Expectancies, Smoking, Substance Abuse, Psychosocial Illness Impact, and Self-efficacy. Each item bank includes a large number of items that can be used to assess the specific domain.
101,102,104,106,107,110,111,115 The variety of assessments available enables a researcher or clinician to appropriately select an instrument that will capture the data that is relevant. Factors that may influence instrument selection could include time available, the level of precision required, method of collection, the age of the patient, specific domains, etc.