The Importance of a Registry in Spinal Surgery



Fig. 54.1
Anatomy of medical research



Levels of evidence are defined by the study design and execution, and designs have been variously graded by their potential to eliminate bias [10, 11]. A hierarchy of study designs was first proposed by Campbell and Stanley [10]. The concept of evidence levels has been widely promoted since then to grade recommendations for clinical practice. The dogma of an RCT and RCT-based meta-analyses having superior evidence over an observational study has long been held as gospel despite numerous appeals to reconsider and to adapt the evidence levels [1116]. The reasons for this may be that the only available evidence pyramids focuses on efficacy studies, which allow for an accurate assessment of cause and effect. The terminology “observational” connotes that the data were collected without interference, so the decision to assign patients to an exposure is not random and may be self-determined by confounding variables that are highly correlated with the outcome. As a result, an observed significant difference in outcomes may not necessarily be attributable to the choice of treatment alone but to the confounding variable (or to some combination). Thus, according to the current evidence-based medicine practice, a small sample-sized RCT would bring undoubtful superior evidence than a large observational study. RCT study design has a high acceptance in all clinical disciplines despite a long list of its limitations such as ethical considerations, organizational burden, complexity of random allocation, complexity of blinding, time and personal resources consumption, compromised integrity of the clinical context, difficulty in studying rare events and outcomes in distant future, limitations in evaluation of population-based interventions [11], sample size limitation for definitive results [17], patient refusal of participation, narrowing of the studied population, and limited external validity [18], as well as financial efforts. This is also despite the fact that so many efforts are required to answer only one main posed hypothesis in an RCT and the fact that the number of non-randomizable clinical questions is by far much higher than the number of randomizable ones.

In orthopedic surgery and traumatology, among other medical disciplines, it is often difficult to provide an ethically justifiable patient randomization for differing and often insufficiently studied treatment methods. In medical disciplines other than surgical ones, like in conservative treatments or in pharmacology, the effect of a treatment is often assessed in comparison to a placebo. In surgical disciplines and particularly in orthopedic implantology, placebo comparisons are difficult to imagine. This is a nonnegligible and important organizational restriction of a randomized controlled study design. There are particular questions in implantology which may justify an RCT; however, the method of choice for implant comparisons is a registry with sufficient participation to ensure high external validity, i.e., generalizability of results to the routine clinical settings [19]. Particularly when introducing new implants and aiming at postmarket surveillance, an observational approach allowing for benchmarking of new and older implants appears to be the most reasonable one [19].

Moreover, one should be aware that a large observational cohort and an RCT are per se designed to answer different questions, and their different goals and setups result in their different characteristics (Table 54.1). Importantly, while an RCT answers the question “can it work?” an observational study addresses the question “does it work?” [20] In a large dataset, one could add “in whom does it work best?” as one of the key questions that the large observational comparisons can answer, since they include better and poorer indications for a certain therapy, as it is mostly the case when an innovation is adopted by the larger medical community. Table 54.1 summarizes characteristics of an RCT versus registry.


Table 54.1
Characteristics of an RCT and a registry




































































































Characteristics

RCT

Registry

Type of evidence

Efficacy

Effectivenessa

Principal question

Can it work? (the first step of evidence generation)

Does it work? (verification in clinical practice)

Internal validity (methodological quality)

+++

+/++ (expandable with, e.g., monitoring, audits, or validation with secondary data, etc.)

External validity (transferability/generalizability)


+++

Bias

Low to very low

High to low depending on set-up (appropriate set-up and evaluation methodology reduces bias)

Levels of evidence

1a, 1b

2b-4, depending on methodology

Hypothesis-based approach

Yes

Usually no

Duration of observation period

Predefined

Predefined or open end

Focus of research/measurement

Sharp, narrow (hypothesis driven)

Broad

Quality assessment

Not intended (strictly defined indications, process quality at least derivable, outcome quality depends on effectiveness, a given indication, and process)

Structure, indication, process, outcome

Early warning system

Not possible

Possible

Long-term follow-up

Feasible

Feasible (depending on set-up only for a representative sample)

Coverage

Only among participants

From individual center/surgeon over representative clinic sample to full national/regional coverage

Benchmarking

Only benchmarking of RCT arms

Various (depending on the final composition of participants regional to nationally representative benchmark)

Type of quality assurance

Internal, external vs. benchmark of participants

Internal, external vs. representative regional or national benchmark

Efforts

Very high

Low

Cost per case

High to very high

Low

Cost per study

High to very high

Low-cost basis

Availability of potential patients

Usually low

Usually high

Patient compliance

Potentially low

High

Use of generated data

Only in the framework of the scientific goal/hypothesis

Open-hypothesis generation possible

Comparator

Given per definition

Ranges between none to numerous comparators, depending on registry set-up


aUnclear terminology, Cochrane called it “efficiency,” better always specify what is meant (evidence derived from controlled experiment versus evidence derived from routine clinical practice)

On the other hand, strong evidence exists on the similarity of the results in RCTs and in observational studies. Benson and Hartz reviewed RCTs and observational studies between 1985 and 1998 and found that in a vast majority of therapeutic comparisons, the estimates of the treatment effects from observational studies and randomized controlled trials were similar [13]. In the same time, Concato et al. reviewed literature between 1991 and 1995 and concluded that the results of well-designed observational studies with either a cohort or a case-control design do not systematically overestimate the magnitude of the effects of treatment as compared with those in randomized controlled trials on the same topic, as it is anticipated [12]. In orthopedics, Bhandari et al. have shown that, when adjusted for important risk factors, the results of observational studies (n = 13) on revision and mortality rates after internal fixation of femoral neck fracture were similar to those in RCTs (n = 14) [21]. Equality of the results in an RCT and in a cohort study was shown also in spinal surgery based on the results from SWISSspine registry [22] and the Spine Tango results of a single center [23].

The proponents of an observational study design are often opponents of an experimental approach and vice versa. In 1996, Black wrote that there is no perfect study design and that “the false conflict between those who advocate randomized trials in all situations and those who believe observational data provides sufficient evidence needs to be replaced with mutual recognition of the complementary roles of the two approaches” [24]. Spinal surgery is one of the youngest and, in the same time, one of the fastest growing clinical disciplines with high and further growing epidemiological and financial impacts on society. The evidence demand in spinal surgery is therefore high, and there is a compelling need for quality assurance, outcome research, and postmarket surveillance of spinal implants.



54.4 “Spine Tango”: An International Spine Registry for Quality Assurance, Outcome Research, and Postmarket Surveillance of Implants



54.4.1 History and Objectives


In the late 1990s, the founder of the first spine unit in Switzerland – Prof. Dieter Grob – was asked by the cantonal government of Zurich to conduct an outcome study of the results of surgical interventions on the spine. The investigation was completed and the data provided to the sponsor. There was no further feedback or consequences as a result of this study. This triggered the idea of conducting a similar project on a larger scale. The Institute for Evaluative Research in Medicine in Bern, Switzerland, had by the time already an over 20-year experience with orthopedic registries, as the first detailed hip arthroplasty registry, the so-called IDES (International Documentation and Evaluation System), was set up in Bern in 1974 by Prof. M.E. Müller. Prof. Max Aebi, who took over the institute lead in 2000, and who was a close friend and collaborator of Prof. Dieter Grob, put the vision of a first international spine registry to reality.

All over the world, efforts were being made to set up orthopedic registries on regional, state, or even national levels. Spine surgery represents a challenge for all registry endeavors. The variety of levels, pathologies, accesses, and surgical techniques confounds all attempts to invent a short yet comprehensive questionnaire. Consequently, the project launched by Grob and Aebi was first based on sketches and discussions on form content, large personal commitment and motivation, and by the most modern approach of the time – the internet-based approach. This effort was later introduced to SSE, the Spine Society of Europe as “Spine Tango,” and the society adopted the registry in the early 2000s.

Goals of the Spine Tango are:



  • Quality assurance and quality improvement


  • Presentation of the state-of-the-art European spinal therapies, including all pathologies, levels, accesses, single as well as two-staged surgeries, and nonsurgical treatments


  • Outcome research and prospective observational evaluation of different surgical and nonsurgical techniques as an alternative to randomized controlled trials


  • Benchmarking on national and international levels


  • Postmarket surveillance of implants

In the mid- and late 1990s, Swedish spine surgeons also implemented a spine registry based on a short questionnaire dedicated to low back pain [25]. This effort become a national one and included other spinal pathologies with various treatments in the early 2000s [2628].

Spine Tango is probably the first spine registry initiative to face the challenge of developing a comprehensive questionnaire covering all major spine pathologies and treatments, as well as spanning all anatomical levels. To accomplish this task, a technically demanding computer application was a prerequisite. The consensus and piloting process for the Spine Tango surgical questionnaires’ “surgery,” “staged surgery,” and “follow-up” took about 5 years and needed around 4000 completed forms. The result are two double-sided A4 questionnaires (surgery, staged surgery) and one single-sided questionnaire for follow-up that can all be completed online or using scannable paper questionnaires. At the same time that the physician-based content was finalized, a working group at the Schulthess hospital in Zurich, Switzerland, had developed and validated the COMI (Core Outcome Measures Index) instruments for neck and low back based on a proposal by Deyo et al. [29], which became the officially recommended patient-based outcome instruments in the framework of the Spine Tango registry [30]. Until today, the Spine Tango database has grown to over 70,000 cases and currently includes 70 hospitals from 16 countries in Europe, the USA [31], but even Latin America, Asia, and Australia [32].

The cumulated numbers of the main Tango forms increase exponentially (Fig. 54.2). During the first three active years about 10,000, the second 3 years about 25,000, and during the last 3 years about 40,000 primary cases were documented.

A314055_1_En_54_Fig2_HTML.gif


Fig. 54.2
Growth rates of different form types in Spine Tango


54.4.2 Content


The refined set of questions still allows for a documentation of the broad spectrum of pathologies and treatments in spine surgery. This is made possible by means of a list of main pathologies and their specifications and the so-called surgical matrix, a terminology system reducing the interventions to their basic principles – decompression, fusion, stabilization rigid, stabilization motion preserving, percutaneous procedures, and others. The duplication and, hence, separation of these principles into anterior and posterior ones complete the matrix. The exact content of the surgeon- and patient-based Tango forms was already previously described [3335], and the forms are available on the Eurospine Web page [36]. Kessler et al. have developed the first version of documentation content for conservative spinal treatments for Spine Tango in 2011 [37].


54.4.3 Technology


Spine Tango has long left the early stage of a simple Web page for data entry and has grown into an international project with a sophisticated IT (information technology) structure and a multitude of clinical and scientific experts serving the user community and developing the registry further. The central database is part of the powerful scientific MEMdoc documentation portal hosted by the University of Bern; it offers various methods for clinical, implant, and radiographic data collection and a multitude of possibilities for data downloads and online statistical queries. An important step was the implementation of so-called modules, national satellite servers that filter out sensitive data for protecting user and patient privacy in the respective country before sending the clinical dataset to the central server in Switzerland [33]. Such modules are meanwhile installed in Germany, Austria, Italy, Poland, Belgium, Switzerland, Great Britain, Australia, USA, Mexico, and Brazil. Users whose country does not yet have such a national filter server may use the international module [36].


54.5 Benefits of Spine Tango


In 2004, Aebi and Grob wrote that it was increasingly understood that in any technology-driven surgical discipline, patients should be documented in a standardized way if they are recipients of novel inventions and implants. This to have a common language for reporting outcomes, complications, and unforeseen incidents which can be better recognized in a large, common data pool serving as an early warning system [38]. These words could be considered as the conception framework of the Spine Tango endeavor. The registry has now grown into one of the largest international clinical registries with one of the highest number of participating countries. Based on modern information technology, the participating spine surgeons have built a database in which numerous spinal treatments and, importantly, treatment indications are backed up by facts. Within the registry, each participating surgeon, department, and hospital has its own performance record. The analyses based on Spine Tango data reach from single center-based studies [3941], overbenchmarking against the pooled data of other hospitals [34] to comparative studies across spine registries [42]. Also, from a methodological point of view, the analyses range between methodological papers [43] and registry-based technical notes [34], over case series [3941] and subgroup analyses to propensity score-based weighted comparison of multiple treatment options [44].

The online tools such as data download, online statistics, follow-up calendar, and annual and benchmarking reports aim at facilitation of real-life quality assessment, outcome research, and performance monitoring for the users. Some European and non-European national societies recognizing the value of the registry are evaluating the possibility of introducing Spine Tango as their national quality assessment and outcome research tool and make it mandatory for certified spine centers.

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

May 4, 2017 | Posted by in ORTHOPEDIC | Comments Off on The Importance of a Registry in Spinal Surgery

Full access? Get Clinical Tree

Get Clinical Tree app for offline access