Documentos de Académico
Documentos de Profesional
Documentos de Cultura
research-article2020
HANXXX10.1177/1558944720948239HANDArnold et al
Surgery Article
HAND
Abstract
Background: The seesaw test consists of flexion and extension of the thumb metacarpal on the trapezium, with
continuous axial pressure to keep the metacarpal base reduced in the carpometacarpal (CMC) joint. We aim to evaluate
this maneuver compared with the grind test. Methods: We prospectively enrolled 80 participants from March 2017
to March 2018 at a single institution, excluding those who had previous thumb surgery or pathology. Each participant
underwent both seesaw and grind tests by 2 independent examiners. We included 24 patients with a mean age of 73 years
in the CMC osteoarthritis group (Eaton stages 2-4) and 44 patients with a mean age of 66 years in the control group (Eaton
stages 0 and 1). We calculated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV),
and multirater κ measure. Results: The seesaw test had a higher sensitivity than the grind test (42%-71% vs 13%-17%), but
a lower specificity (82%-86% vs 91%-98%). The PPV was more consistent between examiners for the seesaw test (63%-
68% vs 42%-80%), and the NPV was higher (73%-84% vs 66%-68%). There was a slight agreement between the attending
surgeon and the fellow performing the grind test (κ = 0.08) and a moderate agreement between the attending surgeon
and the fellow performing the seesaw test (κ = 0.59). Conclusions: The seesaw test is superior to the grind test, with a
much higher sensitivity, slightly lower specificity, more consistent PPV, and slightly higher NPV. Therefore, the seesaw test
could be a valuable addition or even replacement for the grind test, and we recommend considering it in daily practice.
Keywords: hand, CMC, arthritis, diagnosis, osteoarthritis, thumb, anatomy, cartilage, basic science, evaluation, research
and health outcomes, hand therapy, specialty, surgery, pain
Table 1. Inclusion and Exclusion Criteria. numbers (Stata 13). A member of study staff was present to
Inclusion criteria
collect the data on both tests.
35 years of age and older The grind test consists of grasping the thumb metacarpal
New presentation to hand clinic and exerting a combined rotation and axial compression of
English fluency and literacy the joint (Figure 1). The seesaw test consists of holding and
Suspicion of CMC OA/diagnosis unrelated to CMC or STT stabilizing the thumb metacarpal, reducing it in the CMC
arthritis joint; then, while continuous axial pressure is maintained to
Radiographs/fluoroscopic images as part of routine care hold reduction of the metacarpal base in the CMC joint,
available flexion and extension of the metacarpal on the trapezium
Exclusion criteria are performed (Figure 2). Both tests are considered positive
Previous arthroscopy or surgery for thumb CMC arthritis if they elicit pain in the examined joint.8 Any grinding or
Other thumb pathologya clicking felt by the examiner was also noted.
Pregnant or lactating women After performing both diagnostic tests, the attending
Note. CMC = carpometacarpal; OA = osteoarthritis; STT = hand surgeon scored the radiographic stage of the partici-
scaphotrapezial trapezoidal; UCL = ulnar collateral ligament. pant’s thumb for CMC OA using the Eaton and Littler clas-
a
Such as trigger thumb, UCL injury, STT arthrosis, and scaphoid fracture. sification (Table 3). The scoring of radiographs was repeated
by the same hand surgeon at a minimum of 1 month after
both tests and tested the null hypothesis that there is no the participant’s presentation to clinic. The second evalua-
difference in interrater reliability between both tests. tion was used to group all participants into 2 groups: par-
ticipants with radiographic evidence of CMC OA, defined
as Eaton stages 2 to 4, were included in the CMC OA group.
Methods Participants with no or minimal radiographic evidence of
After institutional review board approval, we prospectively CMC OA, in accordance with Eaton stage 0 or 1, were
enrolled a total of 80 participants from March 2017 to included in the control group.
March 2018. All new patients, 35 years or older, with Eng-
lish fluency and literacy, suspected of having thumb CMC
Statistical Analysis
OA or a diagnosis unrelated to CMC or scaphotrapezial
trapezoidal (STT) arthritis presenting at an urban hospital Based on the existing literature, the sensitivity for the grind
in the Northeastern United States, were asked to participate test was expected to be 0.40.6,7,9,10 Assuming that the sensi-
in this study. Patients who had previous arthroscopy or sur- tivity of a meaningful test is 0.65,7 we would need a sample
gery for thumb CMC OA, had other thumb pathology (such size of 46 patients (n = 23 per group) to detect a meaningful
as trigger thumb, ulnar collateral ligament injury, STT difference in sensitivity between the diagnostic tests, with a
arthrosis, and scaphoid fracture), or were either pregnant or type I error of 0.05 and 80% power. We planned on enroll-
lactating were excluded (Table 1). All potential partici- ing a minimum of 50 patients in total (n = 25 per group).
pants had radiographs or fluoroscopic images as part of Based on an empirical prevalence of new CMC OA patients
their routine care. After obtaining informed consent, demo- of about 30% within the patients selected from the schedule
graphic information was gathered by a member of the study in this clinic, we enrolled a total of 80 patients to ensure a
staff (age, sex, race, marital status, involved and dominant minimum of 25 confirmed patients with CMC OA. Twelve
hand, smoking history, and prior treatment) (Table 2). radiographs were only available for the first evaluation,
Afterward, each participant underwent the seesaw test and resulting in 12 patients being excluded from the sensitivity,
grind test twice—once by the attending hand surgeon and specificity, and predictive value analysis.
once by a fellow in hand surgery. Although a single attend- To compare both groups, we used the Fisher exact test
ing hand surgeon did all examinations, 5 different hand fel- for categorical variables and the Wilcoxon rank sum test for
lows participated in this study, at different stages in the nonparametric continuous variables. We calculated the sen-
year-long fellowship training. One fellow conducted 40 of sitivity, specificity, positive predictive value (PPV), and
the 80 examinations, followed by 15, 14, 7, and 4 examina- negative predictive value (NPV). We used the multirater κ
tions by the other 4 fellows, respectively. All fellows were measure described by Siegel and Castellan to estimate
treated as 1 group. The tests were performed prior to regu- agreement between the 2 rating care providers, a commonly
lar clinical assessment and in the absence of the other used statistic to describe chance-corrected agreement in a
assessor. Both care providers were blinded to the partici- variety of interobserver and intraobserver studies. The κ
pant’s reason for presenting to the clinic, and the order of values were interpreted using the guidelines proposed by
tests was randomized using a list of randomly generated Landis and Koch (Table 3).11
Arnold et al 3
Table 2. Demographics.
Figure 1. The grind test. The grind test consists of (a) grasping the thumb metacarpal while stabilizing the trapezium and (b) exerting
a combined rotation and axial compression of the joint.
Stage Description
I Slight joint space widening (prearthritis)
II Slight narrowing of CMC joint with sclerosis, osteophytes <2 mm
III Marked narrowing of CMC joint with osteophytes, osteophytes >2 mm
IV Pantrapezial arthritis (STT involved)
Note. No CMC = Eaton stages 0 and 1. CMC = Eaton stages 2 to 4. CMC = carpometacarpal; OA = osteoarthritis; PPV = positive predictive value;
NPV = negative predictive value.
stages 0 and 1. The mean age of patients was 66 years (SD = by the attending hand surgeon and hand fellow (P = .434),
10.8), which was significantly less than the case group (P = although the surgeon felt subtle grinding without pain twice
.009). Most patients (96%) were white, right-hand dominant as often as the fellow (Tables 4 and 5). All 6 patients with a
(93%), and married (73%). Just over half of all patients were negative grind test but positive grinding felt by the hand
men (59%) and had their dominant hand involved (61%). surgeon had radiographically confirmed CMC OA.
Three patients (6.8%) reported smoking. Most patients (93%) The seesaw test had a sensitivity of 71% to 42%, speci-
did not have prior treatment, 1 patient (2.3%) had hand sur- ficity of 82% to 86%, PPV of 68% to 63%, and NPV of 84%
gery unrelated to CMC OA, and 2 patients (4.6%) had injec- to 73%, performed by the attending hand surgeon and hand
tions of the ipsilateral hand or wrist elsewhere than the CMC fellow, respectively (Table 4). There was a significant dif-
joint. Apart from age, the groups were comparable. ference between the results of the seesaw test performed by
When performed by the attending hand surgeon and the attending hand surgeon and hand fellow (P < .001)
hand fellow, respectively, the grind test had a sensitivity of (Table 5). The attending surgeon rated more seesaw tests as
13% to 17%, specificity of 91% to 98%, PPV of 43% to positive (eliciting pain) and reported more grinding in the
80%, and NPV of 66% to 68%. There was no significant joint both when testing positive and negative. Eight of 16
difference between the results of the grind test performed patients who had no pain (negative test) but perceived to
6 HAND 00(0)
Expected Standard
Test Agreement, % agreement, % κ error P value
Grind 86.25 85.06 0.0795 0.1114 .238
Seesaw 83.75 60.06 0.5931 0.1073 .000
have grinding in the joint during the seesaw test had radio- but it is inconsistent in our experience. We have used
graphic CMC OA. Of the 12 negative tests with clicking flexion and extension of the metacarpal in the CMC joint
(without grinding), none had radiographic CMC OA. Of the (the “seesaw” test) for many years because they seemed to
28 positive tests with grinding, 21 had CMC OA on radio- be more useful in our experience. This prospective study
graphs and 7 did not. aimed to determine the sensitivity, specificity, accuracy,
Overall, the seesaw test had a higher sensitivity than the and interrater reliability of seesaw and grind tests in
grind test (42%-71% vs 13%-17%), especially when per- patients with and without radiographic CMC OA. We
formed by the attending hand surgeon (71%) compared found the seesaw test had a sensitivity of 42% to 71%,
with the hand fellow (42%). The specificity of the seesaw specificity of 82% to 86%, PPV of 63% to 68%, NPV of
test was slightly lower when compared with the grind test 73% to 84%, and moderate interrater reliability (κ =
(82%-86% vs 91%-98%). The PPV was more similar 0.59). The grind test had a sensitivity of 13% to 17%,
between the attending hand surgeon and fellow for the see- specificity of 91% to 98%, PPV of 42% to 80%, NPV of
saw test (68% and 63%) compared with the grind test (42% 66% to 68%, and slight interrater reliability (κ = 0.08).
and 80%). In case of a positive seesaw test by the attending The results of this study need to be interpreted in consid-
hand surgeon, there is a 68% chance the patient has CMC eration of its limitations. First, the tests were performed by
OA identifiable on radiographs; in case of a negative test, 5 different hand fellows, not in equal proportion, and at dif-
there is an 84% chance the patient does not have CMC OA ferent stages in the year-long fellowship training. One of
identifiable on radiographs. these fellows conducted half of the clinical examinations,
The κ values for interrater agreement indicate there is a which could have influenced the interrater agreement.
slight agreement between the attending surgeon and the fel- Although all fellows were trained in using the seesaw test,
low when performing the grind test (κ = 0.08, P = .24). the performance characteristics of the test may improve
There is a moderate agreement between the attending sur- over time, as the most experienced surgeon had better test
geon and fellow when performing the seesaw test (κ = characteristics. Second, both the grind and seesaw tests
0.59, P < .001) (Table 5). were considered positive if they elicited pain, but it is
unclear what subtle grinding, clicking or a combination of
both means in the absence of pain (which we considered a
Discussion negative test). The hand surgeon found grinding without
Thumb CMC OA is challenging to diagnose purely with pain in 6 patients (7.9%) during the grind test and in 16
clinical tests. The grind test is a commonly described test, patients (20%) during the seesaw test. All 6 of the grind
Arnold et al 7
tests with grinding but no pain and 7 of the 16 seesaw tests respectively), and interrater reliability (NA, 79%, and
with grinding but no pain demonstrated CMC OA on their 84%, respectively).9,10 The seesaw test also had a sub-
radiographs. When grinding without pain was considered a stantial higher sensitivity than the grind test (71%-42%)
positive test result, the sensitivity of the grind test improved but lower specificity (82%-86%), missing 29% of the
from 13%-17% to 29%-38%. The specificity remained patients with radiographic CMC OA when performed by
unchanged. The sensitivity of the seesaw test when per- an experienced hand surgeon. Our study was not designed
formed by the attending surgeon improved from 71% to to determine interrater reliability, but the interrater agree-
100%, whereas the specificity dropped from 82% to 64% ment between the attending hand surgeon and fellow was
(Supplemental Table 1). This may indicate that an experi- higher for the seesaw test compared with the grind test
enced surgeon has an even higher sensitivity with the see- (κ = 0.59 vs .08), suggesting that the seesaw test was
saw test if grinding without pain is considered a positive more consistent between the attending hand surgeon and
test result. Third, in this study, there was a relatively high the fellow. Further study to compare the seesaw test
prevalence of CMC OA because of patient selection; there- against other tests that seem to be superior to the grind
fore, it was more likely that patients with a positive test also test should be considered. Currently, the seesaw test can
had radiographic CMC OA. If the seesaw test would be per- be considered as an addition to the battery of clinical
formed in a clinic with a different population with lower examination tools.
prevalence of basilar thumb OA, this might negatively
influence the usefulness of the test. Further testing in more
general patient populations could yield different results.
Conclusion
When the seesaw test is used in high-prevalence clinics In conclusion, we have used the seesaw test for many years
such as hand clinics, we believe it could be a valuable addi- and decided to examine this in comparison with the grind
tion to the grind test. test. We found that the seesaw test had a higher sensitivity
For this study, we used the second evaluation of radio- (42%-71% vs 13%-17%), slightly lower specificity (82%-
graphs in statistical analysis to reduce bias inherent in evalu- 86% vs 91%-98%), more consistent PPV (63%-68% vs
ating the radiographs directly after clinical examination. 42%-80%), and slightly higher NPV (73%-84% vs 66%-
However, some of the radiographs made as part of routine 68%) compared with the grind test. We also found that the
care were only available for the first evaluation, resulting in seesaw test demonstrated more consistent results between
12 radiographs not available for the second evaluation. Even different health care providers (κ= 0.59 vs 0.08). We feel
though additional analysis showed no difference in Eaton that it is a worthwhile test to consider for clinical assess-
scoring between both evaluations, this should be noted. ment of CMC OA.
Clinicians have a battery of diagnostic tests available to
assess thumb CMC OA, including the grind, distraction, Acknowledgments
traction-shift, and metacarpal base compression tests. Prior We would like to thank Dr Daphne van Hooven for assisting in
studies have evaluated the grind test and have generally subject identification and inclusion. We would also like to thank
found a low sensitivity (30%-53%) and high specificity all hand fellows for participating in this study by performing the
(80%-100%).6,7,9,10 Subsequently, the PPV is high (91%- clinical examinations.
100%), and NPV is variable (24%-70%).6,9 In addition, the
interrater reliability is moderate (κ = 0.31-0.48).6,10 Our Ethical Approval
study found the grind test has a very low sensitivity (13%- This study was approved by our institutional review board.
17%) but high specificity (91%-98%). In other words, the
grind test is useful for ruling out CMC OA that would be
Statement of Human and Animal Rights
detected on radiographs, but does not effectively identify
patients who likely have CMC OA on radiographs. In our All procedures followed were in accordance with the ethical stan-
cohort, more than 80% of patients with confirmed CMC OA dards of the responsible committee on human experimentation
(institutional and national) and with the Helsinki Declaration of
on radiographs have a negative grind test. In case of a posi-
1975, as revised in 2008.
tive grind test, there is a 43% to 80% chance that the patient
has CMC OA detectable on radiographs, although the PPV
Statement of Informed Consent
differed greatly between attending surgeon and fellows.
With an NPV of 66% to 68%, a negative test does not Informed consent was obtained from all individual participants
exclude the possibility of radiographic CMC OA. included in the study.
More recently described diagnostic tests, such as the
lever, thumb adduction, and extension stress tests, dem- Declaration of Conflicting Interests
onstrate higher values for sensitivity (82%, 94%, and The author(s) declared no potential conflicts of interest with respect
94%, respectively), specificity (81%, 93%, and 95%, to the research, authorship, and/or publication of this article.
8 HAND 00(0)