Documentos de Académico
Documentos de Profesional
Documentos de Cultura
DOI: 10.1111/j.1471-0528.2012.03297.x
www.bjog.org
Conclusions The IOTA protocol was more accurate for triage than
mass.
(RMI < 25), moderate (25250), or high (above >250) risk. The
IOTA protocol uses LR2s estimated probability of malignancy
(<0.05 indicates low risk, 0.05 but <0.25 moderate risk, and
0.25 high risk).
Please cite this paper as: Van Calster B, Timmerman D, Valentin L, McIndoe A, Ghaem-Maghami S, Testa A, Vergote I, Bourne T. Triaging women with
ovarian masses for surgery: observational diagnostic study to compare RCOG guidelines with an International Ovarian Tumour Analysis (IOTA) group
protocol. BJOG 2012;119:662671.
Introduction
Ovarian cancer remains a significant problem with 6500
cases and 4400 deaths in 2008 among UK women.1 In the
whole of Europe the figures are 66 700 and 41 900, respectively.2 A significant contributor to prognosis is the quality
of the primary surgery, as survival from the disease is
related to residual tumour mass after debulking.3 Less
662
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
Methods
Design and setting
This is a multicentre observational diagnostic study to evaluate two triaging protocols on women presenting with an
adnexal (ovarian, para-ovarian, or tubal) mass who later
Participants
Women presenting to one of the recruitment centres with
at least one persistent adnexal mass that was selected for
surgical intervention by the managing clinicians were eligible for inclusion conditional on oral informed consent
before the ultrasound scan and surgery. In the event of
multiple masses, the mass with the most complex ultrasound morphology was used to collect information on
tumour characteristics for statistical analysis. When masses
with similar morphology were observed we included the
larger of the two masses or the one most easily visible by
ultrasonography. Exclusion criteria were pregnancy, refusal
of transvaginal ultrasonography, and surgical removal of
the mass more than 120 days after the ultrasound examination.
Data collection
A dedicated, secure data collection system was developed
for the study (IOTA 2 study screen; astraia GmbH,
Munich, Germany). An automatically generated unique
identifier was made for each womans record. Clinicians
could only view or update the records from their own centre. Data security was ensured by encrypting all data communication. Client-side checks in the astraia system and
manual checks by one biostatistician and two experienced
ultrasound examiners were used to ensure data integrity
and completeness.
Immediately before the ultrasound examination a standardised history was taken, including the patients age and
menopausal status, information on personal history of
ovarian and breast cancer, number of first-degree relatives
with ovarian or breast cancer, current hormonal therapy,
and previous gynaecological surgery. Women aged 50 years
or more who had undergone hysterectomy were defined as
postmenopausal. A standardised approach was used to
carry out transvaginal ultrasonography in all the women.
The examination technique and the ultrasound terms and
definitions used to describe the ultrasound findings have
been described elsewhere.21 In the event that a large mass
could not be seen in its entirety using a transvaginal probe,
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
663
Benign
n (%)
664
155
199
101
54
41
39
57
40
31
10
4
Borderline
n (%)
Invasive
n (%)
Total
n
(62)
(79)
(66)
(44)
(44)
(43)
(78)
(62)
(82)
(59)
(44)
24
17
3
11
10
15
1
9
1
0
0
73
35
50
57
43
36
15
16
6
7
5
252
251
154
122
94
90
73
65
38
17
9
124 (92)
45 (90)
51 (80)
3
0
2
8
5
11
135
50
64
173
134
110
17
11
(87)
(87)
(80)
(81)
(92)
5
3
6
1
0
22
17
21
3
1
200
154
137
21
12
731
220
445
1396
(63)
(88)
(85)
(72)
91 (8)
5 (2)
15 (3)
111 (6)
343 (29)
24 (10)
64 (12)
431 (22)
1165
249
524
1938
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
clinical variable, five ultrasound variables and one biochemistry variable. It is obtained as the product of the
menopausal status score M, the ultrasound score U, and
the serum CA125 level in U/ml: M*U*CA-125. M has value
3 if the woman is postmenopausal and 1 if the woman is
premenopausal. U has score 0, 1 or 3 depending on
whether none, one or more than one of the following five
ultrasound characteristics are present: multilocular cyst,
evidence of solid components, presence of ascites, bilateral
masses and presence of metastases. If the RMI is below 25
the guideline recommends considering the woman as at
low risk of malignancy, if the RMI is between 25 and 250
the guideline assumes moderate risk, and if the RMI is
above 250 high risk is assumed. According to Davies
et al.,24 the likelihood of having a malignant tumour in the
low, moderate and high risk groups is 2%, 21%, and 75%,
respectively.
The IOTA protocol is based on the logistic regression
model LR2.15 The data that were used to develop LR2 were
collected between 1999 and 2002 in nine centres (IOTA
phase 1), seven of which also participated in IOTA phase
2. LR2 contains one clinical variable (a), four grey scale
ultrasound variables (b, c, d, e) and one Doppler ultrasound variable (f): (a) patient age (years), (b) maximal
diameter of the solid components (mm), (c) presence of
ascites (yes = 1, no = 0), (d) irregular internal cyst walls
(yes = 1, no = 0), (e) presence of acoustic shadows
(yes = 1, no = 0) and (f) the presence of papillary structures with detectable flow (yes = 1, no = 0). The risk of
malignancy is derived as 1/(1 + e)z), with z = )5.3718 +
0.0354 * (a) + 0.0697 * (b) + 1.6159 * (c) + 0.9586 * (d) )
2.9486 * (e) + 1.1768 * (f). A priori, we determined the risk
thresholds for triaging women as low, moderate or high
risk to be 0.05 and 0.25, i.e. 5% (1 in 20) and 25% (1 in
4) estimated risk of malignancy according to LR2: a risk
below 0.05 corresponds to low risk; a risk of at least 0.05
but less than 0.25 corresponds to moderate risk; a risk of
0.25 or higher corresponds to high risk.
Statistical analysis
All statistical analyses were performed using sas version 9.2
(SAS Institute, Cary, NC, USA).
Patients were cross-tabulated based on the true outcome
(benign, borderline, invasive) and the assigned risk (low,
moderate, high), and the percentages of women classified
in each risk group were calculated for each tumour type to
assess the impact of the two protocols. Another index of a
protocols impact is the percentage of all women classified
as moderate risk and therefore not receiving a clear diagnosis. Differences in paired proportions were analysed using
95% confidence intervals obtained by the standard asymptotic method and the McNemar test. We compared the
protocols by assessing whether the IOTA protocol reclassi-
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
665
Index tests
RCOG protocol (n = 1938)
IOTA protocol (n = 1938)
Low risk
RCOG protocol (n = 906)
IOTA protocol (n = 1024)
Moderate risk
RCOG protocol (n = 609)
IOTA protocol (n = 345)
High risk
RCOG protocol (n = 423)
IOTA protocol (n = 569)
Reference standard
RCOG protocol (n = 906)
IOTA protocol (n = 1024)
Reference standard
RCOG protocol (n = 609)
IOTA protocol (n = 345)
Reference standard
RCOG protocol (n = 423)
IOTA protocol (n = 569)
Benign
RCOG
(n = 867)
IOTA
(n = 993)
Borderline
RCOG
(n = 17)
IOTA
(n = 18)
Invasive
RCOG
(n = 22)
IOTA
(n = 13)
Benign
RCOG
(n = 454)
IOTA
(n = 280)
Borderline
RCOG
(n = 63)
IOTA
(n = 29)
Invasive
RCOG
(n = 92)
IOTA
(n = 36)
Benign
RCOG
(n = 75)
IOTA
(n = 123)
Borderline
RCOG
(n = 31)
IOTA
(n = 64)
Invasive
RCOG
(n = 317)
IOTA
(n = 382)
Figure 1. Flow diagram of the RCOG and IOTA protocols as triage systems for women with adnexal masses.
Results
There were 1970 women enrolled in the study (see Figure 1
for a flow diagram). Thirty-two women were excluded for
the following reasons: 15 did not undergo surgical removal
of the mass within 120 days after the ultrasound examination, 12 were pregnant at the time of the examination, four
were excluded because of errors in data entry, and one was
excluded because of a protocol violation. The analysis dataset contained the remaining 1938 women, of whom 1396
had benign (72%), 111 borderline (6%), and 431 invasive
(22%) tumours. Tables 13 describe the pathological and
demographic details of the women. Table 1 presents the
distribution of the outcome (benign, borderline, invasive)
for the total sample and for different centres. Oncology
referral centres had more borderline (8%) and invasive
(29%) tumours than general hospitals or referral centres
for ultrasonography (23% borderline tumours and 1012%
invasive tumours). Table 2 summarises the specific histology
of the masses in the dataset. The most common benign
tumours were endometriomas, serous cystadenomas and
666
n (%)
1396
400
236
226
138
131
86
77
49
24
18
11
111
99
3
8
1
431
70
30
202
30
41
58
(72.0)
(20.6)
(12.2)
(11.7)
(7.1)
(6.8)
(4.4)
(4.0)
(2.5)
(1.2)
(0.9)
(0.6)
(5.7)
(5.1)
(0.2)
(0.4)
(0.1)
(22.2)
(3.6)
(1.6)
(10.4)
(1.6)
(2.1)
(3.0)
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
Table 3. Demographic characteristics, CA125 level, RMI, and the risk of malignancy estimated by logistic regression model LR2 for the women in
the study stratified by tumour type
Variable
Statistics
Age in years
Postmenopausal, %
Nulliparous, %
CA125 in U/mL
Risk of Malignancy Index (RMI)
LR2 estimated risk of malignancy
Median
n (%)
n (%)
Median
Median
Median
Benign
(n = 1396)
(IQR)
41
382
651
18
12
2.3%
(IQR)
(IQR)
(IQR)
(3152)
(27)
(47)
(1144)
(047)
(1.35.8)
Borderline
(n = 111)
48
49
44
33
95
30.7%
(3564)
(44)
(40)
(1891)
(41312)
(8.474.4)
Invasive
(n = 431)
57
311
98
218
1232
69.8%
(5066)
(72)
(23)
(58772)
(2145423)
(48.887.0)
Table 4. Triage results for women when using the RCOG protocol
or the IOTA protocol depending on tumour type
Triage result
Type of tumour
All women
n (%)
RCOG protocol
Low risk
906
Moderate risk
609
High risk
423
IOTA protocol
Low risk
1024
Moderate risk
345
High risk
569
Benign
n (%)
Borderline
n (%)
Invasive
n (%)
(46.7)
(31.4)
(21.8)
867 (62.1)
454 (32.6)
75 (5.3)
17 (14.9)
63 (57.0)
31 (28.1)
22 (5.2)
92 (21.2)
317 (73.6)
(52.8)
(17.8)
(29.4)
993 (71.1)
280 (20.1)
123 (8.8)
18 (16.2)
29 (26.1)
64 (57.7)
13 (3.0)
36 (8.4)
382 (88.6)
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
667
Table 5. Triage results for women with an invasive tumour when using the RCOG protocol or IOTA protocol depending on tumour histology
Triage result
RCOG protocol
Low risk
Moderate risk
High risk
IOTA protocol
Low risk
Moderate risk
High risk
Total
Metastatic invasive
Serous
n (%)
Mucinous
n (%)
Endometrioid
n (%)
Clear cell
n (%)
Epithelial
NOS
n (%)
Non-epithelial
n (%)
4 (1.9)
22 (10.3)
190 (87.8)
1 (3.6)
12 (43.4)
15 (53.1)
0 (0)
12 (27.2)
33 (72.8)
0 (0)
7 (27.2)
17 (72.8)
2 (14.3)
2 (14.3)
10 (71.4)
9 (20.0)
17 (36.7)
20 (43.3)
6 (10.5)
20 (33.4)
32 (56.2)
3 (1.4)
11 (5.1)
202 (93.5)
216
1 (3.6)
4 (14.3)
23 (82.1)
28
0 (0)
3 (6.7)
42 (93.3)
45
0 (0)
0 (0)
24 (100)
24
2 (14.3)
2 (14.3)
10 (71.4)
14
4 (8.7)
9 (19.6)
33 (71.7)
46
3 (5.2)
7 (12.1)
48 (82.8)
58
Table 6. Classification of tumours by the RCOG and IOTA protocols in all women and in various subgroups
Percentage of all
tumours classified
as moderate risk
Percentage
of benign tumours
classified as low risk
Percentage of
benign tumours
classified as high risk
668
Percentage of
invasive tumours
classified as low risk
Percentage of
invasive tumours
classified as high risk
5.3
8.8
5.2
3.0
73.6
88.6
3.9
5.1
10.1
6.7
56.7
78.3
9.2
18.6
3.3
1.6
80.1
92.6
6.0
10.4
5.6
3.3
73.3
88.0
6.9
11.7
4.7
3.2
74.0
88.2
6.8
11.5
4.8
2.9
75.5
88.0
3.3
4.1
16.7
0
47.3
100
3.9
6.7
3.1
4.7
73.2
87.5
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
for the IOTA protocol (three out of 64), but this difference
was based on only one woman.
Discussion
This study has shown that the IOTA protocol is more
accurate than the RMI-based protocol recommended by
the RCOG for classifying adnexal masses as being at low,
moderate or high risk of malignancy. The IOTA protocol
resulted in a substantial increase in the number of benign
ovarian masses classified as low risk and in the number of
invasive tumours classified as high risk. It was also associated with a reduction in the number of invasive tumours
classified as low risk. The difference between the protocols
was similar in all subgroups considered. Further, the IOTA
protocol performed fairly consistently in all histological
types of invasive tumours, whereas the RCOG protocol
showed poorer performance in mucinous, non-epithelial
primary invasive, and metastatic invasive tumours. This is
likely to be explained by the lower expression of CA125 in
these tumour types (data not shown).
The major strengths of this study are that the protocols
were applied on a large, multicentre, international database.
This means that the results are likely to be robust. Also,
the performance of the RMI and the LR2 logistic regression
model was evaluated in a way that corresponds to their use
in clinical practice instead ofas in many other evaluation
studiesrelying on the area under the receiver operating
charactieristics curve.
Our study also has limitations, one being the missing
values for CA125. This issue is appropriately tackled with
the technique of multiple imputation,27 and with a comparison of the results based on multiple imputation
(Table 4) with those based only on cases with available
CA125 levels. It is reassuring that the results based on
women with available information were very similar to
those obtained after multiple imputation. Another weakness is that information on the presence of metastases
was sometimes missing. As described in the Methods section, different strategies to manage the missing values
were compared, and this did not affect the conclusion of
this study. Results based on women with available CA125
levels as well as information on the presence of metastases
(Table 6) were similar to the main results presented in
Table 4. A third criticism could be that both the RMI
and the LR2 were used in the hands of experienced ultrasound operators with a specific interest in adnexal pathology, and that therefore, the results of this study might
not be generalisable to less experienced ultrasound examiners. However, we believe that it should be possible for
any qualified ultrasound practitioner to obtain reliable
information on all the ultrasound variables required for
both models.
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
669
Conclusion
In summary, we have shown that a management protocol
based on triaging women using the IOTA logistic regression model LR2 performs significantly better than the
RMI-based protocol that is currently proposed by the
RCOG. LR2 has been developed on a large database of
adnexal masses and undergone external validation in several different centres. We believe that the LR2 model
should be considered as an alternative to the RMI for
inclusion in triaging protocols for adnexal pathology.
Disclosure of interest
The authors declare that they have no competing interests.
Contribution to authorship
BVC, DT, LV, AM and TB designed the study. DT, LV,
ACT and IV were involved in data acquisition. BVC performed the statistical analysis. BVC, DT, LV, AM, SGM
and TB interpreted the results and BVC and TB drafted the
manuscript. BVC, DT, LV, AM, SGM, ACT, IV, TB revised
the manuscript and approved the final version of the manuscript.
Supporting Information
Additional Supporting Information may be found in the
online version of this article.
Table S1. Triage results for patients when using the
RCOG protocol, the IOTA protocol, and the IOTA+CA125
protocol.
Please note: Wiley-Blackwell is not responsible for the
content or functionality of any supporting information
supplied by the authors. Any queries (other than missing
material) should be directed to the corresponding
author. j
References
Funding
This work was supported by the Research Foundation
Flanders (FWO) (grants 1251609N, 1251612N, G049312N),
Flanders Agency for Innovation by Science and Technology
(IWT) (grant IWT-TBM070706-IOTA3); Swedish Medical
Research Council: grants numbers K2001-72X 11605-06A,
K2002-72X-11605-07B, K2004-73X-11605-09A and K200673X-11605-11-3; funds administered by Malmo University
Hospital; and two Swedish governmental grants: ALFmedel and Landstingsfinansierad Regional Forskning. Tom
Bourne is supported by Imperial Healthcare NHS Trust
NIHR Biomedical Research Centre.
Acknowledgements
B. Van Calster is a postdoctoral fellow of the Research FoundationFlanders (FWO). We acknowledge the free use of a dedicated
study screen developed by astraia GMBH. The Steering Committee of the International Ovarian Tumor Analysis (IOTA) study
group consists of Dirk Timmerman (University Hospitals
K.U.Leuven, Leuven, Belgium), Lil Valentin (Skane University
670
Hospital Malmo, Lund University, Sweden), Tom Bourne (Imperial College Hammersmith Campus, London, UK), Antonia C
Testa (Universita` Cattolica di Sacre Cuore, Rome, Italy), Sabine
Van Huffel (Katholieke Universiteit Leuven, Leuven, Belgium)
Ignace Vergote (University Hospitals K.U.Leuven, Leuven, Belgium). The IOTA principal investigators are (in alphabetical
order) Jean-Pierre Bernard (Maurepas, France), Artur Czekierdowski (Lublin, Poland), Elisabeth Epstein (Lund and Stockholm,
Sweden), Enrico Ferrazzi (Milano, Italy), Daniela Fischerova
(Prague, Czech Republic), Dorella Franchi (Milano, Italy), Robert
Fruscio (Monza, Italy), Stefano Greggi (Napoli, Italy), Stefano
Guerriero (Cagliari, Italy), Jingzhang (Beijing, China), Davor Jurkovic (London, UK), Fabrice Lecuru (Paris, France), Francesco
PG Leone (Milano, Italy), Andrea A Lissoni (Monza, Italy), Ulrike Metzger (Paris, France), Henry Muggah (Hamilton, ON,
Canada), Dario Paladini (Napoli, Italy), Alberto Rossi (Udine,
Italy), Luca Savelli (Bologna, Italy), Antonia C Testa (Roma,
Italy), Dirk Timmerman (Leuven, Belgium), Diego Trio (Milano,
Italy), Lil Valentin (Malmo, Sweden), Caroline Van Holsbeke
(Genk, Belgium), Gerardo Zanetta [deceased] (Monza, Italy).
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
8 Engelen MJA, Kos HE, Willemse PHB, Aalders JG, de Vries EGE,
Schaapveld M, et al. Surgery by consultant gynecologic oncologists
improves survival in patients with ovarian carcinoma. Cancer
2006;106:58998.
9 Jacobs I, Oram D, Fairbanks J, Turner J, Frost C, Grudzinskas JG. A
risk of malignancy index incorporating CA 125, ultrasound and menopausal status for the accurate preoperative diagnosis of ovarian
cancer. Br J Obstet Gynaecol 1990;97:9229.
10 Geomini P, Kruitwagen R, Bremer GL, Cnossen J, Mol BWJ. The
accuracy of risk scores in predicting ovarian malignancy. A systematic review. Obstet Gynecol 2009;113:38494.
11 Clarke SE, Grimshaw R, Rittenberg P, Kieser K, Bentley J. Risk of
malignancy index in the evaluation of patients with adnexal masses.
J Obstet Gynaecol Can 2009;31:4405.
12 Bailey J, Tailor A, Naik R, Lopes A, Godfrey K, Hatem HM, et al. Risk
of malignancy index for referral of ovarian cancer cases to a tertiary
center: does it identify the correct cases? Int J Gynecol Cancer
2006;16(Suppl. 1):304.
13 National Collaborating Centre for Cancer. Recognition and initial
management of ovarian cancer: NICE clinical guideline 122. London,
UK: National Institute for Health and Clinical Excellence, 2011, Available from: www.nice.org.uk/CG122. Last accessed 13 February
2012.
14 Rufford BD, Jacobs IJ. Green-top Guideline No. 34. Ovarian cysts in
postmenopausal women. London, UK: Royal College of Obstetricians
and Gynaecologists, 2003, Available from: http://www.rcog.org.uk/
files/rcog-corp/GTG3411022011.pdf. Last accessed 13 February
2012.
15 Timmerman D, Testa AC, Bourne T, Ferrazzi E, Ameye L, Konstantinovic ML, et al. A logistic regression model to distinguish between
the benign and malignant adnexal mass before surgery: a multicenter study by the International Ovarian Tumor Analysis (IOTA) group.
J Clin Oncol 2005;23:8794801.
16 Van Holsbeke C, Van Calster B, Testa AC, Domali E, Lu C, Van Huffel S, et al. Predicting malignancy of ovarian tumors: prospective
evaluation of models from the IOTA study. Clin Cancer Res 2009;
15:68491.
17 Timmerman D, Van Calster B, Testa AC, Guerriero S, Fischerova D,
Lissoni AA, et al. Ovarian cancer prediction by logistic regression
models: a prospective evaluation of diagnostic accuracy. Ultrasound
Obstet Gynecol 2010;36:22634.
18 Van Holsbeke C, Van Calster B, Bourne T, Ajossa S, Testa AC,
Guerriero S, et al. External validation of diagnostic models to
19
20
21
22
23
24
25
26
27
28
29
30
2012 The Authors BJOG An International Journal of Obstetrics and Gynaecology 2012 RCOG
671