External Validation of the "2021 AAGL Endometriosis Classification": A Retrospective Cohort Study. Mak, J., Eathorne, A., Leonardi, M., Espada, M., Reid, S., Zanardi, J. V., Uzuner, C., Rocha, R., Armour, M., & Condous, G. Journal of Minimally Invasive Gynecology, 30(5):374–381, May, 2023.
doi  abstract   bibtex   
STUDY OBJECTIVE: Externally validate the American Association of Gynecologic Laparoscopists (AAGL) staging system against surgical complexity and compare diagnostic accuracy with revised American Society for Reproductive Medicine (rASRM) stage, as was done in original publication. DESIGN: Retrospective, diagnostic accuracy study. SETTING: Multicenter (Sydney, Australia). PATIENTS: A total of 317 patients (January 2016-October 2021) were used in the final analysis. INTERVENTIONS: A database of patients with coded surgical data was analyzed. MEASUREMENTS AND MAIN RESULTS: Three independent observers assigned an AAGL surgical stage (1-4) as the index test and surgical complexity level (A-D) as the reference standard. Results from the most accurate of the 3 observers were used in the final analysis. The weighted kappa score for the overall performance of AAGL stage and rASRM to predict AAGL level was 0.48 and 0.48, respectively (no difference). This represents weaker agreement with AAGL level than was observed in the reference paper, which reported a weighted kappa of 0.62. Diagnostic accuracy (sensitivity, specificity, positive predictive value, and negative predictive value) for stage 1 to predict level A was 98.5%, 64.3%, 66.3%, and 98.3%; stage 2 to predict level B 31.2%, 90.5%, 27.0%, and 92.1 %; stage 3 to predict level C 12.3%, 94.1%, 59.3%, and 60.7%; stage 4 to predict level D 95.65%, 88.10%, 38.60%, and 99.62%. Area under the receiver operating characteristic curve for A vs B/C/D (cut point 9) was 0.87, A/B vs C/D (cut point 16) was 0.78, and A/B/C vs D (cut point 22) was 0.94. CONCLUSION: There was weak to moderate agreement between AAGL stage and AAGL surgical complexity level. Across all key indicators, the AAGL system did not perform as well in this external validation, nor did it outperform rASRM as it did in the reference paper. Results suggest the system is not generalizable.
@article{mak_external_2023,
	title = {External {Validation} of the "2021 {AAGL} {Endometriosis} {Classification}": {A} {Retrospective} {Cohort} {Study}},
	volume = {30},
	issn = {1553-4669},
	shorttitle = {External {Validation} of the "2021 {AAGL} {Endometriosis} {Classification}"},
	doi = {10.1016/j.jmig.2022.12.012},
	abstract = {STUDY OBJECTIVE: Externally validate the American Association of Gynecologic Laparoscopists (AAGL) staging system against surgical complexity and compare diagnostic accuracy with revised American Society for Reproductive Medicine (rASRM) stage, as was done in original publication.
DESIGN: Retrospective, diagnostic accuracy study.
SETTING: Multicenter (Sydney, Australia).
PATIENTS: A total of 317 patients (January 2016-October 2021) were used in the final analysis.
INTERVENTIONS: A database of patients with coded surgical data was analyzed.
MEASUREMENTS AND MAIN RESULTS: Three independent observers assigned an AAGL surgical stage (1-4) as the index test and surgical complexity level (A-D) as the reference standard. Results from the most accurate of the 3 observers were used in the final analysis. The weighted kappa score for the overall performance of AAGL stage and rASRM to predict AAGL level was 0.48 and 0.48, respectively (no difference). This represents weaker agreement with AAGL level than was observed in the reference paper, which reported a weighted kappa of 0.62. Diagnostic accuracy (sensitivity, specificity, positive predictive value, and negative predictive value) for stage 1 to predict level A was 98.5\%, 64.3\%, 66.3\%, and 98.3\%; stage 2 to predict level B 31.2\%, 90.5\%, 27.0\%, and 92.1 \%; stage 3 to predict level C 12.3\%, 94.1\%, 59.3\%, and 60.7\%; stage 4 to predict level D 95.65\%, 88.10\%, 38.60\%, and 99.62\%. Area under the receiver operating characteristic curve for A vs B/C/D (cut point 9) was 0.87, A/B vs C/D (cut point 16) was 0.78, and A/B/C vs D (cut point 22) was 0.94.
CONCLUSION: There was weak to moderate agreement between AAGL stage and AAGL surgical complexity level. Across all key indicators, the AAGL system did not perform as well in this external validation, nor did it outperform rASRM as it did in the reference paper. Results suggest the system is not generalizable.},
	language = {eng},
	number = {5},
	journal = {Journal of Minimally Invasive Gynecology},
	author = {Mak, Jason and Eathorne, Allie and Leonardi, Mathew and Espada, Mercedes and Reid, Shannon and Zanardi, Jose Vitor and Uzuner, Cansu and Rocha, Rodrigo and Armour, Mike and Condous, George},
	month = may,
	year = {2023},
	pmid = {36621635},
	keywords = {Australia, Endometriosis, Female, Humans, Laparoscopy, ROC Curve, Retrospective Studies, Staging, Ultrasound, United States},
	pages = {374--381},
}

Downloads: 0