Reliability and agreement of manual and automated morphological radiographic hip measurements

Reliability and agreement of manual and automated morphological radiographic hip measurements

Authors:

F. Boel, N.S. Riedstra, J. Tang, D.F. Hanff, H. Ahedi, V. Arbabi, N.K. Arden, S.M.A. Bierma-Zeinstra, M.M.A. van Buuren, F.M. Cicuttini, T.F. Cootes K. Crossley, D. Eygendaal, D.T. Felson, W.P. Gielis, J. Heerey, G. Jones, S. Kluzek, N.E. Lane, C. Lindner, J. Lynch, J. van Meurs, A.E. Nelson, A.B. Mosler, M.C. Nevitt, E.H. Oei, J. Runhaar, H. Weinans, R. Agricola

Abstract

Objective

To determine the reliability and agreement of manual and automated morphological measurements, and agreement in morphological diagnoses.

Methods

Thirty pelvic radiographs were randomly selected from the World COACH consortium. Manual and automated measurements of acetabular depth-width ratio (ADR), modified acetabular index (mAI), alpha angle (AA), Wiberg center edge angle (WCEA), lateral center edge angle (LCEA), extrusion index (EI), neck-shaft angle (NSA), and triangular index ratio (TIR) were performed. Bland-Altman plots and intraclass correlation coefficients (ICCs) were used to test reliability. Agreement in diagnosing acetabular dysplasia, pincer and cam morphology by manual and automated measurements was assessed using percentage agreement. Visualizations of all measurements were scored by a radiologist.

Results

The Bland-Altman plots showed no to small mean differences between automated and manual measurements for all measurements except for ADR. Intraobserver ICCs of manual measurements ranged from 0.26 (95%-CI 0–0.57) for TIR to 0.95 (95%-CI 0.87–0.98) for LCEA. Interobserver ICCs of manual measurements ranged from 0.43 (95%-CI 0.10–0.68) for AA to 0.95 (95%-CI 0.86–0.98) for LCEA. Intermethod ICCs ranged from 0.46 (95%-CI 0.12–0.70) for AA to 0.89 (95%-CI 0.78–0.94) for LCEA. Radiographic diagnostic agreement ranged from 47% to 100% for the manual observers and 63%–96% for the automated method as assessed by the radiologist.

Conclusion

The automated algorithm performed equally well compared to manual measurement by trained observers, attesting to its reliability and efficiency in rapidly computing morphological measurements. This validated method can aid clinical practice and accelerate hip osteoarthritis research.

Read full text https://doi.org/10.1016/j.ocarto.2024.100510