News Release

Deep learning for extremity radiographs confounded by labels

Convolutional neural networks trained to identify abnormalities on upper extremity radiographs are susceptible to a ubiquitous confounding image feature that could limit their clinical utility: radiograph labels

Peer-Reviewed Publication

American Roentgen Ray Society

Posteroanterior Radiograph of Wrist in Patient With Multifocal Osteoarthritis

image: Grad-CAM heatmaps for deep learning models trained on (A) original radiograph, shows emphasis on laterality and/or technologist initial labels; (B) radiograph with label covered by black box, shows emphasis on anatomic features, such as bones. (Colors toward red end of spectrum indicate greater emphasis, whereas colors toward blue end of spectrum indicate less importance.) view more 

Credit: American Roentgen Ray Society (ARRS), American Journal of Roentgenology (AJR)

Leesburg, VA, November 15, 2021According to an open-access Editor’s Choice article in ARRS’ American Journal of Roentgenology (AJR), convolutional neural networks (CNN) trained to identify abnormalities on upper extremity radiographs are susceptible to a ubiquitous confounding image feature that could limit their clinical utility: radiograph labels.

“We recommend that such potential image confounders be collected when possible during dataset curation, and that covering these labels be considered during CNN training,” wrote corresponding author Paul H. Yi from the University of Maryland’s Medical Intelligent Imaging Center in Baltimore.

Yi and team’s retrospective study evaluated 40,561 upper extremity musculoskeletal radiographs from Stanford’s MURA dataset that were used to train three DenseNet-121 CNN classifiers. Three inputs were used to distinguish normal from abnormal radiographs: original images with both anatomy and labels; images with laterality and/or technologist labels subsequently covered by a black box; images where anatomy had been removed and only labels remained.

For the original radiographs, AUC was 0.844, frequently emphasizing laterality and/or technologist labels for decision-making. Covering these labels increased AUC to 0.857 (p=.02) and redirected CNN attention from the labels to the bones. Using labels alone, AUC was 0.638, indicating that radiograph labels are associated with abnormal examinations.

“While we can infer that labels are associated with normal versus abnormal disease categories,” the authors of this AJR article added, “we cannot determine the specific aspect of the labels that resulted in their being confounding factors.”

An electronic supplement to this AJR article is available here.


Founded in 1900, the American Roentgen Ray Society (ARRS) is the first and oldest radiological society in North America, dedicated to the advancement of medicine through the profession of radiology and its allied sciences. An international forum for progress in medical imaging since the discovery of the x-ray, ARRS maintains its mission of improving health through a community committed to advancing knowledge and skills with an annual scientific meeting, monthly publication of the peer-reviewed American Journal of Roentgenology (AJR), quarterly issues of InPractice magazine, AJR Live Webinars and Podcasts, topical symposia, print and online educational materials, as well as awarding scholarships via The Roentgen Fund®.

MEDIA CONTACT:

Logan K. Young, PIO

44211 Slatestone Court

Leesburg, VA 20176

703-858-4332

lyoung@arrs.org


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.