An Accurate and Explainable Deep Learning System Improves Interobserver Agreement in the Interpretation of Chest Radiograph

Type: Preprint

Publication Date: 2021-09-30

Citations: 4

DOI: https://doi.org/10.1101/2021.09.28.21264286

Abstract

ABSTRACT Interpretation of chest radiographs (CXR) is a difficult but essential task for detecting thoracic abnormalities. Recent artificial intelligence (AI) algorithms have achieved radiologist-level performance on various medical classification tasks. However, only a few studies addressed the localization of abnormal findings from CXR scans, which is essential in explaining the image-level classification to radiologists. Additionally, the actual impact of AI algorithms on the diagnostic performance of radiologists in clinical practice remains relatively unclear. To bridge these gaps, we developed an explainable deep learning system called VinDr-CXR that can classify a CXR scan into multiple thoracic diseases and, at the same time, localize most types of critical findings on the image. VinDr-CXR was trained on 51,485 CXR scans with radiologist-provided bounding box annotations. It demonstrated a comparable performance to experienced radiologists in classifying 6 common thoracic diseases on a retrospective validation set of 3,000 CXR scans, with a mean area under the receiver operating characteristic curve (AUROC) of 0.967 (95% confidence interval [CI]: 0.958–0.975). The sensitivity, specificity, F1 -score, false-positive rate (FPR), and false-negative rate (FNR) of the system at the optimal cutoff value were 0.933 (0.898–0.964), 0.900 (0.887–0.911), 0.631 (0.589–0.672), 0.101 (0.089– 0.114) and 0.067 (0.057–0.102), respectively. For the localization task with 14 types of lesions, our free-response receiver operating characteristic (FROC) analysis showed that the VinDr-CXR achieved a sensitivity of 80.2% at the rate of 1.0 false-positive lesion identified per scan. A prospective study was also conducted to measure the clinical impact of the VinDr-CXR in assisting six experienced radiologists. The results indicated that the proposed system, when used as a diagnosis supporting tool, significantly improved the agreement between radiologists themselves with an increase of 1.5% in mean Fleiss’ Kappa. We also observed that, after the radiologists consulted VinDr-CXR’s suggestions, the agreement between each of them and the system was remarkably increased by 3.3% in mean Co-hen’s Kappa. Altogether, our results highlight the potentials of the proposed deep learning system as an effective assistant to radiologists in clinical practice. Part of the dataset used for developing the VinDr-CXR system has been made publicly available at https://physionet.org/content/vindr-cxr/1.0.0/ .

Locations

  • arXiv (Cornell University) - View - PDF
  • medRxiv (Cold Spring Harbor Laboratory) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat An Accurate and Explainable Deep Learning System Improves Interobserver Agreement in the Interpretation of Chest Radiograph 2022 Hieu H. Pham
Ha Q. Nguyen
Hieu Nguyen
Linh Le
Khanh Lam
+ An Accurate and Explainable Deep Learning System Improves Interobserver Agreement in the Interpretation of Chest Radiograph 2022 Hieu H. Pham
Ha Q. Nguyen
Hieu T. Nguyen
Linh Le
Khanh Lam
+ Evaluating the impact of an explainable machine learning system on the interobserver agreement in chest radiograph interpretation 2023 Hieu H. Pham
Ha Q. Nguyen
Hieu T. Nguyen
Linh Le
Khanh Lam
+ Can Deep Learning Reliably Recognize Abnormality Patterns on Chest X-rays? A Multi-Reader Study Examining One Month of AI Implementation in Everyday Radiology Clinical Practice 2023 Daniel Kvak
Anna Chromcová
Petra Ovesná
Jakub Dandár
M. Biros
Robert Hrubý
Daniel Dufek
Marija Pajdaković
+ Can Artificial Intelligence Reliably Report Chest X-Rays?: Radiologist Validation of an Algorithm trained on 2.3 Million X-Rays 2018 Preetham Putha
Manoj Tadepalli
Bhargava Reddy
Tarun Raj
Justy Antony Chiramal
Shalini Govil
Namita Sinha
Manjunath Ks
Sundeep Reddivari
Ammar Jagirdar
+ PDF Chat SHAMSUL: Systematic Holistic Analysis to investigate Medical Significance Utilizing Local interpretability methods in deep learning for chest radiography pathology prediction 2023 Mahbub Ul Alam
Jaakko Hollmén
Jón Rúnar Baldvinsson
Rahim Rahmani
+ PDF Chat Rethinking Annotation Granularity for Overcoming Shortcuts in Deep Learning–based Radiograph Diagnosis: A Multicenter Study 2022 Luyang Luo
Hao Chen
Yongjie Xiao
Yanning Zhou
Xi Wang
Varut Vardhanabhuti
Mingxiang Wu
Chu Han
Zaiyi Liu
Benjamin Fang
+ PDF Chat Computer-Aided Diagnosis of Thoracic Diseases in Chest X-rays using hybrid CNN-Transformer Architecture 2024 Sonit Singh
+ PDF Chat A foundation model for generalizable disease diagnosis in chest X-ray images 2024 Lijian Xu
Ziyu Ni
Haowei Sun
Hongsheng Li
Shaoting Zhang
+ PDF Chat Automated identification of thoracic pathology from chest radiographs with enhanced training pipeline 2019 Adora M. DSouza
Anas Z. Abidin
Axel Wismüller
+ Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation 2021 Saahil Jain
Akshay Smit
Andrew Y. Ng
Pranav Rajpurkar
+ Local Adaptation Improves Accuracy of Deep Learning Model for Automated X-Ray Thoracic Disease Detection : A Thai Study 2020 Isarun Chamveha
Trongtum Tongdee
Pairash Saiviroonporn
Warasinee Chaisangmongkon
+ A clinical validation of VinDr-CXR, an AI system for detecting abnormal chest radiographs 2021 Ngoc Huy Nguyen
Ha Q. Nguyen
Nghia T. Nguyen
Thắng Việt Nguyễn
Hieu Huy Pham
Tuan Ngoc-Minh Nguyen
+ A Deep Learning Based Workflow for Detection of Lung Nodules With Chest Radiograph 2021 Yang Tai
Yu-Wen Fang
Fang‐Yi Su
Jung-Hsien Chiang
+ Enhancing Early Lung Cancer Detection on Chest Radiographs with AI-assistance: A Multi-Reader Study 2022 Gaëtan Dissez
Nicole Tay
Tom Dyer
Matthew Tam
Richard Dittrich
David Doyne
James Hoare
Jackson J. Pat
Stephanie K. Patterson
Amanda Stockham
+ XProtoNet: Diagnosis in Chest Radiography with Global and Local Explanations 2021 Eunji Kim
Siwon Kim
Minji Seo
Sungroh Yoon
+ PDF Chat XProtoNet: Diagnosis in Chest Radiography with Global and Local Explanations 2021 Eunji Kim
Siwon Kim
Minji Seo
Sungroh Yoon
+ PDF Chat Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays 2025 Yiming Lei
Michael Nguyen
T. Liu
Hyounkyun Oh
+ Rethinking Annotation Granularity for Overcoming Shortcuts in Deep Learning-based Radiograph Diagnosis: A Multicenter Study 2021 Luyang Luo
Hao Chen
Yongjie Xiao
Yanning Zhou
Xi Wang
Varut Vardhanabhuti
Mingxiang Wu
Chu Han
Zaiyi Liu
Benjamin Fang
+ PDF Chat The impact of deep learning aid on the workload and interpretation accuracy of radiologists on chest computed tomography: a cross-over reader study 2024 Anvar Kurmukov
Valeria Chernina
Regina Gareeva
Maria Dugova
EA Petrash
Olga Aleshina
Maxim Pisov
Boris Shirokikh
Valentin Samokhin
Vladislav Proskurov

Works Cited by This (19)

Action Title Year Authors
+ Measuring nominal scale agreement among many raters. 1971 Joseph L. Fleiss
+ PDF Chat The Measurement of Observer Agreement for Categorical Data 1977 J. Richard Landis
Gary G. Koch
+ PDF Chat A survey on deep learning in medical image analysis 2017 Geert Litjens
Thijs Kooi
Babak Ehteshami Bejnordi
Arnaud Arindra Adiyoso Setio
Francesco Ciompi
Mohsen Ghafoorian
Jeroen van der Laak
Bram van Ginneken
Clara I. Sá‎nchez
+ PDF Chat Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study 2018 John R. Zech
Marcus A. Badgeley
Manway Liu
Anthony Costa
J. Titano
Eric K. Oermann
+ EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 2019 Mingxing Tan
Quoc V. Le
+ PDF Chat Focal Loss for Dense Object Detection 2017 Tsung-Yi Lin
Priya Goyal
Ross Girshick
Kaiming He
Piotr Dollár
+ PDF Chat CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison 2019 Jeremy Irvin
Pranav Rajpurkar
Michael Ko
Yifan Yu
Silviana Ciurea-Ilcus
Chris Chute
Henrik Marklund
Behzad Haghgoo
Robyn L. Ball
Katie Shpanskaya
+ Adam: A Method for Stochastic Optimization 2014 Diederik P. Kingma
Jimmy Ba
+ PDF Chat Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening 2019 Nan Wu
Jason Phang
Jungkyu Park
Yiqiu Shen
Zhe Huang
Masha Zorin
Stanisław Jastrzȩbski
Thibault Févry
Joe Katsnelson
Eric Kim
+ PDF Chat Exploring Large-scale Public Medical Image Datasets 2019 Luke Oakden‐Rayner