VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography

Type: Article

Publication Date: 2023-05-12

Citations: 48

DOI: https://doi.org/10.1038/s41597-023-02100-7

Abstract

Abstract Mammography, or breast X-ray imaging, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe/x) tools have been developed to support physicians and improve the accuracy of interpreting mammography. A number of large-scale mammography datasets from different populations with various associated annotations and clinical data have been introduced to study the potential of learning-based methods in the field of breast radiology. With the aim to develop more robust and more interpretable support systems in breast imaging, we introduce VinDr-Mammo, a Vietnamese dataset of digital mammography with breast-level assessment and extensive lesion-level annotations, enhancing the diversity of the publicly available mammography data. The dataset consists of 5,000 mammography exams, each of which has four standard views and is double read with disagreement (if any) being resolved by arbitration. The purpose of this dataset is to assess Breast Imaging Reporting and Data System (BI-RADS) and breast density at the individual breast level. In addition, the dataset also provides the category, location, and BI-RADS assessment of non-benign findings. We make VinDr-Mammo publicly available as a new imaging resource to promote advances in developing CADe/x tools for mammography interpretation.

Locations

  • Scientific Data - View - PDF
  • PubMed Central - View
  • arXiv (Cornell University) - View - PDF
  • medRxiv (Cold Spring Harbor Laboratory) - View
  • PubMed - View

Similar Works

Action Title Year Authors
+ PDF Chat VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography 2022 Hieu T. Nguyen
Ha Q. Nguyen
Hieu H. Pham
Khanh Lam
Linh Le
Minh Quang Dao
Van Vu
+ VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography 2022 Hieu T. Nguyen
Ha Q. Nguyen
Hieu H. Pham
Khanh Lam
Linh Le
Minh Quang Dao
Van Vu
+ Performance of Machine Learning Classification in Mammography Images using BI-RADS 2023 Malitha Gunawardhana
Norbert Ć»oƂek
+ Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach 2019 William Lotter
Abdul Rahman Diab
Bryan Haslam
Jiye G. Kim
Giorgia Grisot
Eric Q. Wu
Kevin C.‐W. Wu
Jorge Onieva Onieva
Jerrold L. Boxerman
Meiyun Wang
+ Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach 2019 William Lotter
Abdul Rahman Diab
Bryan Haslam
Jiye G. Kim
Giorgia Grisot
Eric Hsiao‐Kuang Wu
Kevin Wu
Jorge Onieva Onieva
Jerrold L. Boxerman
Meiyun Wang
+ MAMMO: A Deep Learning Solution for Facilitating Radiologist-Machine Collaboration in Breast Cancer Diagnosis 2018 Trent Kyono
Fiona J. Gilbert
Mihaela van der Schaar
+ Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions 2023 Luyang Luo
Xi Wang
Yi Lin
Xiaoqi Ma
Andong Tan
Ronald Chan
Varut Vardhanabhuti
Chiu‐Wing Winnie Chu
Kwang‐Ting Cheng
Hao Chen
+ PDF Chat Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions 2024 Luyang Luo
Xi Wang
Yi Lin
Xiaoqi Ma
Andong Tan
Ronald Chan
Varut Vardhanabhuti
Chiu‐Wing Winnie Chu
Kwang‐Ting Cheng
Hao Chen
+ PDF Chat Deep-Learning-Based Computer-Aided Systems for Breast Cancer Imaging: A Critical Review 2020 Yuliana Jiménez-Gaona
María José Rodríguez-Álvarez
Vasudevan Lakshminarayanan
+ MammoDG: Generalisable Deep Learning Breaks the Limits of Cross-Domain Multi-Center Breast Cancer Screening 2023 Yijun Yang
Shujun Wang
Lihao Liu
Sarah Hickman
Fiona J. Gilbert
Carola‐Bibiane Schönlieb
Angelica I. Aviles‐Rivero
+ Reduction of Surgical Risk Through the Evaluation of Medical Imaging Diagnostics. 2020 Marco A. V. M. Grinet
Nuno M. GarcĂ­a
Ana Gouveia
J. Moutinho
Abel Gomes
+ Reduction of Surgical Risk Through the Evaluation of Medical Imaging Diagnostics 2020 Marco A. V. M. Grinet
Nuno M. GarcĂ­a
Ana I. R. Gouveia
J. Moutinho
Abel Gomes
+ Meta-repository of screening mammography classifiers 2021 Benjamin Stadnick
Jan Witowski
Vishwaesh Rajiv
Jakub ChƂędowski
Farah E. Shamout
Kyunghyun Cho
Krzysztof J. Geras
+ PDF Chat Deep Learning to Improve Breast Cancer Detection on Screening Mammography 2019 Li Shen
Laurie R. Margolies
Joseph H. Rothstein
Eugene Fluder
Russell B. McBride
Weiva Sieh
+ Deep Learning Based Computer-Aided Systems for Breast Cancer Imaging : A Critical Review 2020 Yuliana Jiménez-Gaona
María José Rodríguez-Álvarez
Vasudevan Lakshminarayanan
+ Deep Learning Based Computer-Aided Systems for Breast Cancer Imaging : A Critical Review 2020 Yuliana Jiménez-Gaona
María José Rodríguez-Álvarez
Vasudevan Lakshminarayanan
+ Detecting and classifying lesions in mammograms with Deep Learning 2017 DezsƑ Ribli
Anna HorvĂĄth
Zsuzsa Unger
PĂ©ter Pollner
IstvĂĄn Csabai
+ Detecting and classifying lesions in mammograms with Deep Learning 2017 DezsƑ Ribli
Anna HorvĂĄth
Zsuzsa Unger
PĂ©ter Pollner
IstvĂĄn Csabai
+ Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review 2023 Masoud Tafavvoghi
Lars Ailo Bongo
Nikita Shvetsov
Lill‐Tove Busund
Kajsa MĂžllersen
+ The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms 2022 Jiwoong Jeong
Brianna L. Vey
A. Buvan Reddy
Thomas Kim
Thiago de Santana Santos
RamĂłn Correa
Raman Dutt
Marina MoĆĄunjac
Gabriela Oprea‐Ilies
Geoffrey Smith

Works That Cite This (9)

Action Title Year Authors
+ PDF Chat MamT<sup>4</sup>: Multi-View Attention Networks for Mammography Cancer Classification 2024 Alisher Ibragimov
ĐĄ.А. ĐĄĐ”ĐœĐŸŃ‚Ń€ŃƒŃĐŸĐČĐ°
Arsenii A. Litvinov
Egor Ushakov
Evgeny Karpulevich
Yury Markin
+ PDF Chat Learning From Multiple Expert Annotators for Enhancing Anomaly Detection in Medical Image Analysis 2023 Khiem H. Le
Tuan V. Tran
Hieu H. Pham
Hieu T. Nguyen
Tung Le
Ha Q. Nguyen
+ PDF Chat A Novel Transparency Strategy-based Data Augmentation Approach for BI-RADS Classification of Mammograms 2023 Sam B. Tran
Huyen T. X. Nguyen
Chi M. Phan
Ha Q. Nguyen
Hieu H. Pham
+ PDF Chat In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection 2023 Thanh-Huy Nguyen
Thinh B. Lam
Quan T. D. Tran
Minh T. Nguyen
Dat T. Chung
Vinh Quang Dinh
+ PDF Chat VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography 2023 Hieu Nguyen
Ha Q. Nguyen
Hieu H. Pham
Khanh Lam
Linh Le
Minh Quang Dao
Van Vu
+ PDF Chat Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis 2023 Thanh-Huy Nguyen
Quang Hien Kha
Thai Ngoc Toan Truong
Ba Thinh Lam
Ba Hung Ngo
Quang Dinh
Nguyen Quoc Khanh Le
+ MV-Swin-T: Mammogram Classification with Multi-View Swin Transformer 2024 Sushmita Sarker
Prithul Sarker
George Bebis
Alireza Tavakkoli
+ MAM-E: Mammographic synthetic image generation with diffusion models 2023 Ricardo Montoya-del-Angel
Karla Sam-Millan
Joan C. Vilanova
Robert MartĂ­
+ PDF Chat MAM-E: Mammographic Synthetic Image Generation with Diffusion Models 2024 Ricardo Montoya-del-Angel
Karla Sam-Millan
Joan C. Vilanova
Robert MartĂ­