Uncovering Misattributed Suicide Causes through Annotation Inconsistency Detection in Death Investigation Notes

Type: Preprint

Publication Date: 2024-03-28

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2403.19432

Abstract

Data accuracy is essential for scientific research and policy development. The National Violent Death Reporting System (NVDRS) data is widely used for discovering the patterns and causes of death. Recent studies suggested the annotation inconsistencies within the NVDRS and the potential impact on erroneous suicide-cause attributions. We present an empirical Natural Language Processing (NLP) approach to detect annotation inconsistencies and adopt a cross-validation-like paradigm to identify problematic instances. We analyzed 267,804 suicide death incidents between 2003 and 2020 from the NVDRS. Our results showed that incorporating the target state's data into training the suicide-crisis classifier brought an increase of 5.4% to the F-1 score on the target state's test set and a decrease of 1.1% on other states' test set. To conclude, we demonstrated the annotation inconsistencies in NVDRS's death investigation notes, identified problematic instances, evaluated the effectiveness of correcting problematic instances, and eventually proposed an NLP improvement solution.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat Associations Between Natural Language Processingā€“Enriched Social Determinants of Health and Suicide Death Among US Veterans 2023 Avijit Mitra
Richeek Pradhan
Rachel Melamed
Kun Chen
David C. Hoaglin
Katherine L. Tucker
Joel I. Reisman
Zhichao Yang
Weisong Liu
Jack Tsai
+ PDF Chat Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted Belongingness from Suicide Notes 2022 Soumitra Ghosh
Asif Ekbal
Pushpak Bhattacharyya
+ Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted Belongingness from Suicide Notes 2022 Soumitra Ghosh
Asif Ekbal
Pushpak Bhattacharyya
+ The Hitchhiker's Guide to Computational Linguistics in Suicide Prevention 2021 Yaakov Ophir
Refael Tikochinski
Anat Brunstein Klomek
Roi Reichart
+ Leveraging Contextual Relatedness to Identify Suicide Documentation in Clinical Notes through Zero Shot Learning 2023 T. Elizabeth Workman
Joseph L. Goulet
Cynthia Brandt
Allison R. Warren
Jacob Eleazer
Melissa Skanderson
Luke Lindemann
John R. Blosnich
John O Leary
Qing Zeng Treitler
+ PDF Chat Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models 2024 Zehan Li
Yan Hu
Scott D. Lane
Salih Selek
Lokesh Shahani
Rodrigo Machadoā€Vieira
Jair C. Soares
Hua Xu
Hongfang Liu
Ming Huang
+ PDF Chat ScAN: Suicide Attempt and Ideation Events Dataset 2022 Bhanu Pratap Singh Rawat
Samuel Kovaly
Hong Yu
Wilfred R. Pigeon
+ Associations Between Natural Language Processing (NLP) Enriched Social Determinants of Health and Suicide Death among US Veterans 2022 Avijit Mitra
Richeek Pradhan
Rachel Melamed
Kun Chen
David C. Hoaglin
Katherine L. Tucker
Joel I. Reisman
Zhichao Yang
Weisong Liu
Jack Tsai
+ ScAN: Suicide Attempt and Ideation Events Dataset 2022 Bhanu Pratap Singh Rawat
Samuel Kovaly
Wilfred R. Pigeon
Hong Yu
+ A Systematic Literature Review of Automated ICD Coding and Classification Systems using Discharge Summaries 2021 Rajvir Kaur
Jeewani Anupama Ginige
Oliver Obst
+ Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains 2021 Chenghao Yang
Yudong Zhang
Smaranda Muresan
+ Verbal Autopsy in Civil Registration and Vital Statistics: The Symptom-Cause Information Archive. 2019 Samuel J. Clark
Martin W. Bratschi
Philip Setel
Carla AbouZahr
Don de Savigny
Zehang Li
Tyler H. McCormick
Peter Byass
Daniel Chandramohan
+ PDF Chat Data Quality Matters: Suicide Intention Detection on Social Media Posts Using a RoBERTa-CNN Model 2024 Emily Lin
Jian Sun
Hsingyu Chen
Mohammad H. Mahoor
+ Verbal Autopsy in Civil Registration and Vital Statistics: The Symptom-Cause Information Archive 2019 Samuel J. Clark
Martin W. Bratschi
Philip Setel
Carla AbouZahr
Don de Savigny
Zehang Li
Tyler H. McCormick
Peter Byass
Daniel Chandramohan
+ PDF Chat Improving Cause-of-Death Classification from Verbal Autopsy Reports 2022 Thokozile Manaka
Terence L. van Zyl
D. Kar
+ PDF Chat Academic Case Reports Lack Diversity: Assessing the Presence and Diversity of Sociodemographic and Behavioral Factors related with Post COVID-19 Condition 2025 Juan Florez
Shaina Raza
Richard Lynn
Zahra Shakeri
Brendan T. Smith
Elham Dolatabadi
+ PDF Chat Adapting Coreference Resolution for Processing Violent Death Narratives 2021 Ankith Uppunda
Susan D. Cochran
Jacob G. Foster
Alina Arseniev-Koehler
Vickie M. Mays
Kai-Wei Chang
+ Indonesia's first suicide statistics profile: an analysis of suicide and attempt rates, underreporting, geographic distribution, gender, method, and rurality 2024 Sandersan Onie
Yuslely Usman
Retno Widyastuti
Merry Lusiana
Tri Juni Angkasawati
D. Anwar Musadad
Jessica Felisa Nilam
Ashra Vina
Rizal Kamsurya
Philip J. Batterham
+ PDF Chat Suicide Risk Assessment on Social Media with Semi-Supervised Learning 2024 Max Lovitt
Hui Ma
Song Wang
Yifan Peng
+ SUICIDE STATISTICS. 1905

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors