Measuring interpersonal firearm violence: natural language processing methods to address limitations in criminal charge data

衡量人际枪支暴力:利用自然语言处理方法解决刑事指控数据的局限性

阅读:1

Abstract

OBJECTIVE: Firearm violence constitutes a public health crisis in the United States, but comprehensive data infrastructure is lacking to study this problem. To address this challenge, we used natural language processing (NLP) to classify court record documents from alleged violent crimes as firearm-related or non-firearm-related. MATERIALS AND METHODS: We accessed and digitized court records from the state of Washington (n = 1472). Human review established a gold standard label for firearm involvement (yes/no). We developed a key term search and trained supervised machine learning classifiers for this labeling task. Results were evaluated in a held-out test set. RESULTS: The decision tree performed best (F1 score: 0.82). The key term list had perfect recall (1.0) and a modest F1 score (0.65). DISCUSSION AND CONCLUSION: This case report highlights the accuracy, feasibility, and potential time-saved by using NLP to identify firearm involvement in alleged violent crimes based on digitized narratives from court documents.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。