The development of a machine learning algorithm to identify occupational injuries in agriculture using pre-hospital care reports.
-
2021/07/29
-
File Language:
English
Details
-
Personal Author:
-
Description:Purpose: Current injury surveillance efforts in agriculture are considerably hampered by the limited quantity of occupation or industry data in current health records. This has impeded efforts to develop more accurate injury burden estimates and has negatively impacted the prioritization of workplace health and safety in state and federal public health efforts. This paper describes the development of a Naïve Bayes machine learning algorithm to identify occupational injuries in agriculture using existing administrative data, specifically in pre-hospital care reports (PCR). Methods: A Naïve Bayes machine learning algorithm was trained on PCR datasets from 2008-2010 from Maine and New Hampshire and tested on newer data from those states between 2011 and 2016. Further analyses were devoted to establishing the generalizability of the model across various states and various years. Dual visual inspection was used to verify the records subset by the algorithm. Results: The Naïve Bayes machine learning algorithm reduced the volume of cases that required visual inspection by 69.5 percent over a keyword search strategy alone. Coders identified 341 true agricultural injury records (Case class = 1) (Maine 2011-2016, New Hampshire 2011-2015). In addition, there were 581 (Case class = 2 or 3) that were suspected to be agricultural acute/traumatic events, but lacked the necessary detail to make a certain distinction. Conclusions: The application of the trained algorithm on newer data reduced the volume of records requiring visual inspection by two thirds over the previous keyword search strategy, making it a sustainable and cost-effective way to understand injury trends in agriculture. [Description provided by NIOSH]
-
Subjects:
-
Keywords:
-
ISSN:2047-2501
-
Document Type:
-
Funding:
-
Genre:
-
Place as Subject:
-
CIO:
-
Topic:
-
Location:
-
Pages in Document:31
-
Volume:9
-
Issue:1
-
NIOSHTIC Number:nn:20064776
-
Citation:Health Inf Sci Syst 2021 Jul; 9(1):31
-
Contact Point Address:Erika Scott, Northeast Center for Occupational Health and Safety in Agriculture, Forestry, and Fishing, Bassett Medical Center, Cooperstown, NY, USA
-
Email:Erika.scott@bassett.org
-
Federal Fiscal Year:2021
-
NORA Priority Area:
-
Performing Organization:Mary Imogene Bassett Hospital, Cooperstown, New York
-
Peer Reviewed:True
-
Start Date:20010930
-
Source Full Name:Health Information Science and Systems
-
End Date:20270831
-
Collection(s):
-
Main Document Checksum:urn:sha-512:d5f64501d29c7afdac415e53f507217a52d46c2a39991837d93ec06d90f06b50a03af9c02a00daa9dd7b3c4b5a0808dbdd8396758dbf027598c0c9580f9c915b
-
Download URL:
-
File Type:
File Language:
English
ON THIS PAGE
CDC STACKS serves as an archival repository of CDC-published products including
scientific findings,
journal articles, guidelines, recommendations, or other public health information authored or
co-authored by CDC or funded partners.
As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.
As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.
You May Also Like