U.S. flag An official website of the United States government.
Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

i

Development and evaluation of a Naïve Bayesian model for coding causation of workers’ compensation claims☆

Supporting Files
File Language:
English


Details

  • Alternative Title:
    J Safety Res
  • Personal Author:
  • Description:
    Introduction

    Tracking and trending rates of injuries and illnesses classified as musculoskeletal disorders caused by ergonomic risk factors such as overexertion and repetitive motion (MSDs) and slips, trips, or falls (STFs) in different industry sectors is of high interest to many researchers. Unfortunately, identifying the cause of injuries and illnesses in large datasets such as workers’ compensation systems often requires reading and coding the free form accident text narrative for potentially millions of records.

    Method

    To alleviate the need for manual coding, this paper describes and evaluates a computer auto-coding algorithm that demonstrated the ability to code millions of claims quickly and accurately by learning from a set of previously manually coded claims.

    Conclusions

    The auto-coding program was able to code claims as a musculoskeletal disorders, STF or other with approximately 90% accuracy.

    Impact on industry

    The program developed and discussed in this paper provides an accurate and efficient method for identifying the causation of workers’ compensation claims as a STF or MSD in a large database based on the unstructured text narrative and resulting injury diagnoses. The program coded thousands of claims in minutes. The method described in this paper can be used by researchers and practitioners to relieve the manual burden of reading and identifying the causation of claims as a STF or MSD. Furthermore, the method can be easily generalized to code/classify other unstructured text narratives.

  • Subjects:
  • Source:
    J Safety Res. 43(0):327-332.
  • Pubmed ID:
    23206504
  • Pubmed Central ID:
    PMC4550086
  • Document Type:
  • Funding:
  • Volume:
    43
  • Collection(s):
  • Main Document Checksum:
    urn:sha256:c50a09bfb6051442ccd4bcbbb4416b61d50b921642ac3070b92eb7d993012271
  • Download URL:
  • File Type:
    Filetype[PDF - 716.05 KB ]
File Language:
English
ON THIS PAGE

CDC STACKS serves as an archival repository of CDC-published products including scientific findings, journal articles, guidelines, recommendations, or other public health information authored or co-authored by CDC or funded partners.

As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.