Comparison of Machine Learning Classifiers for Influenza Detection from Emergency Department Free-text Reports

Pineda, Arturo López; Ye, Ye; Visweswaran, Shyam; Cooper, Gregory F.; Wagner, Michael M.; Tsui, Fuchiang (Rich)

doi:10.1016/j.jbi.2015.08.019

Advanced Search

Select up to three search categories and corresponding keywords using the fields to the right. Refer to the Help section for more detailed instructions.

Search our Collections & Repository

Advanced Search
Custom Query

All these words:

For very narrow results

This exact word or phrase:

When looking for a specific result

Any of these words:

Best used for discovery & interchangable words

None of these words:

Recommended to be used in conjunction with other fields

Language:

Dates

Publication Date Range:

to

Document Data

Title:

Document Type:

Library

Collection:

Series:

People

Author:

Clear All

Query Builder

Query box

Clear All

For additional assistance using the Custom Query please check out our Help Page

CDC STACKS serves as an archival repository of CDC-published products including scientific findings, journal articles, guidelines, recommendations, or other public health information authored or co-authored by CDC or funded partners. As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.

i

Comparison of Machine Learning Classifiers for Influenza Detection from Emergency Department Free-text Reports

12 2015
By Pineda, Arturo López ; Ye, Ye ; Visweswaran, Shyam ; ...
http://dx.doi.org/10.1016/j.jbi.2015.08.019
Source: J Biomed Inform. 58:60-69

English

Details Supporting Files You May Also Like

Details:

Alternative Title:

J Biomed Inform
Personal Author:

Pineda, Arturo López ; Ye, Ye ; Visweswaran, Shyam ; Cooper, Gregory F. ; Wagner, Michael M. ; ... More +

Pineda, Arturo López ; Ye, Ye ; Visweswaran, Shyam ; Cooper, Gregory F. ; Wagner, Michael M. ; Tsui, Fuchiang (Rich) Less -
Description:

Influenza is a yearly recurrent disease that has the potential to become a pandemic. An effective biosurveillance system is required for early detection of the disease. In our previous studies, we have shown that electronic Emergency Department (ED) free-text reports can be of value to improve influenza detection in real time. This paper studies seven machine learning (ML) classifiers for influenza detection, compares their diagnostic capabilities against an expert-built influenza Bayesian classifier, and evaluates different ways of handling missing clinical information from the free-text reports. We identified 31,268 ED reports from 4 hospitals between 2008 and 2011 to form two different datasets: training (468 cases, 29,004 controls), and test (176 cases and 1620 controls). We employed Topaz, a natural language processing (NLP) tool, to extract influenza-related findings and to encode them into one of three values: Acute, Non-acute, and Missing. Results show that all ML classifiers had areas under ROCs (AUC) ranging from 0.88 to 0.93, and performed significantly better than the expert-built Bayesian model. Missing clinical information marked as a value of missing (not missing at random) had a consistently improved performance among 3 (out of 4) ML classifiers when it was compared with the configuration of not assigning a value of missing (missing completely at random). The case/control ratios did not affect the classification performance given the large number of training cases. Our study demonstrates ED reports in conjunction with the use of ML and NLP with the handling of missing value information have a great potential for the detection of infectious diseases.
Subjects:

[+]

Emergency Service, Hospital Humans Influenza, Human Machine Learning
Keywords:

[+]

Bayesian Case Detection Emergency Department Reports Influenza
Source:

J Biomed Inform. 58:60-69
Pubmed ID:

26385375
Pubmed Central ID:

PMC4684714
Document Type:

Journal Article
Funding:

U38HK000063/HK/PHITPO CDC HHSUnited States/ ; P01HK000086/HK/PHITPO CDC HHSUnited States/ ; R01LM010020/LM/NLM NIH HHSUnited States/ ; U38 HK000063/HK/PHITPO CDC HHSUnited States/ ; R01 LM010020/LM/NLM NIH HHSUnited States/ ; ... More +

U38HK000063/HK/PHITPO CDC HHSUnited States/ ; P01HK000086/HK/PHITPO CDC HHSUnited States/ ; R01LM010020/LM/NLM NIH HHSUnited States/ ; U38 HK000063/HK/PHITPO CDC HHSUnited States/ ; R01 LM010020/LM/NLM NIH HHSUnited States/ ; R01LM011370/LM/NLM NIH HHSUnited States/ ; R01LM012095/LM/NLM NIH HHSUnited States/ ; P01 HK000086/HK/PHITPO CDC HHSUnited States/ ; R01 LM012095/LM/NLM NIH HHSUnited States/ ; R01 LM011370/LM/NLM NIH HHSUnited States/ Less -
Volume:

58
Collection(s):

CDC Public Access
Main Document Checksum:

[+]

urn:sha256:c2a12c075639ed6576259d979468db402b8003f150a8a4fc7a5be6b248480038
Download URL:

https://stacks.cdc.gov/view/cdc/36922/cdc_36922_DS1.pdf
File Type:

[PDF-1016.35 KB]

	nihms723662.nxml	xml
	nihms723662f1.gif	gif
	nihms723662f1.jpg	jpeg
	nihms723662f2.gif	gif
	nihms723662f2.jpg	jpeg
	nihms723662f3.gif	gif
	nihms723662f3.jpg	jpeg
	nihms723662u1.jpg	jpeg

More +