Back to Previous Page

Natural language generation for electronic health records

November 19 2018

By Lee, Scott H.

Source: NPJ Digit Med. 1:63

[PDF-171.81 KB]

English

Details Supporting Files You May Also Like

Details:

Alternative Title:

NPJ Digit Med

Personal Author:

Lee, Scott H.

Description:

One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative electronic health records (EHRs) to facilitate data sharing between healthcare providers and researchers and promote methodological research. A variety of methods existing for generating synthetic EHRs, but they are not capable of generating unstructured text, like emergency department (ED) chief complaints, history of present illness, or progress notes. Here, we use the encoder-decoder model, a deep learning algorithm that features in many contemporary machine translation systems, to generate synthetic chief complaints from discrete variables in EHRs, like age group, gender, and discharge diagnosis. After being trained end-to-end on authentic records, the model can generate realistic chief complaint text that appears to preserve the epidemiological information encoded in the original record-sentence pairs. As a side effect of the model's optimization goal, these synthetic chief complaints are also free of relatively uncommon abbreviation and misspellings, and they include none of the personally identifiable information (PII) that was in the training data, suggesting that this model may be used to support the de-identification of text in EHRs. When combined with algorithms like generative adversarial networks (GANs), our model could be used to generate fully-synthetic EHRs, allowing healthcare providers to share faithful representations of multimodal medical data without compromising patient privacy. This is an important advance that we hope will facilitate the development of machine-learning methods for clinical decision support, disease surveillance, and other data-hungry applications in biomedical informatics.

Subjects:

[+]

Source:

NPJ Digit Med. 1:63

Pubmed ID:

30687797

Pubmed Central ID:

PMC6345174

Document Type:

Journal Article

Funding:

CC999999/ImCDC/Intramural CDC HHS/United States

Collection(s):

CDC Public Access

Main Document Checksum:

[+]

Download URL:

https://stacks.cdc.gov/view/cdc/75429/cdc_75429_DS1.pdf

File Type:

	nihms-1004482-f0001.gif	gif
	nihms-1004482-f0001.jpg	jpeg
	nihms-1004482.nxml	xml
	NIHMS1004482-supplement-Supplemental_methods.pdf	pdf

More +

You May Also Like

Case Report and Literature Review of Prosthetic Cardiovascular Mucormycosis

Cite

Hoellinger, Baptiste ;

Magnus, Louis

...

11 2023 | Emerg Infect Dis. 2023; 29(11):2388-2390

We report a rare case of aorto-bi-iliac prosthetic allograft mucormycosis in a 57-year-old immunocompetent patient in France. Outcome was favorable af...

[PDF - 611.99 KB]

Uptake of online HIV-related continuing medical education training among primary care providers in Southeast United States, 2017–2018

Cite

Henny, Kirk D. ;

Duke, Christopher C.

...

12 2021 | AIDS Care. 33(12):1515-1524

Primary care providers play a vital role for HIV prevention and care in high burden areas of the Southeast United States. Studies reveal that only a t...

[PDF - 267.78 KB]

Checkout today's featured content at stacks.cdc.gov

Natural language generation for electronic health records

Details:

You May Also Like

Have Questions?

CDC INFORMATION

CONNECT WITH CDC