Using language models to identify relevant new information in inpatient clinical notes

Rui Zhang; Serguei V Pakhomov; Janet T Lee; Genevieve B Melton

Using language models to identify relevant new information in inpatient clinical notes

AMIA Annu Symp Proc. 2014 Nov 14:2014:1268-76. eCollection 2014.

Authors

Rui Zhang¹, Serguei V Pakhomov², Janet T Lee³, Genevieve B Melton¹

Affiliations

¹ Institute for Health Informatics, University of Minnesota, Minneapolis, MN, USA ; Department of Surgery, University of Minnesota, Minneapolis, MN, USA.
² Institute for Health Informatics, University of Minnesota, Minneapolis, MN, USA ; College of Pharmacy, University of Minnesota, Minneapolis, MN, USA.
³ Department of Surgery, University of Minnesota, Minneapolis, MN, USA.

PMID: 25954438
PMCID: PMC4419897

Abstract

Redundant information in clinical notes within electronic health record (EHR) systems is ubiquitous and may negatively impact the use of these notes by clinicians, and, potentially, the efficiency of patient care delivery. Automated methods to identify redundant versus relevant new information may provide a valuable tool for clinicians to better synthesize patient information and navigate to clinically important details. In this study, we investigated the use of language models for identification of new information in inpatient notes, and evaluated our methods using expert-derived reference standards. The best method achieved precision of 0.743, recall of 0.832 and F1-measure of 0.784. The average proportion of redundant information was similar between inpatient and outpatient progress notes (76.6% (SD=17.3%) and 76.7% (SD=14.0%), respectively). Advanced practice providers tended to have higher rates of redundancy in their notes compared to physicians. Future investigation includes the addition of semantic components and visualization of new information.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Electronic Health Records*
Humans
Inpatients
Language*
Models, Statistical*
Natural Language Processing

Abstract

Publication types

MeSH terms

Grants and funding