A case study on generative artificial intelligence to extract the fundamental sleep parameters from polysomnography notes

Arash Maghsoudi; Amir Sharafkhaneh; Mehrnaz Azarian; Amin Ramezani; Max Hirshkowitz; Javad Razjouyan

doi:10.5664/jcsm.11594

A case study on generative artificial intelligence to extract the fundamental sleep parameters from polysomnography notes

J Clin Sleep Med. 2025 Jun 1;21(6):1123-1127. doi: 10.5664/jcsm.11594.

Authors

Arash Maghsoudi^{1

2}, Amir Sharafkhaneh^{2

3}, Mehrnaz Azarian^{1

2}, Amin Ramezani^{1

2}, Max Hirshkowitz^{1

4}, Javad Razjouyan^{1

2

5}

Affiliations

¹ Center for Innovations in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas.
² Department of Medicine, Baylor College of Medicine, Houston, Texas.
³ Pulmonary, Critical Care and Sleep Medicine Section, Medical Care Line, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas.
⁴ Consulting Professor (retired), Stanford University School of Medicine, Stanford, California.
⁵ Big Data Scientist Training Enhancement Program (BD-STEP), Veterans Affairs Office of Research and Development, Washington, DC.

PMID: 40012317
PMCID: PMC12134583 (available on 2026-06-01)
DOI: 10.5664/jcsm.11594

Abstract

Generative artificial intelligence utilizing transformer technology is widely seen as a groundbreaking advancement in applied artificial intelligence. The technology creates a unique opportunity to extract unstructured data from medical notes. In the current experiments, we extracted fundamental sleep parameters from polysomnography notes of veterans in the Corporate Data Warehouse national database using large language models. The "SOLAR-10.7B-Instruct" model extracted values associated with total sleep time, sleep onset latency, and sleep efficiency from the polysomnography notes. The model's performance was evaluated using 464 human annotated notes. The analysis showed close accuracy for the large language model compared to the human total sleep time and sleep efficiency extraction, and a considerable accuracy improvement (7.6%) in extracting sleep onset latency for the machine compared to human annotation. The large language model shows negligible hallucination (no more than 3.6%), and it has the capability to perform complicated reasoning to extract the desired sleep parameter.

Citation: Maghsoudi A, Sharafkhaneh A, Azarian M, Ramezani A, Hirshkowitz M, Razjouyan J. A case study on generative artificial intelligence to extract the fundamental sleep parameters from polysomnography notes. J Clin Sleep Med. 2025;21(6):1123-1127.

Keywords: artificial intelligence; large language model; polysomnography; sleep notes.

MeSH terms

Artificial Intelligence*
Generative Artificial Intelligence
Humans
Polysomnography* / methods
Sleep* / physiology

Grants and funding

K25 HL152006/HL/NHLBI NIH HHS/United States