SleepFMNew AI Model Predicts Disease Risk While You Sleep
Source: Press release
Stanford Medicine
5 min Reading Time
A new AI model developed at Stanford can analyse data from a single night’s sleep to predict the risk of more than 100 diseases — years before symptoms appear — opening a new window on preventive and precision medicine.
“We record an amazing number of signals when we study sleep,” Emmanuel Mignot said. He and James Zou mined sleep data and found those signals could predict disease.
A poor night’s sleep portends a bleary-eyed next day, but it could also hint at diseases that will strike years down the road. A new artificial intelligence model developed by Stanford Medicine researchers and their colleagues can use physiological recordings from one night’s sleep to predict a person’s risk of developing more than 100 health conditions.
Known as SleepFM, the model was trained on nearly 600,000 hours of sleep data collected from 65,000 participants. The sleep data comes from polysomnography, a comprehensive sleep assessment that uses various sensors to record brain activity, heart activity, respiratory signals, leg movements, eye movements and more.
Polysomnography is the gold standard in sleep studies that monitor patients overnight in a lab. It is also, the researchers realized, an untapped gold mine of physiological data.
“We record an amazing number of signals when we study sleep,” said Emmanual Mignot, MD, PhD, the Craig Reynolds Professor in Sleep Medicine and co-senior author of the new study, which will publish Jan. 6 in Nature Medicine. “It’s a kind of general physiology that we study for eight hours in a subject who’s completely captive. It’s very data rich.”
Only a fraction of that data is used in current sleep research and sleep medicine. With advances in artificial intelligence, it’s now possible to make sense of much more of it. The new study is the first to use AI to analyze such large-scale sleep data.
“From an AI perspective, sleep is relatively understudied. There’s a lot of other AI work that’s looking at pathology or cardiology, but relatively little looking at sleep, despite sleep being such an important part of life,” said James Zou, PhD, associate professor of biomedical data science and co-senior author of the study.
Key Results
Disease risk predicted from a single night’s sleep: The AI model SleepFM can predict the future risk of more than 130 diseases using physiological data from just one overnight sleep study.
Large-scale training dataset: The model was trained on ~585,000 hours of polysomnography data from 65,000 individuals, making it one of the largest AI studies of sleep to date.
Strong predictive accuracy: SleepFM achieved C-indices above 0.8 for many conditions, including cancer, cardiovascular disease, neurological disorders and mortality.
High performance for major diseases: Particularly strong predictions were observed for Parkinson’s disease (0.89), dementia (0.85), heart attack (0.81), prostate cancer (0.89) and breast cancer (0.87).
Learning the Language of Sleep
To take advantage of the sleep data trove, the researchers built a foundation model, a type of AI model that can train itself on vast amounts of data and apply what it has learned to a wide range of tasks. Large language models like ChatGPT are examples of foundation models trained on huge amounts of text. The 585,000 hours of polysomnography data that SleepFM was trained on came from patients who’d had their sleep assessed at various sleep clinics. The sleep data is split into five-second increments, analogous to words that large language models use to train on.
“SleepFM is essentially learning the language of sleep,” Zou said. The model was able to incorporate multiple streams of data — electroencephalography, electrocardiography, electromyography, pulse reading and breathing airflow, for example — and glean how they relate to each other. To achieve this, the researchers developed a new training technique, called leave-one-out contrastive learning, that essentially hides one modality of data and challenges the model to reconstruct the missing piece based on the other signals.
“One of the technical advances that we made in this work is to figure out how to harmonize all these different data modalities so they can come together to learn the same language,” Zou said.
Forecasting Disease
After the training phase, the researchers could fine-tune the model to different tasks First, they tested the model on standard sleep analysis tasks, such as classifying different stages of sleep and diagnosing the severity of sleep apnea. SleepFM performed as well as or better than state-of-the-art models used today.
Date: 08.12.2025
Naturally, we always handle your personal data responsibly. Any personal data we receive from you is processed in accordance with applicable data protection legislation. For detailed information please see our privacy policy.
Consent to the use of data for promotional purposes
I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here
Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.
Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.
In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.
Right of revocation
I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.
Then the researchers tackled a more ambitious goal: predicting future disease onset from sleep data. To identify which conditions could be forecast, they needed to pair the training polysomnography data with the long-term health outcomes of the same participants. Fortunately, they had access to more than half a century’s worth of health records from a sleep clinic.
The Stanford Sleep Medicine Center was founded in 1970 by the late William Dement, MD, PhD, widely considered the father of sleep medicine. The largest cohort of patients used to train SleepFM — some 35,000 patients ranging in age from 2 to 96 — had their polysomnography data recorded at the clinic between 1999 and 2024. The researchers paired these patients’ polysomnography data with their electronic health records, which provided up to 25 years of follow-up for some patients. (The clinic’s polysomnography recordings go back even further, but only on paper, said Mignot, who directed the sleep center from 2010 to 2019.)
SleepFM analyzed more than 1,000 disease categories in the health records and found 130 that could be predicted with reasonable accuracy by a patient’s sleep data. The model’s predictions were particularly strong for cancers, pregnancy complications, circulatory conditions and mental disorders, achieving a C-index higher than 0.8.
The C-index, or concordance index, is a common measure of a model’s predictive performance, specifically, its ability to predict which of any two individuals in a group will experience an event first. “For all possible pairs of individuals, the model gives a ranking of who’s more likely to experience an event — a heart attack, for instance — earlier. A C-index of 0.8 means that 80% of the time, the model’s prediction is concordant with what actually happened,” Zou said.
SleepFM excelled at predicting Parkinson’s disease (C-index 0.89), dementia (0.85), hypertensive heart disease (0.84), heart attack (0.81), prostate cancer (0.89), breast cancer (0.87) and death (0.84). “We were pleasantly surprised that for a pretty diverse set of conditions, the model is able to make informative predictions,” Zou said. Models of less accuracy, with C-indices around 0.7, such as those that predict a patient’s response to different cancer treatments, have proven useful in clinical settings, he added.
The team is working on ways to further improve SleepFM’s predictions, perhaps by adding data from wearables, and to understand exactly what the model is interpreting.
“It doesn’t explain that to us in English,” Zou said. “But we have developed different interpretation techniques to figure out what the model is looking at when it’s making a specific disease prediction.”
The researchers note that even though heart signals factor more prominently in heart disease predictions and brain signals factor more prominently in mental health predictions, it was the combination of all the data modalities that achieved the most accurate predictions.
“The most information we got for predicting disease was by contrasting the different channels,” Mignot said. Body constituents that were out of sync — a brain that looks asleep but a heart that looks awake, for example — seemed to spell trouble.
Original Article: A multimodal sleep foundation model for disease prediction; Nature Medicine; DOI:10.1038/s41591-025-04133-4