First steps toward building natural history of diseases computationally: Lessons learned from the Noonan syndrome use case

Tudor Groza, Warittha Rayabsri, Dylan Gration, Harshini Hariram, Saumya Shekhar Jamuar, Gareth Baynam

Research output: Contribution to journalArticlepeer-review

Abstract

Rare diseases (RDs) are conditions affecting fewer than 1 in 2,000 people, with over 7,000 identified, primarily genetic in nature, and more than half impacting children. Although each RD affects a small population, collectively, between 3.5% and 5.9% of the global population, or 262.9–446.2 million people, live with an RD. Most RDs lack established treatment protocols, highlighting the need for proper care pathways addressing prognosis, diagnosis, and management. Advances in generative AI and large language models (LLMs) offer new opportunities to document the temporal progression of phenotypic features, addressing gaps in current knowledge bases. This study proposes an LLM-based framework to capture the natural history of diseases, specifically focusing on Noonan syndrome. The framework aims to document phenotypic trajectories, validate against RD knowledge bases, and integrate insights into care coordination using electronic health record (EHR) data from the Undiagnosed Diseases Program Singapore.

Original languageEnglish
Pages (from-to)1158-1172
Number of pages15
JournalAmerican Journal of Human Genetics
Volume112
Issue number5
Early online date16 Apr 2025
DOIs
Publication statusPublished - 1 May 2025

Funding

FundersFunder number
NHMRC National Health and Medical Research Council

    Fingerprint

    Dive into the research topics of 'First steps toward building natural history of diseases computationally: Lessons learned from the Noonan syndrome use case'. Together they form a unique fingerprint.

    Cite this