TY - JOUR
T1 - First steps toward building natural history of diseases computationally
T2 - Lessons learned from the Noonan syndrome use case
AU - Groza, Tudor
AU - Rayabsri, Warittha
AU - Gration, Dylan
AU - Hariram, Harshini
AU - Jamuar, Saumya Shekhar
AU - Baynam, Gareth
N1 - Publisher Copyright:
© 2025
PY - 2025/5/1
Y1 - 2025/5/1
N2 - Rare diseases (RDs) are conditions affecting fewer than 1 in 2,000 people, with over 7,000 identified, primarily genetic in nature, and more than half impacting children. Although each RD affects a small population, collectively, between 3.5% and 5.9% of the global population, or 262.9–446.2 million people, live with an RD. Most RDs lack established treatment protocols, highlighting the need for proper care pathways addressing prognosis, diagnosis, and management. Advances in generative AI and large language models (LLMs) offer new opportunities to document the temporal progression of phenotypic features, addressing gaps in current knowledge bases. This study proposes an LLM-based framework to capture the natural history of diseases, specifically focusing on Noonan syndrome. The framework aims to document phenotypic trajectories, validate against RD knowledge bases, and integrate insights into care coordination using electronic health record (EHR) data from the Undiagnosed Diseases Program Singapore.
AB - Rare diseases (RDs) are conditions affecting fewer than 1 in 2,000 people, with over 7,000 identified, primarily genetic in nature, and more than half impacting children. Although each RD affects a small population, collectively, between 3.5% and 5.9% of the global population, or 262.9–446.2 million people, live with an RD. Most RDs lack established treatment protocols, highlighting the need for proper care pathways addressing prognosis, diagnosis, and management. Advances in generative AI and large language models (LLMs) offer new opportunities to document the temporal progression of phenotypic features, addressing gaps in current knowledge bases. This study proposes an LLM-based framework to capture the natural history of diseases, specifically focusing on Noonan syndrome. The framework aims to document phenotypic trajectories, validate against RD knowledge bases, and integrate insights into care coordination using electronic health record (EHR) data from the Undiagnosed Diseases Program Singapore.
KW - generative AI
KW - Human Phenotype Ontology
KW - Large Language Models
KW - natural history of disease
KW - Noonan syndrome
KW - rare diseases
UR - https://www.scopus.com/pages/publications/105003553182
U2 - 10.1016/j.ajhg.2025.03.014
DO - 10.1016/j.ajhg.2025.03.014
M3 - Article
C2 - 40245863
SN - 0002-9297
VL - 112
SP - 1158
EP - 1172
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 5
ER -