TY - JOUR
T1 - Stratifying individuals into non-alcoholic fatty liver disease risk levels using time series machine learning models
AU - Ben-Assuli, Ofir
AU - Jacobi, Arie
AU - Goldman, Orit
AU - Shenhar-Tsarfaty, Shani
AU - Rogowski, Ori
AU - Zeltser, David
AU - Shapira, Itzhak
AU - Berliner, Shlomo
AU - Zelber-Sagi, Shira
N1 - Publisher Copyright:
© 2022 Elsevier Inc.
PY - 2022/2
Y1 - 2022/2
N2 - Non-alcoholic fatty liver disease (NAFLD) affects 25% of the population worldwide, and its prevalence is anticipated to increase globally. While most NAFLD patients are asymptomatic, NAFLD may progress to fibrosis, cirrhosis, cardiovascular disease, and diabetes. Research reports, with daunting results, show the challenge that NAFLD's burden causes to global population health. The current process for identifying fibrosis risk levels is inefficient, expensive, does not cover all potential populations, and does not identify the risk in time. Instead of invasive liver biopsies, we implemented a non-invasive fibrosis assessment process calculated from clinical data (accessed via EMRs/EHRs). We stratified patients' risks for fibrosis from 2007 to 2017 by modeling the risk in 5579 individuals. The process involved time-series machine learning models (Hidden Markov Models and Group-Based Trajectory Models) profiled fibrosis risk by modeling patients’ latent medical status resulted in three groups. The high-risk group had abnormal lab test values and a higher prevalence of chronic conditions. This study can help overcome the inefficient, traditional process of detecting fibrosis via biopsies (that are also medically unfeasible due to their invasive nature, the medical resources involved, and costs) at early stages. Thus longitudinal risk assessment may be used to make population-specific medical recommendations targeting early detection of high risk patients, to avoid the development of fibrosis disease and its complications as well as decrease healthcare costs.
AB - Non-alcoholic fatty liver disease (NAFLD) affects 25% of the population worldwide, and its prevalence is anticipated to increase globally. While most NAFLD patients are asymptomatic, NAFLD may progress to fibrosis, cirrhosis, cardiovascular disease, and diabetes. Research reports, with daunting results, show the challenge that NAFLD's burden causes to global population health. The current process for identifying fibrosis risk levels is inefficient, expensive, does not cover all potential populations, and does not identify the risk in time. Instead of invasive liver biopsies, we implemented a non-invasive fibrosis assessment process calculated from clinical data (accessed via EMRs/EHRs). We stratified patients' risks for fibrosis from 2007 to 2017 by modeling the risk in 5579 individuals. The process involved time-series machine learning models (Hidden Markov Models and Group-Based Trajectory Models) profiled fibrosis risk by modeling patients’ latent medical status resulted in three groups. The high-risk group had abnormal lab test values and a higher prevalence of chronic conditions. This study can help overcome the inefficient, traditional process of detecting fibrosis via biopsies (that are also medically unfeasible due to their invasive nature, the medical resources involved, and costs) at early stages. Thus longitudinal risk assessment may be used to make population-specific medical recommendations targeting early detection of high risk patients, to avoid the development of fibrosis disease and its complications as well as decrease healthcare costs.
KW - Clustering models
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85122611538&partnerID=8YFLogxK
U2 - 10.1016/j.jbi.2022.103986
DO - 10.1016/j.jbi.2022.103986
M3 - Article
C2 - 35007752
AN - SCOPUS:85122611538
SN - 1532-0464
VL - 126
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
M1 - 103986
ER -