Novel Machine Learning Model Effectively Predicts NASH
Click Here to Manage Email Alerts
BOSTON — A machine learning approach demonstrated a relatively high sensitivity rate for predicting the probability of non-alcoholic steatohepatitis in at-risk patients, according to data presented at The Liver Meeting 2019.
“I have been involved in NASH research for the last 15 years and one of the striking findings is the low awareness of the disease; both from patients, but also from a primary care physician’s perspective,” Jörn M. Schattenberg, MD, of the University Medical Center in Mainz, Germany, said during a press conference.
Schattenberg noted that the researchers asked whether novel machine learning tools could help identify patients at risk for NASH.
“That’s relevant because patients with NASH will progress and eventually ... develop end-stage liver disease with occurring complications,” he said.
Schattenberg and colleagues conducted an exploratory analysis, feature extraction, model training and parameter tuning on the NAFLD Adult Database from the National Institute of Diabetes, Digestive Diseases (NIDDK). The database consisted of 422 patients with histologic NASH and 282 patients confirmed to not have NASH.
The researchers then tested the best-performing model from NIDDK on the Optum HER database to understand model performance.
Data from 1,016 patients with NASH confirmed by liver biopsy within the Optum HER database were then used to evaluate model performance.
The model, known as NASHmap, includes 14 variables deemed the most important features for predicting NASH.
The model then ranked the features, as follows, in order of importance: HbA1c, AST, ALT, total protein, AST/ALT, BMI, triglycerides, height, platelets, WBC, hematocrit, albumin, hypertension and gender.
A simplified version of the model was also developed to include five features: HbA1c, AST, ALT, total protein and triglycerides.
The 14-feature model correctly identified 81% of patients (AUC = 0.82) with NASH in the NIDDK database. The simplified five-feature model correctly identified 76% of patients (AUC = 0.8) with NASH in that database.
In the Optum HER database, the 14-feature model identified 72% of patients with NASH (AUC = 0.76), while the five-feature model identified 66% of patients with NASH (AUC = 0.74).
“While a lot of talk and a lot of focus is on the degree of fibrosis as being the most important predictor of outcomes in these patients, we are not doing a good job at this time to identify at-risk populations and actually propose screening to them,” Schattenberg said. “The way I see the algorithm ... in the end working, is you do apply this to a large dataset that is available and ... you use every day clinical parameters that the doctor has to identify at-risk population ... to then order a specific test and refine the at-risk population.”
Schattenberg stressed that it’s not considered a diagnostic test, but rather should be used complementary to other tests.
“This will allow us to use resources wisely instead of testing every diabetic patient, for example, that is out there for advanced fibrosis,” he said. “So, while HbA1c is one of the parameters that’s important in the algorithm, the algorithm refines so finely that it will give us a subgroup and allow us a more sophisticated risk stratification.” – by Ryan McDonald
Reference: Schattenberg J, et al. Abstract 190. Presented at: The Liver Meeting; Nov. 7-12, 2019; Boston.
Disclosure: Schattenberg reports serving as a consultant for AbbVie, Boehringer Ingelheim, Bristol-Myers Squibb, Genfit, Gilead, Intercept Pharmaceuticals, MSD and Novartis.