BPG is committed to discovery and dissemination of knowledge
Editorial
Copyright ©The Author(s) 2025. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Sep 28, 2025; 31(36): 112217
Published online Sep 28, 2025. doi: 10.3748/wjg.v31.i36.112217
Machine learning fibrosis score for pediatric metabolic dysfunction-associated steatotic liver disease: Promising but premature
Toshifumi Yodoshi
Toshifumi Yodoshi, Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, United States
Author contributions: Yodoshi T contributed to the concept, design, manuscript writing, and editing, as well as the review of the literature.
Conflict-of-interest statement: The author declares no conflicts of interest.
Open Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Toshifumi Yodoshi, MD, PhD, Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, 3333 Burnet Ave, Cincinnati, OH 45229, United States. toshifumi.yodoshi@cchmc.org
Received: July 22, 2025
Revised: August 14, 2025
Accepted: September 3, 2025
Published online: September 28, 2025
Processing time: 60 Days and 1.5 Hours
Abstract

Metabolic dysfunction-associated steatotic liver disease (MASLD) is now the leading cause of chronic liver disease in children, affecting up to 38% with obesity of children. With the global shift from non-alcoholic fatty liver disease (NAFLD) to MASLD using affirmative criteria (hepatic steatosis plus ≥ 1 cardiometabolic risk factor) and approximately 99% concordance in pediatrics, the development of non-invasive fibrosis tools is accelerating. Yao et al report a machine-learning “chronic MASLD with fibrosis (CH-MASLD-Fib)” score for advanced fibrosis with area under the receiver operating characteristic curve (AUROC) of 0.92. While timely, we urge caution. First, high accuracy from a single-center cohort signals overfitting: Complex models can learn cohort-specific noise and fail to generalize. Consistent with this, established pediatric scores (NAFLD fibrosis score, fibrosis-4, pediatric NAFLD fibrosis score) perform modestly (AUROC: Approximately 0.6-0.7), and aspartate aminotransferase-to-platelet ratio index is variable, raising concern that CH-MASLD-Fib’s result reflects a statistical artifact. Second, MASLD epidemiology varies by ethnicity (highest in Hispanic, lower in Black children); a model derived in a mono-ethnic Chinese cohort may misclassify other populations. Third, clinical utility and cost-effectiveness are unproven; dependence on specialized assays (e.g., bile acids, cholinesterase) would limit access and increase cost. We recommend external validation in multi-ethnic cohorts, head-to-head comparisons with simple serum indices and elastography, and formal economic analyses. Until then, clinical judgment anchored in readily available markers and judicious, targeted liver biopsy remains paramount.

Keywords: Liver fibrosis; Machine learning; Non-invasive biomarkers; Overfitting; Ethnic diversity; Cost-effectiveness; External validation; Health economics; Metabolic dysfunction-associated steatotic liver disease

Core Tip: Machine-learning models for detecting advanced fibrosis in pediatric metabolic dysfunction-associated steatotic liver disease routinely report striking accuracy, yet three recurring flaws limit clinical impact: Overfitting to single-center datasets, absence of multi-ethnic external validation, and dependence on costly, non-routine biomarkers. We outline a pragmatic roadmap: Prospective, multi-site cohorts with standardized liver histology, decision-curve and cost-utility analyses, and transparent model explainability to transform promising algorithms into trustworthy tools. Until these steps are fulfilled, the most reliable strategy combines simple serum tests (alanine aminotransferase, platelet-based indices), vibration-controlled transient elastography, and judicious, targeted liver biopsy for indeterminate or high-risk cases.