TO THE EDITOR
Metabolic dysfunction-associated fatty liver disease (MAFLD), formerly known as non-alcoholic fatty liver disease (NAFLD), affects approximately one-quarter of the adult population worldwide, thereby imposing a significant health and economic burden on all societies[1]. The issue has been demonstrated to result in increased instances of cirrhosis, hepatocellular carcinoma, and cardiovascular mortality, consequently engendering considerable healthcare expenditure, a phenomenon that is especially prevalent in regions characterised by a paucity of screening resources[2]. We read with interest the prospective cohort study by Tian et al[3]. This work pioneers an integrated traditional Chinese medicine (TCM)-machine learning (ML) approach for predicting MAFLD in high-risk populations. While the methodological innovation warrants commendation, we seek to contextualize its contributions, address critical limitations, and propose translational pathways to facilitate its integration into clinical practice.
STUDY OVERVIEW AND DISCUSSION
Tian et al[3] developed an XGBoost model (area under the curve: 0.82) utilizing dual feature selection (LASSO + recursive feature elimination) to identify hepatic steatosis within a cohort of 711 individuals at high metabolic risk. The model incorporated ten predictors, encompassing conventional biomarkers [e.g., aspartate aminotransferase (AST)/alanine aminotransferase (ALT) ratio, low-density lipoprotein cholesterol (LDL-C), triglycerides] alongside novel TCM indicators (tongue edge redness, greasy coating). By prospectively recruiting patients exhibiting metabolic dysregulation (representing 86.2% of the initial cohort, n = 1011) and employing FibroScan-CAP ≥ 238 dB/m as a steatosis confirmation criterion, the authors addressed a significant clinical need: The demand for scalable, non-invasive screening tools suitable for resource-constrained settings. The inclusion of TCM diagnostics aligns well with the multisystem pathophysiology characteristic of MAFLD, thereby offering a holistic perspective complementary to biochemical profiling. SHAP analysis notably positioned TCM tongue features among the leading predictors, underscoring their potential clinical utility. However, the absence of MAFLD subtyping (e.g., lean or diabetic MAFLD) overlooks established pathophysiological heterogeneity. As emphasized by Eslam et al[1], MAFLD is not a monolithic disease entity; distinct subtypes exhibit divergent fibrosis progression trajectories and cardiovascular risk profiles, necessitating stratified diagnostic and management strategies[1]. Early detection is critical, as MAFLD is often asymptomatic in its initial stages yet can progress to severe complications including cirrhosis, liver failure, and hepatocellular carcinoma, while also independently increasing cardiovascular mortality risk[1,4].
STRENGTHS AND LIMITATIONS
The study by Tian et al[3] demonstrates considerable methodological rigor, including its prospective cohort design and dual feature selection strategy (LASSO + recursive feature elimination), which collectively mitigate the risk of overfitting prevalent in high-dimensional biomarker research. The innovative integration of TCM diagnostics-specifically tongue edge redness and greasy coating-constitutes a pioneering effort to address the inherent multisystem heterogeneity of MAFLD from a holistic perspective. This approach resonates with findings from network pharmacology studies revealing multicomponent synergistic effects within TCM formulations (e.g., modulation of AKT1/IL-6/TNF-α pathways by XingQiChuShiYin)[5]. The synergistic application of TCM and ML analytics offers a promising low-cost screening alternative to imaging-dependent modalities like magnetic resonance imaging-derived proton density fat fraction (MRI-PDFF), aligning with urgent demands for accessible tools in resource-limited environments[6]. Nevertheless, critical limitations impede immediate clinical translation. Validation confined to a single center (n = 711) lacks generalizability assessment across diverse populations, introducing potential bias derived from local demographic characteristics (e.g., 77% male participants, median age 43–44 years). Furthermore, the omission of MAFLD subtyping disregards established pathophysiological heterogeneity, potentially diminishing the model's applicability within precision medicine frameworks.
A paramount concern is the limited biological anchoring of the TCM tongue features, despite their high ranking in SHAP analysis-a gap paralleled in herbal medicine research where active compound pharmacokinetics often remain unverified[7]. While Lu et al[8] demonstrated associations between yellow tongue coatings and gut microbiome dysbiosis in MAFLD patients, the mechanistic links for greasy coatings/edge redness -key predictors in this model-remain incompletely defined. The reduction of TCM's holistic framework to two imaging-derived variables (tongue redness/greasiness) may oversimplify its diagnostic richness. It has been suggested that the color of the tongue coating may be associated with insulin resistance (IR), oral microbiota[9], and metabolic disorders[10]. Furthermore, the presence of a thick and greasy tongue coating may be associated with the formation and permeability of vascular endothelial cells, as well as the protein expression of tight junction protein-1 (zonula occludens-1)[11,12]. In the absence of validated mechanistic links to underlying metabolic dysregulation (e.g., gut-liver axis dysfunction or microbial compositional shifts), these indicators risk functioning as "black-box" contributors within the model.
Additionally, while Tian et al[3] quantified TCM tongue features using the Intelligent Constitution Identifier - an imaging system with physician verification-the proprietary algorithms lack transparency. Furthermore, TCM tongue indicators risk capturing confounders unrelated to MAFLD pathophysiology: Greasy coatings may reflect transient dietary lipids or dehydration; redness could signal oral microbiome dysbiosis or systemic inflammation from non-hepatic conditions. Without controlling for these variables, their inclusion may introduce spurious correlations. Critically, TCM markers’ relationship with hepatic steatosis in metabolically high-risk populations (e.g., diabetics) remains poorly defined. Though statistically predictive, they likely capture systemic confounders-IR, dyslipidemia, or oral dysbiosis[12,13]-rather than direct steatotic processes. Without validation against histology or liver-specific biomarkers (e.g., MRI-PDFF, CK-18), their clinical utility for precise steatosis.
FUTURE RESEARCH DIRECTIONS
To address these limitations and advance the field, we propose a multifaceted research agenda. First, external validation of the identified feature set (e.g., AST/ALT, LDL-C, TCM tongue markers) within multiethnic cohorts is imperative to evaluate robustness across varying metabolic phenotypes. Collaborative initiatives, such as the MAFLD Consortium[2,14], could expedite this process. Second, prioritizing the development of subtype-specific predictive models is crucial to capture disease heterogeneity effectively. For instance, lean MAFLD [body mass index (BMI) < 23 kg/m2] demonstrates distinct biomarker signatures [e.g., predominance of uric acid (UA)/creatinine (Cr) ratio], suggesting that tailored algorithms could enhance diagnostic accuracy within high-risk subgroups. Specifically, the omission of MAFLD subtyping-despite known biomarker divergences (e.g., UA/Cr dominance in lean MAFLD[15] vs glycated hemoglobin in diabetic MAFLD[16])-limits the model’s precision in high-risk subgroups. Subtype-specific algorithms could enhance accuracy, particularly for lean MAFLD where conventional metabolic markers (e.g., BMI) are less reliable. Third, elucidating the biological foundations underpinning TCM indicators necessitates multi-omics correlation studies. Investigating potential links between specific tongue phenotypes (e.g., edge redness) and corresponding alterations in the gut microbiome and metabolomic profiles-as demonstrated by Lu et al[8] for yellow coatings-followed by validation of their metabolic impact (e.g., regulation of lipid droplet formation via PLIN-2/ATGL pathways, evidenced in NAFLD models[17]) would establish mechanistic grounding for these features, thereby enhancing model credibility. Finally, conducting real-world cost-effectiveness analyses comparing this integrated TCM-ML approach against established screening paradigms (e.g., FibroScan[18] or FIB-4[19]) is essential to evaluate long-term feasibility, particularly within primary care settings where low-cost tools yield the greatest impact. Future studies should stratify high-risk cohorts by age/comorbidity (e.g., type 2 diabetes mellitus, chronic kidney disease) to assess biomarker stability.
CONCLUSION
The study by Tian et al[3] represents a significant advancement toward accessible MAFLD screening through the synergistic application of TCM diagnostics and ML analytics. The model’s non-invasive nature and potential for cost reduction hold particular promise for resource-limited regions. However, successful clinical translation necessitates multi-center validation, refinement incorporating disease heterogeneity through subtyping, and biological validation of the integrated TCM indicators. Future research iterations addressing these critical gaps possess the potential to revolutionize the early detection of MAFLD, transforming its often asymptomatic progression into a more readily manageable condition.
Provenance and peer review: Invited article; Externally peer reviewed.
Peer-review model: Single blind
Specialty type: Gastroenterology and hepatology
Country of origin: China
Peer-review report’s classification
Scientific Quality: Grade A, Grade A, Grade B, Grade B
Novelty: Grade B, Grade B, Grade B, Grade B
Creativity or Innovation: Grade B, Grade B, Grade B, Grade B
Scientific Significance: Grade A, Grade A, Grade B, Grade B
P-Reviewer: Gutiérrez-Cuevas J, PhD, Professor, Mexico; Othman AA, MD, PhD, Egypt S-Editor: Li L L-Editor: A P-Editor: Wang WB