Published online Jul 28, 2018. doi: 10.3748/wjg.v24.i28.3145
Peer-review started: March 22, 2018
First decision: April 24, 2018
Revised: June 13, 2018
Accepted: June 25, 2018
Article in press: June 25, 2018
Published online: July 28, 2018
Processing time: 127 Days and 17.9 Hours
Liver cancer is the fourth most common digestive cancer worldwide. Prognostic markers can help to make better clinical decision by selecting patients who respond well to some specific treatment. Besides traditional clinical markers, genetic biomarkers are emerging as novel indicators in cancer diagnosis and prognosis. The Cancer Genome Atlas (TCGA) is funded by the National Institute of Health (NIH) to describe the genomic alterations across cancer types. It provides tremendous amount of “omics” data, including mRNA sequencing, miRNA sequencing, reverse phase protein arrays, copy number change and DNA sequencing.
Although there seems to be great potential value of the clinical and genetic markers, there is no consensus on the predictive power of these indicators, especially the molecular markers.
By utilizing the TCGA data, we aimed to evaluate the prognostic power of liver cancer by molecular markers, and also to assess the predictive power of liver cancer by combining molecular markers and clinical data.
Cox regression screen and least absolute shrinkage and selection operator (LASSO) were performed to select significant prognostic variables. Then the concordance index was calculated to evaluate the prognostic power. For the combination data, based on the clinical cox model, molecular features that better fit the model were combined to calculate the concordance index. Prognostic models were built based on the arithmetic summation of the significant variables. Kaplan-Meier survival curve and log-rank test were performed to compare the survival difference. Then the heatmap was constructed and gene set enrichment analysis was performed for pathway analysis.
The mRNA data was the most informative prognostic variables in all kinds of omics data in liver cancer. In the copy number variation (CNV), methylation and miRNA data, the combination of molecular data with clinical data could significantly boost the prediction accuracy of the molecular data alone. On the other hand, the combination of clinical data with methylation, miRNA and mRNA data could significantly boost the prediction accuracy of the clinical data itself. Based on the significant prognostic variables, several prognostic models were built. For the CNV data, score = 10p15.1 - 15q26.3. For the methylation data, score = - REL - MCM2. For the miRNA data, score = miR-3690 + miR-561 - miR-621. For the mRNA data, score = CCDC21 + GTF3C2 + DBF4.
In all kinds of omics data in liver cancer, the mRNA data might be the most informative prognostic variables.
The combination of clinical data with molecular data might be the future direction for cancer prognosis and prediction.