Published online Oct 7, 2014. doi: 10.3748/wjg.v20.i37.13325
Revised: June 4, 2014
Accepted: June 26, 2014
Published online: October 7, 2014
Processing time: 343 Days and 16.2 Hours
Pancreatic cancer (PC) is one of the most aggressive and lethal neoplastic diseases. A valid alternative to the usual invasive diagnostic tools would certainly be the determination of biomarkers in peripheral fluids to provide less invasive tools for early diagnosis. Nowadays, biomarkers are generally investigated mainly in peripheral blood and tissues through high-throughput omics techniques comparing control vs pathological samples. The results can be evaluated by two main strategies: (1) classical methods in which the identification of significant biomarkers is accomplished by monovariate statistical tests where each biomarker is considered as independent from the others; and (2) multivariate methods, taking into consideration the correlations existing among the biomarkers themselves. This last approach is very powerful since it allows the identification of pools of biomarkers with diagnostic and prognostic performances which are superior to single markers in terms of sensitivity, specificity and robustness. Multivariate techniques are usually applied with variable selection procedures to provide a restricted set of biomarkers with the best predictive ability; however, standard selection methods are usually aimed at the identification of the smallest set of variables with the best predictive ability and exhaustivity is usually neglected. The exhaustive search for biomarkers is instead an important alternative to standard variable selection since it can provide information about the etiology of the pathology by producing a comprehensive set of markers. In this review, the most recent applications of the omics techniques (proteomics, genomics and metabolomics) to the identification of exploratory biomarkers for PC will be presented with particular regard to the statistical methods adopted for their identification. The basic theory related to classical and multivariate methods for identification of biomarkers is presented and then, the most recent applications in this field are discussed.
Core tip: Biomarkers are statistically identified as significant by: (1) classical statistical tests where each biomarker is independent from the others; and (2) multivariate methods that take into consideration the correlation among the biomarkers. This last approach provides pools of biomarkers with superior diagnostic and prognostic performances. Multivariate techniques are often applied with variable selection procedures to provide the smallest set of biomarkers with the best predictive ability. The exhaustive identification is instead a valid alternative since it can provide comprehensive information about the etiology of the pathology. The most recent applications of the omics approaches to the identification of biomarkers for PC are presented, with particular regard to the statistical methods adopted.