Published online Aug 14, 2013. doi: 10.3748/wjg.v19.i30.5006
Revised: June 4, 2013
Accepted: June 8, 2013
Published online: August 14, 2013
Processing time: 119 Days and 21.3 Hours
AIM: To apply a new, integrated technique for visualizing bacterial genomes to identify novel pathogenicity islands in Helicobacter pylori (H. pylori).
METHODS: A genomic barcode imaging method (converting frequency matrices to grey-scale levels) was designed to visually distinguish origin-specific genomic regions in H. pylori. The complete genome sequences of the six H. pylori strains published in the National Center for Biotechnological Information prokaryotic genome database were scanned, and compared to the genome barcodes of Escherichia coli (E. coli) O157:H7 strain EDL933 and a random nucleotide sequence. The following criteria were applied to identify potential pathogenicity islands (PAIs): (1) barcode distance distinct from that of the general background; (2) length greater than 10000 continuous base pairs; and (3) containing genes with known virulence-related functions (as determined by PfamScan and Blast2GO).
RESULTS: Comparison of the barcode images generated for the 26695, HPAG1, J99, Shi470, G27 and P12 H. pylori genomes with those for the E. coli and random sequence controls revealed that H. pylori genomes contained fewer anomalous regions. Among the H. pylori-specific continuous anomalous regions (longer than 20 kbp in each strain’s genome), two fit the criteria for identifying candidate PAIs. The bioinformatic-based functional analyses revealed that one of the two anomalous regions was the known pathogenicity island cag-PAI, this finding also served as proof-of-principle for the utility of the genomic barcoding approach for identifying PAIs, and characterized the other as a novel PAI, which was designated as tfs3-PAI. Furthermore, the cag-PAI and tfs3-PAI harbored genes encoding type IV secretion system proteins and were predicted to have potential for functional synergy.
CONCLUSION: Genomic barcode imaging represents an effective bioinformatic-based approach for scanning bacterial genomes, such as H. pylori, to identify candidate PAIs.
Core tip: The genomic barcoding technology was recently developed to increase the accuracy of genome analysis, and has facilitated the identification of origin-specific genomic regions of both eukaryotic and prokaryotic lifeforms. In this study, we applied the genomic barcode imaging approach to screen for pathogenicity islands (PAIs) in Helicobacter pylori using the six strains for which the complete genome sequences have been published and performing comparison to a common Enterobacter species. Bioinformatic-based functional analysis not only provided proof-of-principle (identifying the known cag-PAI) but also identified a novel PAI (designated as tsf3-PAI).