1
|
Pattan V, Kashyap R, Bansal V, Candula N, Koritala T, Surani S. Genomics in medicine: A new era in medicine. World J Methodol 2021; 11:231-242. [PMID: 34631481 PMCID: PMC8472545 DOI: 10.5662/wjm.v11.i5.231] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 06/18/2021] [Accepted: 07/19/2021] [Indexed: 02/06/2023] Open
Abstract
The sequencing of complete human genome revolutionized the genomic medicine. However, the complex interplay of gene-environment-lifestyle and influence of non-coding genomic regions on human health remain largely unexplored. Genomic medicine has great potential for diagnoses or disease prediction, disease prevention and, targeted treatment. However, many of the promising tools of genomic medicine are still in their infancy and their application may be limited because of the limited knowledge we have that precludes its use in many clinical settings. In this review article, we have reviewed the evolution of genomic methodologies/tools, their limitations, and scope, for current and future clinical application.
Collapse
Affiliation(s)
- Vishwanath Pattan
- Division of Endocrinology, Wyoming Medical Center, Casper, WY 82601, United States
| | - Rahul Kashyap
- Department of Anesthesiology and Peri-operative Medicine, Mayo Clinic, Rochester, MN 55905, United States
| | - Vikas Bansal
- Department of Anesthesiology and Peri-operative Medicine, Mayo Clinic, Rochester, MN 55905, United States
| | - Narsimha Candula
- Hospital Medicine, University Florida Health, Jacksonville, FL 32209, United States
| | - Thoyaja Koritala
- Hospital Medicine, Mayo Clinic Health System, Mankato, MN 56001, United States
| | - Salim Surani
- Department of Internal Medicine, Texas A&M University, Corpus Christi, TX 78405, United States
| |
Collapse
|
2
|
Rediscovering the value of families for psychiatric genetics research. Mol Psychiatry 2019; 24:523-535. [PMID: 29955165 PMCID: PMC7028329 DOI: 10.1038/s41380-018-0073-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 01/11/2018] [Accepted: 03/26/2018] [Indexed: 01/09/2023]
Abstract
As it is likely that both common and rare genetic variation are important for complex disease risk, studies that examine the full range of the allelic frequency distribution should be utilized to dissect the genetic influences on mental illness. The rate limiting factor for inferring an association between a variant and a phenotype is inevitably the total number of copies of the minor allele captured in the studied sample. For rare variation, with minor allele frequencies of 0.5% or less, very large samples of unrelated individuals are necessary to unambiguously associate a locus with an illness. Unfortunately, such large samples are often cost prohibitive. However, by using alternative analytic strategies and studying related individuals, particularly those from large multiplex families, it is possible to reduce the required sample size while maintaining statistical power. We contend that using whole genome sequence (WGS) in extended pedigrees provides a cost-effective strategy for psychiatric gene mapping that complements common variant approaches and WGS in unrelated individuals. This was our impetus for forming the "Pedigree-Based Whole Genome Sequencing of Affective and Psychotic Disorders" consortium. In this review, we provide a rationale for the use of WGS with pedigrees in modern psychiatric genetics research. We begin with a focused review of the current literature, followed by a short history of family-based research in psychiatry. Next, we describe several advantages of pedigrees for WGS research, including power estimates, methods for studying the environment, and endophenotypes. We conclude with a brief description of our consortium and its goals.
Collapse
|
3
|
Hernandez-Pacheco N, Pino-Yanes M, Flores C. Genomic Predictors of Asthma Phenotypes and Treatment Response. Front Pediatr 2019; 7:6. [PMID: 30805318 PMCID: PMC6370703 DOI: 10.3389/fped.2019.00006] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 01/10/2019] [Indexed: 12/11/2022] Open
Abstract
Asthma is a complex respiratory disease considered as the most common chronic condition in children. A large genetic contribution to asthma susceptibility is predicted by the clustering of asthma and allergy symptoms among relatives and the large disease heritability estimated from twin studies, ranging from 55 to 90%. Genetic basis of asthma has been extensively investigated in the past 40 years using linkage analysis and candidate-gene association studies. However, the development of dense arrays for polymorphism genotyping has enabled the transition toward genome-wide association studies (GWAS), which have led the discovery of several unanticipated asthma genes in the last 11 years. Despite this, currently known risk variants identified using many thousand samples from distinct ethnicities only explain a small proportion of asthma heritability. This review examines the main findings of the last 2 years in genomic studies of asthma using GWAS and admixture mapping studies, as well as the direction of studies fostering integrative perspectives involving omics data. Additionally, we discuss the need for assessing the whole spectrum of genetic variation in association studies of asthma susceptibility, severity, and treatment response in order to further improve our knowledge of asthma genes and predictive biomarkers. Leveraging the individual's genetic information will allow a better understanding of asthma pathogenesis and will facilitate the transition toward a more precise diagnosis and treatment.
Collapse
Affiliation(s)
- Natalia Hernandez-Pacheco
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, Santa Cruz de Tenerife, Spain.,Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, Santa Cruz de Tenerife, Spain
| | - Maria Pino-Yanes
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, Santa Cruz de Tenerife, Spain.,Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, Santa Cruz de Tenerife, Spain.,CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain
| | - Carlos Flores
- Research Unit, Hospital Universitario N.S. de Candelaria, Universidad de La Laguna, Santa Cruz de Tenerife, Spain.,CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain.,Genomics Division, Instituto Tecnológico y de Energías Renovables, Santa Cruz de Tenerife, Spain
| |
Collapse
|
4
|
Abstract
Participants in the family-based analysis group at Genetic Analysis Workshop 19 addressed diverse topics, all of which used the family data. Topics addressed included questions of study design and data quality control (QC), genotype imputation to augment available sequence data, and linkage and/or association analyses. Results show that pedigree-based tests that are sensitive to genotype error may be useful for QC. Imputation quality improved with inclusion of small amounts of pedigree information used to phase the data in evaluation of 5 commonly used approaches for imputation in samples of (typically) unrelated subjects. It improved still further when pedigree-based imputation using larger pedigrees was also added. An important distinction was made between methods that do versus do not make use of Mendelian transmission in pedigrees, because this serves as a key difference between underlying models and assumptions. Methods that model relatedness generally had higher power in association testing than did analyses that carry out testing in the presence of a transmission model, but this may reflect details of implementation and/or ability of more general methods to jointly include data from larger pedigrees. In either case, for single nucleotide polymorphism-set approaches, weights that incorporate information on functional effects may be more useful than those that are based only on allele frequencies. The overall results demonstrate that family data continue to provide important information in the search for trait loci.
Collapse
Affiliation(s)
- Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98195, USA.
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
5
|
Nato AQ, Chapman NH, Sohi HK, Nguyen HD, Brkanac Z, Wijsman EM. PBAP: a pipeline for file processing and quality control of pedigree data with dense genetic markers. Bioinformatics 2015; 31:3790-8. [PMID: 26231429 PMCID: PMC4668752 DOI: 10.1093/bioinformatics/btv444] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Revised: 07/07/2015] [Accepted: 07/25/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Huge genetic datasets with dense marker panels are now common. With the availability of sequence data and recognition of importance of rare variants, smaller studies based on pedigrees are again also common. Pedigree-based samples often start with a dense marker panel, a subset of which may be used for linkage analysis to reduce computational burden and to limit linkage disequilibrium between single-nucleotide polymorphisms (SNPs). Programs attempting to select markers for linkage panels exist but lack flexibility. RESULTS We developed a pedigree-based analysis pipeline (PBAP) suite of programs geared towards SNPs and sequence data. PBAP performs quality control, marker selection and file preparation. PBAP sets up files for MORGAN, which can handle analyses for small and large pedigrees, typically human, and results can be used with other programs and for downstream analyses. We evaluate and illustrate its features with two real datasets. AVAILABILITY AND IMPLEMENTATION PBAP scripts may be downloaded from http://faculty.washington.edu/wijsman/software.shtml. CONTACT wijsman@uw.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | | | - Hiep D Nguyen
- Division of Medical Genetics, Department of Medicine
| | | | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, Department of Biostatistics and Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
6
|
Saad M, Wijsman EM. Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes. Genet Epidemiol 2013; 38:1-9. [PMID: 24243664 DOI: 10.1002/gepi.21776] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Revised: 09/30/2013] [Accepted: 10/15/2013] [Indexed: 01/09/2023]
Abstract
Recently, the "Common Disease-Multiple Rare Variants" hypothesis has received much attention, especially with current availability of next-generation sequencing. Family-based designs are well suited for discovery of rare variants, with large and carefully selected pedigrees enriching for multiple copies of such variants. However, sequencing a large number of samples is still prohibitive. Here, we evaluate a cost-effective strategy (pseudosequencing) to detect association with rare variants in large pedigrees. This strategy consists of sequencing a small subset of subjects, genotyping the remaining sampled subjects on a set of sparse markers, and imputing the untyped markers in the remaining subjects conditional on the sequenced subjects and pedigree information. We used a recent pedigree imputation method (GIGI), which is able to efficiently handle large pedigrees and accurately impute rare variants. We used burden and kernel association tests, famWS and famSKAT, which both account for family relationships and heterogeneity of allelic effect for famSKAT only. We simulated pedigree sequence data and compared the power of association tests for pseudosequence data, a subset of sequence data used for imputation, and all subjects sequenced. We also compared, within the pseudosequence data, the power of association test using best-guess genotypes and allelic dosages. Our results show that the pseudosequencing strategy considerably improves the power to detect association with rare variants. They also show that the use of allelic dosages results in much higher power than use of best-guess genotypes in these family-based data. Moreover, famSKAT shows greater power than famWS in most of scenarios we considered.
Collapse
Affiliation(s)
- Mohamad Saad
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, United States of America; Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
| | | |
Collapse
|
7
|
Marchani EE, Chapman NH, Cheung CYK, Ankenman K, Stanaway IB, Coon HH, Nickerson D, Bernier R, Brkanac Z, Wijsman EM. Identification of rare variants from exome sequence in a large pedigree with autism. Hum Hered 2013; 74:153-64. [PMID: 23594493 DOI: 10.1159/000346560] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
We carried out analyses with the goal of identifying rare variants in exome sequence data that contribute to disease risk for a complex trait. We analyzed a large, 47-member, multigenerational pedigree with 11 cases of autism spectrum disorder, using genotypes from 3 technologies representing increasing resolution: a multiallelic linkage marker panel, a dense diallelic marker panel, and variants from exome sequencing. Genome-scan marker genotypes were available on most subjects, and exome sequence data was available on 5 subjects. We used genome-scan linkage analysis to identify and prioritize the chromosome 22 region of interest, and to select subjects for exome sequencing. Inheritance vectors (IVs) generated by Markov chain Monte Carlo analysis of multilocus marker data were the foundation of most analyses. Genotype imputation used IVs to determine which sequence variants reside on the haplotype that co-segregates with the autism diagnosis. Together with a rare-allele frequency filter, we identified only one rare variant on the risk haplotype, illustrating the potential of this approach to prioritize variants. The associated gene, MYH9, is biologically unlikely, and we speculate that for this complex trait, the key variants may lie outside the exome.
Collapse
Affiliation(s)
- E E Marchani
- Division of Medical Genetics, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Kilpinen H, Barrett JC. How next-generation sequencing is transforming complex disease genetics. Trends Genet 2013; 29:23-30. [DOI: 10.1016/j.tig.2012.10.001] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2012] [Revised: 09/25/2012] [Accepted: 10/01/2012] [Indexed: 02/02/2023]
|
9
|
Linkage analysis identifies a locus for plasma von Willebrand factor undetected by genome-wide association. Proc Natl Acad Sci U S A 2012; 110:588-93. [PMID: 23267103 DOI: 10.1073/pnas.1219885110] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The plasma glycoprotein von Willebrand factor (VWF) exhibits fivefold antigen level variation across the normal human population determined by both genetic and environmental factors. Low levels of VWF are associated with bleeding and elevated levels with increased risk for thrombosis, myocardial infarction, and stroke. To identify additional genetic determinants of VWF antigen levels and to minimize the impact of age and illness-related environmental factors, we performed genome-wide association analysis in two young and healthy cohorts (n = 1,152 and n = 2,310) and identified signals at ABO (P < 7.9E-139) and VWF (P < 5.5E-16), consistent with previous reports. Additionally, linkage analysis based on sibling structure within the cohorts, identified significant signals at chromosome 2q12-2p13 (LOD score 5.3) and at the ABO locus on chromosome 9q34 (LOD score 2.9) that explained 19.2% and 24.5% of the variance in VWF levels, respectively. Given its strong effect, the linkage region on chromosome 2 could harbor a potentially important determinant of bleeding and thrombosis risk. The absence of a chromosome 2 association signal in this or previous association studies suggests a causative gene harboring many genetic variants that are individually rare, but in aggregate common. These results raise the possibility that similar loci could explain a significant portion of the "missing heritability" for other complex genetic traits.
Collapse
|
10
|
Genetic heterogeneity in Finnish hereditary prostate cancer using ordered subset analysis. Eur J Hum Genet 2012; 21:437-43. [PMID: 22948022 DOI: 10.1038/ejhg.2012.185] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Prostate cancer (PrCa) is the most common male cancer in developed countries and the second most common cause of cancer death after lung cancer. We recently reported a genome-wide linkage scan in 69 Finnish hereditary PrCa (HPC) families, which replicated the HPC9 locus on 17q21-q22 and identified a locus on 2q37. The aim of this study was to identify and to detect other loci linked to HPC. Here we used ordered subset analysis (OSA), conditioned on nonparametric linkage to these loci to detect other loci linked to HPC in subsets of families, but not the overall sample. We analyzed the families based on their evidence for linkage to chromosome 2, chromosome 17 and a maximum score using the strongest evidence of linkage from either of the two loci. Significant linkage to a 5-cM linkage interval with a peak OSA nonparametric allele-sharing LOD score of 4.876 on Xq26.3-q27 (ΔLOD=3.193, empirical P=0.009) was observed in a subset of 41 families weakly linked to 2q37, overlapping the HPCX1 locus. Two peaks that were novel to the analysis combining linkage evidence from both primary loci were identified; 18q12.1-q12.2 (OSA LOD=2.541, ΔLOD=1.651, P=0.03) and 22q11.1-q11.21 (OSA LOD=2.395, ΔLOD=2.36, P=0.006), which is close to HPC6. Using OSA allows us to find additional loci linked to HPC in subsets of families, and underlines the complex genetic heterogeneity of HPC even in highly aggregated families.
Collapse
|
11
|
Kazma R, Bailey JN. Population-based and family-based designs to analyze rare variants in complex diseases. Genet Epidemiol 2012; 35 Suppl 1:S41-7. [PMID: 22128057 DOI: 10.1002/gepi.20648] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genotyping of rare variants on a large scale is now possible using next-generation sequencing. Sample selection is a crucial step in designing the genetic study of a complex disease, and knowledge of the efficiency and limitations of population-based and family-based designs can help researchers make the appropriate choice. The nine contributions to Group 5 of Genetic Analysis Workshop 17 evaluate population-based and family-based designs by comparing the results obtained with various methods applied to the mini-exome simulations. These simulations consisted of 200 replicates composed of unrelated individuals and eight extended pedigrees with genotypes and various phenotypes. The methods tested for association with a population-based and/or a family-based design, tested for linkage with a family-based design, or estimated heritability. We summarize the strengths and weaknesses of both designs. Although population-based designs seem more suitable for detecting the effect of multiple rare variants, family-based designs can potentially enrich the sample in rare variants, for which the effect would be concealed at the population level. However, as of today, the main limitation is still the high cost of next-generation sequencing.
Collapse
Affiliation(s)
- Rémi Kazma
- Department of Epidemiology and Biostatistics and Institute for Human Genetics, University of California, San Francisco, CA 94143-3110, USA.
| | | |
Collapse
|
12
|
Wilson AF, Ziegler A. Lessons learned from Genetic Analysis Workshop 17: transitioning from genome-wide association studies to whole-genome statistical genetic analysis. Genet Epidemiol 2011; 35 Suppl 1:S107-14. [PMID: 22128050 PMCID: PMC3277851 DOI: 10.1002/gepi.20659] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions to GAW17, a wide variety of statistical methods were applied to simulated traits in population- and family-based samples, and results from these analyses were compared to the known generating model. In general, many of the statistical genetic methods used in the population-based sample identified causal sequence variants (SVs) when the estimated locus-specific heritability, as measured in the population-based sample, was greater than about 0.08. However, SVs with locus-specific heritabilities less than 0.03 were rarely identified consistently. In the family-based samples, many of the methods detected SVs that were rarer than those detected in the population-based sample, but the estimated locus-specific heritabilities for these rare SVs, as measured in the family-based samples, were substantially higher (>0.2) than their corresponding heritabilities in the population-based samples. Substantial inflation of the type I error rate was observed across a wide variety of statistical methods. Although many of the contributions found little inflation in type I error for Q4, a trait with no causal SVs, type I error rates for Q1 and Q2 were well above their nominal levels with the inflation for Q1 being higher than that for Q2. It seems likely that this inflation in type I error is due to correlations among SVs.
Collapse
Affiliation(s)
- Alexander F Wilson
- Genometrics Section, Inherited Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD 21224, USA.
| | | |
Collapse
|