1
|
Shadrina M, Kalay Ö, Demirkaya-Budak S, LeDuc CA, Chung WK, Turgut D, Budak G, Arslan E, Semenyuk V, Davis-Dusenbery B, Seidman CE, Yost HJ, Jain A, Gelb BD. Efficient identification of de novo mutations in family trios: a consensus-based informatic approach. Life Sci Alliance 2025; 8:e202403039. [PMID: 40155050 PMCID: PMC11953573 DOI: 10.26508/lsa.202403039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 03/19/2025] [Accepted: 03/20/2025] [Indexed: 04/01/2025] Open
Abstract
Accurate identification of de novo variants (DNVs) remains challenging despite advances in sequencing technologies, often requiring ad hoc filters and manual inspection. Here, we explored a purely informatic, consensus-based approach for identifying DNVs in proband-parent trios using short-read genome sequencing data. We evaluated variant calls generated by three sequence analysis pipelines-GATK HaplotypeCaller, DeepTrio, and Velsera GRAF-and examined the assumption that a requirement of consensus can serve as an effective filter for high-quality DNVs. Comparison with a highly accurate DNV set, validated previously by manual inspection and Sanger sequencing, demonstrated that consensus filtering, followed by a force-calling procedure, effectively removed false-positive calls, achieving 98.0-99.4% precision. At the same time, sensitivity of the workflow based on the previously established DNVs reached 99.4%. Validation in the HG002-3-4 Genome-in-a-Bottle trio confirmed its robustness, with precision reaching 99.2% and sensitivity up to 96.6%. We believe that this consensus approach can be widely implemented as an automated bioinformatics workflow suitable for large-scale analyses without the need for manual intervention, especially when very high precision is valued over sensitivity.
Collapse
Affiliation(s)
- Mariya Shadrina
- Mindich Child Health and Development Institute and the Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York, NY, USA
| | | | | | - Charles A LeDuc
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Wendy K Chung
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | | | | | | | | | | | - Christine E Seidman
- Division of Cardiovascular Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - H Joseph Yost
- Molecular Medicine Program, University of Utah, Salt Lake City, UT, USA
| | | | - Bruce D Gelb
- Mindich Child Health and Development Institute and the Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York, NY, USA
- Department of Pediatrics, Icahn School of Medicine, New York, NY, USA
| |
Collapse
|
2
|
Liang Q, Lan Q, Liu Q, Wu X, Luo L, Shen C, Zhu B. Systematic evaluations of forensic effectiveness and genetic structures of two ethnic groups in Northwest China using a self-developed Multi-InDel panel. Hereditas 2025; 162:80. [PMID: 40380291 DOI: 10.1186/s41065-025-00416-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Accepted: 03/12/2025] [Indexed: 05/19/2025] Open
Abstract
BACKGROUND The use of compound markers has gained significant interest among forensic practitioners, due to their ability to enhance genetic marker polymorphisms by introducing new alleles. Two or more closely linked insertion/deletion (InDel) markers form a compound marker termed Multi-InDel, which offers the advantages of microhaplotype (MH) and can be genotyped using capillary electrophoresis (CE) platform. A multiplex amplification panel, including 41 Multi-InDel markers and the sex-determination locus Amelogenin, was developed and validated as an effective tool for forensic and population genetics applications. METHODS A total of 245 Kazakh and Kyrgyz samples from China were genotyped based on the 41 Multi-InDel markers to evaluate the forensic efficacy of the panel. In addition, Multi-InDel genotyping data from 28 reference populations were collected, and population genetic analyses were performed to elucidate the genetic backgrounds of Chinese Kazakh and Kyrgyz groups. CONCLUSIONS The Multi-InDel markers demonstrated high genetic polymorphisms in Chinese Kazakh and Kyrgyz ethnic groups, indicating their suitability for forensic applications. For the two ethnic groups, the cumulative power of discrimination (CPD) values were 0.999999999999999999999999835993 and 0.999999999999999999999999717184, respectively, while the cumulative power of exclusion (CPE) values were 0.999998887418153 and 0.999999348634116, respectively. Using this Multi-InDel panel, an average of 98.82% of full sibling (FS) pairs could be distinguished from unrelated individual pairs (likelihood ratio > 1). Regarding population genetics, Chinese Kazakh and Kyrgyz groups were found to exhibit an East Asia-Europe admixed ancestry pattern, while maintaining closer genetic affinities with East Asian populations.
Collapse
Affiliation(s)
- Qinglin Liang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China
| | - Qiong Lan
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China
| | - Qinglin Liu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China
| | - Xiaolian Wu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China
| | - Lisiteng Luo
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China
| | - Chunmei Shen
- Department of Clinical Laboratory, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, 510515, China.
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China.
| |
Collapse
|
3
|
Koh GCC, Nanda AS, Rinaldi G, Boushaki S, Degasperi A, Badja C, Pregnall AM, Zhao SJ, Chmelova L, Black D, Heskin L, Dias J, Young J, Memari Y, Shooter S, Czarnecki J, Brown MA, Davies HR, Zou X, Nik-Zainal S. A redefined InDel taxonomy provides insights into mutational signatures. Nat Genet 2025; 57:1132-1141. [PMID: 40210680 DOI: 10.1038/s41588-025-02152-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 03/04/2025] [Indexed: 04/12/2025]
Abstract
Despite their deleterious effects, small insertions and deletions (InDels) have received far less attention than substitutions. Here we generated isogenic CRISPR-edited human cellular models of postreplicative repair dysfunction (PRRd), including individual and combined gene edits of DNA mismatch repair (MMR) and replicative polymerases (Pol ε and Pol δ). Unique, diverse InDel mutational footprints were revealed. However, the prevailing InDel classification framework was unable to discriminate these InDel signatures from background mutagenesis and from each other. To address this, we developed an alternative InDel classification system that considers flanking sequences and informative motifs (for example, longer homopolymers), enabling unambiguous InDel classification into 89 subtypes. Through focused characterization of seven tumor types from the 100,000 Genomes Project, we uncovered 37 InDel signatures; 27 were new. In addition to unveiling previously hidden biological insights, we also developed PRRDetect-a highly specific classifier of PRRd status in tumors, with potential implications for immunotherapies.
Collapse
Affiliation(s)
- Gene Ching Chiek Koh
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
- School of Medical and Life Sciences, Sunway University, Sunway City, Malaysia
| | - Arjun Scott Nanda
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Giuseppe Rinaldi
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Soraya Boushaki
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Andrea Degasperi
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Cherif Badja
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Andrew Marcel Pregnall
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Salome Jingchen Zhao
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Lucia Chmelova
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Daniella Black
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Laura Heskin
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - João Dias
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Jamie Young
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Yasin Memari
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Scott Shooter
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Jan Czarnecki
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Matthew Arthur Brown
- Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, UK
| | - Helen Ruth Davies
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Xueqing Zou
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Serena Nik-Zainal
- Department of Genomic Medicine, School of Clinical Medicine, University of Cambridge, Cambridge, UK.
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK.
| |
Collapse
|
4
|
Zhao S, Xu D, Cai J, Shen Q, He M, Pan X, Gao Y, Li J, Yuan X. Benchmarking strategies for CNV calling from whole genome bisulfite data in humans. Comput Struct Biotechnol J 2025; 27:912-919. [PMID: 40123798 PMCID: PMC11929052 DOI: 10.1016/j.csbj.2025.02.040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2025] Open
Abstract
It's important to dissect the relationship between copy number variations (CNVs) and DNA methylation, because both greatly change the dosages of genes and are responsible for diverse human cancers. Although whole genome bisulfite sequencing (WGBS) informs CNVs and DNA methylation, no study has provided a systematic benchmark for detecting CNVs from WGBS data. Herein, based on simulated and real WGBS datasets of 84.62 billion reads, we undertook 714 CNV detections to comprehensively benchmark the performance of 35 strategies, 5 alignment algorithms (bismarkbt2, bsbolt, bsmap, bwameth, and walt) wrapping with 7 CNV detection applications (BreakDancer, cn.mops, CNVkit, CNVnator, DELLY, GASV and Pindel). The results highlighted a subset of strategies that accurately called CNVs depending on numbers, lengths, precision, recall, and F1 scores of CNV detections. We found that bwameth-DELLY and bwameth-BreakDancer were the best strategies for calling deletions, and walt-CNVnator and bismarkbt2-CNVnator were the best strategies for calling duplications. These works provided investigators with useful information to accurately explore CNVs from WGBS data in humans.
Collapse
Affiliation(s)
- Shanghui Zhao
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Dantong Xu
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Jiali Cai
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Qingpeng Shen
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Mingran He
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Xiangchun Pan
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Yahui Gao
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Jiaqi Li
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
| | - Xiaolong Yuan
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong 510642, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
- Centre for Healthy Ageing, Health Futures Institute, Murdoch University, Murdoch, WA 6150, Australia
| |
Collapse
|
5
|
Mei S, Cui W, Chen M, Cai M, Lei F, Wang X, Yang S, Zhu B. Comprehensive elucidation on the genetic profile of the Hezhou Han population via an efficient InDel panel. Forensic Sci Res 2025; 10:owae021. [PMID: 40007634 PMCID: PMC11850652 DOI: 10.1093/fsr/owae021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 03/27/2024] [Indexed: 02/27/2025] Open
Abstract
The Han nationality is widely distributed in different regions, and it is one of the most populous nationalities in China. Compared with the ethnic minorities in Guangxi Zhuang Autonomous Region, there is relatively less research on Han individuals dwelled in Guangxi as a part of Chinese Han population. In this study, the genetic polymorphisms of 57 autosomal insertion/deletion (InDel) loci were explored in Hezhou Han (HZH) population. Forensic-related parameters revealed that these 57 InDel loci had high forensic validity and could be used in forensic practice application. In addition, the genetic relationships between the HZH population and 30 worldwide reference populations were explored using a variety of analytical methods, such as phylogenetic tree, principal component analysis, and genetic structure analysis. These results demonstrated that there were closer genetic relationships between the HZH and nine populations from East Asia (EAS). The prediction accuracy rates of five inter-continental cross-validation analyses for individuals from EAS was >0.9, and the prediction accuracy rates of three inter-continental cross-validation analyses for individuals from EAS, Europe, and Africa were all >0.95. In addition, 24 of the 57 InDel loci could be served as ancestral information inference loci, which could effectively distinguish individuals of EAS, Europe, and Africa. In conclusion, these InDel loci could be used not only as a good tool for individual identification and paternity testing in HZH population, but also as an auxiliary tool for ancestry information inference research.
Collapse
Affiliation(s)
- Shuyan Mei
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, China
- College of Basic Medicine and Forensic Medicine, Henan University of Science and Technology, Luoyang, China
| | - Wei Cui
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Man Chen
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Meiming Cai
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Fanzhang Lei
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Xi Wang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | | | - Bofeng Zhu
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, China
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi’an Jiaotong University, Xi’an, China
| |
Collapse
|
6
|
Ramos LC, Kobachuk LDG, Nadur DM, Sabbag LR, Rosário MMTD, Naslavsky MS, Mendes-Junior CT, Castelli EC. Assessing the performance of multi-InDel panels for human identification among admixed Brazilians. Forensic Sci Int Genet 2025; 74:103161. [PMID: 39418769 DOI: 10.1016/j.fsigen.2024.103161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 09/05/2024] [Accepted: 10/07/2024] [Indexed: 10/19/2024]
Abstract
Insertion/deletion polymorphisms, or InDels, are widely present in the human genome. They have been considered as potential markers for forensic analysis because they can be genotyped using the CE platform and compatible typing techniques used in forensic laboratories. Additionally, InDels have lower mutation rates and often short amplicon sizes, making them ideal for detecting degraded samples. However, most InDels are bi-allelic; therefore, their discrimination power is relatively low. A new set of genetic marker called multi-InDels was reported to improve InDel's informativeness. Multi-InDel markers are generally designated as microhaplotypes encompassing two or more InDels within a short distance, usually less than 200 bp. In this study, we evaluated the applicability of three previously proposed panels of multi-InDel markers, designed for Asian populations, for human identification in Brazil. We assessed all the multi-InDel markers using high-coverage whole-genome sequencing data from a census-based cohort of 1171 Brazilians residing in São Paulo, the largest Brazilian capital. The results showed that most markers are informative for Brazilian individuals since they present more than three frequent haplotypes with different sizes. However, most markers are prone to amplification/sequencing errors due to repetitive or low-complexity regions. Among the tested panels, the one from Huang et al. (2014) is the most promising for forensic use in Brazil, with a combined match probability and cumulative power of exclusion of 4.92 ×10-14 and 0.9991, respectively. Nevertheless, these values are low compared to the ones obtained with CODIS STRs (short tandem repeats) and larger SNP (single nucleotide polymorphisms) panels. Therefore, new attempts to scan the human genome for highly polymorphic multi-InDel markers are still necessary to obtain a suitable panel of multi-InDels for worldwide populations.
Collapse
Affiliation(s)
- Livia Carla Ramos
- São Paulo State University (UNESP), Institute of Biosciences, Botucatu, São Paulo, Brazil; Molecular Genetics and Bioinformatics Laboratory, Experimental Research Unit - Unipex, School of Medicine, São Paulo State University - UNESP, Botucatu, São Paulo, Brazil; Superintendência da Polícia Técnico-Científica, Núcleo de Perícias Criminalísticas de Bauru, São Paulo, Brazil.
| | - Luciellen D G Kobachuk
- Polícia Científica do Paraná, Seção de Genética Molecular Forense, Curitiba, Paraná, Brazil; Department of Genetics, School of Medicine, University of São Paulo (USP), Ribeirão Preto, São Paulo, Brazil
| | - Douglas Mendes Nadur
- São Paulo State University (UNESP), Institute of Biosciences, Botucatu, São Paulo, Brazil
| | - Luiza Rauen Sabbag
- Department of Genetics, School of Medicine, University of São Paulo (USP), Ribeirão Preto, São Paulo, Brazil
| | | | - Michel S Naslavsky
- Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo (USP), São Paulo, São Paulo, Brazil; Hospital Israelita Albert Einstein, São Paulo, São Paulo, Brazil
| | - Celso Teixeira Mendes-Junior
- Department of Chemistry, Forensic and Genomics Research Laboratory, Faculty of Philosophy, Science and Letters at Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP 14040-901, Brazil.
| | - Erick C Castelli
- Molecular Genetics and Bioinformatics Laboratory, Experimental Research Unit - Unipex, School of Medicine, São Paulo State University - UNESP, Botucatu, São Paulo, Brazil; São Paulo State University (UNESP), Department of Pathology, School of Medicine, Botucatu, São Paulo, Brazil.
| |
Collapse
|
7
|
Zhou W, Mumm C, Gan Y, Switzenberg JA, Wang J, De Oliveira P, Kathuria K, Losh SJ, McDonald TL, Bessell B, Van Deynze K, McConnell MJ, Boyle AP, Mills RE. A personalized multi-platform assessment of somatic mosaicism in the human frontal cortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.18.629274. [PMID: 39763954 PMCID: PMC11702624 DOI: 10.1101/2024.12.18.629274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/16/2025]
Abstract
Somatic mutations in individual cells lead to genomic mosaicism, contributing to the intricate regulatory landscape of genetic disorders and cancers. To evaluate and refine the detection of somatic mosaicism across different technologies with personalized donor-specific assembly (DSA), we obtained tissue from the dorsolateral prefrontal cortex (DLPFC) of a post-mortem neurotypical 31-year-old individual. We sequenced bulk DLPFC tissue using Oxford Nanopore Technologies (~60X), NovaSeq (~30X), and linked-read sequencing (~28X). Additionally, we applied Cas9 capture methodology coupled with long-read sequencing (TEnCATS), targeting active transposable elements. We also isolated and amplified DNA from flow-sorted single DLPFC neurons using MALBAC, sequencing 115 of these MALBAC libraries on Nanopore and 94 on NovaSeq. We constructed a haplotype-resolved assembly with a total length of 5.77 Gb and a phase block length of 2.67 Mb (N50) to facilitate cross-platform analysis of somatic genetic variations. We observed an increase in the phasing rate from 11.6% to 38.0% between short-read and long-read technologies. By generating a catalog of phased germline SNVs, CNVs, and TEs from the assembled genome, we applied standard approaches to recall these variants across sequencing technologies. We achieved aggregated recall rates from 97.3% to 99.4% based on long-read bulk tissue data, setting an upper bound for detection limits. Moreover, utilizing haplotype-based analysis from DSA, we achieved a remarkable reduction in false positive somatic calls in bulk tissue, ranging from 14.9% to 72.4%. We developed pipelines leveraging DSA information to enhance somatic large genetic variant calling in long-read single cells. By examining somatic variation using long-reads in 115 individual neurons, we identified 468 candidate somatic heterozygous large deletions (1.5Mb - 20Mb), 137 of which intersected with short-read single-cell data. Additionally, we identified 61 putative somatic TEs (60 Alus, one LINE-1) in the single-cell data. Collectively, our analysis spans personalized assembly to single-cell somatic variant calling, providing a comprehensive ab initio ad finem approach and resource in real human tissue.
Collapse
Affiliation(s)
- Weichen Zhou
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Camille Mumm
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Yanming Gan
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jessica A. Switzenberg
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jinhao Wang
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | | | - Kunal Kathuria
- Lieber Institute for Brain Development, Baltimore, MD, USA
| | - Steven J. Losh
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Torrin L. McDonald
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Brandt Bessell
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Kinsey Van Deynze
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | | | - Alan P. Boyle
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Ryan E. Mills
- Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
8
|
Zhao L, Teng J, Ning C, Zhang Q. Genome-Wide Association Study of Insertions and Deletions Identified Novel Loci Associated with Milk Production Traits in Dairy Cattle. Animals (Basel) 2024; 14:3556. [PMID: 39765460 PMCID: PMC11672399 DOI: 10.3390/ani14243556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 11/25/2024] [Accepted: 12/04/2024] [Indexed: 01/11/2025] Open
Abstract
Genome-wide association study (GWAS) have identified a large number of SNPs associated with milk production traits in dairy cattle. Behind SNPs, INDELs are the second most abundant genetic polymorphisms in the genome, which may exhibit an independent association with complex traits in humans and other species. However, there are no reports on GWASs of INDELs for milk production traits in dairy cattle. In this study, using imputed sequence data, we performed INDEL-based and SNP-based GWASs for milk production traits in a Holstein cattle population. We identified 58 unique significant INDELs for one or multiple traits. The majority of these INDELs are in considerable LD with nearby significant SNPs. However, through conditional association analysis, we identified nine INDELs which showed independent associations. Genomic annotations of these INDELs indicated some novel associated genes, i.e., TRNAG-CCC, EPPK1, PPM1K, PTDSS1, and mir-10163, which were not reported in previous SNP-based GWASs. Our findings suggest that INDEL-based GWASs could be valuable complement to SNP-based GWASs for milk production traits.
Collapse
Affiliation(s)
| | | | | | - Qin Zhang
- Shandong Provincial Key Laboratory for Livestock Germplasm Innovation & Utilization, College of Animal Science and Technology, Shandong Agricultural University, Tai’an 271018, China; (L.Z.); (J.T.); (C.N.)
| |
Collapse
|
9
|
Koko M, Fabian L, Popov I, Eberhardt RY, Zakharov G, Huang QQ, Wade EE, Azad R, Danecek P, Ho K, Hough A, Huang W, Lindsay SJ, Malawsky DS, Bonfanti D, Mason D, Plowman D, Quail MA, Ring SM, Shireby G, Widaa S, Fitzsimons E, Iyer V, Bann D, Timpson NJ, Wright J, Hurles ME, Martin HC. Exome sequencing of UK birth cohorts. Wellcome Open Res 2024; 9:390. [PMID: 39839975 PMCID: PMC11747307 DOI: 10.12688/wellcomeopenres.22697.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/26/2024] [Indexed: 01/23/2025] Open
Abstract
Birth cohort studies involve repeated surveys of large numbers of individuals from birth and throughout their lives. They collect information useful for a wide range of life course research domains, and biological samples which can be used to derive data from an increasing collection of omic technologies. This rich source of longitudinal data, when combined with genomic data, offers the scientific community valuable insights ranging from population genetics to applications across the social sciences. Here we present quality-controlled whole exome sequencing data from three UK birth cohorts: the Avon Longitudinal Study of Parents and Children (8,436 children and 3,215 parents), the Millenium Cohort Study (7,667 children and 6,925 parents) and Born in Bradford (8,784 children and 2,875 parents). The overall objective of this coordinated effort is to make the resulting high-quality data widely accessible to the global research community in a timely manner. We describe how the datasets were generated and subjected to quality control at the sample, variant and genotype level. We then present some preliminary analyses to illustrate the quality of the datasets and probe potential sources of bias. We introduce measures of ultra-rare variant burden to the variables available for researchers working on these cohorts, and show that the exome-wide burden of deleterious protein-truncating variants, S het burden, is associated with educational attainment and cognitive test scores. The whole exome sequence data from these birth cohorts (CRAM & VCF files) are available through the European Genome-Phenome Archive, and here we provide guidance for their use.
Collapse
Affiliation(s)
- Mahmoud Koko
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Laurie Fabian
- Population Health Sciences, University of Bristol Medical School, Bristol, England, BS8 2BN, UK
| | - Iaroslav Popov
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Ruth Y. Eberhardt
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Gennadii Zakharov
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Qin Qin Huang
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Emma E. Wade
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Rafaq Azad
- Bradford Institute for Health Research, Bradford Royal Infirmary, Bradford, England, BD9 6RJ, UK
| | - Petr Danecek
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Karen Ho
- Population Health Sciences, University of Bristol Medical School, Bristol, England, BS8 2BN, UK
| | - Amy Hough
- Bradford Institute for Health Research, Bradford Royal Infirmary, Bradford, England, BD9 6RJ, UK
| | - Wei Huang
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Sarah J. Lindsay
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Daniel S. Malawsky
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Davide Bonfanti
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Dan Mason
- Bradford Institute for Health Research, Bradford Royal Infirmary, Bradford, England, BD9 6RJ, UK
| | - Deborah Plowman
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Michael A. Quail
- Sequencing R&D, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Susan M. Ring
- Population Health Sciences, University of Bristol Medical School, Bristol, England, BS8 2BN, UK
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, England, BS8 2BN, UK
| | - Gemma Shireby
- Centre for Longitudinal Studies, University College London Institute of Education, London, England, WC1H 0NU, UK
| | - Sara Widaa
- Sequencing R&D, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Emla Fitzsimons
- Centre for Longitudinal Studies, University College London Institute of Education, London, England, WC1H 0NU, UK
| | - Vivek Iyer
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - David Bann
- Centre for Longitudinal Studies, University College London Institute of Education, London, England, WC1H 0NU, UK
| | - Nicholas J. Timpson
- Population Health Sciences, University of Bristol Medical School, Bristol, England, BS8 2BN, UK
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, England, BS8 2BN, UK
| | - John Wright
- Bradford Institute for Health Research, Bradford Royal Infirmary, Bradford, England, BD9 6RJ, UK
| | - Matthew E. Hurles
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| | - Hilary C. Martin
- Human Genetics, Wellcome Sanger Institute, Hinxton, England, CB10 1SA, UK
| |
Collapse
|
10
|
Ji Y, Chen C, Lu P, Wang Z, Chen H, Sun L, Fei S, Ju X, Tan R, Gu M. Nuclear factor of activated T cell cytoplasmic 1 (NFATc1) insertion gene polymorphism as a possible trigger in acute T cell-mediated rejection (aTCMR) after kidney transplantation. Transpl Immunol 2024; 87:102139. [PMID: 39461381 DOI: 10.1016/j.trim.2024.102139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 10/04/2024] [Accepted: 10/20/2024] [Indexed: 10/29/2024]
Abstract
BACKGROUND To investigate the potential regulatory role of gene insertion or deletion (in/del) polymorphism in the occurrence of acute T cell-mediated rejection (aTCMR) after kidney transplantation. METHODS We retrospectively analyzed the 5-year follow-up data of 133 recipients who underwent renal transplantation at the First Affiliated Hospital of Nanjing Medical University between February 1, 2010, and December 1, 2015. With target sequencing based on next-generation sequencing (NGS), tagger in/dels selection involved calculating the Hardy-Weinberg equilibrium (HWE), Minor Allele Frequency (MAF), and the linkage disequilibrium (LD) blocks. Significant in/dels associated with aTCMR were identified by intersecting the results obtained through analysis of covariance (ANCOVA) of clinical cofounders and model analysis in Rstudio using the "SNPassoc" package. Additionally, logistic models were employed to assess the associations between genotypes and the aTCMR occurrence in 5 years after surgery. RESULTS NFATc1 rs55741427 insertion was identified to be significantly associated with the post-surgery aTCMR(OR = 2.66, P < 0.001). We constructed a conclusive model containing the occurrence of delayed graft function (DGF) and the insertion polymorphism of rs55741427, showing a favorable predictive ability (AUC = 0.766) for aTCMR after surgery. Based on the receiver operating characteristic (ROC) curve, all cases were stratified into aTCMR high-risk and low-risk groups. Kaplan-Meier curves for two groups revealed that the aTCMR high-risk group exhibited a more unfavorable graft survival outcome (P = 0.0048). CONCLUSION Insertion mutation of rs55741427 was found to be statistically correlated with the post-surgery aTCMR during 5 years of follow-up. Our model identified DGF and insertion of rs55741427 as two crucial aTCMR-related hazards, and aTCMR high-risk group showed a worse graft prognosis.
Collapse
Affiliation(s)
- Yisheng Ji
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China; The First Clinical Medical College, Nanjing Medical University, Nanjing, China
| | - Congcong Chen
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China; The First Clinical Medical College, Nanjing Medical University, Nanjing, China
| | - Pei Lu
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Zijie Wang
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Hao Chen
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Li Sun
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Shuang Fei
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Xiaobing Ju
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| | - Ruoyun Tan
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| | - Min Gu
- Deparment of Urology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China; Department of Urology, the Second Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| |
Collapse
|
11
|
Yang C, Liu C, Lun M, Chen X, Xu Q, Liu X, He M, Ye L, He G, Wang M, Liu C. Dissecting the genetic admixture and forensic signatures of ethnolinguistically diverse Chinese populations via a 114-plex NGS InDel panel. BMC Genomics 2024; 25:1137. [PMID: 39587470 PMCID: PMC11587575 DOI: 10.1186/s12864-024-10894-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Accepted: 10/11/2024] [Indexed: 11/27/2024] Open
Abstract
Comprehensive characterizations of genetic diversity and demographic models of ethnolinguistically diverse Chinese populations are essential for elucidating their forensic characteristics and evolutionary past. We developed a 114-plex NGS InDel panel to genotype 114 genome-wide markers and investigated the genetic structures of Zhuang, Hui, Miao, Li, Tibetan, Yi, and Mongolian populations, encompassing five language families. This panel demonstrated robust performance, with exceptional potential for forensic individual identification and paternity testing, evidenced by the combined power of discrimination for 77 autosomal InDels (ranged from 1-3.6400 × 10-30 to 1-3.5713 × 10-32) and the combined power of exclusion (ranged from 1-2.1863 × 10-6 to 1-2.1261 × 10-7). The cumulative mean exclusion chance for 32 X-chromosomal InDels varied between 0.99996 and 0.99999 for trios and 0.99760 to 0.99898 for duos. We also analyzed genetic similarities and differences between these populations and 27 global populations, revealing distinct clusters among African, South Asian, East Asian, and European groups, with a close genetic affinity to East Asians. Notably, we identified geography-related genetic substructures: Inner Mongolia Mongolians and Gansu Huis formed a northern branch, Tibetans and Yis from Sichuan constituted a highland branch, and Guangxi Zhuangs exhibited close ties with Hainan Lis and Guangxi Miaos in the southern branch. Additionally, many InDels proved to be ancestry-informative markers for biogeographic ancestry inference. Collectively, these findings underscore the utility of the 114-plex NGS InDel panel as a complementary tool for forensic investigations and as a source of insights into the genetic architecture of ethnolinguistically distinct Chinese populations.
Collapse
Affiliation(s)
- Chengliang Yang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Changhui Liu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- Guangzhou Forensic Science Institute, Guangzhou, China
| | - Miaoqiang Lun
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Xiaohui Chen
- Guangzhou Forensic Science Institute, Guangzhou, China
| | - Quyi Xu
- Guangzhou Forensic Science Institute, Guangzhou, China
| | - Xueyuan Liu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Meiyun He
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Linying Ye
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Guanglin He
- Center for Archaeological Science, Sichuan University, Chengdu, China.
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, China.
| | - Mengge Wang
- Center for Archaeological Science, Sichuan University, Chengdu, China.
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, China.
| | - Chao Liu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China.
- National Anti-Drug Laboratory Guangdong Regional Center, Guangzhou, China.
| |
Collapse
|
12
|
Rodriguez-Flores JL, Khalid S, Parikshak N, Rasheed A, Ye B, Kapoor M, Backman J, Sepehrband F, Gioia SAD, Gelfman S, De T, Banerjee N, Sharma D, Martinez H, Castaneda S, D'Ambrosio D, Zhang XA, Xun P, Tsai E, Tsai IC, Khan MZ, Jahanzaib M, Mian MR, Liaqat MB, Mahmood K, Salam TU, Hussain M, Iqbal J, Aslam F, Cantor MN, Tzoneva G, Overton J, Marchini J, Reid JG, Baras A, Verweij N, Lotta LA, Coppola G, Karalis K, Economides A, Fazio S, Liedtke W, Danesh J, Kamal A, Frossard P, Coleman T, Shuldiner AR, Saleheen D. NOTCH3 p.Arg1231Cys is markedly enriched in South Asians and associated with stroke. Nat Commun 2024; 15:8029. [PMID: 39271666 PMCID: PMC11399414 DOI: 10.1038/s41467-024-51819-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 08/19/2024] [Indexed: 09/15/2024] Open
Abstract
The genetic factors of stroke in South Asians are largely unexplored. Exome-wide sequencing and association analysis (ExWAS) in 75 K Pakistanis identified NM_000435.3(NOTCH3):c.3691 C > T, encoding the missense amino acid substitution p.Arg1231Cys, enriched in South Asians (alternate allele frequency = 0.58% compared to 0.019% in Western Europeans), and associated with subcortical hemorrhagic stroke [odds ratio (OR) = 3.39, 95% confidence interval (CI) = [2.26, 5.10], p = 3.87 × 10-9), and all strokes (OR [CI] = 2.30 [1.77, 3.01], p = 7.79 × 10-10). NOTCH3 p.Arg231Cys was strongly associated with white matter hyperintensity on MRI in United Kingdom Biobank (UKB) participants (effect [95% CI] in SD units = 1.1 [0.61, 1.5], p = 3.0 × 10-6). The variant is attributable for approximately 2.0% of hemorrhagic strokes and 1.1% of all strokes in South Asians. These findings highlight the value of diversity in genetic studies and have major implications for genomic medicine and therapeutic development in South Asian populations.
Collapse
Affiliation(s)
| | - Shareef Khalid
- Columbia University, New York, NY, USA
- Center for Non-Communicable Diseases, Karachi, Pakistan
| | | | - Asif Rasheed
- Center for Non-Communicable Diseases, Karachi, Pakistan
| | - Bin Ye
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | - Tanima De
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | - Ellen Tsai
- University of California at Los Angeles, Los Angeles, CA, USA
| | - I-Chun Tsai
- Regeneron Pharmaceuticals Inc, Tarrytown, NY, USA
| | | | | | | | | | - Khalid Mahmood
- Dow University of Health Sciences and Civil Hospital, Karachi, Pakistan
| | | | | | - Javed Iqbal
- Department of Neurology, Allied Hospital, Faisalabad, Pakistan
| | - Faizan Aslam
- Department of Neurology, Aziz Fatima Hospital, Faisalabad, Pakistan
| | | | | | | | | | | | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | - Sergio Fazio
- Regeneron Pharmaceuticals Inc, Tarrytown, NY, USA
| | | | - John Danesh
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Ayeesha Kamal
- Section of Neurology, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | | | | | | | - Danish Saleheen
- Columbia University, New York, NY, USA.
- Center for Non-Communicable Diseases, Karachi, Pakistan.
| |
Collapse
|
13
|
Redelings BD, Holmes I, Lunter G, Pupko T, Anisimova M. Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications. Mol Biol Evol 2024; 41:msae177. [PMID: 39172750 PMCID: PMC11385596 DOI: 10.1093/molbev/msae177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/02/2024] [Accepted: 07/09/2024] [Indexed: 08/24/2024] Open
Abstract
Insertions and deletions constitute the second most important source of natural genomic variation. Insertions and deletions make up to 25% of genomic variants in humans and are involved in complex evolutionary processes including genomic rearrangements, adaptation, and speciation. Recent advances in long-read sequencing technologies allow detailed inference of insertions and deletion variation in species and populations. Yet, despite their importance, evolutionary studies have traditionally ignored or mishandled insertions and deletions due to a lack of comprehensive methodologies and statistical models of insertions and deletion dynamics. Here, we discuss methods for describing insertions and deletion variation and modeling insertions and deletions over evolutionary time. We provide practical advice for tackling insertions and deletions in genomic sequences and illustrate our discussion with examples of insertions and deletion-induced effects in human and other natural populations and their contribution to evolutionary processes. We outline promising directions for future developments in statistical methodologies that would allow researchers to analyze insertions and deletion variation and their effects in large genomic data sets and to incorporate insertions and deletions in evolutionary inference.
Collapse
Affiliation(s)
| | - Ian Holmes
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
- Calico Life Sciences LLC, South San Francisco, CA 94080, USA
| | - Gerton Lunter
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen 9713 GZ, The Netherlands
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Maria Anisimova
- Institute of Computational Life Sciences, Zurich University of Applied Sciences, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
14
|
Matt GY, Sioson E, Shelton K, Wang J, Lu C, Zaldivar Peraza A, Gangwani K, Paul R, Reilly C, Acić A, Liu Q, Sandor SR, McLeod C, Patel J, Wang F, Im C, Wang Z, Sapkota Y, Wilson CL, Bhakta N, Ness KK, Armstrong GT, Hudson MM, Robison LL, Zhang J, Yasui Y, Zhou X. St. Jude Survivorship Portal: Sharing and Analyzing Large Clinical and Genomic Datasets from Pediatric Cancer Survivors. Cancer Discov 2024; 14:1403-1417. [PMID: 38593228 PMCID: PMC11294819 DOI: 10.1158/2159-8290.cd-23-1441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/08/2024] [Accepted: 03/26/2024] [Indexed: 04/11/2024]
Abstract
Childhood cancer survivorship studies generate comprehensive datasets comprising demographic, diagnosis, treatment, outcome, and genomic data from survivors. To broadly share this data, we created the St. Jude Survivorship Portal (https://survivorship.stjude.cloud), the first data portal for sharing, analyzing, and visualizing pediatric cancer survivorship data. More than 1,600 phenotypic variables and 400 million genetic variants from more than 7,700 childhood cancer survivors can be explored on this free, open-access portal. Summary statistics of variables are computed on-the-fly and visualized through interactive and customizable charts. Survivor cohorts can be customized and/or divided into groups for comparative analysis. Users can also seamlessly perform cumulative incidence and regression analyses on the stored survivorship data. Using the portal, we explored the ototoxic effects of platinum-based chemotherapy, uncovered a novel association between mental health, age, and limb amputation, and discovered a novel haplotype in MAGI3 strongly associated with cardiomyopathy specifically in survivors of African ancestry. Significance: The St. Jude Survivorship Portal is the first data portal designed to share and explore clinical and genetic data from childhood cancer survivors. The portal provides both open- and controlled-access features and will fulfill a wide range of data sharing needs of the survivorship research community and beyond. See co-corresponding author Xin Zhou discuss this research article, published simultaneously at the AACR Annual Meeting 2024: https://vimeo.com/932617204/7d99fa4958.
Collapse
Affiliation(s)
- Gavriel Y. Matt
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Edgar Sioson
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Kyla Shelton
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Jian Wang
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Congyu Lu
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Airen Zaldivar Peraza
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Karishma Gangwani
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Robin Paul
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Colleen Reilly
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Aleksandar Acić
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Qi Liu
- School of Public Health, University of Alberta, Edmonton, Canada.
| | - Stephanie R. Sandor
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Clay McLeod
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Jaimin Patel
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Fan Wang
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Cindy Im
- Department of Pediatrics, University of Minnesota Medical School, Minneapolis, Minnesota.
| | - Zhaoming Wang
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Yadav Sapkota
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Carmen L. Wilson
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Nickhill Bhakta
- Department of Global Pediatric Medicine, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Kirsten K. Ness
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Gregory T. Armstrong
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Melissa M. Hudson
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
- Department of Oncology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Leslie L. Robison
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Jinghui Zhang
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| | - Yutaka Yasui
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, Tennessee.
- School of Public Health, University of Alberta, Edmonton, Canada.
| | - Xin Zhou
- Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee.
| |
Collapse
|
15
|
Nikjoo H, Rahmanian S, Taleei R. Modelling DNA damage-repair and beyond. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2024; 190:1-18. [PMID: 38754703 DOI: 10.1016/j.pbiomolbio.2024.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 03/27/2024] [Accepted: 05/10/2024] [Indexed: 05/18/2024]
Abstract
The paper presents a review of mechanistic modelling studies of DNA damage and DNA repair, and consequences to follow in mammalian cell nucleus. We hypothesize DNA deletions are consequences of repair of double strand breaks leading to the modifications of genome that play crucial role in long term development of genetic inheritance and diseases. The aim of the paper is to review formation mechanisms underlying naturally occurring DNA deletions in the human genome and their potential relevance for bridging the gap between induced DNA double strand breaks and deletions in damaged human genome from endogenous and exogenous events. The model of the cell nucleus presented enables simulation of DNA damage at molecular level identifying the spectrum of damage induced in all chromosomal territories and loops. Our mechanistic modelling of DNA repair for double stand breaks (DSB), single strand breaks (SSB) and base damage (BD), shows the complexity of DNA damage is responsible for the longer repair times and the reason for the biphasic feature of mammalian cells repair curves. In the absence of experimentally determined data, the mechanistic model of repair predicts the in vivo rate constants for the proteins involved in the repair of DSB, SSB, and of BD.
Collapse
Affiliation(s)
- Hooshang Nikjoo
- Department of Physiology, Anatomy and Genetics (DPAG), Oxford University, Oxford, OX1 3PT, UK.
| | | | - Reza Taleei
- Medical Physics Division, Department of Radiation Oncology Sidney Kimmel Medical College at Thomas Jefferson University, Philadelphia, PA, 19107, USA.
| |
Collapse
|
16
|
Bauer BA, Schmidt CM, Ruddy KJ, Olson JE, Meydan C, Schmidt JC, Smith SY, Couch FJ, Earls JC, Price ND, Dudley JT, Mason CE, Zhang B, Phipps SM, Schmidt MA. A Multiomics, Molecular Atlas of Breast Cancer Survivors. Metabolites 2024; 14:396. [PMID: 39057719 PMCID: PMC11279123 DOI: 10.3390/metabo14070396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Revised: 07/09/2024] [Accepted: 07/15/2024] [Indexed: 07/28/2024] Open
Abstract
Breast cancer imposes a significant burden globally. While the survival rate is steadily improving, much remains to be elucidated. This observational, single time point, multiomic study utilizing genomics, proteomics, targeted and untargeted metabolomics, and metagenomics in a breast cancer survivor (BCS) and age-matched healthy control cohort (N = 100) provides deep molecular phenotyping of breast cancer survivors. In this study, the BCS cohort had significantly higher polygenic risk scores for breast cancer than the control group. Carnitine and hexanoyl carnitine were significantly different. Several bile acid and fatty acid metabolites were significantly dissimilar, most notably the Omega-3 Index (O3I) (significantly lower in BCS). Proteomic and metagenomic analyses identified group and pathway differences, which warrant further investigation. The database built from this study contributes a wealth of data on breast cancer survivorship where there has been a paucity, affording the ability to identify patterns and novel insights that can drive new hypotheses and inform future research. Expansion of this database in the treatment-naïve, newly diagnosed, controlling for treatment confounders, and through the disease progression, can be leveraged to profile and contextualize breast cancer and breast cancer survivorship, potentially leading to the development of new strategies to combat this disease and improve the quality of life for its victims.
Collapse
Affiliation(s)
| | - Caleb M. Schmidt
- Sovaris Aerospace, Boulder, CO 80302, USA
- Advanced Pattern Analysis and Human Performance Group, Boulder, CO 80302, USA
| | | | | | - Cem Meydan
- Thorne Research, Inc., Summerville, SC 29483, USA
| | - Julian C. Schmidt
- Sovaris Aerospace, Boulder, CO 80302, USA
- Advanced Pattern Analysis and Human Performance Group, Boulder, CO 80302, USA
| | | | | | | | - Nathan D. Price
- Thorne Research, Inc., Summerville, SC 29483, USA
- Buck Institute for Research on Aging, Novato, CA 94945, USA
| | | | | | - Bodi Zhang
- Thorne Research, Inc., Summerville, SC 29483, USA
| | | | - Michael A. Schmidt
- Sovaris Aerospace, Boulder, CO 80302, USA
- Advanced Pattern Analysis and Human Performance Group, Boulder, CO 80302, USA
| |
Collapse
|
17
|
Avellaneda LL, Johnson DT, Gutierrez RM, Thompson L, Sturm SA, Sage KA, Houston RM, LaRue BL. Development of a novel five dye insertion/deletion (INDEL) panel for ancestry determination. Int J Legal Med 2024; 138:1233-1244. [PMID: 38369682 DOI: 10.1007/s00414-024-03196-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 02/09/2024] [Indexed: 02/20/2024]
Abstract
The use of genetic markers, specifically Short Tandem Repeats (STRs), has been a valuable tool for identifying persons of interest. However, the ability to analyze additional markers including Single Nucleotide Polymorphisms (SNPs) and Insertion/Deletion (INDELs) polymorphisms allows laboratories to explore other investigative leads. INDELs were chosen in this study because large panels can be differentiated by size, allowing them to be genotyped by capillary electrophoresis. Moreover, these markers do not produce stutter and are smaller in size than STRs, facilitating the recovery of genetic information from degraded samples. The INDEL Ancestry Informative Markers (AIMs) in this study were selected from the 1000 Genomes Project based on a fixation index (FST) greater than 0.50, high allele frequency divergence, and genetic distance. A total of 25 INDEL-AIMs were optimized and validated according to SWGDAM guidelines in a five-dye multiplex. To validate the panel, genotyping was performed on 155 unrelated individuals from four ancestral groups (Caucasian, African, Hispanic, and East Asian). Bayesian clustering and principal component analysis (PCA) were performed revealing clear separation among three groups, with some observed overlap within the Hispanic group. Additionally, the PCA results were compared against a training set of 793 samples from the 1000 Genomes Project, demonstrating consistent results. Validation studies showed the assay to be reproducible, tolerant to common inhibitors, robust with challenging casework type samples, and sensitive down to 125 pg. In conclusion, our results demonstrated the robustness and effectiveness of a 25 loci INDEL system for ancestry inference of four ancestries commonly found in the United States.
Collapse
Affiliation(s)
- Lucio L Avellaneda
- Department of Forensic Science, Sam Houston State University, 1003 Bowers Blvd., Huntsville, TX, 77340, USA.
| | - Damani T Johnson
- Department of Forensic Science, Sam Houston State University, 1003 Bowers Blvd., Huntsville, TX, 77340, USA
| | - Ryan M Gutierrez
- Department of Forensic Science, Sam Houston State University, 1003 Bowers Blvd., Huntsville, TX, 77340, USA
| | - Lindsey Thompson
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - Sarah A Sturm
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - Kelly A Sage
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - Rachel M Houston
- Department of Forensic Science, Sam Houston State University, 1003 Bowers Blvd., Huntsville, TX, 77340, USA
| | - Bobby L LaRue
- Department of Forensic Science, Sam Houston State University, 1003 Bowers Blvd., Huntsville, TX, 77340, USA
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| |
Collapse
|
18
|
Zhang H, Yang M, Zhang H, Ren Z, Wang Q, Liu Y, Jin X, Ji J, Feng Y, Cai C, Ran Q, Li C, Huang J. Forensic features and phylogenetic structure survey of four populations from southwest China via the autosomal insertion/deletion markers. Forensic Sci Res 2024; 9:owad052. [PMID: 38765700 PMCID: PMC11102079 DOI: 10.1093/fsr/owad052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 05/09/2023] [Indexed: 05/22/2024] Open
Abstract
Insertion/Deletion (InDel) polymorphisms, characterized by their smaller amplicons, reduced mutation rates, and compatibility with the prevalent capillary electrophoresis (CE) platforms in forensic laboratories, significantly contribute to the advancement and application of genetic analysis. Guizhou province in China serves as an important region for investigating the genetic structure, ethnic group origins, and human evolution. However, DNA data and the sampling of present-day populations are lacking, especially about the InDel markers. Here, we reported data on 47 autosomal InDels from 592 individuals from four populations in Guizhou (Han, Dong, Yi, and Chuanqing). Genotyping was performed with the AGCU InDel 50 kit to evaluate their utility for forensic purposes and to explore the population genetic structure. Our findings showed no significant deviations from Hardy-Weinberg and linkage equilibriums. The combined power of discrimination (CPD) and the combined power of exclusion (CPE) for each population demonstrated that the kit could be applied to forensic individual identification and was an effective supplement for parentage testing. Genetic structure analyses, including principal component analysis, multidimensional scaling, genetic distance calculation, STRUCTURE, and phylogenetic analysis, highlighted that the genetic proximity of the studied populations correlates with linguistic, geographical, and cultural factors. The observed genetic variances within four research populations were less pronounced than those discerned between populations across different regions. Notably, the Guizhou Han, Dong, and Chuanqing populations showed closer genetic affiliations with linguistically similar groups than the Guizhou Yi. These results underscore the potential of InDel markers in forensic science and provide insights into the genetic landscape and human evolution in multi-ethnic regions like Guizhou. Key points InDel markers show promise for forensic individual identification and parentage testing via the AGCU InDel 50 kit.Genetic analysis of Guizhou populations reveals correlations with linguistic, geographical, and cultural factors.Guizhou Han, Dong, and Chuanqing populations showed closer genetic affiliations with linguistically similar groups than the Guizhou Yi.
Collapse
Affiliation(s)
- Han Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
- Institute of Forensic Science, Fudan University, Shanghai, China
| | - Meiqing Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Hongling Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Zheng Ren
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Qiyan Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yubo Liu
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Xiaoye Jin
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Jingyan Ji
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yuhang Feng
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Changsheng Cai
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Qianchong Ran
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Chengtao Li
- Institute of Forensic Science, Fudan University, Shanghai, China
| | - Jiang Huang
- The Key Laboratory of Environmental Pollution Monitoring and Disease Control, Ministry of Education, Guizhou Medical University, Guiyang, China
| |
Collapse
|
19
|
Filoglu G, Duvenci A, Tas S, Karadayi H, Asicioglu F, Bulbul O. Development of a multiplex panel with 36 insertion/deletion markers (InDel) for individual identification. Forensic Sci Med Pathol 2024; 20:400-411. [PMID: 37266863 DOI: 10.1007/s12024-023-00652-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2023] [Indexed: 06/03/2023]
Abstract
In recent years, the insertion/deletion (InDel) polymorphism has become a preferred genetic marker in forensic genetics due to its low mutation rates and small amplicon sizes. In this study, a 36-InDelplex identification panel, consisting of autosomal 34 InDel loci, 1 Y InDel locus, and amelogenin, was developed, and gene frequencies in the Turkish population were determined. The loci of the InDel panel with global minimum allele frequencies (MAF) ≥ 0.4 were selected from the 1000 Genomes Project Phase 3 data. The amplicon sizes of the loci were designed in the range of 69-252 bp. In the validation study of the developed panel, analysis threshold, dynamic range, sensitivity, stochastic threshold, inhibitor tolerance, and reproducibility parameters were studied by following the Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines. The sensitivity studies indicated that complete and reliable InDel profiles could be obtained with 0.25 ng of DNA. A population study was evaluated using 250 samples from Turkey. The mean observed heterozygosity ratio (Ho) of all loci was 0.48. The combined discrimination power (CPD) is 0.999999999990867 and the combined exclusion probability (CPE) was 0.9930. The population comparison was also made using Turkish and the five major populations from the 1000 Genomes Phase 3 populations' data (Africa, Europe, East Asia, South Asia, and America). In conclusion, the results showed that the 36-InDelplex panel is a reliable, sensitive, and accurate system that is suitable for human identification and population genetics purposes.
Collapse
Affiliation(s)
- Gonul Filoglu
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey
| | - Arzu Duvenci
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey
| | - Sebahat Tas
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey
| | - Huseyin Karadayi
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey
| | - Faruk Asicioglu
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey
| | - Ozlem Bulbul
- Institute of Forensic Science and Legal Medicine, Istanbul University- Cerrahpasa, 34500, Istanbul, Turkey.
| |
Collapse
|
20
|
Krishna Murthy SB, Yang S, Bheda S, Tomar N, Li H, Yaghoobi A, Khan A, Kiryluk K, Motelow JE, Ren N, Gharavi AG, Milo Rasouly H. Assisting the analysis of insertions and deletions using regional allele frequencies. Funct Integr Genomics 2024; 24:104. [PMID: 38764005 PMCID: PMC11414712 DOI: 10.1007/s10142-024-01358-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 04/02/2024] [Accepted: 04/12/2024] [Indexed: 05/21/2024]
Abstract
Accurate estimation of population allele frequency (AF) is crucial for gene discovery and genetic diagnostics. However, determining AF for frameshift-inducing small insertions and deletions (indels) faces challenges due to discrepancies in mapping and variant calling methods. Here, we propose an innovative approach to assess indel AF. We developed CRAFTS-indels (Calculating Regional Allele Frequency Targeting Small indels), an algorithm that combines AF of distinct indels within a given region and provides "regional AF" (rAF). We tested and validated CRAFTS-indels using three independent datasets: gnomAD v2 (n=125,748 samples), an internal dataset (IGM; n=39,367), and the UK BioBank (UKBB; n=469,835). By comparing rAF against standard AF, we identified rare indels with rAF exceeding standard AF (sAF≤10-4 and rAF>10-4) as "rAF-hi" indels. Notably, a high percentage of rare indels were "rAF-hi", with a higher proportion in gnomAD v2 (11-20%) and IGM (11-22%) compared to the UKBB (5-9% depending on the CRAFTS-indels' parameters). Analysis of the overlap of regions based on their rAF with low complexity regions and with ClinVar classification supported the pertinence of rAF. Using the internal dataset, we illustrated the utility of CRAFTS-indel in the analysis of de novo variants and the potential negative impact of rAF-hi indels in gene discovery. In summary, annotation of indels with cohort specific rAF can be used to handle some of the limitations of current annotation pipelines and facilitate detection of novel gene disease associations. CRAFTS-indels offers a user-friendly approach to providing rAF annotation. It can be integrated into public databases such as gnomAD, UKBB and used by ClinVar to revise indel classifications.
Collapse
Affiliation(s)
- Sarath Babu Krishna Murthy
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Sandy Yang
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Shiraz Bheda
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Nikita Tomar
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Haiyue Li
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Amir Yaghoobi
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Joshua E Motelow
- Division of Critical Care and Hospital Medicine, Department of Pediatrics, Columbia University Irving Medical Center, New York-Presbyterian Morgan Stanley Children's Hospital, New York, New York, USA
| | - Nick Ren
- Institute for Genomic Medicine, Columbia University, New York, NY, USA
| | - Ali G Gharavi
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Hila Milo Rasouly
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA.
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA.
| |
Collapse
|
21
|
Avellaneda LL, Johnson DT, Gutierrez R, Thompson L, Sage KA, Sturm SA, Houston RM, LaRue BL. Development of a novel five-dye panel for human identification insertion/deletion (INDEL) polymorphisms. J Forensic Sci 2024; 69:814-824. [PMID: 38291825 DOI: 10.1111/1556-4029.15475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 01/15/2024] [Accepted: 01/18/2024] [Indexed: 02/01/2024]
Abstract
DNA analysis of forensic case samples relies on short tandem repeats (STRs), a key component of the combined DNA index system (CODIS) used to identify individuals. However, limitations arise when dealing with challenging samples, prompting the exploration of alternative markers such as single nucleotide polymorphisms (SNPs) and insertion/deletion (INDELs) polymorphisms. Unlike SNPs, INDELs can be differentiated easily by size, making them compatible with electrophoresis methods. It is possible to design small INDEL amplicons (<200 bp) to enhance recovery from degraded samples. To this end, a set of INDEL Human Identification Markers (HID) was curated from the 1000 Genomes Project, employing criteria including a fixation index (FST) ≤ 0.06, minor allele frequency (MAF) >0.2, and high allele frequency divergence. A panel of 33 INDEL-HIDs was optimized and validated following the Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines, utilizing a five-dye multiplex electrophoresis system. A small sample set (n = 79 unrelated individuals) was genotyped to assess the assay's performance. The validation studies exhibited reproducibility, inhibition tolerance, ability to detect a two-person mixture from a 4:1 to 1:6 ratio, robustness with challenging samples, and sensitivity down to 125 pg of DNA. In summary, the 33-loci INDEL-HID panel exhibited robust recovery with low-template and degraded samples and proved effective for individualization within a small sample set.
Collapse
Affiliation(s)
- Lucio L Avellaneda
- Department of Forensic Science, Sam Houston State University, Huntsville, Texas, USA
| | - Damani T Johnson
- Department of Forensic Science, Sam Houston State University, Huntsville, Texas, USA
| | - Ryan Gutierrez
- Department of Forensic Science, Sam Houston State University, Huntsville, Texas, USA
| | - Lindsey Thompson
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Kelly A Sage
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Sarah A Sturm
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Rachel M Houston
- Department of Forensic Science, Sam Houston State University, Huntsville, Texas, USA
| | - Bobby L LaRue
- Department of Forensic Science, Sam Houston State University, Huntsville, Texas, USA
- Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, Texas, USA
| |
Collapse
|
22
|
Shadrina M, Kalay Ö, Demirkaya-Budak S, LeDuc CA, Chung WK, Turgut D, Budak G, Arslan E, Semenyuk V, Davis-Dusenbery B, Seidman CE, Yost HJ, Jain A, Gelb BD. Automated Identification of Germline de novo Mutations in Family Trios: A Consensus-Based Informatic Approach. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.08.584100. [PMID: 38559260 PMCID: PMC10979888 DOI: 10.1101/2024.03.08.584100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Accurate identification of germline de novo variants (DNVs) remains a challenging problem despite rapid advances in sequencing technologies as well as methods for the analysis of the data they generate, with putative solutions often involving ad hoc filters and visual inspection of identified variants. Here, we present a purely informatic method for the identification of DNVs by analyzing short-read genome sequencing data from proband-parent trios. Our method evaluates variant calls generated by three genome sequence analysis pipelines utilizing different algorithms-GATK HaplotypeCaller, DeepTrio and Velsera GRAF-exploring the assumption that a requirement of consensus can serve as an effective filter for high-quality DNVs. We assessed the efficacy of our method by testing DNVs identified using a previously established, highly accurate classification procedure that partially relied on manual inspection and used Sanger sequencing to validate a DNV subset comprising less confident calls. The results show that our method is highly precise and that applying a force-calling procedure to putative variants further removes false-positive calls, increasing precision of the workflow to 99.6%. Our method also identified novel DNVs, 87% of which were validated, indicating it offers a higher recall rate without compromising accuracy. We have implemented this method as an automated bioinformatics workflow suitable for large-scale analyses without need for manual intervention.
Collapse
Affiliation(s)
- Mariya Shadrina
- Mindich Child Health and Development Institute and the Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York, NY, USA
| | - Özem Kalay
- Velsera Inc, 529 Main St, Suite 6610, Charlestown, MA, USA
| | | | - Charles A. LeDuc
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Wendy K. Chung
- Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Deniz Turgut
- Velsera Inc, 529 Main St, Suite 6610, Charlestown, MA, USA
| | - Gungor Budak
- Velsera Inc, 529 Main St, Suite 6610, Charlestown, MA, USA
| | - Elif Arslan
- Velsera Inc, 529 Main St, Suite 6610, Charlestown, MA, USA
| | | | | | - Christine E. Seidman
- Division of Cardiovascular Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - H. Joseph Yost
- Molecular Medicine Program, University of Utah, Salt Lake City, UT, USA
| | - Amit Jain
- Velsera Inc, 529 Main St, Suite 6610, Charlestown, MA, USA
| | - Bruce D. Gelb
- Mindich Child Health and Development Institute and the Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York, NY, USA
- Department of Pediatrics, Icahn School of Medicine, New York, NY, USA
| |
Collapse
|
23
|
Yao Y, Ji Q, Li Z, Zhou Z, Qian J, Ji L, Sun K, Xie J. Development of a 39 MM-InDel multiplex assay for the forensic application. Electrophoresis 2024; 45:505-516. [PMID: 38037287 DOI: 10.1002/elps.202300181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 10/23/2023] [Accepted: 11/18/2023] [Indexed: 12/02/2023]
Abstract
Insertion/deletion polymorphisms (InDels) are a category of highly prevalent markers in the human genome, characterized by their distinctive attributes, including short amplicon sizes and low mutation rates, which have shown great potential in forensic applications. Multi-allelic InDel and multi-InDel markers, collectively abbreviated as MM-InDels, were developed to enhance polymorphism by the introduction of novel alleles. Nevertheless, the relatively low mutation rates of InDels, coupled with the founder effect, result in distinct allele frequency distributions among populations. The divergent characteristics of InDels in different populations also pose challenges to the establishment of universally efficient InDel multiplex assays. To enhance the system efficiency of the InDel assay and its applicability across diverse populations, 39 MM-InDels with high polymorphism in five different ancestry superpopulations were selected from the 1000 Genomes Project dataset and combined with an amelogenin gender marker to construct a multiplex assay (named MMIDplex). The combined power of discrimination and the cumulative probability of exclusion of 39 MM-InDels were 1 - 1.3 × 10-23 and 1 - 9.83 × 10-6 in the Chinese Han population, and larger than 1-10-19 and 1-10-4 in the reference populations, relatively. These results demonstrate that the MMIDplex assay has the potential to obtain sufficient power for individual identification and paternity test in global populations.
Collapse
Affiliation(s)
- Yining Yao
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Qiqi Ji
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Zhimin Li
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Zhihan Zhou
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Jinglei Qian
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Lili Ji
- Department of Hematology, Zhongshan Hospital, Fudan University, Shanghai, P. R. China
| | - Kuan Sun
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| | - Jianhui Xie
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai, P. R. China
| |
Collapse
|
24
|
Zhao M, Cui W, Xu H, Zhang Y, Zhu B. Unveiling the genetic landscape of high-altitude adaptive ethnic groups with polymorphic markers: Implications of comprehensive forensic appraisals and population genetic investigations. Heliyon 2023; 9:e21229. [PMID: 38027587 PMCID: PMC10656254 DOI: 10.1016/j.heliyon.2023.e21229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 09/11/2023] [Accepted: 10/18/2023] [Indexed: 12/01/2023] Open
Abstract
Background Deletion/insertion polymorphisms (DIPs), a novel class of biomarker, have been widely utilized in forensic areas for individual identification, paternity tests, and ancestral origin inference due to its applicability to degraded samples and low mutation rates. Despite the availability of a well-established commercial kit, the Investigator® DIPplex kit (Qiagen), certain loci exhibit limited levels of polymorphisms in East Asian populations, particularly in Chinese populations. Objective This dissertation seeks to undertake a comprehensive evaluation about the forensic efficiency of a self-developed multiplex amplification system in high-altitude adaptive ethnic groups of China. Healthy unrelated Tibetan individuals residing in Tibet Autonomous Region and Qinghai Province were genotyped using previously reported 43 deletion/insertion polymorphism loci. Forensic statistical analyses including allele frequencies and forensic parameters were conducted in the two Tibetan groups, and the genetic relatedness of the studied groups with reference populations from the 1000 Genomes Project Phase 3 were investigated. Results Forensic statistical results showed that the polymorphism information content values of the 43 deletion/insertion polymorphism loci in the two Tibetan groups exceeded 0.35. Moreover, the combined power of discrimination using the 43 deletion/insertion polymorphism loci was calculated to be 0.9999999999999999984 in the Qinghai Tibetan group and 0.9999999999999999921 in the Tibet Tibetan group. The cumulative power of exclusion using the 43 deletion/insertion polymorphism loci was calculated to be 0.999782512 in the Qinghai Tibetan group and 0.999886205 in the Tibet Tibetan group. Analysis of population genetics demonstrated that the two studied Tibetan groups shared close genetic relationships with East Asia populations. Conclusion The set of 43 deletion/insertion polymorphism loci exhibited remarkable forensic efficacy, rendering it a promising tool for forensic practice. Population genetic analyses indicated that the two Tibetan groups had closer genetic affinities to East Asian populations.
Collapse
Affiliation(s)
- Ming Zhao
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515, Guangzhou, Guangdong, China
| | - Wei Cui
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515, Guangzhou, Guangdong, China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515, Guangzhou, Guangdong, China
| | - Yunying Zhang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515, Guangzhou, Guangdong, China
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515, Guangzhou, Guangdong, China
| |
Collapse
|
25
|
Ziyatdinov A, Torres J, Alegre-Díaz J, Backman J, Mbatchou J, Turner M, Gaynor SM, Joseph T, Zou Y, Liu D, Wade R, Staples J, Panea R, Popov A, Bai X, Balasubramanian S, Habegger L, Lanche R, Lopez A, Maxwell E, Jones M, García-Ortiz H, Ramirez-Reyes R, Santacruz-Benítez R, Nag A, Smith KR, Damask A, Lin N, Paulding C, Reppell M, Zöllner S, Jorgenson E, Salerno W, Petrovski S, Overton J, Reid J, Thornton TA, Abecasis G, Berumen J, Orozco-Orozco L, Collins R, Baras A, Hill MR, Emberson JR, Marchini J, Kuri-Morales P, Tapia-Conyer R. Genotyping, sequencing and analysis of 140,000 adults from Mexico City. Nature 2023; 622:784-793. [PMID: 37821707 PMCID: PMC10600010 DOI: 10.1038/s41586-023-06595-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 08/31/2023] [Indexed: 10/13/2023]
Abstract
The Mexico City Prospective Study is a prospective cohort of more than 150,000 adults recruited two decades ago from the urban districts of Coyoacán and Iztapalapa in Mexico City1. Here we generated genotype and exome-sequencing data for all individuals and whole-genome sequencing data for 9,950 selected individuals. We describe high levels of relatedness and substantial heterogeneity in ancestry composition across individuals. Most sequenced individuals had admixed Indigenous American, European and African ancestry, with extensive admixture from Indigenous populations in central, southern and southeastern Mexico. Indigenous Mexican segments of the genome had lower levels of coding variation but an excess of homozygous loss-of-function variants compared with segments of African and European origin. We estimated ancestry-specific allele frequencies at 142 million genomic variants, with an effective sample size of 91,856 for Indigenous Mexican ancestry at exome variants, all available through a public browser. Using whole-genome sequencing, we developed an imputation reference panel that outperforms existing panels at common variants in individuals with high proportions of central, southern and southeastern Indigenous Mexican ancestry. Our work illustrates the value of genetic studies in diverse populations and provides foundational imputation and allele frequency resources for future genetic studies in Mexico and in the United States, where the Hispanic/Latino population is predominantly of Mexican descent.
Collapse
Affiliation(s)
| | - Jason Torres
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK.
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK.
| | - Jesús Alegre-Díaz
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | | | | | - Michael Turner
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- Oxford Kidney Unit, Churchill Hospital, Oxford, UK
| | | | | | - Yuxin Zou
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Daren Liu
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Rachel Wade
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | | | | | - Alex Popov
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | - Alex Lopez
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | - Raul Ramirez-Reyes
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | - Rogelio Santacruz-Benítez
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | - Abhishek Nag
- Centre for Genomics Research, Discovery Sciences, Research and Development Biopharmaceuticals, AstraZeneca, Cambridge, UK
| | - Katherine R Smith
- Centre for Genomics Research, Discovery Sciences, Research and Development Biopharmaceuticals, AstraZeneca, Cambridge, UK
| | - Amy Damask
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Nan Lin
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | - Sebastian Zöllner
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | | | | | - Slavé Petrovski
- Centre for Genomics Research, Discovery Sciences, Research and Development Biopharmaceuticals, AstraZeneca, Cambridge, UK
| | | | | | | | | | - Jaime Berumen
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM), Mexico City, Mexico
| | | | - Rory Collins
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Michael R Hill
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Jonathan R Emberson
- Clinical Trial Service Unit and Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | | | - Pablo Kuri-Morales
- Instituto Tecnológico y de Estudios Superiores de Monterrey, Monterrey, Mexico
- Faculty of Medicine, National Autonomous University of Mexico, Mexico City, Mexico
| | - Roberto Tapia-Conyer
- Faculty of Medicine, National Autonomous University of Mexico, Mexico City, Mexico.
| |
Collapse
|
26
|
Shim J, Hong SY, Han JH, Yu Y, Yoo E, Sung J, Chin JH, Lee ON. A Genomic Evaluation of Six Selected Inbred Lines of the Naturalized Plants of Milk Thistle ( Silybum marianum L. Gaertn.) in Korea. PLANTS (BASEL, SWITZERLAND) 2023; 12:2702. [PMID: 37514316 PMCID: PMC10384142 DOI: 10.3390/plants12142702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 07/18/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023]
Abstract
Milk thistle (Silybum marianum) belongs to the Asteraceae family and is a medicinal plant native to the Mediterranean Basin. Silymarin in achene is a widely used herbal product for chronic liver disease. There is growing interest in natural medicine using milk thistle in Korea, but the raw material completely relies on imports. Despite its economic importance, phenotypic evaluations of native resources of milk thistle in Korea have not been carried out. In addition, genomic research and molecular marker development are very limited in milk thistle. In this study, we evaluated 220 milk thistle resources consisting of 172 accessions collected from the domestic market, and 48 accessions isolated from 6 accessions distributed by the National Agrobiodiversity Center in Korea. Six plant characteristics (height, seed weight, number of flowers, seed weight per flower, spine length, and color at harvest) were measured, and six samples (M01-M06) were selected to represent the genetic diversity of the population for genomic research. To develop PCR-based and co-dominant insertion/deletion (InDel) markers, we performed genome-wide InDel detection by comparing the whole-genome resequencing data of the six selected accessions with the reference genome sequence (GCA_001541825). As a result, 177 InDel markers with high distinguishability and reproducibility were selected from the 30,845 InDel variants. Unknowingly imported alien plant resources could easily be genetically mixed, and jeopardized seed purity can cause continuous difficulties in the development of high value-added agricultural platforms utilizing natural products. The selected plant materials and 177 validated InDel markers developed via whole-genome resequencing analysis could be valuable resources for breeding, conservation, and ecological studies of natives to Korea, along with acceleration of Silybum marianum industrialization.
Collapse
Affiliation(s)
- Jeehyoung Shim
- Department of Food and Nutrition, Chung Ang University, Seodong-daero 4726, Daedeok-myeon, Anseong 17546, Republic of Korea
- EL&I Co., Ltd., Hwaseong 18278, Republic of Korea
| | - Su Young Hong
- Genomics Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeonju 54874, Republic of Korea
| | - Jae-Hyuk Han
- Food Crops Molecular Breeding Laboratory, Department of Integrative Biological Sciences and Industry, Sejong University, Seoul 05006, Republic of Korea
| | - Yeisoo Yu
- DNACare Co., Ltd., Seoul 06730, Republic of Korea
| | - Eunae Yoo
- National Agrobiodiversity Center, National Institute of Agricultural Sciences, Rural Development Administration, Jeonju 54874, Republic of Korea
| | - Jungsook Sung
- National Agrobiodiversity Center, National Institute of Agricultural Sciences, Rural Development Administration, Jeonju 54874, Republic of Korea
| | - Joong Hyoun Chin
- Food Crops Molecular Breeding Laboratory, Department of Integrative Biological Sciences and Industry, Sejong University, Seoul 05006, Republic of Korea
- Convergence Research Center for Natural Products, Sejong University, Seoul 05006, Republic of Korea
| | - O New Lee
- Department of Bioindustry and Bioresource Engineering, Sejong University, Seoul 05006, Republic of Korea
| |
Collapse
|
27
|
Jabeen S, Saif R, Haq R, Hayat A, Naz S. Whole-genome sequencing and variant discovery of Citrus reticulata "Kinnow" from Pakistan. Funct Integr Genomics 2023; 23:227. [PMID: 37422603 DOI: 10.1007/s10142-023-01153-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/10/2023]
Abstract
Citrus is a source of nutritional and medicinal advantages, cultivated worldwide with major groups of sweet oranges, mandarins, grapefruits, kumquats, lemons and limes. Pakistan produces all major citrus groups with mandarin (Citrus reticulata) being the prominent group that includes local commercial cultivars Feutral's Early, Dancy, Honey, and Kinnow. The present study designed to understand the genetic architecture of this unique variety of Citrus reticulata 'Kinnow.' The whole-genome resequencing and variant calling was performed to map the genomic variability that might be responsible for its particular characteristics like taste, seedlessness, juice content, thickness of peel, and shelf-life. A total of 139,436,350 raw sequence reads were generated with 20.9 Gb data in Fastq format having 98% effectiveness and 0.2% base call error rate. Overall, 3,503,033 SNPs, 176,949 MNPs, 323,287 INS, and 333,083 DEL were identified using the GATK4 variant calling pipeline against Citrus clementina. Furthermore, g:Profiler was applied for annotating the newly found variants, harbor genes/transcripts and their involved pathways. A total of 73,864 transcripts harbors 4,336,352 variants, most of the observed variants were predicted in non-coding regions and 1009 transcripts were found well annotated by different databases. Out of total aforementioned transcripts, 588 involved in biological processes, 234 in molecular functions and 167 transcripts in cellular components. In a nutshell, 18,153 high impact variants and 216 genic variants found in the current study, which may be used after its functional validation for marker-assisted breeding programs of "Kinnow" to propagate its valued traits for the improvement of contemporary citrus varieties in the region.
Collapse
Affiliation(s)
- Sadia Jabeen
- Department of Biotechnology, Lahore College for Women University, Lahore, Pakistan
| | - Rashid Saif
- Department of Biotechnology, Qarshi University, Lahore, Pakistan
- Decode Genomics, Punjab University Employees Housing Scheme, Lahore, Pakistan
| | - Rukhama Haq
- Department of Biotechnology, Lahore College for Women University, Lahore, Pakistan
| | - Akbar Hayat
- Citrus Research Institute, Sargodha, Pakistan
| | - Shagufta Naz
- Department of Biotechnology, Lahore College for Women University, Lahore, Pakistan.
| |
Collapse
|
28
|
Mei S, Yi S, Cai M, Zhang Y, Cui W, Xu H, Lan Q, Zhu B. Exploring the forensic effectiveness and population genetic differentiation by self-constructed 41 multi-InDel panel in Yunnan Zhuang group. Gene 2023; 860:147180. [PMID: 36669579 DOI: 10.1016/j.gene.2023.147180] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 12/21/2022] [Accepted: 01/04/2023] [Indexed: 01/19/2023]
Abstract
Yunnan is one of the main residences of the Zhuang group which is one of the 55 ethnic minorities in China. At present, there are relatively few researches on population genetics and forensic science of the Yunnan Zhuang group. Therefore, this study used a self-constructed panel containing 41 multi-InDel markers to analyze the genetic polymorphisms of 173 individuals from Yunnan Zhuang group. The results indicated that these 41 multi-InDels in Yunnan Zhuang group were highly polymorphic markers expect for three markers. The cumulative match probability and combined exclusion probability values of the 40 multi-InDels (MI38 marker was excluded) were 8.0671E-26 and 0.9999995959, respectively. In addition, population genetic analyses were performed on genotyping data of 41 multi-InDel markers among the Yunnan Zhuang and 26 reference populations, revealing that the Yunnan Zhuang group was genetically close to the five populations in East Asia. According to the STRUCTURE analysis, the Yunnan Zhuang group presented similar ancestral compositions to the five populations from East Asia, and when the K value was three, the five intercontinental populations showed their different genetic structures. In conclusion, the 41 multi-InDel markers could be used as an effective tool for individual identification and paternity testing of the Zhuang group in Yunnan province, as well as for their ancestry information inference studies.
Collapse
Affiliation(s)
- Shuyan Mei
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China
| | - Shaohua Yi
- Department of Forensic Medicine, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Meiming Cai
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China
| | - Yunying Zhang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China
| | - Wei Cui
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China; Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an 710004, China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China
| | - Qiong Lan
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China.
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital; Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
29
|
Lemire BD, Uppuluri P. Coding Sequence Insertions in Fungal Genomes are Intrinsically Disordered and can Impart Functionally-Important Properties on the Host Protein. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.06.535715. [PMID: 37066283 PMCID: PMC10104129 DOI: 10.1101/2023.04.06.535715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Insertion and deletion mutations (indels) are important mechanisms of generating protein diversity. Indels in coding sequences are under considerable selective pressure to maintain reading frames and to preserve protein function, but once generated, indels provide raw material for the acquisition of new protein properties and functions. We reported recently that coding sequence insertions in the Candida albicans NDU1 protein, a mitochondrial protein involved in the assembly of the NADH:ubiquinone oxidoreductase are imperative for respiration, biofilm formation and pathogenesis. NDU1 inserts are specific to CTG-clade fungi, absent in human ortholog and successfully harnessed as drug targets. Here, we present the first comprehensive report investigating indels and clade-defining insertions (CDIs) in fungal proteomes. We investigated 80 ascomycete proteomes encompassing CTG clade species, the Saccharomycetaceae family, the Aspergillaceae family and the Herpotrichiellaceae (black yeasts) family. We identified over 30,000 insertions, 4,000 CDIs and 2,500 clade-defining deletions (CDDs). Insert sizes range from 1 to over 1,000 residues in length, while maximum deletion length is 19 residues. Inserts are strikingly over-represented in protein kinases, and excluded from structural domains and transmembrane segments. Inserts are predicted to be highly disordered. The amino acid compositions of the inserts are highly depleted in hydrophobic residues and enriched in polar residues. An indel in the Saccharomyces cerevisiae Sth1 protein, the catalytic subunit of the RSC (Remodel the Structure of Chromatin) complex is predicted to be disordered until it forms a ß-strand upon interaction. This interaction performs a vital role in RSC-mediated transcriptional regulation, thereby expanding protein function.
Collapse
Affiliation(s)
- Bernard D. Lemire
- Department of Biochemistry, University of Alberta, Edmonton, Canada (retired)
| | - Priya Uppuluri
- Institute for Infection and Immunity, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, USA
- David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| |
Collapse
|
30
|
Valverde-Hernández JC, Flores-Cruz A, Chavarría-Soley G, Silva de la Fuente S, Campos-Sánchez R. Frequencies of variants in genes associated with dyslipidemias identified in Costa Rican genomes. Front Genet 2023; 14:1114774. [PMID: 37065472 PMCID: PMC10098023 DOI: 10.3389/fgene.2023.1114774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 03/14/2023] [Indexed: 04/18/2023] Open
Abstract
Dyslipidemias are risk factors in diseases of significant importance to public health, such as atherosclerosis, a condition that contributes to the development of cardiovascular disease. Unhealthy lifestyles, the pre-existence of diseases, and the accumulation of genetic variants in some loci contribute to the development of dyslipidemia. The genetic causality behind these diseases has been studied primarily on populations with extensive European ancestry. Only some studies have explored this topic in Costa Rica, and none have focused on identifying variants that can alter blood lipid levels and quantifying their frequency. To fill this gap, this study focused on identifying variants in 69 genes involved in lipid metabolism using genomes from two studies in Costa Rica. We contrasted the allelic frequencies with those of groups reported in the 1000 Genomes Project and gnomAD and identified potential variants that could influence the development of dyslipidemias. In total, we detected 2,600 variants in the evaluated regions. However, after various filtering steps, we obtained 18 variants that have the potential to alter the function of 16 genes, nine variants have pharmacogenomic or protective implications, eight have high risk in Variant Effect Predictor, and eight were found in other Latin American genetic studies of lipid alterations and the development of dyslipidemia. Some of these variants have been linked to changes in blood lipid levels in other global studies and databases. In future studies, we propose to confirm at least 40 variants of interest from 23 genes in a larger cohort from Costa Rica and Latin American populations to determine their relevance regarding the genetic burden for dyslipidemia. Additionally, more complex studies should arise that include diverse clinical, environmental, and genetic data from patients and controls and functional validation of the variants.
Collapse
Affiliation(s)
| | - Andrés Flores-Cruz
- Centro de Investigación en Biología Celular y Molecular, University of Costa Rica, San José, Costa Rica
| | - Gabriela Chavarría-Soley
- Centro de Investigación en Biología Celular y Molecular, University of Costa Rica, San José, Costa Rica
- Escuela de Biología, University of Costa Rica, San José, Costa Rica
| | - Sandra Silva de la Fuente
- Centro de Investigación en Biología Celular y Molecular, University of Costa Rica, San José, Costa Rica
| | - Rebeca Campos-Sánchez
- Centro de Investigación en Biología Celular y Molecular, University of Costa Rica, San José, Costa Rica
| |
Collapse
|
31
|
Lan Q, Cai M, Lei F, Shen C, Zhu B. Systematically exploring the performance of a self-developed Multi-InDel system in forensic identification, ancestry inference and genetic structure analysis of Chinese Manchu and Mongolian groups. Forensic Sci Int 2023; 346:111637. [PMID: 36934684 DOI: 10.1016/j.forsciint.2023.111637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 02/25/2023] [Accepted: 03/01/2023] [Indexed: 03/14/2023]
Abstract
The insertion/deletion (InDel) polymorphism has promising applications in forensic DNA analysis. However, the insufficient forensic efficiencies of the present InDel-based systems restrict their applications in parentage testing, due to the lower genetic polymorphism of the biallelic InDel locus and the limited number of InDel loci in a multiplex amplification system. Here, we introduced an in-house developed system which contained 41 polymorphic Multi-InDel markers (equivalent to 82 InDels in total), to serve as an efficient and reliable tool for different forensic applications in the Manchu and Mongolian groups. We demonstrated that the new system exhibited potential efficiencies for personal identification, parentage testing, two-person DNA mixture interpretation and ancestry inference of intercontinental populations. Meanwhile, we explored the genetic backgrounds of the Manchu and Mongolian groups by conducting a series of population genetic analyses. We showed that the Manchu and Mongolian groups shared closer genetic relationships with East Asian populations, especially Han Chinese populations in northern China. Moreover, more similar genetic compositions were detected between the Manchu group and the northern Han populations in this study, suggesting that the Manchu group had higher genetic affinities with northern Han populations than the Mongolian group. Overall. this study provided the necessary evidence that these Multi-InDel genetic markers could play an important role in forensic applications.
Collapse
Affiliation(s)
- Qiong Lan
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515 Guangzhou, China; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Meiming Cai
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515 Guangzhou, China
| | - Fanzhang Lei
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515 Guangzhou, China
| | - Chunmei Shen
- Department of Laboratory Medicine, Nanfang Hospital, Southern Medical University, Guangzhou, China.
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 510515 Guangzhou, China; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China; Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, China.
| |
Collapse
|
32
|
Ozyer S, Uslu ZS, Unsal Sapan T, Bulbul O, Filoglu G. The development of the novel 22 X- indel multiplex system for forensic genetics. Leg Med (Tokyo) 2023; 62:102224. [PMID: 36898282 DOI: 10.1016/j.legalmed.2023.102224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 01/28/2023] [Accepted: 02/23/2023] [Indexed: 03/06/2023]
Abstract
InDel (Insertions/deletion) markers have been used as an alternative, or as a complementary marker system, to STR markers in human identification due to their advantages such as low mutation rates, no stutter, and potential small amplicon sizes. In forensic sciences, sex chromosomes are widely used in forensic genetics for specific cases. For example, the relationship between father and daughter can be determined by using X-InDels. In this study, we developed a novel 22 X-InDel multiplex system that was identified by two separate assays with fluorescence amplification and capillary electrophoresis detection technology. We chose 22 X-InDel markers based on the following criteria: mean heterozygosity over 30% in Europeans; minimum of 250 Kb differences between each InDel loci; and an amplicon length that was less than 300 bp. We performed an optimization and validation study of 22 X-InDel systems under the following parameters: analytical threshold, sensitivity, precision and accuracy, stochastic threshold, repeatability, and reproducibility. We evaluated the allele frequency of this multiplex system in the Turkish population, and then the population comparisons were carried out on data from 1000 Genome populations (Europe, Africa, America, South Asia, and East Asia). The sensitivity test showed a complete genotyping profile with DNA concentrations as low as 0.5 ng. The heterozygosity ratio of 22 X-InDel loci was determined as 0.4690 and the discrimination power was defined as 0.99. The results show that the new 22 X-InDel multiplex system provides high polymorphism information, and it is a reproducible, accurate, sensitive, and robust system that could be used as an additional tool for kinship testing.
Collapse
Affiliation(s)
- Selen Ozyer
- Istanbul University- Cerrahpasa, Institute of Forensic Science and Legal Medicine, Istanbul, Turkey; Dogus University, Advanced Vocational School, Program of Autopsy Assistant, Istanbul, Turkey
| | - Zulal Seval Uslu
- Istanbul University- Cerrahpasa, Institute of Forensic Science and Legal Medicine, Istanbul, Turkey
| | - Tugba Unsal Sapan
- Uskudar University, Institute of Addiction and Forensic Science, Istanbul, Turkey
| | - Ozlem Bulbul
- Istanbul University- Cerrahpasa, Institute of Forensic Science and Legal Medicine, Istanbul, Turkey
| | - Gonul Filoglu
- Istanbul University- Cerrahpasa, Institute of Forensic Science and Legal Medicine, Istanbul, Turkey
| |
Collapse
|
33
|
Bordbar M, Saadat M. Association between 15 insertion/deletion genetic polymorphisms and risk of schizophrenia using pooled samples. EXCLI JOURNAL 2023; 22:310-314. [PMID: 37223083 PMCID: PMC10201014 DOI: 10.17179/excli2022-5734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 02/21/2023] [Indexed: 05/25/2023]
Abstract
Schizophrenia is a psychiatric syndrome that affects approximately 1 % of the world population and is among the top 10 reasons for disability. In this case-control study, we investigated the association between 15 insertion/deletion (Indel) polymorphisms and schizophrenia risk using pooled samples. In the present case-control study, 361 individuals with schizophrenia and 360 healthy individuals were included in the study. We examined the insertion/deletion polymorphisms in APOB, ADRA2B, PDCD6IP, LRPAP1, TLR2, DHFR, VEGF, HLA-G, TPA, DBH, UCP2, FADS2, MDM2, TP53 and SLC6A4 genes. Our results revealed that the Del allele of the HLA-G 14bp Indel polymorphism increased the risk of schizophrenia (OR=1.23, 95 % CI=1.01-1.52, p=0.045) and the Alu- allele of the TPA Alu+/Alu- polymorphism negatively associated with the schizophrenia risk (OR=0.67, 95 % CI=0.54-0.82, p<0.001).
Collapse
Affiliation(s)
- Maedeh Bordbar
- Department of Biology, College of Sciences, Shiraz University, Shiraz 71467-13565, Iran
| | - Mostafa Saadat
- Department of Biology, College of Sciences, Shiraz University, Shiraz 71467-13565, Iran
| |
Collapse
|
34
|
Saleh AA, Xue L, Zhao Y. Screening Indels from the whole genome to identify the candidates and their association with economic traits in several goat breeds. Funct Integr Genomics 2023; 23:58. [PMID: 36757519 DOI: 10.1007/s10142-023-00981-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 01/20/2023] [Accepted: 01/25/2023] [Indexed: 02/10/2023]
Abstract
In the present study, the re-sequencing of our previous whole-genome sequencing (WGS) for selected individuals of Dazu-black goat (DBG) and Inner-Mongolia Cashmere goat (IMCG) breeds was used to detect and compare the differentiation in Indels depending on the reference genome of goat. Then, three selected candidate Indels rs668795676, rs657996810, and rs669452874 of the three genes SUFU, SYCP2L and GLIPR1L1, respectively, have been chosen, based on the results of prior GWAS across the genome, and examined for their association with body weight and dimensions (body height, body length, heart girth, chest width, cannon circumference, and chest depth) by kompetitive allele specific PCR assay for 342 goats from the three studied goat breeds (DBG, n = 203; ♂99, ♀104), IMCG (n = 65; 15♂, 50♀), and Hechuan white goat (HWG, n = 74; 34♂, 40♀) breeds. The analysis of 192.747 Gb WGS revealed an average 334,151 Indels in the whole genome of DBG and IMCG breeds. Chromosome 1 had a maximum number of mutations (Indels) of 58,497 and 55,527 for IMCG and DBG, respectively, while chromosome 25 had the least number of mutations of 15,680 and 16,103 for IMCG and DBG, respectively. The majority of Indels were either Ins or Del of short fragments of 1-5 bp, which covered 79.06 and 71.78% of the total number of Indels mutations in IMCG and DBG, respectively. Comparing the differences of Indels between the studied goat breeds revealed 100 and 110 unique Indels for IMCG and DBG, respectively. The Indels loci in the intron region were unique for both studied goat breeds which were related to 30 and 38 candidate genes in IMCG and DBG, respectively, including SUFU, SYCP2L, and GLIPR1L1 genes. Concerning rs669452874 locus, body height and body length of Del/Del genotype in DBG were significantly higher (P < 0.05) than that of Ins/Del genotype, while body height and body length of Del/Del genotype in IMCG were significantly higher (P < 0.01) than those of Ins/Ins genotype, whereas body height and body length and heart girth of Del/Del genotype in HWG were significantly higher (P < 0.01) than those of the Ins/Del and Ins/Ins genotypes. Thus, Del/Del genotype of rs669452874 locus can be used as a candidate molecular marker related to the body dimensions in the studied goat breeds.
Collapse
Affiliation(s)
- Ahmed A Saleh
- Animal and Fish Production Department, Faculty of Agriculture (Alshatby), Alexandria University, Alexandria City, 11865, Egypt.
| | - Lei Xue
- College of Animal Science and Technology, Chongqing Key Laboratory of Forage & Herbivore, Chongqing Engineering Research Centre for Herbivores Resource Protection and Utilization, Southwest University, Chongqing, 400715, People's Republic of China
| | - Yongju Zhao
- College of Animal Science and Technology, Chongqing Key Laboratory of Forage & Herbivore, Chongqing Engineering Research Centre for Herbivores Resource Protection and Utilization, Southwest University, Chongqing, 400715, People's Republic of China
| |
Collapse
|
35
|
Liu Z, Simayijiang H, Wang Q, Yang J, Sun H, Wu R, Yan J. DNA and protein analyses of hair in forensic genetics. Int J Legal Med 2023; 137:613-633. [PMID: 36732435 DOI: 10.1007/s00414-023-02955-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/20/2023] [Indexed: 02/04/2023]
Abstract
Hair is one of the most common pieces of biological evidence found at a crime scene and plays an essential role in forensic investigation. Hairs, especially non-follicular hairs, are usually found at various crime scenes, either by natural shedding or by forcible shedding. However, the genetic material in hairs is usually highly degraded, which makes forensic analysis difficult. As a result, the value of hair has not been fully exploited in forensic investigations and trials. In recent years, with advances in molecular biology, forensic analysis of hair has achieved remarkable strides and provided crucial clues in numerous cases. This article reviews recent developments in DNA and protein analysis of hair and attempts to provide a comprehensive solution to improve forensic hair analysis.
Collapse
Affiliation(s)
- Zhiyong Liu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Halimureti Simayijiang
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China
| | - Qiangwei Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Jingyi Yang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Hongyu Sun
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Riga Wu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China. .,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.
| | - Jiangwei Yan
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China.
| |
Collapse
|
36
|
Fan X, Pan H, Tian A, Chung WK, Shen Y. SHINE: protein language model-based pathogenicity prediction for short inframe insertion and deletion variants. Brief Bioinform 2023; 24:bbac584. [PMID: 36575831 PMCID: PMC9851320 DOI: 10.1093/bib/bbac584] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/04/2022] [Accepted: 11/29/2022] [Indexed: 12/29/2022] Open
Abstract
Accurate variant pathogenicity predictions are important in genetic studies of human diseases. Inframe insertion and deletion variants (indels) alter protein sequence and length, but not as deleterious as frameshift indels. Inframe indel Interpretation is challenging due to limitations in the available number of known pathogenic variants for training. Existing prediction methods largely use manually encoded features including conservation, protein structure and function, and allele frequency to infer variant pathogenicity. Recent advances in deep learning modeling of protein sequences and structures provide an opportunity to improve the representation of salient features based on large numbers of protein sequences. We developed a new pathogenicity predictor for SHort Inframe iNsertion and dEletion (SHINE). SHINE uses pretrained protein language models to construct a latent representation of an indel and its protein context from protein sequences and multiple protein sequence alignments, and feeds the latent representation into supervised machine learning models for pathogenicity prediction. We curated training data from ClinVar and gnomAD, and created two test datasets from different sources. SHINE achieved better prediction performance than existing methods for both deletion and insertion variants in these two test datasets. Our work suggests that unsupervised protein language models can provide valuable information about proteins, and new methods based on these models can improve variant interpretation in genetic analyses.
Collapse
Affiliation(s)
- Xiao Fan
- Department of Pediatrics, Columbia University, New York, NY, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Hongbing Pan
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Alan Tian
- Lynbrook High School, San Jose, CA, USA
| | - Wendy K Chung
- Department of Pediatrics, Columbia University, New York, NY, USA
- Department of Medicine, Columbia University, New York, NY, USA
| | - Yufeng Shen
- Department of Systems Biology, Columbia University, New York, NY, USA
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
- JP Sulzberger Columbia Genome Center, Columbia University, New York, NY, USA
| |
Collapse
|
37
|
Guanglin H, Lan-Hai W, Mengge W. Editorial: Forensic investigative genetic genealogy and fine-scale structure of human populations. Front Genet 2023; 13:1067865. [PMID: 36685813 PMCID: PMC9849385 DOI: 10.3389/fgene.2022.1067865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 12/22/2022] [Indexed: 01/06/2023] Open
Affiliation(s)
- He Guanglin
- 1Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, China,*Correspondence: He Guanglin, ; Wang Mengge,
| | - Wei Lan-Hai
- 2School of Ethnology and Anthropology, Inner Mongolia Normal University, Hohhot, China
| | - Wang Mengge
- 3Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China,*Correspondence: He Guanglin, ; Wang Mengge,
| |
Collapse
|
38
|
Fang Y, Liu Y, Xu H, Zhu B. Performance evaluation of an in-house panel containing 59 autosomal InDels for forensic identification in Chinese Hui and Mongolian groups. Genomics 2023; 115:110552. [PMID: 36565793 DOI: 10.1016/j.ygeno.2022.110552] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/31/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
In recent years, a novel multiplex system containing two mini-short tandem repeats, 59 autosomal InDels, two Y-chromosomal InDels, and the Amelogenin gene with all amplicons less than 200 bp has been constructed and validated by ourselves for forensic degration sample, and its forensic application efficiency has been studied in Chinese some populations. Herein, the population genetic polymorphisms of these loci were investigated in Chinese Hui (n = 249) and Mongolian (n = 222) ethnic groups using direct multiplex amplification and capillary electrophoresis platform. The forensic identification efficiencies of this self-developed system were further evaluated in these two groups. And the results showed that the values of the combined power of discrimination were 0.9999999999999999999999999999006 (Hui) and 0.999999999999999999999999999738 (Mongolian), respectively. Moreover, the combined power of exclusion values were 0.99999817 (Hui) and 0.99999779 (Mongolian). The 59 autosomal InDels used in this study exhibited high forensic identification efficiencies in 10 East Asian populations, which was also expected to be a new powerful tool for identifying degraded biological materials in East Asian populations.
Collapse
Affiliation(s)
- Yating Fang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China; School of Basic Medical Sciences, Anhui Medical University, Anhui 230031, China
| | - Yanfang Liu
- Laboratory of Fundamental Nursing Research, School of Nursing, Guangdong Medical University, Dongguan, China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China; College of Forensic Medicine, Xi'an Jiaotong University Health Science Center, Xi'an 710061, China.
| |
Collapse
|
39
|
Zhou Z, Li Z, Yao Y, Qian J, Ji Q, Shao C, Xie J. Validation of phylogenetic informative Y-InDels in Y-chromosomal haplogroup O-M175. Front Genet 2023; 14:1182028. [PMID: 37205119 PMCID: PMC10185902 DOI: 10.3389/fgene.2023.1182028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 04/17/2023] [Indexed: 05/21/2023] Open
Abstract
The Y-chromosomal haplogroup tree, which consists of a group of Y-chromosomal loci with phylogenetic information, has been widely applied in anthropology, archaeology and population genetics. With the continuous updating of the phylogenetic structure, Y-chromosomal haplogroup tree provides more information for recalling the biogeographical origin of Y chromosomes. Generally, Y-chromosomal insertion-deletion polymorphisms (Y-InDels) are genetically stable as Y-chromosomal single nucleotide polymorphisms (Y-SNPs), and therefore carry mutations that can accumulate over generations. In this study, potential phylogenetic informative Y-InDels were filtered out in haplogroup O-M175, which is dominant in East Asia, based on population data retrieved from the 1000 Genomes Project. A group of 22 phylogenetic informative Y-InDels were identified and then assigned to their corresponding subclades of haplogroup O-M175, which provided a supplement for the update and application of Y-chromosomal markers. Especially, four Y-InDels were introduced to define subclades determined using a single Y-SNP.
Collapse
|
40
|
Lan J, Zhang X, Cui W, Mei S, Xu J, Zhu B. Genetic polymorphisms and population genetic analyses of 57 autosomal InDel loci in Hubei Tujia group. Front Genet 2023; 14:1066655. [PMID: 36936428 PMCID: PMC10022698 DOI: 10.3389/fgene.2023.1066655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 01/31/2023] [Indexed: 03/06/2023] Open
Abstract
Introduction: The Tujia is the eighth most populous population in China, but its genetic structure has not been fully studied. Methods: In this study, we utilized 57 autosomal Insertion/deletion (InDel) loci to evaluate the genetic polymorphisms and efficiency of forensic applications in the Chinese Hubei Tujia group, and analyzed the genetic structure variances among the studied group and other 26 different reference populations from five continents in 1000 Genomes Project (1KG). Results: The results showed that 57 InDels have no significant deviations from Hardy-Weinberg equilibrium and linkage equilibrium. The combined power of discrimination (CPD) and the combined probability of exclusion (CPE) values for 57 InDels were 0.99999999999999999999999699822 and 0.999975177214539 in the Hubei Tujia group, respectively. In addition, the results of genetic structure analyses indicated that the Hubei Tujia group has close genetic relationships with the Chinese Han population and other East Asian populations. Discussion: These 57 autosomal InDels can be used as reliable tools for forensic individual identification and paternity testing, and are more suitable for East Asian populations. Furthermore, three InDels (rs72085595, rs145941537, and rs34529639) are promising for inferring ancestral information.
Collapse
Affiliation(s)
- Jiangwei Lan
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Xingru Zhang
- College of Forensic Medicine, Xi’an Jiaotong University Health Science Center, Xi’an, China
| | - Wei Cui
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Shuyan Mei
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
| | - Jingtao Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- *Correspondence: Jingtao Xu, ; Bofeng Zhu,
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, China
- *Correspondence: Jingtao Xu, ; Bofeng Zhu,
| |
Collapse
|
41
|
Feng Y, Wang T, Yang Y, You J, He K, Zhang H, Wang Q, Yang M, Huang J, Ren Z, Jin X. Genetic features and phylogenetic relationship analyses of Guizhou Han population residing in Southwest China via 38 X-InDels. PeerJ 2023; 11:e14964. [PMID: 36915656 PMCID: PMC10007965 DOI: 10.7717/peerj.14964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 02/06/2023] [Indexed: 03/11/2023] Open
Abstract
Background The insertion/deletion polymorphism (InDel), an ideal forensic genetic marker with a low spontaneous mutation rate and small amplification product fragments, is widely distributed in the genome, combining the advantages of STR and SNP genetic markers. The X-chromosome has high application value in complex paternity testing, and it is an excellent system for evaluating population admixture and studying evolutionary anthropology. However, further research is needed on the population genetics of X-chromosome InDels (X-InDels). Methods In this article, a system composed of 38 X-InDel loci was utilized to analyse and evaluate the forensic parameters of the Guizhou Han population in order to explore its forensic application efficiency. Results The results showed that expected heterozygosities spanned from 0.0189 to 0.5715, and the cumulative power of discrimination of the 32 X-InDels and three linkage blocks was 0.9999999954 and 0.999999999999741 for males and females, respectively. The combined mean exclusion chance of these loci for trios and duos is 0.999999 and 0.999747, respectively. Multiple methods like principal component analysis, Fst genetic distance, and phylogenetic reconstruction were employed for dissecting the genetic structure of the Guizhou Han population by comparing it with previously reported populations. As expected, the studied Han population displayed relatively close genetic affinities with the East Asian populations. At the same time, there were obvious genetic differentiations between the Guizhou Han population and other continental populations that were discerned, especially for the African populations. Conclusions This study further verified the applicability of 38 X-InDels for human personal identification and kinship analyses of Han Chinese, and also showed the application potential of X-InDels in population genetics.
Collapse
Affiliation(s)
- Yuhang Feng
- Shanghai Key Lab of Forensic Medicine, Key Lab of Forensic Science, Ministry of Justice, China, Academy of Forensic Science, Shanghai, China.,Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Ting Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Yunteng Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Jiangtao You
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Kun He
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Hongling Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Qiyan Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Meiqing Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Jiang Huang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Zheng Ren
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Xiaoye Jin
- Shanghai Key Lab of Forensic Medicine, Key Lab of Forensic Science, Ministry of Justice, China, Academy of Forensic Science, Shanghai, China.,Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| |
Collapse
|
42
|
Yao Y, Sun K, Yang Q, Zhou Z, Qian J, Li Z, Shao C, Qian X, Tang Q, Xie J. Development of a multiplex panel with 31 multi-allelic InDels for forensic DNA typing. Int J Legal Med 2023; 137:1-12. [PMID: 36326889 DOI: 10.1007/s00414-022-02907-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 10/20/2022] [Indexed: 11/06/2022]
Abstract
Insertion/Deletion (InDel) polymorphic genetic markers are abundant in human genomes. Diallelic InDel markers have been widely studied for forensic purposes, yet the low polymorphic information content limits their application and current InDel panels remain to be improved. In this study, multi-allelic InDels located out of low complexity sequence regions were selected in the datasets from East Asian populations, and a multiplex amplification system containing 31 multi-allelic InDel markers and the Amelogenin marker (FA-HID32plex) was constructed and optimized. The preliminary study on sensitivity, species specificity, inhibitor tolerance, mixture resolution, and the detection of degraded samples demonstrates that the FA-HID32plex is highly sensitive, specific, and robust for traces and degraded samples. The combined power of discrimination (CPD) of 31 multi-allelic InDel markers was 0.999 999 999 999 999 999 85, and the cumulative probability of exclusion (CPE) was 0.999 920 in a Chinese Han population, which indicates a high discrimination power. Altogether, the FA-HID32plex panel could provide reliable supplements or stand-alone information in individual identification and paternity testing, especially for challenging samples.
Collapse
Affiliation(s)
- Yining Yao
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Kuan Sun
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China.,Department of Fetal Medicine and Prenatal Diagnosis Center, Shanghai First Maternity and Infant Hospital, Tongji University School of Medicine, 2699 West Gaoke Rd, 201204, Shanghai, China.,Shanghai Key Laboratory of Maternal Fetal Medicine, Shanghai First Maternity and Infant Hospital, School of Medicine, Tongji University, Shanghai, 200092, China
| | - Qinrui Yang
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Zhihan Zhou
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Jinglei Qian
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Zhimin Li
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Chengchen Shao
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Xiaoqin Qian
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Qiqun Tang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, 200032, China
| | - Jianhui Xie
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China.
| |
Collapse
|
43
|
Ren T, Lin W, Yang X, Zhang Z, He S, Li W, Li Z, Zhang X. QPCTL Affects the Daily Weight Gain of the F2 Population and Regulates Myogenic Cell Proliferation and Differentiation in Chickens. Animals (Basel) 2022; 12:ani12243535. [PMID: 36552455 PMCID: PMC9774964 DOI: 10.3390/ani12243535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/10/2022] [Accepted: 12/11/2022] [Indexed: 12/16/2022] Open
Abstract
Molecular breeding can accelerate the process of animal breeding and improve the breeding efficiency. To date, many Indel molecular markers have been identified in livestock and poultry, but how Indels affect economic traits is not well understood. For molecular breeding, it is crucial to reveal the mechanism of action of Indels and to provide more accurate information. The purpose of this study was to investigate how the 52/224-bp multiallelic Indels of the chicken QPCTL promoter area affect the daily weight gain of chickens and the potential regulatory mechanism of the QPCTL gene. The analysis was conducted by association analysis, qPCR, dual-fluorescence assay and Western blotting. The results showed that Indels in the QPCTL promoter region were significantly associated with the daily weight gain in chickens and that QPCTL expression showed a decreasing trend in embryonic breast muscle tissues. Furthermore, QPCTL expression was significantly higher in breast muscle tissues of the AC genotype than in those of the AB and BB genotypes. Based on the transcriptional activity results, the pGL3-C vector produced more luciferase activity than pGL3-A and pGL3-B. In addition, overexpression of QPCTL promoted chicken primary myoblast (CPM) proliferation and inhibited differentiation. The results of this study suggest that Indels in the promoter region of the QPCTL gene may regulate the proliferation and differentiation of CPMs by affecting the expression of QPCTL, which ultimately affects the growth rate of chickens. These Indels have important value for the molecular breeding of chickens, and QPCTL can be used as a candidate gene to regulate and improve chicken growth and development.
Collapse
Affiliation(s)
- Tuanhui Ren
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China
| | - Wujian Lin
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
| | - Xiuxian Yang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
| | - Zihao Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
| | - Shizi He
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
| | - Wangyu Li
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
| | - Zhuanjian Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China
- Correspondence: (Z.L.); (X.Z.)
| | - Xiquan Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, China
- Correspondence: (Z.L.); (X.Z.)
| |
Collapse
|
44
|
The Identification of Large Rearrangements Involving Intron 2 of the CDH1 Gene in BRCA1/2 Negative and Breast Cancer Susceptibility. Genes (Basel) 2022; 13:genes13122213. [PMID: 36553480 PMCID: PMC9778491 DOI: 10.3390/genes13122213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 11/03/2022] [Accepted: 11/14/2022] [Indexed: 11/29/2022] Open
Abstract
E-cadherin, a CDH1 gene product, is a calcium-dependent cell-cell adhesion molecule playing a critical role in the establishment of epithelial architecture, maintenance of cell polarity, and differentiation. Germline pathogenic variants in the CDH1 gene are associated with hereditary diffuse gastric cancer (HDGC), and large rearrangements in the CDH1 gene are now being reported as well. Because CDH1 pathogenic variants could be associated with breast cancer (BC) susceptibility, CDH1 rearrangements could also impact it. The aim of our study is to identify rearrangements in the CDH1 gene in 148 BC cases with no BRCA1 and BRCA2 pathogenic variants. To do so, a zoom-in CGH array, covering the exonic, intronic, and flanking regions of the CDH1 gene, was used to screen our cohort. Intron 2 of the CDH1 gene was specifically targeted because it is largely reported to include several regulatory regions. As results, we detected one large rearrangement causing a premature stop in exon 3 of the CDH1 gene in a proband with a bilateral lobular breast carcinoma and a gastric carcinoma (GC). Two large rearrangements in the intron 2, a deletion and a duplication, were also reported only with BC cases without any familial history of GC. No germline rearrangements in the CDH1 coding region were detected in those families without GC and with a broad range of BC susceptibility. This study confirms the diversity of large rearrangements in the CDH1 gene. The rearrangements identified in intron 2 highlight the putative role of this intron in CDH1 regulation and alternative transcripts. Recurrent duplication copy number variations (CNV) are found in this region, and the deletion encompasses an alternative CDH1 transcript. Screening for large rearrangements in the CDH1 gene could be important for genetic testing of BC.
Collapse
|
45
|
Heide T, Househam J, Cresswell GD, Spiteri I, Lynn C, Mossner M, Kimberley C, Fernandez-Mateos J, Chen B, Zapata L, James C, Barozzi I, Chkhaidze K, Nichol D, Gunasri V, Berner A, Schmidt M, Lakatos E, Baker AM, Costa H, Mitchinson M, Piazza R, Jansen M, Caravagna G, Ramazzotti D, Shibata D, Bridgewater J, Rodriguez-Justo M, Magnani L, Graham TA, Sottoriva A. The co-evolution of the genome and epigenome in colorectal cancer. Nature 2022; 611:733-743. [PMID: 36289335 PMCID: PMC9684080 DOI: 10.1038/s41586-022-05202-1] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 08/05/2022] [Indexed: 12/13/2022]
Abstract
Colorectal malignancies are a leading cause of cancer-related death1 and have undergone extensive genomic study2,3. However, DNA mutations alone do not fully explain malignant transformation4-7. Here we investigate the co-evolution of the genome and epigenome of colorectal tumours at single-clone resolution using spatial multi-omic profiling of individual glands. We collected 1,370 samples from 30 primary cancers and 8 concomitant adenomas and generated 1,207 chromatin accessibility profiles, 527 whole genomes and 297 whole transcriptomes. We found positive selection for DNA mutations in chromatin modifier genes and recurrent somatic chromatin accessibility alterations, including in regulatory regions of cancer driver genes that were otherwise devoid of genetic mutations. Genome-wide alterations in accessibility for transcription factor binding involved CTCF, downregulation of interferon and increased accessibility for SOX and HOX transcription factor families, suggesting the involvement of developmental genes during tumourigenesis. Somatic chromatin accessibility alterations were heritable and distinguished adenomas from cancers. Mutational signature analysis showed that the epigenome in turn influences the accumulation of DNA mutations. This study provides a map of genetic and epigenetic tumour heterogeneity, with fundamental implications for understanding colorectal cancer biology.
Collapse
Affiliation(s)
- Timon Heide
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Computational Biology Research Centre, Human Technopole, Milan, Italy
| | - Jacob Househam
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - George D Cresswell
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Inmaculada Spiteri
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Claire Lynn
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Maximilian Mossner
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Chris Kimberley
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | | | - Bingjie Chen
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Luis Zapata
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Chela James
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Iros Barozzi
- Department of Surgery and Cancer, Imperial College London, London, UK
- Centre for Cancer Research, Medical University of Vienna, Vienna, Austria
| | - Ketevan Chkhaidze
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Daniel Nichol
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Vinaya Gunasri
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Alison Berner
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Melissa Schmidt
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Eszter Lakatos
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Ann-Marie Baker
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
| | - Helena Costa
- Department of Pathology, UCL Cancer Institute, University College London, London, UK
| | - Miriam Mitchinson
- Department of Pathology, UCL Cancer Institute, University College London, London, UK
| | - Rocco Piazza
- Department of Medicine and Surgery, University of Milano-Bicocca, Milan, Italy
| | - Marnix Jansen
- Department of Pathology, UCL Cancer Institute, University College London, London, UK
| | - Giulio Caravagna
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
- Department of Mathematics and Geosciences, University of Triest, Triest, Italy
| | - Daniele Ramazzotti
- Department of Medicine and Surgery, University of Milano-Bicocca, Milan, Italy
| | - Darryl Shibata
- Department of Pathology, University of Southern California Keck School of Medicine, Los Angeles, CA, USA
| | | | | | - Luca Magnani
- Department of Surgery and Cancer, Imperial College London, London, UK
| | - Trevor A Graham
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK.
- Evolution and Cancer Lab, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK.
| | - Andrea Sottoriva
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK.
- Computational Biology Research Centre, Human Technopole, Milan, Italy.
| |
Collapse
|
46
|
Zou L, Bao W, Gao Y, Chen M, Wu Y, Wang S, Li C, Zhang J, Zhang D, Wang Q, Zhu A. Integrated Analysis of Transcriptome and microRNA Profile Reveals the Toxicity of Euphorbia Factors toward Human Colon Adenocarcinoma Cell Line Caco-2. Molecules 2022; 27:molecules27206931. [PMID: 36296525 PMCID: PMC9608949 DOI: 10.3390/molecules27206931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/22/2022] Open
Abstract
Euphorbia factors, lathyrane-type diterpenoids isolated from the medical herb Euphorbia lathyris L. (Euphorbiaceae), have been associated with intestinal irritation toxicity, but the mechanisms underlying this phenomenon are still unknown. The objective of this study was to evaluate the transcriptome and miRNA profiles of human colon adenocarcinoma Caco-2 cells in response to Euphorbia factors L1 (EFL1) and EFL2. Whole transcriptomes of mRNA and microRNA (miRNA) were obtained using second generation high-throughput sequencing technology in response to 200 μM EFL treatment for 72 h, and the differentially expressed genes and metabolism pathway were enriched. Gene structure changes were analyzed by comparing them with reference genome sequences. After 72 h of treatment, 16 miRNAs and 154 mRNAs were differently expressed between the EFL1 group and the control group, and 47 miRNAs and 1101 mRNAs were differentially expressed between the EFL2 group and the control. Using clusters of orthologous protein enrichment, the sequenced mRNAs were shown to be mainly involved in transcription, post-translational modification, protein turnover, chaperones, signal transduction mechanisms, intracellular trafficking, secretion, vesicular transport, and the cytoskeleton. The differentially expressed mRNA functions and pathways were enriched in transmembrane transport, T cell extravasation, the IL-17 signaling pathway, apoptosis, and the cell cycle. The differentially expressed miRNA EFLs caused changes in the structure of the gene, including alternative splicing, insertion and deletion, and single nucleotide polymorphisms. This study reveals the underlying mechanism responsible for the toxicity of EFLs in intestinal cells based on transcriptome and miRNA profiles of gene expression and structure.
Collapse
Affiliation(s)
- Lingyue Zou
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Wenqiang Bao
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Yadong Gao
- Department of Toxicology, School of Public Health, Peking University, Beijing 100191, China
- Fujian Provincial Key Laboratory of Zoonosis Research, Fujian Center for Disease Control and Prevention, Fuzhou 350001, China
| | - Mengting Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Yajiao Wu
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Shuo Wang
- Department of Toxicology, School of Public Health, Peking University, Beijing 100191, China
| | - Chutao Li
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Jian Zhang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Dongcheng Zhang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
| | - Qi Wang
- Department of Toxicology, School of Public Health, Peking University, Beijing 100191, China
- Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing 100191, China
- Beijing Key Laboratory of Toxicological Research and Risk Assessment for Food Safety, Beijing 100191, China
- Correspondence: (Q.W.); (A.Z.)
| | - An Zhu
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350108, China
- Correspondence: (Q.W.); (A.Z.)
| |
Collapse
|
47
|
Population Genetic Data of 30 Insertion-Deletion Markers in the Polish Population. Genes (Basel) 2022; 13:genes13101683. [PMID: 36292568 PMCID: PMC9601377 DOI: 10.3390/genes13101683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 09/16/2022] [Indexed: 11/17/2022] Open
Abstract
(1) Background: Insertion-deletion (InDel) markers show the advantages of both short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) and are considered alternative markers in forensic genetics. (2) Methods: Allelic frequencies and corresponding forensic efficiency parameters of 30 autosomal polymorphic InDel loci included in the Investigator DIPplex kit (Qiagen) were obtained in a sample of 631 unrelated Polish individuals. Allelic frequency data were compared with those reported for selected populations (3) Results: All the loci conformed with Hardy-Weinberg equilibrium after applying a Bonferroni correction and no pair-wise significant linkage disequilibrium was detected. (4) Conclusions: DIPplex Kit differences were high among populations worldwide. The InDel markers are highly discriminating for human identification purposes in the Polish population.
Collapse
|
48
|
Karczewski KJ, Solomonson M, Chao KR, Goodrich JK, Tiao G, Lu W, Riley-Gillis BM, Tsai EA, Kim HI, Zheng X, Rahimov F, Esmaeeli S, Grundstad AJ, Reppell M, Waring J, Jacob H, Sexton D, Bronson PG, Chen X, Hu X, Goldstein JI, King D, Vittal C, Poterba T, Palmer DS, Churchhouse C, Howrigan DP, Zhou W, Watts NA, Nguyen K, Nguyen H, Mason C, Farnham C, Tolonen C, Gauthier LD, Gupta N, MacArthur DG, Rehm HL, Seed C, Philippakis AA, Daly MJ, Davis JW, Runz H, Miller MR, Neale BM. Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes. CELL GENOMICS 2022; 2:100168. [PMID: 36778668 PMCID: PMC9903662 DOI: 10.1016/j.xgen.2022.100168] [Citation(s) in RCA: 122] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 03/20/2022] [Accepted: 07/16/2022] [Indexed: 01/20/2023]
Abstract
Genome-wide association studies have successfully discovered thousands of common variants associated with human diseases and traits, but the landscape of rare variations in human disease has not been explored at scale. Exome-sequencing studies of population biobanks provide an opportunity to systematically evaluate the impact of rare coding variations across a wide range of phenotypes to discover genes and allelic series relevant to human health and disease. Here, we present results from systematic association analyses of 4,529 phenotypes using single-variant and gene tests of 394,841 individuals in the UK Biobank with exome-sequence data. We find that the discovery of genetic associations is tightly linked to frequency and is correlated with metrics of deleteriousness and natural selection. We highlight biological findings elucidated by these data and release the dataset as a public resource alongside the Genebass browser for rapidly exploring rare-variant association results.
Collapse
Affiliation(s)
- Konrad J. Karczewski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Matthew Solomonson
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine R. Chao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Julia K. Goodrich
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Grace Tiao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Wenhan Lu
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | | | - Hye In Kim
- Worldwide Research Development and Medical, Pfizer, Inc., Cambridge, MA 02139, USA
| | - Xiuwen Zheng
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | - Fedik Rahimov
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | - Sahar Esmaeeli
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | | | - Mark Reppell
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | - Jeff Waring
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | - Howard Jacob
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | | | | | - Xing Chen
- Worldwide Research Development and Medical, Pfizer, Inc., Cambridge, MA 02139, USA
| | - Xinli Hu
- Worldwide Research Development and Medical, Pfizer, Inc., Cambridge, MA 02139, USA
| | - Jacqueline I. Goldstein
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Daniel King
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Christopher Vittal
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Timothy Poterba
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Duncan S. Palmer
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Claire Churchhouse
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Daniel P. Howrigan
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Wei Zhou
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Nicholas A. Watts
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Kevin Nguyen
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Huy Nguyen
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Cara Mason
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Christopher Farnham
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Charlotte Tolonen
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Laura D. Gauthier
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Namrata Gupta
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Daniel G. MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Heidi L. Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Cotton Seed
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Mark J. Daly
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Molecular Medicine Finland, Helsinki, Finland
| | - J. Wade Davis
- Genomics Research Center, AbbVie, North Chicago, IL 60064, USA
| | | | - Melissa R. Miller
- Worldwide Research Development and Medical, Pfizer, Inc., Cambridge, MA 02139, USA
| | - Benjamin M. Neale
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|
49
|
Forensic Applications of Markers Present on the X Chromosome. Genes (Basel) 2022; 13:genes13091597. [PMID: 36140765 PMCID: PMC9498577 DOI: 10.3390/genes13091597] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 08/23/2022] [Accepted: 09/02/2022] [Indexed: 11/17/2022] Open
Abstract
Microsatellite genetic markers are the gold standard for human genetic identification. Forensic analyses around the world are carried out through protocols using the analysis of STR markers in autosomal chromosomes and in the Y chromosome to solve crimes. However, these analyses do not allow for the resolution of all cases, such as rape situations with suspicion of incest, paternity without a maternal sample for comparison, and biological traces with DNA mixture where the profile sought is female, among other situations. In these complex cases, the study of X-chromosome STR markers significantly increases the probability of identification by complementing the data obtained for autosomal and Y-chromosome markers, due to the unique structure of the X chromosome and its exclusive method of inheritance. However, there are currently no validated Brazilian protocols for this purpose, nor are there any population data necessary for statistical analyses that must be included in the issuance of expert reports. Thus, the aim of this article is to provide a literary review of the applications of X-chromosomal markers in population genetics.
Collapse
|
50
|
Dai SS, Sulaiman X, Isakova J, Xu WF, Abdulloevich NT, Afanasevna ME, Ibrohimovich KB, Chen X, Yang WK, Wang MS, Shen QK, Yang XY, Yao YG, Aldashev AA, Saidov A, Chen W, Cheng LF, Peng MS, Zhang YP. The genetic echo of the Tarim mummies in modern Central Asians. Mol Biol Evol 2022; 39:6675590. [PMID: 36006373 PMCID: PMC9469894 DOI: 10.1093/molbev/msac179] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The diversity of Central Asians has been shaped by multiple migrations and cultural diffusion. Although ancient DNA studies have revealed the demographic changes of the Central Asian since the Bronze Age, the contribution of the ancient populations to the modern Central Asian remains opaque. Herein, we performed high-coverage sequencing of 131 whole genomes of Indo-European-speaking Tajik and Turkic-speaking Kyrgyz populations to explore their genomic diversity and admixture history. By integrating the ancient DNA data, we revealed more details of the origins and admixture history of Central Asians. We found that the major ancestry of present-day Tajik populations can be traced back to the admixture of the Bronze Age Bactria–Margiana Archaeological Complex and Andronovo-related populations. Highland Tajik populations further received additional gene flow from the Tarim mummies, an isolated ancient North Eurasian–related population. The West Eurasian ancestry of Kyrgyz is mainly derived from Historical Era populations in Xinjiang of China. Furthermore, the recent admixture signals detected in both Tajik and Kyrgyz are ascribed to the expansions of Eastern Steppe nomadic pastoralists during the Historical Era.
Collapse
Affiliation(s)
- Shan Shan Dai
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Xierzhatijiang Sulaiman
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Department of Pharmacology, School of Pharmacy, Xinjiang Medical University, Urumqi 830054, China
| | - Jainagul Isakova
- Institute of Molecular Biology and Medicine, Bishkek 720040, Kyrgyzstan
| | - Wei Fang Xu
- Shenzhen Hospital of Guangzhou University of Chinese Medicine, Shenzhen 518034, China
| | - Najmudinov Tojiddin Abdulloevich
- E.N. Pavlovsky Institute of Zoology and Parasitology, Academy of Sciences of Republic of Tajikistan, Dushanbe 734025, Tajikistan
| | - Manilova Elena Afanasevna
- E.N. Pavlovsky Institute of Zoology and Parasitology, Academy of Sciences of Republic of Tajikistan, Dushanbe 734025, Tajikistan
| | - Khudoidodov Behruz Ibrohimovich
- E.N. Pavlovsky Institute of Zoology and Parasitology, Academy of Sciences of Republic of Tajikistan, Dushanbe 734025, Tajikistan
| | - Xi Chen
- Research Center for Ecology and Environment of Central Asia, Chinese Academy of Sciences, Urumqi 830011, China.,State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China
| | - Wei Kang Yang
- State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China
| | - Ming Shan Wang
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Quan Kuan Shen
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Xing Yan Yang
- Key Laboratory of Chemistry in Ethnic Medicinal Resource, Yunnan Minzu University, Kunming 650504, China.,School of Chemistry and Environment, Yunnan Minzu University, Kunming 650504, China
| | - Yong Gang Yao
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,KIZ/CUHK Joint Laboratory of Bio-resources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Almaz A Aldashev
- Institute of Molecular Biology and Medicine, Bishkek 720040, Kyrgyzstan
| | - Abdusattor Saidov
- E.N. Pavlovsky Institute of Zoology and Parasitology, Academy of Sciences of Republic of Tajikistan, Dushanbe 734025, Tajikistan
| | - Wei Chen
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650224, China.,State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming 650224, China
| | - Lu Feng Cheng
- Department of Pharmacology, School of Pharmacy, Xinjiang Medical University, Urumqi 830054, China
| | - Min Sheng Peng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,KIZ/CUHK Joint Laboratory of Bio-resources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Ya Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,KIZ/CUHK Joint Laboratory of Bio-resources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China.,State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, School of Life Sciences, Yunnan University, Kunming 650091, China
| |
Collapse
|