Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 3.799
Filter
1.
Mol Genet Genomics ; 299(1): 65, 2024 Jul 07.
Article in English | MEDLINE | ID: mdl-38972030

ABSTRACT

BACKGROUND: A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations. RESULTS: Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution. CONCLUSION: Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.


Subject(s)
DNA Copy Number Variations , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , DNA Copy Number Variations/genetics , High-Throughput Nucleotide Sequencing/methods , Genome, Human/genetics , Polymorphism, Single Nucleotide/genetics , Genetic Variation/genetics , Genetic Predisposition to Disease , Genetics, Population/methods , INDEL Mutation
2.
Nat Commun ; 15(1): 5613, 2024 Jul 04.
Article in English | MEDLINE | ID: mdl-38965236

ABSTRACT

Advancements in CRISPR technology, particularly the development of base editors, revolutionize genetic variant research. When combined with model organisms like zebrafish, base editors significantly accelerate and refine in vivo analysis of genetic variations. However, base editors are restricted by protospacer adjacent motif (PAM) sequences and specific editing windows, hindering their applicability to a broad spectrum of genetic variants. Additionally, base editors can introduce unintended mutations and often exhibit reduced efficiency in living organisms compared to cultured cell lines. Here, we engineer a suite of adenine base editors (ABEs) called ABE-Ultramax (Umax), demonstrating high editing efficiency and low rates of insertions and deletions (indels) in zebrafish. The ABE-Umax suite of editors includes ABEs with shifted, narrowed, or broadened editing windows, reduced bystander mutation frequency, and highly flexible PAM sequence requirements. These advancements have the potential to address previous challenges in disease modeling and advance gene therapy applications.


Subject(s)
Adenine , CRISPR-Cas Systems , Gene Editing , INDEL Mutation , Zebrafish , Zebrafish/genetics , Animals , Gene Editing/methods , Adenine/metabolism , RNA, Guide, CRISPR-Cas Systems/genetics , RNA, Guide, CRISPR-Cas Systems/metabolism , Animals, Genetically Modified , Alleles
3.
Hum Genomics ; 18(1): 79, 2024 Jul 15.
Article in English | MEDLINE | ID: mdl-39010135

ABSTRACT

The analysis of genomic variations in offspring after implantation has been infrequently studied. In this study, we aim to investigate the extent of de novo mutations in humans from developing fetus to birth. Using high-depth whole-genome sequencing, 443 parent-offspring trios were studied to compare the results of de novo mutations (DNMs) between different groups. The focus was on fetuses and newborns, with DNA samples obtained from the families' blood and the aspirated embryonic tissues subjected to deep sequencing. It was observed that the average number of total DNMs in the newborns group was 56.26 (54.17-58.35), which appeared to be lower than that the multifetal reduction group, which was 76.05 (69.70-82.40) (F = 2.42, P = 0.12). However, after adjusting for parental age and maternal pre-pregnancy body mass index (BMI), significant differences were found between the two groups. The analysis was further divided into single nucleotide variants (SNVs) and insertion/deletion of a small number of bases (indels), and it was discovered that the average number of de novo SNVs associated with the multifetal reduction group and the newborn group was 49.89 (45.59-54.20) and 51.09 (49.22-52.96), respectively. No significant differences were noted between the groups (F = 1.01, P = 0.32). However, a significant difference was observed for de novo indels, with a higher average number found in the multifetal reduction group compared to the newborn group (F = 194.17, P < 0.001). The average number of de novo indels among the multifetal reduction group and the newborn group was 26.26 (23.27-29.05) and 5.17 (4.82-5.52), respectively. To conclude, it has been observed that the quantity of de novo indels in the newborns experiences a significant decrease when compared to that in the aspirated embryonic tissues (7-9 weeks). This phenomenon is evident across all genomic regions, highlighting the adverse effects of de novo indels on the fetus and emphasizing the significance of embryonic implantation and intrauterine growth in human genetic selection mechanisms.


Subject(s)
Fetus , Humans , Female , Pregnancy , Infant, Newborn , Male , Adult , Polymorphism, Single Nucleotide/genetics , Embryo Implantation/genetics , Genome, Human/genetics , INDEL Mutation/genetics , Genomics , Whole Genome Sequencing , High-Throughput Nucleotide Sequencing , Mutation/genetics , Fetal Development/genetics
4.
Curr Microbiol ; 81(9): 275, 2024 Jul 17.
Article in English | MEDLINE | ID: mdl-39020143

ABSTRACT

In this study, the toxigenic characteristics of 14 strains of Microcystis were analyzed, and single nucleotide polymorphism (SNP) and insertion/deletion (InDel) loci in microcystin synthetase (mcy) gene clusters were screened. Based on SNP and InDel loci associated with the toxigenic characteristics, primers and TaqMan or Cycling fluorescent probes were designed to develop duplex real-time fluorescent quantitative PCR (FQ-PCR) assays. After evaluating specificity and sensitivity, these assays were applied to detect the toxigenic Microcystis genotypes in a shrimp pond where Microcystis blooms occurred. The results showed a total of 2155 SNP loci and 66 InDel loci were obtained, of which 12 SNP loci and 5 InDel loci were associated with the toxigenic characteristics. Three duplex real-time FQ-PCR assays were developed, each of which could quantify two genotypes of toxigenic Microcystis. These FQ-PCR assays were highly specific, and two Cycling assays were more sensitive than TaqMan assay. In the shrimp pond, six genotypes of toxigenic Microcystis were detected using the developed FQ-PCR assays, indicating that above genotyping assays have the potential for quantitative analysis of the toxigenic Microcystis genotypes in natural water.


Subject(s)
Genotype , Microcystis , Multigene Family , Polymorphism, Single Nucleotide , Real-Time Polymerase Chain Reaction , Microcystis/genetics , Microcystis/classification , Real-Time Polymerase Chain Reaction/methods , Microcystins/genetics , INDEL Mutation , Bacterial Proteins/genetics , Sensitivity and Specificity , Ponds/microbiology , Peptide Synthases/genetics
5.
Gigascience ; 132024 Jan 02.
Article in English | MEDLINE | ID: mdl-39028586

ABSTRACT

BACKGROUND: The use of sex-specific molecular markers has become a prominent method in enhancing fish production and economic value, as well as providing a foundation for understanding the complex molecular mechanisms involved in fish sex determination. Over the past decades, research on male and female sex identification has predominantly employed molecular biology methodologies such as restriction fragment length polymorphism, random amplification of polymorphic DNA, simple sequence repeat, and amplified fragment length polymorphism. The emergence of high-throughput sequencing technologies, particularly Illumina, has led to the utilization of single nucleotide polymorphism and insertion/deletion variants as significant molecular markers for investigating sex identification in fish. The advancement of sex-controlled breeding encounters numerous challenges, including the inefficiency of current methods, intricate experimental protocols, high costs of development, elevated rates of false positives, marker instability, and cumbersome field-testing procedures. Nevertheless, the emergence and swift progress of PacBio high-throughput sequencing technology, characterized by its long-read output capabilities, offers novel opportunities to overcome these obstacles. FINDINGS: Utilizing male/female assembled genome information in conjunction with short-read sequencing data survey and long-read PacBio sequencing data, a catalog of large-segment (>100 bp) insertion/deletion genetic variants was generated through a genome-wide variant site-scanning approach with bidirectional comparisons. The sequence tagging sites were ranked based on the long-read depth of the insertion/deletion site, with markers exhibiting lower long-read depth being considered more effective for large-segment deletion variants. Subsequently, a catalog of bulk primers and simulated PCR for the male/female variant loci was developed, incorporating primer design for the target region and electronic PCR (e-PCR) technology. The Japanese parrotfish (Oplegnathus fasciatus), belonging to the Oplegnathidae family within the Centrarchiformes order, holds significant economic value as a rocky reef fish indigenous to East Asia. The criteria for rapid identification of male and female differences in Japanese parrotfish were established through agarose gel electrophoresis, which revealed 2 amplified bands for males and 1 amplified band for females. A high-throughput identification catalog of sex-specific markers was then constructed using this method, resulting in the identification of 3,639 (2,786 INS/853 DEL, ♀ as reference) and 3,672 (2,876 INS/833 DEL, ♂ as reference) markers in conjunction with 1,021 and 894 high-quality genetic sex identification markers, respectively. Sixteen differential loci were randomly chosen from the catalog for validation, with 11 of them meeting the criteria for male/female distinctions. The implementation of cost-effective and efficient technological processes would facilitate the rapid advancement of genetic breeding through expediting the high-throughput development of sex genetic markers for various species. CONCLUSIONS: Our study utilized assembled genome information from male and female individuals obtained from PacBio, in addition to data from short-read sequencing data survey and long-read PacBio sequencing data. We extensively employed genome-wide variant site scanning and identification, high-throughput primer design of target regions, and e-PCR batch amplification, along with statistical analysis and ranking of the long-read depth of the variant sites. Through this integrated approach, we successfully compiled a catalog of large insertion/deletion sites (>100 bp) in both male and female Japanese parrotfish.


Subject(s)
High-Throughput Nucleotide Sequencing , Animals , Female , Male , High-Throughput Nucleotide Sequencing/methods , Genetic Markers , Perciformes/genetics , INDEL Mutation , Polymorphism, Single Nucleotide , Sex Determination Analysis/methods , Fishes/genetics , East Asian People
6.
Sci Rep ; 14(1): 13068, 2024 06 06.
Article in English | MEDLINE | ID: mdl-38844495

ABSTRACT

Diabetic nephropathy represents one of the main long-term complications in T2DM patients. Cigarette smoking represents one of modifiable renal risk factors to kidney damage due to lead (Pb) exposure in these patients. Our goal is to investigate serum copeptin and Kidney injury molecule-1 (KIM-1) and urinary lead (UPb) in type 2 diabetes mellitus (T2DM) patients even smokers and non-smokers groups and compared to corresponding health controls and assess its associations with Angiotensin-Converting enzyme Insertion/Deletion polymorphism [ACE (I/D)] polymorphism in diabetic nephropathy progression in those patients. In present study, 106 T2DM patients and 102 healthy control individuals were enrolled. Serum glucose, copeptin, KIM-1, total cholesterol (TChol), triglycerides (TG), estimated glomerular filtration rate (eGFR) and UPb levels and ACE (I/D) polymorphisms were assessed in both groups. Results mentioned to significant variations in all parameters compared to in T2DM group compared to control group. Serum copeptin and UPb demonstrated significant difference in diabetic smokers (DS) and diabetic non-smokers (DNS) groups while KIM-1 exhibited significant change between DNS and healthy control non-smokers (CNS) groups. Positive relation was recorded between serum glucose and KIM-1 while negative one was found between serum copeptin and TChol. D allele was associated with significant variation in most parameters in T2DM, especially insertion/deletion (ID) polymorphism. ROC curve analysis (AUC) for serum copeptin was 0.8, p < 0.044 and for Kim-1 was 0.54, p = 0.13 while for uPb was 0.71, p < 0.033. Serum copeptin and UPb might be a prognostic biomarker for renal function decline in smoker T2DM patients while KIM-1 was potent marker in non-smoker T2DM with association with D allele of ACE I/D gene polymorphism.


Subject(s)
Diabetes Mellitus, Type 2 , Glycopeptides , Hepatitis A Virus Cellular Receptor 1 , Peptidyl-Dipeptidase A , Polymorphism, Genetic , Humans , Male , Peptidyl-Dipeptidase A/genetics , Peptidyl-Dipeptidase A/blood , Female , Diabetes Mellitus, Type 2/blood , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/complications , Glycopeptides/blood , Middle Aged , Hepatitis A Virus Cellular Receptor 1/genetics , Diabetic Nephropathies/blood , Diabetic Nephropathies/genetics , Diabetic Nephropathies/etiology , INDEL Mutation , Smokers , Case-Control Studies , Adult , Genetic Predisposition to Disease , Glomerular Filtration Rate , Biomarkers/blood , ROC Curve
7.
Genes (Basel) ; 15(6)2024 Jun 12.
Article in English | MEDLINE | ID: mdl-38927706

ABSTRACT

Deficiencies in DNA mismatch repair (MMRd) leave characteristic footprints of microsatellite instability (MSI) in cancer genomes. We used data from the Cancer Genome Atlas and International Cancer Genome Consortium to conduct a comprehensive analysis of MSI-associated cancers, focusing on indel mutational signatures. We classified MSI-high genomes into two subtypes based on their indel profiles: deletion-dominant (MMRd-del) and insertion-dominant (MMRd-ins). Compared with MMRd-del genomes, MMRd-ins genomes exhibit distinct mutational and transcriptomic features, including a higher prevalence of T>C substitutions and related mutation signatures. Short insertions and deletions in MMRd-ins and MMRd-del genomes target different sets of genes, resulting in distinct indel profiles between the two subtypes. In addition, indels in the MMRd-ins genomes are enriched with subclonal alterations that provide clues about a distinct evolutionary relationship between the MMRd-ins and MMRd-del genomes. Notably, the transcriptome analysis indicated that MMRd-ins cancers upregulate immune-related genes, show a high level of immune cell infiltration, and display an elevated neoantigen burden. The genomic and transcriptomic distinctions between the two types of MMRd genomes highlight the heterogeneity of genetic mechanisms and resulting genomic footprints and transcriptomic changes in cancers, which has potential clinical implications.


Subject(s)
DNA Mismatch Repair , INDEL Mutation , Microsatellite Instability , Neoplasms , Humans , Neoplasms/genetics , Neoplasms/immunology , DNA Mismatch Repair/genetics , Genome, Human , Transcriptome/genetics
8.
Nat Commun ; 15(1): 5096, 2024 Jun 14.
Article in English | MEDLINE | ID: mdl-38877047

ABSTRACT

CRISPR/Cas9 is widely used for precise mutagenesis through targeted DNA double-strand breaks (DSBs) induction followed by error-prone repair. A better understanding of this process requires measuring the rates of cutting, error-prone, and precise repair, which have remained elusive so far. Here, we present a molecular and computational toolkit for multiplexed quantification of DSB intermediates and repair products by single-molecule sequencing. Using this approach, we characterize the dynamics of DSB induction, processing and repair at endogenous loci along a 72 h time-course in tomato protoplasts. Combining this data with kinetic modeling reveals that indel accumulation is determined by the combined effect of the rates of DSB induction processing of broken ends, and precise versus error repair. In this study, 64-88% of the molecules were cleaved in the three targets analyzed, while indels ranged between 15-41%. Precise repair accounts for most of the gap between cleavage and error repair, representing up to 70% of all repair events. Altogether, this system exposes flux in the DSB repair process, decoupling induction and repair dynamics, and suggesting an essential role of high-fidelity repair in limiting the efficiency of CRISPR-mediated mutagenesis.


Subject(s)
CRISPR-Cas Systems , DNA Breaks, Double-Stranded , DNA Repair , Solanum lycopersicum , Solanum lycopersicum/genetics , Solanum lycopersicum/metabolism , Gene Editing/methods , Protoplasts/metabolism , INDEL Mutation , Kinetics
9.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38842253

ABSTRACT

Despite having important biological implications, insertion, and deletion (indel) events are often disregarded or mishandled during phylogenetic inference. In multiple sequence alignment, indels are represented as gaps and are estimated without considering the distinct evolutionary history of insertions and deletions. Consequently, indels are usually excluded from subsequent inference steps, such as ancestral sequence reconstruction and phylogenetic tree search. Here, we introduce indel-aware parsimony (indelMaP), a novel way to treat gaps under the parsimony criterion by considering insertions and deletions as separate evolutionary events and accounting for long indels. By identifying the precise location of an evolutionary event on the tree, we can separate overlapping indel events and use affine gap penalties for long indel modeling. Our indel-aware approach harnesses the phylogenetic signal from indels, including them into all inference stages. Validation and comparison to state-of-the-art inference tools on simulated data show that indelMaP is most suitable for densely sampled datasets with closely to moderately related sequences, where it can reach alignment quality comparable to probabilistic methods and accurately infer ancestral sequences, including indel patterns. Due to its remarkable speed, our method is well suited for epidemiological datasets, eliminating the need for downsampling and enabling the exploitation of the additional information provided by dense taxonomic sampling. Moreover, indelMaP offers new insights into the indel patterns of biologically significant sequences and advances our understanding of genetic variability by considering gaps as crucial evolutionary signals rather than mere artefacts.


Subject(s)
INDEL Mutation , Phylogeny , Sequence Alignment , Sequence Alignment/methods , Evolution, Molecular , Models, Genetic , Humans
10.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38869090

ABSTRACT

Sequence alignment is an essential method in bioinformatics and the basis of many analyses, including phylogenetic inference, ancestral sequence reconstruction, and gene annotation. Sequencing artifacts and errors made during genome assembly, such as abiological frameshifts and incorrect early stop codons, can impact downstream analyses leading to erroneous conclusions in comparative and functional genomic studies. More significantly, while indels can occur both within and between codons in natural sequences, most amino-acid- and codon-based aligners assume that indels only occur between codons. This mismatch between biology and alignment algorithms produces suboptimal alignments and errors in downstream analyses. To address these issues, we present COATi, a statistical, codon-aware pairwise aligner that supports complex insertion-deletion models and can handle artifacts present in genomic data. COATi allows users to reduce the amount of discarded data while generating more accurate sequence alignments. COATi can infer indels both within and between codons, leading to improved sequence alignments. We applied COATi to a dataset containing orthologous protein-coding sequences from humans and gorillas and conclude that 41% of indels occurred between codons, agreeing with previous work in other species. We also applied COATi to semiempirical benchmark alignments and find that it outperforms several popular alignment programs on several measures of alignment quality and accuracy.


Subject(s)
INDEL Mutation , Sequence Alignment , Sequence Alignment/methods , Humans , Animals , Software , Algorithms , Codon , Gorilla gorilla/genetics , Computational Biology/methods , Open Reading Frames , Phylogeny
11.
Mol Biol (Mosk) ; 58(1): 157-159, 2024.
Article in Russian | MEDLINE | ID: mdl-38943587

ABSTRACT

Streptococcus pyogenes Cas9 (SpCas9) is the most popular tool in gene editing; however, off-target mutagenesis is one of the biggest impediments in its application. In our previous study, we proposed the HH theory, which states that sgRNA/DNA hybrid (hybrid) extrusion-induced enhancement of hydrophobic interactions between the hybrid and REC3/HNH is a key factor in cleavage initiation. Based on the HH theory, we analyzed the interactions between the REC3 domain and hybrid and obtained 8 mutant sites. We designed 8 SpCas9 variants (V1-V8), used digital droplet PCR to assess SpCas9-induced DNA indels in human cells, and developed high-fidelity variants. Thus, the HH theory may be employed to further optimize SpCas9-mediated genome editing systems, and the resultant V3, V6, V7, and V8 SpCas9 variants may be valuable for applications requiring high-precision genome editing.


Subject(s)
CRISPR-Associated Protein 9 , CRISPR-Cas Systems , Gene Editing , Streptococcus pyogenes , Humans , Gene Editing/methods , CRISPR-Associated Protein 9/genetics , CRISPR-Associated Protein 9/metabolism , Streptococcus pyogenes/genetics , Streptococcus pyogenes/enzymology , HEK293 Cells , INDEL Mutation , RNA, Guide, CRISPR-Cas Systems/genetics , RNA, Guide, CRISPR-Cas Systems/metabolism , DNA/genetics , DNA/metabolism , DNA/chemistry
12.
Nat Commun ; 15(1): 5014, 2024 Jun 12.
Article in English | MEDLINE | ID: mdl-38866774

ABSTRACT

Genetic testing is crucial for precision cancer medicine. However, detecting multiple same-site insertions or deletions (indels) is challenging. Here, we introduce CoHIT (Cas12a-based One-for-all High-speed Isothermal Test), a one-pot CRISPR-based assay for indel detection. Leveraging an engineered AsCas12a protein variant with high mismatch tolerance and broad PAM scope, CoHIT can use a single crRNA to detect multiple NPM1 gene c.863_864 4-bp insertions in acute myeloid leukemia (AML). After optimizing multiple parameters, CoHIT achieves a detection limit of 0.01% and rapid results within 30 minutes, without wild-type cross-reactivity. It successfully identifies NPM1 mutations in 30 out of 108 AML patients and demonstrates potential in monitoring minimal residual disease (MRD) through continuous sample analysis from three patients. The CoHIT method is also competent for detecting indels of KIT, BRAF, and EGFR genes. Integration with lateral flow test strips and microfluidic chips highlights CoHIT's adaptability and multiplexing capability, promising significant advancements in clinical cancer diagnostics.


Subject(s)
CRISPR-Cas Systems , INDEL Mutation , Leukemia, Myeloid, Acute , Nucleophosmin , Humans , Leukemia, Myeloid, Acute/genetics , Leukemia, Myeloid, Acute/diagnosis , Neoplasm, Residual/genetics , Neoplasm, Residual/diagnosis , Nuclear Proteins/genetics , Proto-Oncogene Proteins B-raf/genetics , Genetic Testing/methods , ErbB Receptors/genetics , Bacterial Proteins , Endodeoxyribonucleases , CRISPR-Associated Proteins
13.
Bioinformatics ; 40(Supplement_1): i277-i286, 2024 Jun 28.
Article in English | MEDLINE | ID: mdl-38940131

ABSTRACT

MOTIVATION: Insertions and deletions (indels) influence the genetic code in fundamentally distinct ways from substitutions, significantly impacting gene product structure and function. Despite their influence, the evolutionary history of indels is often neglected in phylogenetic tree inference and ancestral sequence reconstruction, hindering efforts to comprehend biological diversity determinants and engineer variants for medical and industrial applications. RESULTS: We frame determining the optimal history of indel events as a single Mixed-Integer Programming (MIP) problem, across all branch points in a phylogenetic tree adhering to topological constraints, and all sites implied by a given set of aligned, extant sequences. By disentangling the impact on ancestral sequences at each branch point, this approach identifies the minimal indel events that jointly explain the diversity in sequences mapped to the tips of that tree. MIP can recover alternate optimal indel histories, if available. We evaluated MIP for indel inference on a dataset comprising 15 real phylogenetic trees associated with protein families ranging from 165 to 2000 extant sequences, and on 60 synthetic trees at comparable scales of data and reflecting realistic rates of mutation. Across relevant metrics, MIP outperformed alternative parsimony-based approaches and reported the fewest indel events, on par or below their occurrence in synthetic datasets. MIP offers a rational justification for indel patterns in extant sequences; importantly, it uniquely identifies global optima on complex protein data sets without making unrealistic assumptions of independence or evolutionary underpinnings, promising a deeper understanding of molecular evolution and aiding novel protein design. AVAILABILITY AND IMPLEMENTATION: The implementation is available via GitHub at https://github.com/santule/indelmip.


Subject(s)
INDEL Mutation , Phylogeny , Evolution, Molecular , Algorithms , Computational Biology/methods
14.
Sci Rep ; 14(1): 13171, 2024 06 07.
Article in English | MEDLINE | ID: mdl-38849492

ABSTRACT

Angiotensin-converting enzyme (ACE) is closely related to cardiometabolic risk factors and atherosclerosis. This study aims to investigate whether the insertion/deletion (I/D) variant of ACE gene impacts cardiometabolic risk factors, premature coronary artery disease (PCAD), and severity of coronary lesions. PubMed, Cochrane Library, Central, CINAHL, and ClinicalTrials.gov were searched until December 22, 2023. 94,270 individuals were included for the analysis. Carriers of DD genotype had higher levels of triglycerides (TG), total cholesterol (TC), low-density lipoprotein cholesterol (LDL-C), systolic blood pressure (SBP), diastolic blood pressure (DBP), body mass index (BMI), and waist circumference (WC) than carriers of II or ID genotypes. In addition, carriers of DD genotype were at high risk of PCAD and multiple vessel lesions. The impacts of ACE I/D variant on lipid levels were significant in American individuals but stronger in male individuals. In contrast, the impacts of ACE I/D variant on PCAD and severity of coronary lesions were primarily significant in Caucasian individuals. This study indicates that the ACE I/D variant has a slight but significant impact on cardiometabolic risk factors, PCAD, and severity of coronary lesions. Angiotensin-converting enzyme inhibitors (ACEI) may benefit high-risk populations with ACE DD genotype to prevent PCAD and multiple vessel lesions.PROSPERO registration number: CRD42023426732.


Subject(s)
Cardiometabolic Risk Factors , Coronary Artery Disease , INDEL Mutation , Peptidyl-Dipeptidase A , Humans , Peptidyl-Dipeptidase A/genetics , Coronary Artery Disease/genetics , Male , Female , Blood Pressure , Genetic Predisposition to Disease , Severity of Illness Index , Middle Aged , Genotype , Body Mass Index , Risk Factors , Triglycerides/blood , Adult
15.
Genes (Basel) ; 15(5)2024 05 07.
Article in English | MEDLINE | ID: mdl-38790221

ABSTRACT

Early-onset breast cancer (EoBC), defined by a diagnosis <40 years of age, is associated with poor prognosis. This study investigated the mutational landscape of non-metastatic EoBC and the prognostic relevance of mutational signatures using 100 tumour samples from Alberta, Canada. The MutationalPatterns package in R/Bioconductor was used to extract de novo single-base substitution (SBS) and insertion-deletion (indel) mutational signatures and to fit COSMIC SBS and indel signatures. We assessed associations between these signatures and clinical characteristics of disease, in addition to recurrence-free (RFS) and overall survival (OS). Five SBS and two indel signatures were extracted. The SBS13-like signature had higher relative contributions in the HER2-enriched subtype. Patients with higher than median contribution tended to have better RFS after adjustment for other prognostic factors (HR = 0.29; 95% CI: 0.08-1.06). An unsupervised clustering algorithm based on absolute contribution revealed three clusters of fitted COSMIC SBS signatures, but cluster membership was not associated with clinical variables or survival outcomes. The results of this exploratory study reveal various SBS and indel signatures may be associated with clinical features of disease and prognosis. Future studies with larger samples are required to better understand the mechanistic underpinnings of disease progression and treatment response in EoBC.


Subject(s)
Breast Neoplasms , Humans , Female , Breast Neoplasms/genetics , Breast Neoplasms/pathology , Breast Neoplasms/mortality , Adult , Prognosis , Age of Onset , Mutation , INDEL Mutation , Biomarkers, Tumor/genetics , Alberta/epidemiology , Middle Aged
16.
Nucleic Acids Res ; 52(W1): W102-W107, 2024 Jul 05.
Article in English | MEDLINE | ID: mdl-38709886

ABSTRACT

Over the past decade, mtDNA-Server established itself as one of the most widely used variant calling web-services for human mitochondrial genomes. The service accepts sequencing data in BAM format and returns an annotated variant analysis report for both homoplasmic and heteroplasmic variants. In this work we present mtDNA-Server 2, which includes several new features highly requested by the community. Most importantly, it includes (a) the integration of a novel variant calling mode that accurately call insertions, deletions and single nucleotide variants at once, (b) the integration of additional quality control and input validation modules, (c) a method to estimate the required coverage to minimize false positives and (d) an interactive analytics dashboard. Furthermore, we migrated the complete analysis workflow to the Nextflow workflow manager for improved parallelization, reproducibility and local execution. Recognizing the importance of insertions and deletions as well as offering novel quality control, validation and reporting features, mtDNA-Server 2 provides researchers and clinicians a new state-of-the-art analysis platform for interpreting mitochondrial genomes. mtDNA-Server 2 is available via mitoverse, our analysis platform that offers a centralized place for mtDNA analysis in the cloud. The web-service, source code and its documentation are freely accessible at https://mitoverse.i-med.ac.at.


Subject(s)
DNA, Mitochondrial , Software , DNA, Mitochondrial/genetics , Humans , Sequence Analysis, DNA/methods , Genome, Mitochondrial , Workflow , High-Throughput Nucleotide Sequencing/methods , Internet , Reproducibility of Results , INDEL Mutation
17.
Theor Appl Genet ; 137(6): 136, 2024 May 20.
Article in English | MEDLINE | ID: mdl-38764078

ABSTRACT

KEY MESSAGE: Different kinship and resistance to cotton leaf curl disease (CLCuD) and heat were found between upland cotton cultivars from China and Pakistan. 175 SNPs and 82 InDels loci related to yield, fiber quality, CLCuD, and heat resistance were identified. Elite alleles found in Pakistani accessions aided local adaptation to climatic condition of two countries. Adaptation of upland cotton (Gossypium hirsutum) beyond its center of origin is expected to be driven by tailoring of the genome and genes to enhance yield and quality in new ecological niches. Here, resequencing of 456 upland cotton accessions revealed two distinct kinships according to the associated country. Fiber quality and lint percentage were consistent across kinships, but resistance to cotton leaf curl disease (CLCuD) and heat was distinctly exhibited by accessions from Pakistan, illustrating highly local adaption. A total of 175 SNP and 82 InDel loci related to yield, fiber quality, CLCuD and heat resistance were identified; among them, only two overlapped between Pakistani and Chinese accessions underscoring the divergent domestication and improvement targets in each country. Loci associated with resistance alleles to leaf curl disease and high temperature were largely found in Pakistani accessions to counter these stresses prevalent in Pakistan. These results revealed that breeding activities led to the accumulation of unique alleles and helped upland cotton become adapted to the respective climatic conditions, which will contribute to elucidating the genetic mechanisms that underlie resilience traits and help develop climate-resilient cotton cultivars for use worldwide.


Subject(s)
Gossypium , Polymorphism, Single Nucleotide , Gossypium/genetics , Pakistan , China , Disease Resistance/genetics , Plant Diseases/genetics , INDEL Mutation , Adaptation, Physiological/genetics , Genome, Plant , Alleles , Plant Breeding , Cotton Fiber , Phenotype
18.
Cancer Invest ; 42(5): 390-399, 2024 May.
Article in English | MEDLINE | ID: mdl-38773925

ABSTRACT

Evaluation of the test performance of the Target enhanced whole-genome sequencing (TE-WGS) assay for comprehensive oncology genomic profiling. The analytical validation of the assay included sensitivity and specificity for single nucleotide variants (SNVs), insertions/deletions (indels), and structural variants (SVs), revealing a revealed a sensitivity of 99.8% for SNVs and 99.2% for indels. The positive predictive value (PPV) was 99.3% SNVs and 98.7% indels. Clinical validation was benchmarked against established orthogonal methods and demonstrated high concordance with reference methods. TE-WGS provides insights beyond targeted panels by comprehensive analysis of key biomarkers and the entire genome encompassing both germline and somatic findings.


Subject(s)
Genomics , INDEL Mutation , Whole Genome Sequencing , Humans , Whole Genome Sequencing/methods , Genomics/methods , Polymorphism, Single Nucleotide , Neoplasms/genetics , Female , Male , Genome, Human , Middle Aged , Sensitivity and Specificity , High-Throughput Nucleotide Sequencing/methods , Aged , Adult , Reproducibility of Results
19.
Microbiol Spectr ; 12(7): e0425923, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38757975

ABSTRACT

Currently, tuberculosis immunoprophylaxis is based solely on Bacillus Calmette-Guérin (BCG) vaccination, and some of the new potential tuberculosis vaccines are based on the BCG genome. Therefore, it is reasonable to analyze the genomes of individual BCG substrains. The aim of this study was the genetic characterization of the BCG-Moreau Polish (PL) strain used for the production of the BCG vaccine in Poland since 1955. Sequencing of different BCG lots showed that the strain was stable over a period of 59 years. As a result of comparison, BCG-Moreau PL with BCG-Moreau Rio de Janeiro (RDJ) 143 single nucleotide polymorphisms (SNPs) and 32 insertion/deletion mutations (INDELs) were identified. However, the verification of these mutations showed that the most significant were accumulated in the BCG-Moreau RDJ genome. The mutations unique to the Polish strain genome are 1 SNP and 2 INDEL. The strategy of combining short-read sequencing with long-read sequencing is currently the most optimal approach for sequencing bacterial genomes. With this approach, the only available genomic sequence of BCG-Moreau PL was obtained. This sequence will primarily be a reference point in the genetic control of the stability of the vaccine strain in the future. The results enrich knowledge about the microevolution and attenuation of the BCG vaccine substrains. IMPORTANCE: The whole genome sequence obtained is the only genomic sequence of the strain that has been used for vaccine production in Poland since 1955. Sequencing of different BCG lots showed that the strain was stable over a period of 59 years. The comprehensive genomic analysis performed not only enriches knowledge about the microevolution and attenuation of the BCG vaccine substrains but also enables the utilization of identified markers as a reference point in the genetic control and identity tests of the stability of the vaccine strain in the future.


Subject(s)
BCG Vaccine , Genome, Bacterial , Mycobacterium bovis , Polymorphism, Single Nucleotide , Whole Genome Sequencing , BCG Vaccine/genetics , BCG Vaccine/immunology , Mycobacterium bovis/genetics , Mycobacterium bovis/classification , Poland , Humans , Tuberculosis/prevention & control , Tuberculosis/microbiology , INDEL Mutation , Mutation
20.
Genome Biol Evol ; 16(5)2024 05 02.
Article in English | MEDLINE | ID: mdl-38735759

ABSTRACT

A fundamental goal in evolutionary biology and population genetics is to understand how selection shapes the fate of new mutations. Here, we test the null hypothesis that insertion-deletion (indel) events in protein-coding regions occur randomly with respect to secondary structures. We identified indels across 11,444 sequence alignments in mouse, rat, human, chimp, and dog genomes and then quantified their overlap with four different types of secondary structure-alpha helices, beta strands, protein bends, and protein turns-predicted by deep-learning methods of AlphaFold2. Indels overlapped secondary structures 54% as much as expected and were especially underrepresented over beta strands, which tend to form internal, stable regions of proteins. In contrast, indels were enriched by 155% over regions without any predicted secondary structures. These skews were stronger in the rodent lineages compared to the primate lineages, consistent with population genetic theory predicting that natural selection will be more efficient in species with larger effective population sizes. Nonsynonymous substitutions were also less common in regions of protein secondary structure, although not as strongly reduced as in indels. In a complementary analysis of thousands of human genomes, we showed that indels overlapping secondary structure segregated at significantly lower frequency than indels outside of secondary structure. Taken together, our study shows that indels are selected against if they overlap secondary structure, presumably because they disrupt the tertiary structure and function of a protein.


Subject(s)
INDEL Mutation , Protein Structure, Secondary , Humans , Animals , Mice , Rats , Evolution, Molecular , Proteins/genetics , Proteins/chemistry , Dogs , Selection, Genetic , Genome
SELECTION OF CITATIONS
SEARCH DETAIL