ABSTRACT
Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.
Subject(s)
Chromosome Inversion , Segmental Duplications, Genomic , Chromosome Inversion/genetics , DNA Copy Number Variations/genetics , Genome, Human , Genomics , HumansABSTRACT
De novo copy number variants (dnCNVs) arising at multiple loci in a personal genome have usually been considered to reflect cancer somatic genomic instabilities. We describe a multiple dnCNV (MdnCNV) phenomenon in which individuals with genomic disorders carry five to ten constitutional dnCNVs. These CNVs originate from independent formation incidences, are predominantly tandem duplications or complex gains, exhibit breakpoint junction features reminiscent of replicative repair, and show increased de novo point mutations flanking the rearrangement junctions. The active CNV mutation shower appears to be restricted to a transient perizygotic period. We propose that a defect in the CNV formation process is responsible for the "CNV-mutator state," and this state is dampened after early embryogenesis. The constitutional MdnCNV phenomenon resembles chromosomal instability in various cancers. Investigations of this phenomenon may provide unique access to understanding genomic disorders, structural variant mutagenesis, human evolution, and cancer biology.
Subject(s)
Chromosome Aberrations , DNA Copy Number Variations , Genetic Diseases, Inborn/embryology , Genetic Diseases, Inborn/genetics , Genomic Instability , Mutation , Chromosome Breakpoints , Chromosome Duplication , DNA Replication , Embryonic Development , Female , Gametogenesis , Humans , MaleABSTRACT
Copy number variants (CNVs) are significant contributors to the pathogenicity of rare genetic diseases and, with new innovative methods, can now reliably be identified from exome sequencing. Challenges still remain in accurate classification of CNV pathogenicity. CNV calling using GATK-gCNV was performed on exomes from a cohort of 6,633 families (15,759 individuals) with heterogeneous phenotypes and variable prior genetic testing collected at the Broad Institute Center for Mendelian Genomics of the Genomics Research to Elucidate the Genetics of Rare Diseases consortium and analyzed using the seqr platform. The addition of CNV detection to exome analysis identified causal CNVs for 171 families (2.6%). The estimated sizes of CNVs ranged from 293 bp to 80 Mb. The causal CNVs consisted of 140 deletions, 15 duplications, 3 suspected complex structural variants (SVs), 3 insertions, and 10 complex SVs, the latter two groups being identified by orthogonal confirmation methods. To classify CNV variant pathogenicity, we used the 2020 American College of Medical Genetics and Genomics/ClinGen CNV interpretation standards and developed additional criteria to evaluate allelic and functional data as well as variants on the X chromosome to further advance the framework. We interpreted 151 CNVs as likely pathogenic/pathogenic and 20 CNVs as high-interest variants of uncertain significance. Calling CNVs from existing exome data increases the diagnostic yield for individuals undiagnosed after standard testing approaches, providing a higher-resolution alternative to arrays at a fraction of the cost of genome sequencing. Our improvements to the classification approach advances the systematic framework to assess the pathogenicity of CNVs.
Subject(s)
DNA Copy Number Variations , Exome Sequencing , Exome , Rare Diseases , Humans , DNA Copy Number Variations/genetics , Rare Diseases/genetics , Rare Diseases/diagnosis , Exome/genetics , Male , Female , Cohort Studies , Genetic Testing/methodsABSTRACT
Recurrent genomic rearrangements at 16p11.2 BP4-5 represent one of the most common causes of genomic disorders. Originally associated with increased risk for autism spectrum disorder, schizophrenia, and intellectual disability, as well as adiposity and head circumference, these CNVs have since been associated with a plethora of phenotypic alterations, albeit with high variability in expressivity and incomplete penetrance. Here, we comprehensively review the pleiotropy associated with 16p11.2 BP4-5 rearrangements to shine light on its full phenotypic spectrum. Illustrating this phenotypic heterogeneity, we expose many parallels between findings gathered from clinical versus population-based cohorts, which often point to the same physiological systems, and emphasize the role of the CNV beyond neuropsychiatric and anthropometric traits. Revealing the complex and variable clinical manifestations of this CNV is crucial for accurate diagnosis and personalized treatment strategies for carrier individuals. Furthermore, we discuss areas of research that will be key to identifying factors contributing to phenotypic heterogeneity and gaining mechanistic insights into the molecular pathways underlying observed associations, while demonstrating how diversity in affected individuals, cohorts, experimental models, and analytical approaches can catalyze discoveries.
ABSTRACT
While extensively studied in clinical cohorts, the phenotypic consequences of 22q11.2 copy-number variants (CNVs) in the general population remain understudied. To address this gap, we performed a phenome-wide association scan in 405,324 unrelated UK Biobank (UKBB) participants by using CNV calls from genotyping array. We mapped 236 Human Phenotype Ontology terms linked to any of the 90 genes encompassed by the region to 170 UKBB traits and assessed the association between these traits and the copy-number state of 504 genotyping array probes in the region. We found significant associations for eight continuous and nine binary traits associated under different models (duplication-only, deletion-only, U-shape, and mirror models). The causal effect of the expression level of 22q11.2 genes on associated traits was assessed through transcriptome-wide Mendelian randomization (TWMR), revealing that increased expression of ARVCF increased BMI. Similarly, increased DGCR6 expression causally reduced mean platelet volume, in line with the corresponding CNV effect. Furthermore, cross-trait multivariable Mendelian randomization (MVMR) suggested a predominant role of genuine (horizontal) pleiotropy in the CNV region. Our findings show that within the general population, 22q11.2 CNVs are associated with traits previously linked to genes in the region, and duplications and deletions act upon traits in different fashions. We also showed that gain or loss of distinct segments within 22q11.2 may impact a trait under different association models. Our results have provided new insights to help further the understanding of the complex 22q11.2 region.
Subject(s)
DNA Copy Number Variations , Phenomics , Humans , DNA Copy Number Variations/genetics , Phenotype , Chromosomes, Human, Pair 22ABSTRACT
Copy number variations (CNVs) play pivotal roles in disease susceptibility and have been intensively investigated in human disease studies. Long-read sequencing technologies offer opportunities for comprehensive structural variation (SV) detection, and numerous methodologies have been developed recently. Consequently, there is a pressing need to assess these methods and aid researchers in selecting appropriate techniques for CNV detection using long-read sequencing. Hence, we conducted an evaluation of eight CNV calling methods across 22 datasets from nine publicly available samples and 15 simulated datasets, covering multiple sequencing platforms. The overall performance of CNV callers varied substantially and was influenced by the input dataset type, sequencing depth, and CNV type, among others. Specifically, the PacBio CCS sequencing platform outperformed PacBio CLR and Nanopore platforms regarding CNV detection recall rates. A sequencing depth of 10x demonstrated the capability to identify 85% of the CNVs detected in a 50x dataset. Moreover, deletions were more generally detectable than duplications. Among the eight benchmarked methods, cuteSV, Delly, pbsv, and Sniffles2 demonstrated superior accuracy, while SVIM exhibited high recall rates.
Subject(s)
Algorithms , DNA Copy Number Variations , High-Throughput Nucleotide Sequencing , Humans , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Computational Biology/methods , Genome, HumanABSTRACT
The human microbiota affects critical cellular functions, although the responsible mechanism(s) is still poorly understood. In this regard, we previously showed that Mycoplasma fermentans DnaK, an HSP70 chaperone protein, hampers the activity of important cellular proteins responsible for DNA integrity. Here, we describe a novel DnaK knock-in mouse model generated in our laboratory to study the effect of M. fermentans DnaK expression in vivo. By using an array-based comparative genomic hybridization assay, we demonstrate that exposure to DnaK was associated with a higher number of DNA copy number variants (CNVs) indicative of unbalanced chromosomal alterations, together with reduced fertility and a high rate of fetal abnormalities. Consistent with their implication in genetic disorders, one of these CNVs caused a homozygous Grid2 deletion, resulting in an aberrant ataxic phenotype that recapitulates the extensive biallelic deletion in the Grid2 gene classified in humans as autosomal recessive spinocerebellar ataxia 18. Our data highlight a connection between components of the human urogenital tract microbiota, namely Mycoplasmas, and genetic abnormalities in the form of DNA CNVs, with obvious relevant medical, diagnostic, and therapeutic implications.
Subject(s)
DNA Copy Number Variations , Mycoplasma Infections , Mycoplasma fermentans/genetics , Homozygote , Mycoplasma Infections/genetics , Mycoplasma Infections/metabolism , Animals , Mice , Mice, Inbred C57BLABSTRACT
Recent studies suggest that time estimation relies on bodily rhythms and interoceptive signals. We provide the first direct electrophysiological evidence suggesting an association between the brain's processing of heartbeat and duration judgment. We examined heartbeat-evoked potential (HEP) and contingent negative variation (CNV) during an auditory duration-reproduction task and a control reaction-time task spanning 4, 8, and 12â s intervals, in both male and female participants. Interoceptive awareness was assessed with the Self-Awareness Questionnaire (SAQ) and interoceptive accuracy through the heartbeat-counting task (HCT). Results revealed that SAQ scores, but not the HCT, correlated with mean reproduced durations with higher SAQ scores associating with longer and more accurate duration reproductions. Notably, the HEP amplitude changes during the encoding phase of the timing task, particularly within 130-270â ms (HEP1) and 470-520â ms (HEP2) after the R-peak, demonstrated interval-specific modulations that did not emerge in the control task. A significant ramp-like increase in HEP2 amplitudes occurred during the duration-encoding phase of the timing but not during the control task. This increase within the reproduction phase of the timing task correlated significantly with the reproduced durations for the 8â s and the 4â s intervals. The larger the increase in HEP2, the greater the under-reproduction of the estimated duration. CNV components during the encoding phase of the timing task were more negative than those in the reaction-time task, suggesting greater executive resources orientation toward time. We conclude that interoceptive awareness (SAQ) and cortical responses to heartbeats (HEP) predict duration reproductions, emphasizing the embodied nature of time.
Subject(s)
Brain , Electroencephalography , Heart Rate , Interoception , Time Perception , Humans , Male , Female , Time Perception/physiology , Heart Rate/physiology , Young Adult , Adult , Interoception/physiology , Brain/physiology , Awareness/physiology , Evoked Potentials/physiology , Reaction Time/physiology , Contingent Negative Variation/physiologyABSTRACT
Structural variation (SV) is a large difference (typically >100 bp) in the genomic structure of two genomes and includes both copy number variation and variation that does not change copy number of a genomic region, such as an inversion. Improved reference genomes, combined with widespread genome sequencing using short-read sequencing technology, and increasingly using long-read sequencing, have reignited interest in SV. Recent large-scale studies and functional focused analyses have highlighted the role of SV in human evolution. In this review, we highlight human-specific SVs involved in changes in the brain, population-specific SVs that affect response to the environment, including adaptation to diet and infectious diseases, and summarise the contribution of archaic hominin admixture to present-day human SV.
Subject(s)
DNA Copy Number Variations , Hominidae , Animals , DNA Copy Number Variations/genetics , Genome , Genome, Human/genetics , Genomic Structural Variation/genetics , Genomics , Hominidae/genetics , Humans , Sequence Analysis, DNAABSTRACT
The human genome contains tens of thousands of large tandem repeats and hundreds of genes that show common and highly variable copy-number changes. Due to their large size and repetitive nature, these variable number tandem repeats (VNTRs) and multicopy genes are generally recalcitrant to standard genotyping approaches and, as a result, this class of variation is poorly characterized. However, several recent studies have demonstrated that copy-number variation of VNTRs can modify local gene expression, epigenetics, and human traits, indicating that many have a functional role. Here, using read depth from whole-genome sequencing to profile copy number, we report results of a phenome-wide association study (PheWAS) of VNTRs and multicopy genes in a discovery cohort of â¼35,000 samples, identifying 32 traits associated with copy number of 38 VNTRs and multicopy genes at 1% FDR. We replicated many of these signals in an independent cohort and observed that VNTRs showing trait associations were significantly enriched for expression QTLs with nearby genes, providing strong support for our results. Fine-mapping studies indicated that in the majority (â¼90%) of cases, the VNTRs and multicopy genes we identified represent the causal variants underlying the observed associations. Furthermore, several lie in regions where prior SNV-based GWASs have failed to identify any significant associations with these traits. Our study indicates that copy number of VNTRs and multicopy genes contributes to diverse human traits and suggests that complex structural variants potentially explain some of the so-called "missing heritability" of SNV-based GWASs.
Subject(s)
DNA Copy Number Variations , Minisatellite Repeats , DNA Copy Number Variations/genetics , Genome, Human , Genome-Wide Association Study , Humans , Minisatellite Repeats/genetics , PhenotypeABSTRACT
The impact of copy-number variations (CNVs) on complex human traits remains understudied. We called CNVs in 331,522 UK Biobank participants and performed genome-wide association studies (GWASs) between the copy number of CNV-proxy probes and 57 continuous traits, revealing 131 signals spanning 47 phenotypes. Our analysis recapitulated well-known associations (e.g., 1q21 and height), revealed the pleiotropy of recurrent CNVs (e.g., 26 and 16 traits for 16p11.2-BP4-BP5 and 22q11.21, respectively), and suggested gene functionalities (e.g., MARF1 in female reproduction). Forty-eight CNV signals (38%) overlapped with single-nucleotide polymorphism (SNP)-GWASs signals for the same trait. For instance, deletion of PDZK1, which encodes a urate transporter scaffold protein, decreased serum urate levels, while deletion of RHD, which encodes the Rhesus blood group D antigen, associated with hematological traits. Other signals overlapped Mendelian disorder regions, suggesting variable expressivity and broad impact of these loci, as illustrated by signals mapping to Rotor syndrome (SLCO1B1/3), renal cysts and diabetes syndrome (HNF1B), or Charcot-Marie-Tooth (PMP22) loci. Total CNV burden negatively impacted 35 traits, leading to increased adiposity, liver/kidney damage, and decreased intelligence and physical capacity. Thirty traits remained burden associated after correcting for CNV-GWAS signals, pointing to a polygenic CNV architecture. The burden negatively correlated with socio-economic indicators, parental lifespan, and age (survivorship proxy), suggesting a contribution to decreased longevity. Together, our results showcase how studying CNVs can expand biological insights, emphasizing the critical role of this mutational class in shaping human traits and arguing in favor of a continuum between Mendelian and complex diseases.
Subject(s)
DNA Copy Number Variations , Genome-Wide Association Study , DNA Copy Number Variations/genetics , Female , Humans , Liver-Specific Organic Anion Transporter 1 , Multifactorial Inheritance , Phenotype , Polymorphism, Single Nucleotide/geneticsABSTRACT
The leucine-rich glioma-inactivated (LGI) family consists of four highly conserved paralogous genes, LGI1-4, that are highly expressed in mammalian central and/or peripheral nervous systems. LGI1 antibodies are detected in subjects with autoimmune limbic encephalitis and peripheral nerve hyperexcitability syndromes (PNHSs) such as Isaacs and Morvan syndromes. Pathogenic variations of LGI1 and LGI4 are associated with neurological disorders as disease traits including familial temporal lobe epilepsy and neurogenic arthrogryposis multiplex congenita 1 with myelin defects, respectively. No human disease has been reported associated with either LGI2 or LGI3. We implemented exome sequencing and family-based genomics to identify individuals with deleterious variants in LGI3 and utilized GeneMatcher to connect practitioners and researchers worldwide to investigate the clinical and electrophysiological phenotype in affected subjects. We also generated Lgi3-null mice and performed peripheral nerve dissection and immunohistochemistry to examine the juxtaparanode LGI3 microarchitecture. As a result, we identified 16 individuals from eight unrelated families with loss-of-function (LoF) bi-allelic variants in LGI3. Deep phenotypic characterization showed LGI3 LoF causes a potentially clinically recognizable PNHS trait characterized by global developmental delay, intellectual disability, distal deformities with diminished reflexes, visible facial myokymia, and distinctive electromyographic features suggestive of motor nerve instability. Lgi3-null mice showed reduced and mis-localized Kv1 channel complexes in myelinated peripheral axons. Our data demonstrate bi-allelic LoF variants in LGI3 cause a clinically distinguishable disease trait of PNHS, most likely caused by disturbed Kv1 channel distribution in the absence of LGI3.
Subject(s)
Myokymia , Nerve Tissue Proteins , Animals , Autoantibodies , Axons , Genomics , Humans , Intracellular Signaling Peptides and Proteins/genetics , Mammals/genetics , Mice , Nerve Tissue Proteins/genetics , Phenotype , Reverse GeneticsABSTRACT
Overexpression of vesicular stomatitis virus G protein (VSV-G) elevates the secretion of EVs known as gectosomes, which contain VSV-G. Such vesicles can be engineered to deliver therapeutic macromolecules. We investigated viral glycoproteins from several viruses for their potential in gectosome production and intracellular cargo delivery. Expression of the viral glycoprotein (viral glycoprotein from the Chandipura virus [CNV-G]) from the human neurotropic pathogen Chandipura virus in 293T cells significantly augments the production of CNV-G-containing gectosomes. In comparison with VSV-G gectosomes, CNV-G gectosomes exhibit heightened selectivity toward specific cell types, including primary cells and tumor cell lines. Consistent with the differential tropism between CNV-G and VSV-G gectosomes, cellular entry of CNV-G gectosome is independent of the Low-density lipoprotein receptor, which is essential for VSV-G entry, and shows varying sensitivity to pharmacological modulators. CNV-G gectosomes efficiently deliver diverse intracellular cargos for genomic modification or responses to stimuli in vitro and in the brain of mice in vivo utilizing a split GFP and chemical-induced dimerization system. Pharmacokinetics and biodistribution analyses support CNV-G gectosomes as a versatile platform for delivering macromolecular therapeutics intracellularly.
Subject(s)
Vesiculovirus , Animals , Humans , Mice , Vesiculovirus/genetics , Vesiculovirus/metabolism , Extracellular Vesicles/metabolism , Viral Envelope Proteins/metabolism , Viral Envelope Proteins/genetics , Glycoproteins/metabolism , Glycoproteins/genetics , HEK293 Cells , Viral Proteins/metabolism , Viral Proteins/genetics , Membrane Glycoproteins/metabolism , Membrane Glycoproteins/genetics , Drug Delivery Systems/methods , Cell Line, TumorABSTRACT
BACKGROUND: White clover (Trifolium repens) is a globally important perennial forage legume. This species also serves as an eco-evolutionary model system for studying within-species chemical defense variation; it features a well-studied polymorphism for cyanogenesis (HCN release following tissue damage), with higher frequencies of cyanogenic plants favored in warmer locations worldwide. Using a newly generated haplotype-resolved genome and two other long-read assemblies, we tested the hypothesis that copy number variants (CNVs) at cyanogenesis genes play a role in the ability of white clover to rapidly adapt to local environments. We also examined questions on subgenome evolution in this recently evolved allotetraploid species and on chromosomal rearrangements in the broader IRLC legume clade. RESULTS: Integration of PacBio HiFi, Omni-C, Illumina, and linkage map data yielded a completely de novo genome assembly for white clover (created without a priori sequence assignment to subgenomes). We find that white clover has undergone extensive transposon diversification since its origin but otherwise shows highly conserved genome organization and composition with its diploid progenitors. Unlike some other clover species, its chromosomal structure is conserved with other IRLC legumes. We further find extensive evidence of CNVs at the major cyanogenesis loci; these contribute to quantitative variation in the cyanogenic phenotype and to local adaptation across wild North American populations. CONCLUSIONS: This work provides a case study documenting the role of CNVs in local adaptation in a plant species, and it highlights the value of pan-genome data for identifying contributions of structural variants to adaptation in nature.
Subject(s)
DNA Copy Number Variations , Genome, Plant , Trifolium , Adaptation, Physiological/genetics , Trifolium/geneticsABSTRACT
Choroidal neovascularization (CNV) represents a hallmark of neovascular fundus diseases, including age-related macular degeneration and diabetic retinopathy. Traditional eyedrops have encountered formidable challenges in treating CNV, primarily due to their extremely poor intraocular bioavailability and potential adverse off-target effects. Herein, an ocular-permeable supramolecular prodrug eyedrop (Di-DAS/P-PCD) has been developed for the on-demand delivery of antiangiogenic agents in the oxidative microenvironment of CNV. The eyedrop nanoformulation is composed of cell-penetrating peptide-modified PEGylated cyclodextrin (P-PCD) and reactive oxygen species (ROS)-sensitive antiangiogenic dasatinib prodrug Di-DAS. In a laser-induced CNV mouse model, daily instillation of Di-DAS/P-PCD has achieved remarkable penetration into the choroid and significantly suppressed CNV growth while exhibiting a good biocompatibility profile. Our results highlight the potential of the supramolecular prodrug eyedrops as a versatile approach for the targeted treatment of CNV and other neovascular eye disorders.
ABSTRACT
Recent evidence challenges the belief that individuals who are Duffy-negative are resistant to Plasmodium vivax due to lacking the Duffy antigen receptor for chemokines. Erythrocyte-binding protein (EBP/DBP2) has shown moderate binding to Duffy-negative erythrocytes in vitro. Reticulocyte-binding protein 2b (RBP2b) interactions with transferrin receptor 1 suggest involvement in Duffy-negative infections. Gene copy number variations in PvDBP1, PvEBP/DBP2, and PvRBP2b were investigated in Duffy-positive and Duffy-negative P vivax infections from Ethiopia. Among Duffy-positive samples, 34% displayed PvDBP1 duplications (Cambodian type). In Duffy-negative infections, 30% showed duplications, mostly Cambodian type. For PvEBP/DBP2 and PvRBP2b, Duffy-positive samples exhibited higher duplication rates (1-8 copies for PvEBP/DBP2, 46%; 1-5 copies for PvRBP2b, 43%) as compared with Duffy-negative samples (20.8% and 26%, respectively). The range of copy number variations was lower in Duffy-negative infections. Demographic and clinical factors associated with gene multiplications in both Duffy types were explored, enhancing understanding of P vivax evolution in Africans who are Duffy negative.
Subject(s)
DNA Copy Number Variations , Duffy Blood-Group System , Malaria, Vivax , Plasmodium vivax , Protozoan Proteins , Receptors, Cell Surface , Humans , Plasmodium vivax/genetics , Duffy Blood-Group System/genetics , Malaria, Vivax/parasitology , Malaria, Vivax/genetics , Protozoan Proteins/genetics , Ethiopia/epidemiology , Receptors, Cell Surface/genetics , Female , Male , Adult , Adolescent , Young Adult , Middle Aged , Child , Antigens, Protozoan/genetics , Child, Preschool , East African People , Membrane ProteinsABSTRACT
BACKGROUND: Structural variations play an important role in bacterial genomes. They can mediate genome adaptation quickly in response to the external environment and thus can also play a role in antibiotic resistance. The detection of structural variations in bacteria is challenging, and the recognition of even small rearrangements can be important. Even though most detection tools are aimed at and benchmarked on eukaryotic genomes, they can also be used on prokaryotic genomes. The key features of detection are the ability to detect small rearrangements and support haploid genomes. Because of the limiting performance of a single detection tool, combining the detection abilities of multiple tools can lead to more robust results. There are already available workflows for structural variation detection for long-reads technologies and for the detection of single-nucleotide variation and indels, both aimed at bacteria. Yet we are unaware of structural variations detection workflows for the short-reads sequencing platform. Motivated by this gap we created our workflow. Further, we were interested in increasing the detection performance and providing more robust results. RESULTS: We developed an open-source bioinformatics pipeline, ProcaryaSV, for the detection of structural variations in bacterial isolates from paired-end short sequencing reads. Multiple tools, starting with quality control and trimming of sequencing data, alignment to the reference genome, and multiple structural variation detection tools, are integrated. All the partial results are then processed and merged with an in-house merging algorithm. Compared with a single detection approach, ProcaryaSV has improved detection performance and is a reproducible easy-to-use tool. CONCLUSIONS: The ProcaryaSV pipeline provides an integrative approach to structural variation detection from paired-end next-generation sequencing of bacterial samples. It can be easily installed and used on Linux machines. It is publicly available on GitHub at https://github.com/robinjugas/ProcaryaSV .
Subject(s)
Genome, Bacterial , High-Throughput Nucleotide Sequencing , Software , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Bacteria/geneticsABSTRACT
BACKGROUND: The majority of CAKUT-associated CNVs overlap at least one miRNA gene, thus affecting the cellular levels of the corresponding miRNA. We aimed to investigate the potency of restitution of CNV-affected miRNA levels to remediate the dysregulated expression of target genes involved in kidney physiology and development in vitro. METHODS: Heterozygous MIR484 knockout HEK293 and homozygous MIR185 knockout HEK293 cell lines were used as models depicting the deletion of the frequently affected miRNA genes by CAKUT-associated CNVs. After treatment with the corresponding miRNA mimics, the levels of the target genes have been compared to the non-targeting control treatment. For both investigated miRNAs, MDM2 and PKD1 were evaluated as common targets, while additional 3 genes were investigated as targets of each individual miRNA (NOTCH3, FIS1 and APAF1 as hsa-miR-484 targets and RHOA, ATF6 and CDC42 as hsa-miR-185-5p targets). RESULTS: Restitution of the corresponding miRNA levels in both knockout cell lines has induced a change in the mRNA levels of certain candidate target genes, thus confirming the potential to alleviate the CNV effect on miRNA expression. Intriguingly, HEK293 WT treatment with investigated miRNA mimics has triggered a more pronounced effect, thus suggesting the importance of miRNA interplay in different genomic contexts. CONCLUSIONS: Dysregulation of multiple mRNA targets mediated by CNV-affected miRNAs could represent the underlying mechanism behind the unresolved CAKUT occurrence and phenotypic variability observed in CAKUT patients. Characterizing miRNAs located in CNVs and their potential to become molecular targets could eventually help in understanding and improving the management of CAKUT.
Subject(s)
MicroRNAs , Urogenital Abnormalities , Vesico-Ureteral Reflux , Humans , Down-Regulation , HEK293 Cells , MicroRNAs/genetics , MicroRNAs/metabolism , RNA, MessengerABSTRACT
Conifers are long-lived and slow-evolving, thus requiring effective defences against their fast-evolving insect natural enemies. The copy number variation (CNV) of two key acetophenone biosynthesis genes Ugt5/Ugt5b and ßglu-1 may provide a plausible mechanism underlying the constitutively variable defence in white spruce (Picea glauca) against its primary defoliator, spruce budworm. This study develops a long-insert sequence capture probe set (Picea_hung_p1.0) for quantifying copy number of ßglu-1-like, Ugt5-like genes and single-copy genes on 38 Norway spruce (Picea abies) and 40 P. glauca individuals from eight and nine provenances across Europe and North America respectively. We developed local assemblies (Piabi_c1.0 and Pigla_c.1.0), full-length transcriptomes (PIAB_v1 and PIGL_v1), and gene models to characterise the diversity of ßglu-1 and Ugt5 genes. We observed very large copy numbers of ßglu-1, with up to 381 copies in a single P. glauca individual. We observed among-provenance CNV of ßglu-1 in P. glauca but not P. abies. Ugt5b was predominantly single-copy in both species. This study generates critical hypotheses for testing the emergence and mechanism of extreme CNV, the dosage effect on phenotype, and the varying copy number of genes with the same pathway. We demonstrate new approaches to overcome experimental challenges in genomic research in conifer defences.
Subject(s)
Picea , Humans , Picea/genetics , Picea/metabolism , DNA Copy Number Variations , beta-Glucosidase/genetics , Genomics , TranscriptomeABSTRACT
Large-scale copy number variants (CNVs) are structural alterations in the genome that involve the duplication or deletion of DNA segments, contributing to genetic diversity and playing a crucial role in the evolution and development of various diseases and disorders, as they can lead to the dosage imbalance of one or more genes. Massively parallel sequencing (MPS) has revolutionized the field of genetic analysis and contributed significantly to routine clinical diagnosis and screening. It offers a precise method for detecting CNVs with exceptional accuracy. In this context, a non-invasive prenatal test (NIPT) based on the sequencing of cell-free DNA (cfDNA) from pregnant women's plasma using a low-coverage whole genome MPS (WGS) approach represents a valuable source for population studies. Here, we analyzed genomic data of 12,732 pregnant women from the Slovak (9,230), Czech (1,583), and Hungarian (1,919) populations. We identified 5,062 CNVs ranging from 200 kbp and described their basic characteristics and differences between the subject populations. Our results suggest that re-analysis of sequencing data from routine WGS assays has the potential to obtain large-scale CNV population frequencies, which are not well known and may provide valuable information to support the classification and interpretation of this type of genetic variation. Furthermore, this could contribute to expanding knowledge about the central European genome without investing in additional laboratory work, as NIPTs are a relatively widely used screening method.