Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 740
Filter
1.
Life Sci Alliance ; 7(11)2024 Nov.
Article in English | MEDLINE | ID: mdl-39117454

ABSTRACT

Structural variants (SVs) of unknown significance are great challenges for prenatal risk assessment, especially when involving dose-sensitive genes such as DMD The pathogenicities of 5'-terminal DMD duplications in the database remain controversial. Four prenatal cases with Xp21.1 duplications were identified by routine prenatal genomic testing, encompassing the 5'-UTR to exons 1-2 in family 1 and family 2, and to exons 1-9 in family 3. The duplication in family 4 was non-contiguous covering the 5'-UTR to exon 1 and exons 3-7. All were traced to unaffected males in the family pedigrees. A new genome-wide approach of optical genome mapping was performed in families 1, 2, and 3 to delineate the breakpoints and orientation of the duplicated fragments. The extra copies were tandemly inserted into the upstream of DMD, preserving the integrity of ORF from the second copy. The pathogenicities were thus reclassified as likely benign. Our data highlight the importance of structural delineation by optical genome mapping in prenatal risk assessment of incidentally identified SVs involving DMD and other similar large dose-sensitive genes.


Subject(s)
Dystrophin , Pedigree , Humans , Female , Male , Risk Assessment/methods , Pregnancy , Dystrophin/genetics , Prenatal Diagnosis/methods , Chromosome Mapping/methods , Chromosomes, Human, X/genetics , Muscular Dystrophy, Duchenne/genetics , Muscular Dystrophy, Duchenne/diagnosis , Chromosome Duplication/genetics , Exons/genetics , Gene Duplication/genetics , Adult
2.
Clin Epigenetics ; 16(1): 101, 2024 Aug 02.
Article in English | MEDLINE | ID: mdl-39095842

ABSTRACT

Adaptive nanopore sequencing as a diagnostic method for imprinting disorders and episignature analysis revealed an intragenic duplication of Exon 6 and 7 in UBE3A (NM_000462.5) in a patient with relatively mild Angelman-like syndrome. In an all-in-one nanopore sequencing analysis DNA hypomethylation of the SNURF:TSS-DMR, known contributing deletions on the maternal allele and point mutations in UBE3A could be ruled out as disease drivers. In contrast, breakpoints and orientation of the tandem duplication could clearly be defined. Segregation analysis in the family showed that the duplication derived de novo in the maternal grandfather. Our study shows the benefits of an all-in-one nanopore sequencing approach for the diagnostics of Angelman syndrome and other imprinting disorders.


Subject(s)
Angelman Syndrome , DNA Methylation , Gene Duplication , Nanopore Sequencing , Ubiquitin-Protein Ligases , Humans , Angelman Syndrome/genetics , Angelman Syndrome/diagnosis , Ubiquitin-Protein Ligases/genetics , Nanopore Sequencing/methods , DNA Methylation/genetics , Female , Gene Duplication/genetics , Male , Exons/genetics , Pedigree , Genomic Imprinting/genetics
3.
Cell Genom ; 4(7): 100590, 2024 Jul 10.
Article in English | MEDLINE | ID: mdl-38908378

ABSTRACT

The duplication-triplication/inverted-duplication (DUP-TRP/INV-DUP) structure is a complex genomic rearrangement (CGR). Although it has been identified as an important pathogenic DNA mutation signature in genomic disorders and cancer genomes, its architecture remains unresolved. Here, we studied the genomic architecture of DUP-TRP/INV-DUP by investigating the DNA of 24 patients identified by array comparative genomic hybridization (aCGH) on whom we found evidence for the existence of 4 out of 4 predicted structural variant (SV) haplotypes. Using a combination of short-read genome sequencing (GS), long-read GS, optical genome mapping, and single-cell DNA template strand sequencing (strand-seq), the haplotype structure was resolved in 18 samples. The point of template switching in 4 samples was shown to be a segment of ∼2.2-5.5 kb of 100% nucleotide similarity within inverted repeat pairs. These data provide experimental evidence that inverted low-copy repeats act as recombinant substrates. This type of CGR can result in multiple conformers generating diverse SV haplotypes in susceptible dosage-sensitive loci.


Subject(s)
Haplotypes , Humans , Haplotypes/genetics , Comparative Genomic Hybridization , Genomic Structural Variation/genetics , Genome, Human/genetics , Gene Duplication/genetics
4.
Trends Genet ; 40(7): 621-631, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38637269

ABSTRACT

Whole-genome duplications (WGDs) are widespread genomic events in eukaryotes that are hypothesized to contribute to the evolutionary success of many lineages, including flowering plants, Saccharomyces yeast, and vertebrates. WGDs generally can be classified into autopolyploids (ploidy increase descended from one species) or allopolyploids (ploidy increase descended from multiple species). Assignment of allopolyploid progenitor species (called subgenomes in the polyploid) is important to understanding the biology and evolution of polyploids, including the asymmetric subgenome evolution following hybridization (biased fractionation). Here, I review the different methodologies used to identify the ancestors of allopolyploid subgenomes, discuss the advantages and disadvantages of these methods, and outline the implications of how these methods affect the subsequent evolutionary analysis of these genomes.


Subject(s)
Evolution, Molecular , Polyploidy , Phylogeny , Animals , Genome/genetics , Genomics/methods , Gene Duplication/genetics
5.
Int J Mol Sci ; 24(13)2023 Jul 04.
Article in English | MEDLINE | ID: mdl-37446254

ABSTRACT

Glutathione peroxidase-like enzyme is an important enzymatic antioxidant in plants. It is involved in scavenging reactive oxygen species, which can effectively prevent oxidative damage and improve resistance. GPXL has been studied in many plants but has not been reported in potatoes, the world's fourth-largest food crop. This study identified eight StGPXL genes in potatoes for the first time through genome-wide bioinformatics analysis and further studied the expression patterns of these genes using qRT-PCR. The results showed that the expression of StGPXL1 was significantly upregulated under high-temperature stress, indicating its involvement in potato defense against high-temperature stress, while the expression levels of StGPXL4 and StGPXL5 were significantly downregulated. The expression of StGPXL1, StGPXL2, StGPXL3, and StGPXL6 was significantly upregulated under drought stress, indicating their involvement in potato defense against drought stress. After MeJA hormone treatment, the expression level of StGPXL6 was significantly upregulated, indicating its involvement in the chemical defense mechanism of potatoes. The expression of all StGPXL genes is inhibited under biotic stress, which indicates that GPXL is a multifunctional gene family, which may endow plants with resistance to various stresses. This study will help deepen the understanding of the function of the potato GPXL gene family, provide comprehensive information for the further analysis of the molecular function of the potato GPXL gene family as well as a theoretical basis for potato molecular breeding.


Subject(s)
Gene Expression Regulation, Plant , Genome-Wide Association Study , Glutathione Peroxidase , Plant Proteins , Solanum tuberosum , Gene Expression Profiling , Glutathione Peroxidase/genetics , Glutathione Peroxidase/metabolism , Plant Proteins/genetics , Plant Proteins/metabolism , Solanum tuberosum/classification , Solanum tuberosum/enzymology , Solanum tuberosum/genetics , Stress, Physiological/genetics , Gene Duplication/genetics , Conserved Sequence/genetics , Amino Acid Motifs/genetics , Arabidopsis Proteins/genetics , Gene Ontology
6.
Blood Adv ; 6(14): 4236-4240, 2022 07 26.
Article in English | MEDLINE | ID: mdl-35584376

ABSTRACT

KMT2A partial tandem duplication (KMT2A-PTD) is an adverse risk factor in acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS), a potential therapeutic target, and an attractive marker of measurable residual disease. High initial KMT2A-PTD RNA levels have been linked to poor prognosis, but mechanisms regulating KMT2A-PTD expression are not well understood. Although KMT2A-PTD has been reported to affect only a single allele, it has been theorized but not proven that genomic gains of a monoallelic KMT2A-PTD may occur, thereby potentially driving high expression and disease progression. In this study, we identified 94 patients with KMT2A-PTDs using targeted DNA next-generation sequencing (NGS) and found that 16% (15/94) had complex secondary events, including copy-neutral loss of heterozygosity and selective gain involving the KMT2A-PTD allele. High copy numbers indicating complexity were significantly enriched in AML vs MDS and correlated with higher RNA expression. Moreover, in serial samples, complexity was associated with relapse and secondary transformation. Taken together, we provide approaches to integrate quantitative and allelic assessment of KMT2A-PTDs into targeted DNA NGS and demonstrate that secondary genetic events occur in KMT2A-PTD by multiple mechanisms that may be linked to myeloid disease progression by driving increased expression from the affected allele.


Subject(s)
Histone-Lysine N-Methyltransferase , Leukemia, Myeloid, Acute , Myelodysplastic Syndromes , Myeloid-Lymphoid Leukemia Protein , Alleles , Disease Progression , Gene Duplication/genetics , Histone-Lysine N-Methyltransferase/genetics , Humans , Leukemia, Myeloid, Acute/genetics , Myelodysplastic Syndromes/genetics , Myeloid-Lymphoid Leukemia Protein/genetics , RNA
7.
Article in English | MEDLINE | ID: mdl-35232817

ABSTRACT

Li-Fraumeni syndrome (LFS) is one of the most common cancer predisposition syndromes that affects both children and adults. Individuals with LFS are at an increased risk of developing various types of cancer over their lifetime including soft tissue sarcomas, osteosarcomas, breast cancer, leukemia, brain tumors, and adrenocortical carcinoma. Heterozygous germline pathogenic variants in the tumor suppressor gene TP53 are the known causal genetic defect for LFS. Single-nucleotide variants (SNVs) including missense substitutions that occur in the highly conserved DNA binding domain of the protein are the most common alterations, followed by nonsense and splice site variants. Gross copy-number changes in TP53 are rare and account for <1% of all variants. Using next-generation sequencing (NGS) panels, we identified a paternally inherited germline intragenic duplication of TP53 in a child with metastatic osteosarcoma who later developed acute myeloid leukemia (AML). Transcriptome sequencing (RNA-seq) demonstrated the duplication was tandem, encompassing exons 2-6 and 28 nt of the untranslated region (UTR) upstream of the start codon in exon 2. The inclusion of the 28 nt is expected to result in a frameshift with a stop codon 18 codons downstream from the exon 6, leading to a loss-of-function allele. This case highlights the significance of simultaneous identification of both significant copy-number variants as well as SNVs/indels using NGS panels.


Subject(s)
Adrenal Cortex Neoplasms , Breast Neoplasms , Li-Fraumeni Syndrome , Tumor Suppressor Protein p53 , Adult , Breast Neoplasms/genetics , Child , Female , Gene Duplication/genetics , Genetic Predisposition to Disease , Germ-Line Mutation/genetics , Humans , Li-Fraumeni Syndrome/genetics , Tumor Suppressor Protein p53/genetics
8.
Int J Mol Sci ; 23(2)2022 Jan 06.
Article in English | MEDLINE | ID: mdl-35054810

ABSTRACT

Brassica napus and its diploid progenitors (B. rapa and B. oleracea) are suitable for studying the problems associated with polyploidization. As an important anti-stress protein, RCI2 proteins widely exist in various tissues of plants, and are crucial to plant growth, development, and stress response. In this study, the RCI2 gene family was comprehensively identified and analyzed, and 9, 9, and 24 RCI2 genes were identified in B. rapa, B. oleracea, and B. napus, respectively. Phylogenetic analysis showed that all of the identified RCI2 genes were divided into two groups, and further divided into three subgroups. Ka/Ks analysis showed that most of the identified RCI2 genes underwent a purifying selection after the duplication events. Moreover, gene structure analysis showed that the structure of RCI2 genes is largely conserved during polyploidization. The promoters of the RCI2 genes in B. napus contained more cis-acting elements, which were mainly involved in plant development and growth, plant hormone response, and stress responses. Thus, B. napus might have potential advantages in some biological aspects. In addition, the changes of RCI2 genes during polyploidization were also discussed from the aspects of gene number, gene structure, gene relative location, and gene expression, which can provide reference for future polyploidization analysis.


Subject(s)
Brassica napus/genetics , Diploidy , Genome, Plant , Multigene Family , Polyploidy , Base Sequence , Chromosomes, Plant/genetics , Exons/genetics , Gene Duplication/genetics , Gene Expression Regulation, Plant , Genes, Plant , Introns/genetics , Phylogeny , Plant Proteins/genetics , Plant Proteins/metabolism , Promoter Regions, Genetic , Subcellular Fractions/metabolism , Synteny/genetics
9.
PLoS Comput Biol ; 17(12): e1009638, 2021 12.
Article in English | MEDLINE | ID: mdl-34871317

ABSTRACT

This work studies the effects of the two rounds of Whole Genome Duplication (WGD) at the origin of the vertebrate lineage on the architecture of the human gene regulatory networks. We integrate information on transcriptional regulation, miRNA regulation, and protein-protein interactions to comparatively analyse the role of WGD and Small Scale Duplications (SSD) in the structural properties of the resulting multilayer network. We show that complex network motifs, such as combinations of feed-forward loops and bifan arrays, deriving from WGD events are specifically enriched in the network. Pairs of WGD-derived proteins display a strong tendency to interact both with each other and with common partners and WGD-derived transcription factors play a prominent role in the retention of a strong regulatory redundancy. Combinatorial regulation and synergy between different regulatory layers are in general enhanced by duplication events, but the two types of duplications contribute in different ways. Overall, our findings suggest that the two WGD events played a substantial role in increasing the multi-layer complexity of the vertebrate regulatory network by enhancing its combinatorial organization, with potential consequences on its overall robustness and ability to perform high-level functions like signal integration and noise control. Lastly, we discuss in detail the RAR/RXR pathway as an illustrative example of the evolutionary impact of WGD duplications in human.


Subject(s)
Evolution, Molecular , Gene Duplication/genetics , Gene Regulatory Networks/genetics , Genome, Human/genetics , Animals , Genomics , Humans , Models, Genetic , Vertebrates/genetics
10.
BMC Plant Biol ; 21(1): 535, 2021 Nov 13.
Article in English | MEDLINE | ID: mdl-34773988

ABSTRACT

BACKGROUNDS: Populus and Salix belong to Salicaceae and are used as models to investigate woody plant physiology. The variation of karyotype and nuclear DNA content can partly reflect the evolutionary history of the whole genome, and can provide critical information for understanding, predicting, and potentially ameliorating the woody plant traits. Therefore, it is essential to study the chromosome number (CN) and genome size in detail to provide information for revealing the evolutionary process of Salicaceae. RESULTS: In this study, we report the somatic CNs of seventeen species from eight genera in Salicaceae. Of these, CNs for twelve species and for five genera are reported for the first time. Among the three subfamilies of Salicaceae, the available data indicate CN in Samydoideae is n = 21, 22, 42. The only two genera, Dianyuea and Scyphostegia, in Scyphostegioideae respectively have n = 9 and 18. In Salicoideae, Populus, Salix and five genera closely related to them (Bennettiodendron, Idesia, Carrierea, Poliothyrsis, Itoa) are based on relatively high CNs from n = 19, 20, 21, 22 to n = 95 in Salix. However, the other genera of Salicoideae are mainly based on relatively low CNs of n = 9, 10, 11. The genome sizes of 35 taxa belonging to 14 genera of Salicaceae were estimated. Of these, the genome sizes of 12 genera and all taxa except Populus euphratica are first reported. Except for Dianyuea, Idesia and Bennettiodendron, all examined species have relatively small genome sizes of less than 1 pg, although polyploidization exists. CONCLUSIONS: The variation of CN and genome size across Salicaceae indicates frequent ploidy changes and a widespread sharing of the salicoid whole genome duplication (WGD) by the relatives of Populus and Salix. The shrinkage of genome size after WGD indicates massive loss of genomic components. The phylogenetic asymmetry in clade of Populus, Salix, and their close relatives suggests that there is a lag-time for the subsequent radiations after the salicoid WGD event. Our results provide useful data for studying the evolutionary events of Salicaceae.


Subject(s)
Populus/metabolism , Salicaceae/metabolism , Salix/metabolism , Gene Duplication/genetics , Gene Duplication/physiology , Genome, Plant/genetics , Phylogeny , Populus/genetics , Salicaceae/genetics , Salix/genetics , Whole Genome Sequencing
11.
PLoS One ; 16(11): e0258657, 2021.
Article in English | MEDLINE | ID: mdl-34735479

ABSTRACT

Mitogen activated protein kinase kinase kinase (MAPKKK) form the upstream component of MAPK cascade. It is well characterized in several plants such as Arabidopsis and rice however the knowledge about MAPKKKs in tea plant is largely unknown. In the present study, MAPKKK genes of tea were obtained through a genome wide search using Arabidopsis thaliana as the reference genome. Among 59 candidate MAPKKK genes in tea, 17 genes were MEKK-like, 31 genes were Raf-like and 11 genes were ZIK- like. Additionally, phylogenetic relationships were established along with structural analysis, which includes gene structure, its location as well as conserved motifs, cis-acting regulatory elements and functional domain signatures that were systematically examined. Also, on the basis of one orthologous gene found between tea and Arabidopsis, functional interaction was carried out in C. sinensis based on an Arabidopsis association model. The expressional profiles indicated major involvement of MAPKKK genes from tea in response to various abiotic stress factors. Taken together, this study provides the targets for additional inclusive identification, functional study, and provides comprehensive knowledge for a better understanding of the MAPKKK cascade regulatory network in C. sinensis.


Subject(s)
Camellia sinensis/genetics , Genome, Plant/genetics , MAP Kinase Kinase Kinases/genetics , Phylogeny , Arabidopsis/genetics , Chromosomes, Plant/genetics , Conserved Sequence/genetics , Gene Duplication/genetics , Gene Expression Regulation, Plant/genetics , MAP Kinase Kinase Kinases/classification , MAP Kinase Signaling System/genetics , Multigene Family/genetics , Oryza/genetics , Sequence Alignment , Stress, Physiological/genetics
12.
Int J Mol Sci ; 22(19)2021 Sep 23.
Article in English | MEDLINE | ID: mdl-34638605

ABSTRACT

Cystatins, as reversible inhibitors of papain-like and legumain proteases, have been identified in several plant species. Although the cystatin family plays crucial roles in plant development and defense responses to various stresses, this family in wheat (Triticum aestivum L.) is still poorly understood. In this study, 55 wheat cystatins (TaCystatins) were identified. All TaCystatins were divided into three groups and both the conserved gene structures and peptide motifs were relatively conserved within each group. Homoeolog analysis suggested that both homoeolog retention percentage and gene duplications contributed to the abundance of the TaCystatin family. Analysis of duplication events confirmed that segmental duplications played an important role in the duplication patterns. The results of codon usage pattern analysis showed that TaCystatins had evident codon usage bias, which was mainly affected by mutation pressure. TaCystatins may be regulated by cis-acting elements, especially abscisic acid and methyl jasmonate responsive elements. In addition, the expression of all selected TaCystatins was significantly changed following viral infection and cold stress, suggesting potential roles in response to biotic and abiotic challenges. Overall, our work provides new insights into TaCystatins during wheat evolution and will help further research to decipher the roles of TaCystatins under diverse stress conditions.


Subject(s)
Cystatins/genetics , Genes, Plant/genetics , Genome, Plant/genetics , Multigene Family/genetics , Plant Proteins/genetics , Triticum/genetics , Abscisic Acid/metabolism , Bread , Gene Duplication/genetics , Gene Expression Profiling/methods , Gene Expression Regulation, Plant/genetics , Genome-Wide Association Study/methods , Mutation , Phylogeny , Stress, Physiological/genetics
13.
Genes (Basel) ; 12(10)2021 10 16.
Article in English | MEDLINE | ID: mdl-34681022

ABSTRACT

Gene duplications generate new genes that can contribute to expression changes and the evolution of new functions. Genomes often consist of gene families that undergo expansions, some of which occur in specific lineages that reflect recent adaptive diversification. In this study, lineage-specific genes and gene family expansions were studied across five dictyostelid species to determine when and how they are expressed during multicellular development. Lineage-specific genes were found to be enriched among genes with biased expression (predominant expression in one developmental stage) in each species and at most developmental time points, suggesting independent functional innovations of new genes throughout the phylogeny. Biased duplicate genes had greater expression divergence than their orthologs and paralogs, consistent with subfunctionalization or neofunctionalization. Lineage-specific expansions in particular had biased genes with both molecular signals of positive selection and high expression, suggesting adaptive genetic and transcriptional diversification following duplication. Our results present insights into the potential contributions of lineage-specific genes and families in generating species-specific phenotypes during multicellular development in dictyostelids.


Subject(s)
Dictyostelium/genetics , Evolution, Molecular , Phylogeny , Dictyostelium/growth & development , Gene Duplication/genetics , Gene Expression Regulation, Developmental/genetics , Genome/genetics , Species Specificity
14.
Eur J Med Genet ; 64(12): 104367, 2021 Dec.
Article in English | MEDLINE | ID: mdl-34678473

ABSTRACT

MECP2 duplication syndrome (MDS) is caused by copy number variation (CNV) spanning the MECP2 gene at Xq28 and is a major cause of intellectual disability (ID) in males. Herein, we describe two unrelated males harboring non-recurrent complex Xq28 rearrangements associated with MDS. Copy number gains were initially detected by quantitative real-time polymerase chain reaction and further delineated by high-resolution array comparative genomic hybridization, familial segregation, expression analysis and X-chromosome inactivation (XCI) evaluation in a carrier mother. SNVs within the rearrangements and/or fluorescent in situ hybridization (FISH) were used to assess the parental origin of the rearrangements. Patient 1 exhibited an intrachromosomal rearrangement, whose structure is consistent with a triplicated segment presumably embedded in an inverted orientation between two duplicated sequences (DUP-TRP/INV-DUP). The rearrangement was inherited from the carrier mother, who exhibits extreme XCI skewing and subtle psychiatric symptoms. Patient 2 presented a de novo (X;Y) unbalanced translocation resulting in duplication of Xq28 and deletion of Yp, originated in the paternal gametogenesis. Neurodevelopmental trajectory and non-neurological symptoms were consistent with previous reports, with the exception of cerebellar vermis hypoplasia in patient 2. Although both patients share the core MDS phenotype, patient 1 showed MECP2 transcript levels in blood similar to controls. Understanding the molecular mechanisms related to MDS is essential for designing targeted therapeutic strategies.


Subject(s)
Chromosome Duplication/genetics , Gene Duplication/genetics , Gene Rearrangement/genetics , Methyl-CpG-Binding Protein 2/genetics , Adolescent , Adult , Child , Child, Preschool , Chromosome Aberrations , Chromosomes, Human, X/genetics , Comparative Genomic Hybridization/methods , DNA Copy Number Variations/genetics , Genomics/methods , Humans , Infant , Intellectual Disability/genetics , Male , Mental Retardation, X-Linked/genetics , Middle Aged , Translocation, Genetic/genetics , X Chromosome Inactivation/genetics , Young Adult
15.
Int J Biol Macromol ; 190: 487-498, 2021 Nov 01.
Article in English | MEDLINE | ID: mdl-34508718

ABSTRACT

Nuclear factor Y (NF-Y) is a heterotrimeric transcription factor playing crucial roles in various biological process in plant. However, thorough research on NF-Y gene family of Tartary buckwheat (Fagopyrum tataricum) is little. In this study, 38 FtNF-Y genes (12 FtNF-YAs, 17 FtNF-YBs, and 9 FtNF-YCs) were identified and renamed on the basis of their subfamily and chromosomal location. Their gene structure, genomic mapping, motif composition, conserved domain, phylogenetic relationships, cis-acting elements and gene expression were investigated. Illustration of gene structures and conserved domains of FtNF-Ys revealed their functional conservation and specificity. Construction of phylogenetic trees of NF-Ys in Tartary buckwheat, Arabidopsis, tomato, rice and banana, allowed us to predict functional similarities among NF-Ys from different species. Gene expression analysis displayed that twenty-four FtNF-Ys were expressed in all the tissues and the transcript levels of them were different, suggesting their function varieties. Moreover, expression profiles of twenty FtNF-Ys along five different fruit development stages acquired by real-time quantitative PCR (RT-qPCR) demonstrated distinct abundance diversity at different stages, providing some clues of potential fruit development regulators. Our study could provide helpful reference information for further function characterization of FtNF-Ys and for the fruit quality enhancement of Tartary buckwheat.


Subject(s)
CCAAT-Binding Factor/genetics , Fagopyrum/genetics , Fruit/growth & development , Fruit/genetics , Genome, Plant , Multigene Family , Plant Proteins/genetics , Amino Acid Motifs , Amino Acid Sequence , CCAAT-Binding Factor/chemistry , Chromosomes, Plant/genetics , Conserved Sequence , Evolution, Molecular , Gene Duplication/genetics , Gene Expression Profiling , Gene Expression Regulation, Plant , Organ Specificity/genetics , Phylogeny , Plant Proteins/chemistry , Promoter Regions, Genetic/genetics
16.
Sci Rep ; 11(1): 18258, 2021 09 14.
Article in English | MEDLINE | ID: mdl-34521941

ABSTRACT

Genomic rearrangements cause congenital disorders, cancer, and complex diseases in human. Yet, they are still understudied in rare diseases because their detection is challenging, despite the advent of whole genome sequencing (WGS) technologies. Short-read (srWGS) and long-read WGS approaches are regularly compared, and the latter is commonly recommended in studies focusing on genomic rearrangements. However, srWGS is currently the most economical, accurate, and widely supported technology. In Caenorhabditis elegans (C. elegans), such variants, induced by various mutagenesis processes, have been used for decades to balance large genomic regions by preventing chromosomal crossover events and allowing the maintenance of lethal mutations. Interestingly, those chromosomal rearrangements have rarely been characterized on a molecular level. To evaluate the ability of srWGS to detect various types of complex genomic rearrangements, we sequenced three balancer strains using short-read Illumina technology. As we experimentally validated the breakpoints uncovered by srWGS, we showed that, by combining several types of analyses, srWGS enables the detection of a reciprocal translocation (eT1), a free duplication (sDp3), a large deletion (sC4), and chromoanagenesis events. Thus, applying srWGS to decipher real complex genomic rearrangements in model organisms may help designing efficient bioinformatics pipelines with systematic detection of complex rearrangements in human genomes.


Subject(s)
Caenorhabditis elegans/genetics , Gene Rearrangement/genetics , Whole Genome Sequencing/methods , Animals , Crossing Over, Genetic/genetics , DNA Copy Number Variations/genetics , Gene Duplication/genetics , Genome, Helminth/genetics , Heterozygote , Homozygote , Mutagenesis/genetics
17.
Mol Genet Genomics ; 296(6): 1263-1278, 2021 Nov.
Article in English | MEDLINE | ID: mdl-34453201

ABSTRACT

Nascent ribosomal 60S subunits undergo the last maturation steps in the cytoplasm. The last one involves removing the anti-association factor eIF6 from the 60S ribosomal surface by the joint action of the Elongation Factor-like 1 (EFL1) GTPase and the SBDS protein. Herein, we studied the evolutionary relationship of the EFL1 and EF-2 protein families and the functional conservation within EFL1 orthologues. Phylogenetic analysis demonstrated that the EFL1 proteins are exclusive of eukaryotes and share an evolutionary origin with the EF-2 and EF-G protein families. EFL1 proteins originated by gene duplication from the EF-2 proteins and specialized in ribosome maturation while the latter retained their function in translation. Some organisms have more than one EFL1 protein resulting from alternative splicing, while others are encoded in different genes originated by gene duplication. However, the function of these alternative EFL1 proteins is still unknown. We performed GTPase activity and complementation assays to study the functional conservation of EFL1 homologs alone and together with their SBDS counterparts. None of the orthologues or cross-species combinations could replace the function of the corresponding yeast EFL1•SBDS binomial. The complementation of SBDS interspecies chimeras indicates that domain 2 is vital for its function together with EFL1 and the 60S subunit. The results suggest a functional species-specificity and possible co-evolution between EFL1, SBDS, and the 60S ribosomal subunit. These findings set the basis for further studies directed to understand the molecular evolution of these proteins and their impact on ribosome biogenesis and disease.


Subject(s)
Peptide Elongation Factor 2/metabolism , Peptide Elongation Factors/genetics , Proteins/genetics , Ribonucleoprotein, U5 Small Nuclear/genetics , Ribosome Subunits, Large, Eukaryotic/metabolism , Ribosomes/metabolism , Alternative Splicing/genetics , Amino Acid Sequence/genetics , Eukaryota/genetics , Evolution, Molecular , Gene Duplication/genetics , Humans , Peptide Elongation Factor 2/genetics , Phylogeny , Sequence Alignment
18.
Mol Genet Genomics ; 296(6): 1235-1247, 2021 Nov.
Article in English | MEDLINE | ID: mdl-34363105

ABSTRACT

Lineage-specific genes (LSGs) are the genes that have no recognizable homology to any sequences in other species, which are important drivers for the generation of new functions, phenotypic changes, and facilitating species adaptation to environment. Aegiceras corniculatum is one of major mangrove plant species adapted to waterlogging and saline conditions, and the exploration of aegiceras-specific genes (ASGs) is important to reveal its adaptation to the harsh environment. Here, we performed a systematic analysis on ASGs, focusing on their sequence characterization, origination and expression patterns. Our results reveal that there are 4823 ASGs in the genome, approximately 11.84% of all protein-coding genes. High proportion (45.78%) of ASGs originate from gene duplication, and the time of gene duplication of ASGs is consistent with the timing of two genome-wide replication (WGD) events that occurred in A. corniculatum, and also coincides with a short period of global warming during the Paleocene-Eocene Maximum (PETM, 55.5 million years ago). Gene structure analysis showed that ASGs have shorter protein lengths, fewer exons, and higher isoelectric point. Expression patterns analysis showed that ASGs had low levels of expression and more tissue-specific expression. Weighted gene co-expression network analysis (WGCNA) revealed that 86 ASGs co-expressed gene modules were primarily involved in pathways related to adversity stress, including plant hormone signal transduction, phenylpropanoid biosynthesis, photosynthesis, peroxisome and pentose phosphate pathway. This study provides a comprehensive analysis of the characteristics and potential functions of ASGs and identifies key candidate genes, which will contribute to the subsequent further investigation of the adaptation of A. corniculatum to intertidal coastal wetland habitats.


Subject(s)
Adaptation, Physiological/genetics , Cell Lineage/genetics , Gene Duplication/genetics , Primulaceae/genetics , Primulaceae/metabolism , Gene Expression Profiling , Genome, Plant/genetics , Transcriptome/genetics , Wetlands
19.
Nucleic Acids Res ; 49(14): 8232-8246, 2021 08 20.
Article in English | MEDLINE | ID: mdl-34302486

ABSTRACT

Most coding genes in the human genome are annotated with multiple alternative transcripts. However, clear evidence for the functional relevance of the protein isoforms produced by these alternative transcripts is often hard to find. Alternative isoforms generated from tandem exon duplication-derived substitutions are an exception. These splice events are rare, but have important functional consequences. Here, we have catalogued the 236 tandem exon duplication-derived substitutions annotated in the GENCODE human reference set. We find that more than 90% of the events have a last common ancestor in teleost fish, so are at least 425 million years old, and twenty-one can be traced back to the Bilateria clade. Alternative isoforms generated from tandem exon duplication-derived substitutions also have significantly more clinical impact than other alternative isoforms. Tandem exon duplication-derived substitutions have >25 times as many pathogenic and likely pathogenic mutations as other alternative events. Tandem exon duplication-derived substitutions appear to have vital functional roles in the cell and may have played a prominent part in metazoan evolution.


Subject(s)
Evolution, Molecular , Fishes/genetics , Genome, Human/genetics , Protein Isoforms/genetics , Alternative Splicing/genetics , Animals , Exons/genetics , Gene Duplication/genetics , Humans , Molecular Sequence Annotation , Sequence Alignment
20.
PLoS Genet ; 17(7): e1009654, 2021 07.
Article in English | MEDLINE | ID: mdl-34242211

ABSTRACT

It is a conventionally held dogma that the genetic basis underlying development is conserved in a long evolutionary time scale. Ample experiments based on mutational, biochemical, functional, and complementary knockdown/knockout approaches have revealed the unexpectedly important role of recently evolved new genes in the development of Drosophila. The recent progress in the genome-wide experimental testing of gene effects and improvements in the computational identification of new genes (< 40 million years ago, Mya) open the door to investigate the evolution of gene essentiality with a phylogenetically high resolution. These advancements also raised interesting issues in techniques and concepts related to phenotypic effect analyses of genes, particularly of those that recently originated. Here we reported our analyses of these issues, including reproducibility and efficiency of knockdown experiment and difference between RNAi libraries in the knockdown efficiency and testing of phenotypic effects. We further analyzed a large data from knockdowns of 11,354 genes (~75% of the Drosophila melanogaster total genes), including 702 new genes (~66% of the species total new genes that aged < 40 Mya), revealing a similarly high proportion (~32.2%) of essential genes that originated in various Sophophora subgenus lineages and distant ancestors beyond the Drosophila genus. The transcriptional compensation effect from CRISPR knockout were detected for highly similar duplicate copies. Knockout of a few young genes detected analogous essentiality in various functions in development. Taken together, our experimental and computational analyses provide valuable data for detection of phenotypic effects of genes in general and further strong evidence for the concept that new genes in Drosophila quickly evolved essential functions in viability during development.


Subject(s)
Evolution, Molecular , Gene Duplication/genetics , Genes, Essential/genetics , Animals , Biological Evolution , Drosophila Proteins/genetics , Drosophila melanogaster/genetics , Gene Knockdown Techniques/methods , Genomics , Genotype , Models, Genetic , Mutation , Phenotype , Phylogeny , Reproducibility of Results
SELECTION OF CITATIONS
SEARCH DETAIL