Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 34.706
Filter
1.
Sci Rep ; 14(1): 17786, 2024 08 01.
Article in English | MEDLINE | ID: mdl-39090226

ABSTRACT

A long-standing question concerns the role of Z-DNA in transcription. Here we use a deep learning approach DeepZ that predicts Z-flipons based on DNA sequence, structural properties of nucleotides and omics data. We examined Z-flipons that are conserved between human and mouse genomes after generating whole-genome Z-flipon maps and then validated them by orthogonal approaches based on high resolution chemical mapping of Z-DNA and the transformer algorithm Z-DNABERT. For human and mouse, we revealed similar pattern of transcription factors, chromatin remodelers, and histone marks associated with conserved Z-flipons. We found significant enrichment of Z-flipons in alternative and bidirectional promoters associated with neurogenesis genes. We show that conserved Z-flipons are associated with increased experimentally determined transcription reinitiation rates compared to promoters without Z-flipons, but without affecting elongation or pausing. Our findings support a model where Z-flipons engage Transcription Factor E and impact phenotype by enabling the reset of preinitiation complexes when active, and the suppression of gene expression when engaged by repressive chromatin complexes.


Subject(s)
DNA , Promoter Regions, Genetic , Animals , Humans , Mice , DNA/genetics , DNA/metabolism , Transcription, Genetic , Transcription Factors/metabolism , Transcription Factors/genetics , Chromatin Assembly and Disassembly , Transcription Initiation, Genetic , Chromatin/genetics , Chromatin/metabolism , Deep Learning , Conserved Sequence
2.
Open Biol ; 14(7): 240092, 2024 Jul.
Article in English | MEDLINE | ID: mdl-39043226

ABSTRACT

Magnetoreceptive biology as a field remains relatively obscure; compared with the breadth of species believed to sense magnetic fields, it remains under-studied. Here, we present grounds for the expansion of magnetoreception studies among teleosts. We begin with the electromagnetic perceptive gene (EPG) from Kryptopterus vitreolus and expand to identify 72 teleosts with homologous proteins containing a conserved three-phenylalanine (3F) motif. Phylogenetic analysis provides insight as to how EPG may have evolved over time and indicates that certain clades may have experienced a loss of function driven by different fitness pressures. One potential factor is water type with freshwater fish significantly more likely to possess the functional motif version (FFF), and saltwater fish to have the non-functional variant (FXF). It was also revealed that when the 3F motif from the homologue of Brachyhypopomus gauderio (B.g.) is inserted into EPG-EPG(B.g.)-the response (as indicated by increased intracellular calcium) is faster. This indicates that EPG has the potential to be engineered to improve upon its response and increase its utility to be used as a controller for specific outcomes.


Subject(s)
Amino Acid Motifs , Fishes , Phenylalanine , Phylogeny , Animals , Phenylalanine/genetics , Phenylalanine/metabolism , Phenylalanine/chemistry , Fishes/genetics , Conserved Sequence , Fish Proteins/genetics , Fish Proteins/metabolism , Fish Proteins/chemistry , Amino Acid Sequence , Electromagnetic Fields
3.
Mol Biol Evol ; 41(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-39058500

ABSTRACT

Ultraconserved elements were discovered two decades ago, arbitrarily defined as sequences that are identical over a length ≥ 200 bp in the human, mouse, and rat genomes. The definition was subsequently extended to sequences ≥ 100 bp identical in at least three of five mammalian genomes (including dog and cow), and shown to have undergone rapid expansion from ancestors in fish and strong negative selection in birds and mammals. Since then, many more genomes have become available, allowing better definition and more thorough examination of ultraconserved element distribution and evolutionary history. We developed a fast and flexible analytical pipeline for identifying ultraconserved elements in multiple genomes, dedUCE, which allows manipulation of minimum length, sequence identity, and number of species with a detectable ultraconserved element according to specified parameters. We suggest an updated definition of ultraconserved elements as sequences ≥ 100 bp and ≥97% sequence identity in ≥50% of placental mammal orders (12,813 ultraconserved elements). By mapping ultraconserved elements to ∼200 species, we find that placental ultraconserved elements appeared early in vertebrate evolution, well before land colonization, suggesting that the evolutionary pressures driving ultraconserved element selection were present in aquatic environments in the Cambrian-Devonian periods. Most (>90%) ultraconserved elements likely appeared after the divergence of gnathostomes from jawless predecessors, were largely established in sequence identity by early Sarcopterygii evolution-before the divergence of lobe-finned fishes from tetrapods-and became near fixed in the amniotes. Ultraconserved elements are mainly located in the introns of protein-coding and noncoding genes involved in neurological and skeletomuscular development, enriched in regulatory elements, and dynamically expressed throughout embryonic development.


Subject(s)
Conserved Sequence , Evolution, Molecular , Vertebrates , Animals , Humans , Vertebrates/genetics , Genome , Phylogeny
4.
Nature ; 631(8022): 850-856, 2024 Jul.
Article in English | MEDLINE | ID: mdl-39020165

ABSTRACT

Several immune pathways in humans conjugate ubiquitin-like proteins to virus and host molecules as a means of antiviral defence1-5. Here we studied an antiphage defence system in bacteria, comprising a ubiquitin-like protein, ubiquitin-conjugating enzymes E1 and E2, and a deubiquitinase. We show that during phage infection, this system specifically conjugates the ubiquitin-like protein to the phage central tail fibre, a protein at the tip of the tail that is essential for tail assembly as well as for recognition of the target host receptor. Following infection, cells encoding this defence system release a mixture of partially assembled, tailless phage particles and fully assembled phages in which the central tail fibre is obstructed by the covalently attached ubiquitin-like protein. These phages show severely impaired infectivity, explaining how the defence system protects the bacterial population from the spread of phage infection. Our findings demonstrate that conjugation of ubiquitin-like proteins is an antiviral strategy conserved across the tree of life.


Subject(s)
Bacterial Proteins , Bacteriophages , Deubiquitinating Enzymes , Escherichia coli , Ubiquitin-Conjugating Enzymes , Ubiquitins , Virus Assembly , Bacteriophages/chemistry , Bacteriophages/metabolism , Bacteriophages/pathogenicity , Bacteriophages/physiology , Deubiquitinating Enzymes/metabolism , Escherichia coli/enzymology , Escherichia coli/metabolism , Escherichia coli/virology , Ubiquitin-Activating Enzymes/metabolism , Ubiquitin-Conjugating Enzymes/metabolism , Ubiquitins/metabolism , Viral Tail Proteins/metabolism , Viral Tail Proteins/chemistry , Bacterial Proteins/metabolism , Evolution, Molecular , Conserved Sequence
5.
Genes (Basel) ; 15(7)2024 Jul 13.
Article in English | MEDLINE | ID: mdl-39062693

ABSTRACT

Cytosolic sulfotransferases (SULTs) are Phase 2 drug-metabolizing enzymes that catalyze the conjugation of sulfonate to endogenous and xenobiotic compounds, increasing their hydrophilicity and excretion from cells. To date, 13 human SULTs have been identified and classified into five families. SULT4A1 mRNA encodes two variants: (1) the wild type, encoding a 284 amino acid, ~33 kDa protein, and (2) an alternative spliced variant resulting from a 126 bp insert between exon 6 and 7, which introduces a premature stop codon that enhances nonsense-mediated decay. SULT4A1 is classified as an SULT based on sequence and structural similarities, including PAPS-domains, active-site His, and the dimerization domain; however, the catalytic pocket lid 'Loop 3' size is not conserved. SULT4A1 is uniquely expressed in the brain and localized in the cytosol and mitochondria. SULT4A1 is highly conserved, with rare intronic polymorphisms that have no outward manifestations. However, the SULT4A1 haplotype is correlated with Phelan-McDermid syndrome and schizophrenia. SULT4A1 knockdown revealed potential SULT4A1 functions in photoreceptor signaling and knockout mice display hampered neuronal development and behavior. Mouse and yeast models revealed that SULT4A1 protects the mitochondria from endogenously and exogenously induced oxidative stress and stimulates cell division, promoting dendritic spines' formation and synaptic transmission. To date, no physiological enzymatic activity has been associated with SULT4A1.


Subject(s)
Sulfotransferases , Animals , Humans , Sulfotransferases/genetics , Sulfotransferases/metabolism , Sulfotransferases/chemistry , Mice , Vertebrates/genetics , Conserved Sequence
6.
Sci Rep ; 14(1): 15145, 2024 07 02.
Article in English | MEDLINE | ID: mdl-38956134

ABSTRACT

Hepatitis C virus (HCV) is a plus-stranded RNA virus that often chronically infects liver hepatocytes and causes liver cirrhosis and cancer. These viruses replicate their genomes employing error-prone replicases. Thereby, they routinely generate a large 'cloud' of RNA genomes (quasispecies) which-by trial and error-comprehensively explore the sequence space available for functional RNA genomes that maintain the ability for efficient replication and immune escape. In this context, it is important to identify which RNA secondary structures in the sequence space of the HCV genome are conserved, likely due to functional requirements. Here, we provide the first genome-wide multiple sequence alignment (MSA) with the prediction of RNA secondary structures throughout all representative full-length HCV genomes. We selected 57 representative genomes by clustering all complete HCV genomes from the BV-BRC database based on k-mer distributions and dimension reduction and adding RefSeq sequences. We include annotations of previously recognized features for easy comparison to other studies. Our results indicate that mainly the core coding region, the C-terminal NS5A region, and the NS5B region contain secondary structure elements that are conserved beyond coding sequence requirements, indicating functionality on the RNA level. In contrast, the genome regions in between contain less highly conserved structures. The results provide a complete description of all conserved RNA secondary structures and make clear that functionally important RNA secondary structures are present in certain HCV genome regions but are largely absent from other regions. Full-genome alignments of all branches of Hepacivirus C are provided in the supplement.


Subject(s)
Conserved Sequence , Genome, Viral , Hepacivirus , Nucleic Acid Conformation , RNA, Viral , Hepacivirus/genetics , RNA, Viral/genetics , RNA, Viral/chemistry , Humans , Sequence Alignment , Hepatitis C/virology , Hepatitis C/genetics
7.
Nat Commun ; 15(1): 6464, 2024 Jul 31.
Article in English | MEDLINE | ID: mdl-39085231

ABSTRACT

Gene regulatory elements drive complex biological phenomena and their mutations are associated with common human diseases. The impacts of human regulatory variants are often tested using model organisms such as mice. However, mapping human enhancers to conserved elements in mice remains a challenge, due to both rapid enhancer evolution and limitations of current computational methods. We analyze distal enhancers across 45 matched human/mouse cell/tissue pairs from a comprehensive dataset of DNase-seq experiments, and show that while cell-specific regulatory vocabulary is conserved, enhancers evolve more rapidly than promoters and CTCF binding sites. Enhancer conservation rates vary across cell types, in part explainable by tissue specific transposable element activity. We present an improved genome alignment algorithm using gapped-kmer features, called gkm-align, and make genome wide predictions for 1,401,803 orthologous regulatory elements. We show that gkm-align discovers 23,660 novel human/mouse conserved enhancers missed by previous algorithms, with strong evidence of conserved functional activity.


Subject(s)
Algorithms , Conserved Sequence , Enhancer Elements, Genetic , Animals , Enhancer Elements, Genetic/genetics , Humans , Mice , Evolution, Molecular , Binding Sites/genetics , Mammals/genetics , Promoter Regions, Genetic/genetics , Computational Biology/methods , CCCTC-Binding Factor/metabolism , CCCTC-Binding Factor/genetics
8.
Cells ; 13(13)2024 Jun 28.
Article in English | MEDLINE | ID: mdl-38994973

ABSTRACT

Throughout embryonic development, the shaping of the functional and morphological characteristics of embryos is orchestrated by an intricate interaction between transcription factors and cis-regulatory elements. In this study, we conducted a comprehensive analysis of deuterostome cis-regulatory landscapes during gastrulation, focusing on four paradigmatic species: the echinoderm Strongylocentrotus purpuratus, the cephalochordate Branchiostoma lanceolatum, the urochordate Ciona intestinalis, and the vertebrate Danio rerio. Our approach involved comparative computational analysis of ATAC-seq datasets to explore the genome-wide blueprint of conserved transcription factor binding motifs underlying gastrulation. We identified a core set of conserved DNA binding motifs associated with 62 known transcription factors, indicating the remarkable conservation of the gastrulation regulatory landscape across deuterostomes. Our findings offer valuable insights into the evolutionary molecular dynamics of embryonic development, shedding light on conserved regulatory subprograms and providing a comprehensive perspective on the conservation and divergence of gene regulation underlying the gastrulation process.


Subject(s)
Ciona intestinalis , Gastrulation , Gene Expression Regulation, Developmental , Animals , Gastrulation/genetics , Ciona intestinalis/genetics , Ciona intestinalis/embryology , Zebrafish/genetics , Zebrafish/embryology , Transcription Factors/metabolism , Transcription Factors/genetics , Strongylocentrotus purpuratus/genetics , Strongylocentrotus purpuratus/embryology , Conserved Sequence/genetics , Regulatory Sequences, Nucleic Acid/genetics , Lancelets/genetics , Lancelets/embryology , Evolution, Molecular
9.
Int J Mol Sci ; 25(13)2024 Jun 21.
Article in English | MEDLINE | ID: mdl-38999960

ABSTRACT

The initial adoption of penicillin as an antibiotic marked the start of exploring other compounds essential for pharmaceuticals, yet resistance to penicillins and their side effects has compromised their efficacy. The N-terminal nucleophile (Ntn) amide-hydrolases S45 family plays a key role in catalyzing amide bond hydrolysis in various compounds, including antibiotics like penicillin and cephalosporin. This study comprehensively analyzes the structural and functional traits of the bacterial N-terminal nucleophile (Ntn) amide-hydrolases S45 family, covering penicillin G acylases, cephalosporin acylases, and D-succinylase. Utilizing structural bioinformatics tools and sequence analysis, the investigation delineates structurally conserved regions (SCRs) and substrate binding site variations among these enzymes. Notably, sixteen SCRs crucial for substrate interaction are identified solely through sequence analysis, emphasizing the significance of sequence data in characterizing functionally relevant regions. These findings introduce a novel approach for identifying targets to enhance the biocatalytic properties of N-terminal nucleophile (Ntn) amide-hydrolases, while facilitating the development of more accurate three-dimensional models, particularly for enzymes lacking structural data. Overall, this research advances our understanding of structure-function relationships in bacterial N-terminal nucleophile (Ntn) amide-hydrolases, providing insights into strategies for optimizing their enzymatic capabilities.


Subject(s)
Amidohydrolases , Amidohydrolases/chemistry , Amidohydrolases/metabolism , Amidohydrolases/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/metabolism , Bacterial Proteins/genetics , Binding Sites , Structure-Activity Relationship , Conserved Sequence , Bacteria/enzymology , Amino Acid Sequence , Models, Molecular , Substrate Specificity
10.
Appl Microbiol Biotechnol ; 108(1): 415, 2024 Jul 11.
Article in English | MEDLINE | ID: mdl-38990377

ABSTRACT

Currently, the main α-amylase family GH13 has been divided into 47 subfamilies in CAZy, with new subfamilies regularly emerging. The present in silico study was performed to highlight the groups, represented by the maltogenic amylase from Thermotoga neapolitana and the α-amylase from Haloarcula japonica, which are worth of creating their own new GH13 subfamilies. This enlarges functional annotation and thus allows more precise prediction of the function of putative proteins. Interestingly, those two share certain sequence features, e.g. the highly conserved cysteine in the second conserved sequence region (CSR-II) directly preceding the catalytic nucleophile, or the well-preserved GQ character of the end of CSR-VII. On the other hand, the two groups bear also specific and highly conserved positions that distinguish them not only from each other but also from representatives of remaining GH13 subfamilies established so far. For the T. neapolitana maltogenic amylase group, it is the stretch of residues at the end of CSR-V highly conserved as L-[DN]. The H. japonica α-amylase group can be characterized by a highly conserved [WY]-[GA] sequence at the end of CSR-II. Other specific sequence features include an almost fully conserved aspartic acid located directly preceding the general acid/base in CSR-III or well-preserved glutamic acid in CSR-IV. The assumption that these two groups represent two mutually related, but simultaneously independent GH13 subfamilies has been supported by phylogenetic analysis as well as by comparison of tertiary structures. The main α-amylase family GH13 has thus been expanded by two novel subfamilies GH13_48 and GH13_49. KEY POINTS: • In silico analysis of two groups of family GH13 members with characterized representatives • Identification of certain common, but also some specific sequence features in seven CSRs • Creation of two novel subfamilies-GH13_48 and GH13_49 within the CAZy database.


Subject(s)
Phylogeny , alpha-Amylases , alpha-Amylases/genetics , alpha-Amylases/metabolism , alpha-Amylases/chemistry , Amino Acid Sequence , Conserved Sequence , Sequence Alignment
11.
Front Immunol ; 15: 1424307, 2024.
Article in English | MEDLINE | ID: mdl-39011043

ABSTRACT

Introduction: Bluetongue (BT) poses a significant threat to the livestock industry, affecting various animal species and resulting in substantial economic losses. The existence of numerous BT virus (BTV) serotypes has hindered control efforts, highlighting the need for broad-spectrum vaccines. Methodology: In this study, we evaluated the conserved amino acid sequences within key non-structural (NS) proteins of BTV and identified numerous highly conserved murine- and bovine-specific MHC class I-restricted (MHC-I) CD8+ and MHC-II-restricted CD4+ epitopes. We then screened these conserved epitopes for antigenicity, allergenicity, toxicity, and solubility. Using these epitopes, we developed in silico-based broad-spectrum multiepitope vaccines with Toll-like receptor (TLR-4) agonists. The predicted proinflammatory cytokine response was assessed in silico using the C-IMMSIM server. Structural modeling and refinement were achieved using Robetta and GalaxyWEB servers. Finally, we assessed the stability of the docking complexes through extensive 100-nanosecond molecular dynamics simulations before considering the vaccines for codon optimization and in silico cloning. Results: We found many epitopes that meet these criteria within NS1 and NS2 proteins and developed in silico broad-spectrum vaccines. The immune simulation studies revealed that these vaccines induce high levels of IFN-γ and IL-2 in the vaccinated groups. Protein-protein docking analysis demonstrated promising epitopes with strong binding affinities to TLR-4. The docked complexes were stable, with minimal Root Mean Square Deviation and Root Mean Square Fluctuation values. Finally, the in silico-cloned plasmids have high % of GC content with > 0.8 codon adaptation index, suggesting they are suitable for expressing the protein vaccines in prokaryotic system. Discussion: These next-generation vaccine designs are promising and warrant further investigation in wet lab experiments to assess their immunogenicity, safety, and efficacy for practical application in livestock. Our findings offer a robust framework for developing a comprehensive, broad-spectrum vaccine, potentially revolutionizing BT control and prevention strategies in the livestock industry.


Subject(s)
Bluetongue virus , Computational Biology , Epitopes, T-Lymphocyte , Viral Nonstructural Proteins , Viral Vaccines , Animals , Bluetongue virus/immunology , Epitopes, T-Lymphocyte/immunology , Viral Vaccines/immunology , Viral Nonstructural Proteins/immunology , Viral Nonstructural Proteins/genetics , Mice , Computational Biology/methods , Serogroup , Cattle , Bluetongue/prevention & control , Bluetongue/immunology , Bluetongue/virology , Conserved Sequence
12.
J Virol ; 98(8): e0100024, 2024 Aug 20.
Article in English | MEDLINE | ID: mdl-39078391

ABSTRACT

Kaposi's sarcoma herpesvirus (KSHV) ORF34 plays a significant role as a component of the viral pre-initiation complex (vPIC), which is indispensable for late gene expression across beta- and gammaherpesviruses. Although the key role of ORF34 within the vPIC and its function as a hub protein have been recognized, further clarification regarding its specific contribution to vPIC functionality and interactions with other components is required. This study employed a deep learning algorithm-assisted structural model of ORF34, revealing highly conserved amino acid residues across human beta- and gammaherpesviruses localized in structured domains. Thus, we engineered ORF34 alanine-scanning mutants by substituting conserved residues with alanine. These mutants were evaluated for their ability to interact with other vPIC factors and restore viral production in cells harboring the ORF34-deficient KSHV-BAC. Our experimental results highlight the crucial role of the four cysteine residues conserved in ORF34: a tetrahedral arrangement consisting of a pair of C-Xn-C consensus motifs. This suggests the potential incorporation of metal cations in interacting with ORF24 and ORF66 vPIC components, facilitating late gene transcription, and promoting overall virus production by capturing metal cations. In summary, our findings underline the essential role of conserved cysteines in KSHV ORF34 for effective vPIC assembly and viral replication, thereby enhancing our understanding of the complex interplay between the vPIC components. IMPORTANCE: The initiation of late gene transcription is universally conserved across the beta- and gammaherpesvirus families. This process employs a viral pre-initiation complex (vPIC), which is analogous to a cellular PIC. Although KSHV ORF34 is a critical factor for viral replication and is a component of the vPIC, the specifics of vPIC formation and the essential domains crucial for its function remain unclear. Structural predictions suggest that the four conserved cysteines (C170, C175, C256, and C259) form a tetrahedron that coordinates the metal cation. We investigated the role of these conserved amino acids in interactions with other vPIC components, late gene expression, and virus production to demonstrate for the first time that these cysteines are pivotal for such functions. This discovery not only deepens our comprehensive understanding of ORF34 and vPIC dynamics but also lays the groundwork for more detailed studies on herpesvirus replication mechanisms in future research.


Subject(s)
Cysteine , Herpesvirus 8, Human , Viral Proteins , Virus Replication , Herpesvirus 8, Human/genetics , Herpesvirus 8, Human/metabolism , Humans , Viral Proteins/metabolism , Viral Proteins/genetics , Viral Proteins/chemistry , Cysteine/metabolism , Cysteine/genetics , Conserved Sequence , Gene Expression Regulation, Viral , HEK293 Cells , Amino Acid Sequence
13.
Viruses ; 16(7)2024 Jun 23.
Article in English | MEDLINE | ID: mdl-39066172

ABSTRACT

Enterovirus genomic replication initiates at a predicted RNA cloverleaf (5'CL) at the 5' end of the RNA genome. The 5'CL contains one stem (SA) and three stem-loops (SLB, SLC, SLD). Here, we present an analysis of 5'CL conservation and divergence for 209 human health-related serotypes from the enterovirus genus, including enterovirus and rhinovirus species. Phylogenetic analysis indicates six distinct 5'CL serotypes that only partially correlate with the species definition. Additional findings include that 5'CL sequence conservation is higher between the EV species than between the RV species, the 5'CL of EVA and EVB are nearly identical, and RVC has the lowest 5'CL conservation. Regions of high conservation throughout all species include SA and the loop and nearby bases of SLB, which is consistent with known protein interactions at these sites. In addition to the known protein binding site for the Poly-C binding protein in the loop of SLB, other conserved consecutive cytosines in the stems of SLB and SLC provide additional potential interaction sites that have not yet been explored. Other sites of conservation, including the predicted bulge of SLD and other conserved stem, loop, and junction regions, are more difficult to explain and suggest additional interactions or structural requirements that are not yet fully understood. This more intricate understanding of sequence and structure conservation and variability in the 5'CL may assist in the development of broad-spectrum antivirals against a wide range of enteroviruses, while better defining the range of virus isotypes expected to be affected by a particular antiviral.


Subject(s)
Antiviral Agents , Enterovirus , Phylogeny , RNA, Viral , Virus Replication , Virus Replication/drug effects , Antiviral Agents/pharmacology , Enterovirus/genetics , Enterovirus/drug effects , Enterovirus/classification , Enterovirus/physiology , Humans , RNA, Viral/genetics , Nucleic Acid Conformation , Conserved Sequence , 5' Untranslated Regions , Genome, Viral
14.
Viruses ; 16(7)2024 Jun 26.
Article in English | MEDLINE | ID: mdl-39066193

ABSTRACT

Puumala orthohantavirus (PUUV) is an emerging zoonotic virus endemic to Europe and Russia that causes nephropathia epidemica, a mild form of hemorrhagic fever with renal syndrome (HFRS). There are limited options for treatment and diagnosis of orthohantavirus infection, making the search for potential immunogenic candidates crucial. In the present work, various bioinformatics tools were employed to design conserved immunogenic peptides containing multiple epitopes of PUUV nucleocapsid protein. Eleven conserved peptides (90% conservancy) of the PUUV nucleocapsid protein were identified. Three conserved peptides containing multiple T and B cell epitopes were selected using a consensus epitope prediction algorithm. Molecular docking using the HPEP dock server demonstrated strong binding interactions between the epitopes and HLA molecules (ten alleles for each class I and II HLA). Moreover, an analysis of population coverage using the IEDB database revealed that the identified peptides have over 90% average population coverage across six continents. Molecular docking and simulation analysis reveal a stable interaction with peptide constructs of chosen immunogenic peptides and Toll-like receptor-4. These computational analyses demonstrate selected peptides' immunogenic potential, which needs to be validated in different experimental systems.


Subject(s)
Molecular Docking Simulation , Nucleocapsid Proteins , Peptides , Puumala virus , Puumala virus/immunology , Puumala virus/genetics , Peptides/immunology , Peptides/chemistry , Humans , Nucleocapsid Proteins/immunology , Nucleocapsid Proteins/chemistry , Nucleocapsid Proteins/genetics , Epitopes, T-Lymphocyte/immunology , Epitopes, T-Lymphocyte/chemistry , Epitopes, T-Lymphocyte/genetics , Epitopes, B-Lymphocyte/immunology , Epitopes, B-Lymphocyte/chemistry , Hemorrhagic Fever with Renal Syndrome/immunology , Hemorrhagic Fever with Renal Syndrome/virology , Toll-Like Receptor 4/metabolism , Toll-Like Receptor 4/immunology , Toll-Like Receptor 4/chemistry , Computational Biology , Conserved Sequence , Amino Acid Sequence , Protein Binding
15.
Viruses ; 16(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-39066234

ABSTRACT

Coronavirus infectious disease 2019 (COVID-19), caused by severe acute respiratory virus type 2 (SARS-CoV-2), has caused a global public health crisis. As an RNA virus, the high gene mutability of SARS-CoV-2 poses significant challenges to the development of broad-spectrum vaccines and antiviral therapeutics. There remains a lack of specific therapeutics directly targeting SARS-CoV-2. With the ability to efficiently inhibit the expression of target genes in a sequence-specific way, small interfering RNA (siRNA) therapy has exhibited significant potential in antiviral and other disease treatments. In this work, we presented a highly effective self-assembled siRNA nanoparticle targeting multiple highly conserved regions of SARS-CoV-2. The siRNA sequences targeting viral conserved regions were first screened and evaluated by their thermodynamic features, off-target effects, and secondary structure toxicities. RNA motifs including siRNA sequences were then designed and self-assembled into siRNA nanoparticles. These siRNA nanoparticles demonstrated remarkable uniformity and stability and efficiently entered cells directly through cellular endocytic pathways. Moreover, these nanoparticles effectively inhibited the replication of SARS-CoV-2, exhibiting a superior inhibitory effect compared to free siRNA. These results demonstrated that these self-assembled siRNA nanoparticles targeting highly conserved regions of SARS-CoV-2 represent highly effective antiviral candidates for the treatment of infections, and are promisingly effective against current and future viral variants.


Subject(s)
Nanoparticles , RNA, Small Interfering , SARS-CoV-2 , Virus Replication , RNA, Small Interfering/genetics , SARS-CoV-2/drug effects , SARS-CoV-2/genetics , SARS-CoV-2/physiology , Virus Replication/drug effects , Nanoparticles/chemistry , Humans , Antiviral Agents/pharmacology , Antiviral Agents/chemistry , Conserved Sequence , COVID-19/virology , RNA, Viral/genetics , RNA, Viral/metabolism , Animals , Chlorocebus aethiops , Vero Cells
16.
J Membr Biol ; 257(3-4): 215-230, 2024 Aug.
Article in English | MEDLINE | ID: mdl-38970681

ABSTRACT

Progesterone (P4) acts as a key conserved signalling molecule in vertebrate reproduction. P4 is especially important for mature sperm physiology and subsequent reproductive success. "CatSpermasome", a multi-unit molecular complex, has been suggested to be the main if not the only P4-responsive atypical Ca2+-ion channel present in mature sperm. Altogether, here we analyse the protein sequences of CatSper1-4 from more than 500 vertebrates ranging from early fishes to humans. CatSper1 becomes longer in mammals due to sequence gain mainly at the N-terminus. Overall the conservation of full-length CatSper1-4 as well as the individual TM regions remain low. The lipid-water-interface residues (i.e. a 5 amino acid stretch sequence present on both sides of each TM region) also remain highly diverged. No specific patterns of amino acid distributions were observed. The total frequency of positively charged, negatively charged or their ratios do not follow in any specific pattern. Similarly, the frequency of total hydrophobic, total hydrophilic residues or even their ratios remain random and do not follow any specific pattern. We noted that the CatSper1-4 genes are missing in amphibians and the CatSper1 gene is missing in birds. The high variability of CatSper1-4 and gene-loss in certain clades indicate that the "CatSpermasome" is not the only P4-responsive ion channel. Data indicate that the molecular evolution of CatSper is mostly guided by diverse hydrophobic ligands rather than only P4. The comparative data also suggest possibilities of other Ca2+-channel/s in vertebrate sperm that can also respond to P4.


Subject(s)
Calcium Channels , Progesterone , Spermatozoa , Male , Animals , Spermatozoa/metabolism , Calcium Channels/metabolism , Calcium Channels/genetics , Calcium Channels/chemistry , Progesterone/metabolism , Humans , Vertebrates/genetics , Vertebrates/metabolism , Amino Acid Sequence , Conserved Sequence
17.
Nucleic Acids Res ; 52(14): 8112-8126, 2024 Aug 12.
Article in English | MEDLINE | ID: mdl-38953162

ABSTRACT

Ribosome profiling experiments support the translation of a range of novel human open reading frames. By contrast, most peptides from large-scale proteomics experiments derive from just one source, 5' untranslated regions. Across the human genome we find evidence for 192 translated upstream regions, most of which would produce protein isoforms with extended N-terminal ends. Almost all of these N-terminal extensions are from highly abundant genes, which suggests that the novel regions we detect are just the tip of the iceberg. These upstream regions have characteristics that are not typical of coding exons. Their GC-content is remarkably high, even higher than 5' regions in other genes, and a large majority have non-canonical start codons. Although some novel upstream regions have cross-species conservation - five have orthologues in invertebrates for example - the reading frames of two thirds are not conserved beyond simians. These non-conserved regions also have no evidence of purifying selection, which suggests that much of this translation is not functional. In addition, non-conserved upstream regions have significantly more peptides in cancer cell lines than would be expected, a strong indication that an aberrant or noisy translation initiation process may play an important role in translation from upstream regions.


Subject(s)
5' Untranslated Regions , Protein Biosynthesis , Humans , Codon, Initiator/genetics , Base Composition , Genome, Human , Animals , Open Reading Frames/genetics , Conserved Sequence , Peptides/genetics , Peptides/metabolism
18.
Nature ; 632(8023): 166-173, 2024 Aug.
Article in English | MEDLINE | ID: mdl-39020176

ABSTRACT

Gene expression in Arabidopsis is regulated by more than 1,900 transcription factors (TFs), which have been identified genome-wide by the presence of well-conserved DNA-binding domains. Activator TFs contain activation domains (ADs) that recruit coactivator complexes; however, for nearly all Arabidopsis TFs, we lack knowledge about the presence, location and transcriptional strength of their ADs1. To address this gap, here we use a yeast library approach to experimentally identify Arabidopsis ADs on a proteome-wide scale, and find that more than half of the Arabidopsis TFs contain an AD. We annotate 1,553 ADs, the vast majority of which are, to our knowledge, previously unknown. Using the dataset generated, we develop a neural network to accurately predict ADs and to identify sequence features that are necessary to recruit coactivator complexes. We uncover six distinct combinations of sequence features that result in activation activity, providing a framework to interrogate the subfunctionalization of ADs. Furthermore, we identify ADs in the ancient AUXIN RESPONSE FACTOR family of TFs, revealing that AD positioning is conserved in distinct clades. Our findings provide a deep resource for understanding transcriptional activation, a framework for examining function in intrinsically disordered regions and a predictive model of ADs.


Subject(s)
Arabidopsis Proteins , Arabidopsis , Gene Expression Regulation, Plant , Protein Domains , Transcription Factors , Transcriptional Activation , Arabidopsis/chemistry , Arabidopsis/genetics , Arabidopsis/metabolism , Arabidopsis Proteins/chemistry , Arabidopsis Proteins/classification , Arabidopsis Proteins/metabolism , Conserved Sequence/genetics , Datasets as Topic , Gene Expression Regulation, Plant/genetics , Indoleacetic Acids/metabolism , Intrinsically Disordered Proteins , Molecular Sequence Annotation , Neural Networks, Computer , Proteome/chemistry , Proteome/metabolism , Transcription Factors/chemistry , Transcription Factors/classification , Transcription Factors/metabolism , Transcriptional Activation/genetics
19.
Biochemistry ; 63(14): 1795-1807, 2024 Jul 16.
Article in English | MEDLINE | ID: mdl-38951132

ABSTRACT

Many bacteria have hemerythrin (Hr) proteins that bind O2, including Pseudomonas aeruginosa, in which microoxia-induced Hr (Mhr) provide fitness advantages under microoxic conditions. Mhr has a 23 amino-acid extension at its C-terminus relative to a well-characterized Hr from Methylococcus capsulatus, and similar extensions are also found in Hrs from other bacteria. The last 11 amino acids of this extended, C-terminal tail are highly conserved in gammaproteobacteria and predicted to form a helix with positively charged and hydrophobic faces. In cellular fractionation assays, wild-type (WT) Mhr was found in both membrane and cytosolic fractions, while a MhrW143* variant lacking the last 11 residues was largely in the cytosol and did not complement Mhr function in competition assays. MhrL112Y, a variant that has a much longer-lived O2-bound form, was fully functional and had a similar localization pattern to that of WT Mhr. Both MhrW143* and MhrL112Y had secondary structures, stabilities, and O2-binding kinetics similar to those of WT Mhr. Fluorescence studies revealed that the C-terminal tail, and particularly the fragment corresponding to its last 11 residues, was sufficient and necessary for association with lipid vesicles. Molecular dynamics simulations and subsequent cellular analysis of Mhr variants have demonstrated that conserved, positively charged residues in the tail are important for Mhr interactions with negatively charged membranes and the contribution of this protein to competitive fitness. Together, these data suggest that peripheral interactions of Mhr with membranes are guided by the C-terminal tail and are independent of O2-binding.


Subject(s)
Cell Membrane , Hemerythrin , Pseudomonas aeruginosa , Pseudomonas aeruginosa/metabolism , Pseudomonas aeruginosa/genetics , Hemerythrin/metabolism , Hemerythrin/chemistry , Hemerythrin/genetics , Cell Membrane/metabolism , Bacterial Proteins/metabolism , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Amino Acid Sequence , Conserved Sequence , Oxygen/metabolism
20.
New Phytol ; 243(6): 2235-2250, 2024 Sep.
Article in English | MEDLINE | ID: mdl-39049570

ABSTRACT

Cytoplasmic ribonucleoprotein (RNP) granules are membraneless structures composed of various RNAs and proteins that play important roles in post-transcriptional regulation. While RNP granules are known to regulate the meiotic entry in some organisms, little is known about their roles in plants. In this study, we observed the cytoplasmic granular structures of rice RNA-binding protein MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2), which contributes to the control of meiotic entry timing, in leaf protoplasts and spore mother cells. We performed colocalization analysis with known cytoplasmic RNP factors, and domain deletion analysis to assess their impact on granule formation and meiosis progression. Conservation of MEL2 domains across plant species was also explored. Our results indicated that MEL2 granules colocalized with processing body and stress granule factors. The maintenance of granule properties modulated by LOTUS domain and the intrinsically disordered region (IDR) is essential for proper MEL2 function in meiosis progression. MEL2-like proteins widely found in plant kingdom conserved LOTUS domain followed by the IDR despite their diverse domain structures, suggesting the functional conservation of these domains among plant species. This study highlights the role of MEL2 granule dynamics and its impact on meiotic transition and progression.


Subject(s)
Cytoplasmic Granules , Meiosis , Oryza , Plant Proteins , Protein Domains , Ribonucleoproteins , Oryza/genetics , Oryza/metabolism , Plant Proteins/metabolism , Plant Proteins/genetics , Plant Proteins/chemistry , Ribonucleoproteins/metabolism , Cytoplasmic Granules/metabolism , Conserved Sequence
SELECTION OF CITATIONS
SEARCH DETAIL