Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 39
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-38659920

RESUMO

Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. Additionally, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from non-coding ones in otherwise ambiguous cases.

2.
iScience ; 26(11): 108080, 2023 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-37860693

RESUMO

The SARS-CoV-2 genome encodes a multitude of accessory proteins. Using comparative genomic approaches, an additional accessory protein, ORF3c, has been predicted to be encoded within the ORF3a sgmRNA. Expression of ORF3c during infection has been confirmed independently by ribosome profiling. Despite ORF3c also being present in the 2002-2003 SARS-CoV, its function has remained unexplored. Here we show that ORF3c localizes to mitochondria, where it inhibits innate immunity by restricting IFN-ß production, but not NF-κB activation or JAK-STAT signaling downstream of type I IFN stimulation. We find that ORF3c is inhibitory after stimulation with cytoplasmic RNA helicases RIG-I or MDA5 or adaptor protein MAVS, but not after TRIF, TBK1 or phospho-IRF3 stimulation. ORF3c co-immunoprecipitates with the antiviral proteins MAVS and PGAM5 and induces MAVS cleavage by caspase-3. Together, these data provide insight into an uncharacterized mechanism of innate immune evasion by this important human pathogen.

3.
Nucleic Acids Res ; 51(D1): D942-D949, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36420896

RESUMO

GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


Assuntos
Biologia Computacional , Genoma Humano , Humanos , Animais , Camundongos , Anotação de Sequência Molecular , Biologia Computacional/métodos , Genoma Humano/genética , Transcriptoma/genética , Perfilação da Expressão Gênica , Bases de Dados Genéticas
6.
Nature ; 602(7897): 487-495, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34942634

RESUMO

The emergence of SARS-CoV-2 variants of concern suggests viral adaptation to enhance human-to-human transmission1,2. Although much effort has focused on the characterization of changes in the spike protein in variants of concern, mutations outside of spike are likely to contribute to adaptation. Here, using unbiased abundance proteomics, phosphoproteomics, RNA sequencing and viral replication assays, we show that isolates of the Alpha (B.1.1.7) variant3 suppress innate immune responses in airway epithelial cells more effectively than first-wave isolates. We found that the Alpha variant has markedly increased subgenomic RNA and protein levels of the nucleocapsid protein (N), Orf9b and Orf6-all known innate immune antagonists. Expression of Orf9b alone suppressed the innate immune response through interaction with TOM70, a mitochondrial protein that is required for activation of the RNA-sensing adaptor MAVS. Moreover, the activity of Orf9b and its association with TOM70 was regulated by phosphorylation. We propose that more effective innate immune suppression, through enhanced expression of specific viral antagonist proteins, increases the likelihood of successful transmission of the Alpha variant, and may increase in vivo replication and duration of infection4. The importance of mutations outside the spike coding region in the adaptation of SARS-CoV-2 to humans is underscored by the observation that similar mutations exist in the N and Orf9b regulatory regions of the Delta and Omicron variants.


Assuntos
COVID-19/imunologia , COVID-19/virologia , Evolução Molecular , Evasão da Resposta Imune , Imunidade Inata/imunologia , SARS-CoV-2/genética , SARS-CoV-2/imunologia , COVID-19/transmissão , Proteínas do Nucleocapsídeo de Coronavírus/química , Proteínas do Nucleocapsídeo de Coronavírus/metabolismo , Humanos , Imunidade Inata/genética , Interferons/imunologia , Proteínas do Complexo de Importação de Proteína Precursora Mitocondrial/metabolismo , Fosfoproteínas/química , Fosfoproteínas/metabolismo , Fosforilação , Proteômica , RNA Viral/genética , RNA-Seq , SARS-CoV-2/classificação , SARS-CoV-2/crescimento & desenvolvimento
7.
Nat Biotechnol ; 40(2): 209-217, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34663921

RESUMO

Tumor-associated epitopes presented on MHC-I that can activate the immune system against cancer cells are typically identified from annotated protein-coding regions of the genome, but whether peptides originating from novel or unannotated open reading frames (nuORFs) can contribute to antitumor immune responses remains unclear. Here we show that peptides originating from nuORFs detected by ribosome profiling of malignant and healthy samples can be displayed on MHC-I of cancer cells, acting as additional sources of cancer antigens. We constructed a high-confidence database of translated nuORFs across tissues (nuORFdb) and used it to detect 3,555 translated nuORFs from MHC-I immunopeptidome mass spectrometry analysis, including peptides that result from somatic mutations in nuORFs of cancer samples as well as tumor-specific nuORFs translated in melanoma, chronic lymphocytic leukemia and glioblastoma. NuORFs are an unexplored pool of MHC-I-presented, tumor-specific peptides with potential as immunotherapy targets.


Assuntos
Imunoterapia , Melanoma , Antígenos de Neoplasias , Antígenos de Histocompatibilidade Classe I/genética , Antígenos de Histocompatibilidade Classe I/metabolismo , Humanos , Imunoterapia/métodos , Espectrometria de Massas , Melanoma/genética , Peptídeos
8.
Nutrients ; 13(6)2021 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-34064175

RESUMO

CONTEXT: Calcifediol has been proposed as a potential treatment for COVID-19 patients. OBJECTIVE: To compare the administration or not of oral calcifediol on mortality risk of patients hospitalized because of COVID-19. DESIGN: Retrospective, multicenter, open, non-randomized cohort study. SETTINGS: Hospitalized care. PATIENTS: Patients with laboratory-confirmed COVID-19 between 5 February and 5 May 2020 in five hospitals in the South of Spain. INTERVENTION: Patients received calcifediol (25-hydroxyvitamin D3) treatment (0.266 mg/capsule, 2 capsules on entry and then one capsule on day 3, 7, 14, 21, and 28) or not. MAIN OUTCOME MEASURE: In-hospital mortality during the first 30 days after admission. RESULTS: A total of 537 patients were hospitalized with COVID-19 (317 males (59%), median age, 70 years), and 79 (14.7%) received calcifediol treatment. Overall, in-hospital mortality during the first 30 days was 17.5%. The OR of death for patients receiving calcifediol (mortality rate of 5%) was 0.22 (95% CI, 0.08 to 0.61) compared to patients not receiving such treatment (mortality rate of 20%; p < 0.01). Patients who received calcifediol after admission were more likely than those not receiving treatment to have comorbidity and a lower rate of CURB-65 score for pneumonia severity ≥ 3 (one point for each of confusion, urea > 7 mmol/L, respiratory rate ≥ 30/min, systolic blood pressure < 90 mm Hg or diastolic blood pressure ≤ 60 mm Hg, and age ≥ 65 years), acute respiratory distress syndrome (moderate or severe), c-reactive protein, chronic kidney disease, and blood urea nitrogen. In a multivariable logistic regression model, adjusting for confounders, there were significant differences in mortality for patients receiving calcifediol compared with patients not receiving it (OR = 0.16 (95% CI 0.03 to 0.80). CONCLUSION: Among patients hospitalized with COVID-19, treatment with calcifediol, compared with those not receiving calcifediol, was significantly associated with lower in-hospital mortality during the first 30 days. The observational design and sample size may limit the interpretation of these findings.


Assuntos
Tratamento Farmacológico da COVID-19 , Calcifediol/administração & dosagem , Mortalidade Hospitalar , SARS-CoV-2/metabolismo , Idoso , Idoso de 80 Anos ou mais , COVID-19/sangue , COVID-19/mortalidade , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos , Índice de Gravidade de Doença
9.
Nat Commun ; 12(1): 2642, 2021 05 11.
Artigo em Inglês | MEDLINE | ID: mdl-33976134

RESUMO

Despite its clinical importance, the SARS-CoV-2 gene set remains unresolved, hindering dissection of COVID-19 biology. We use comparative genomics to provide a high-confidence protein-coding gene set, characterize evolutionary constraint, and prioritize functional mutations. We select 44 Sarbecovirus genomes at ideally-suited evolutionary distances, and quantify protein-coding evolutionary signatures and overlapping constraint. We find strong protein-coding signatures for ORFs 3a, 6, 7a, 7b, 8, 9b, and a novel alternate-frame gene, ORF3c, whereas ORFs 2b, 3d/3d-2, 3b, 9c, and 10 lack protein-coding signatures or convincing experimental evidence of protein-coding function. Furthermore, we show no other conserved protein-coding genes remain to be discovered. Mutation analysis suggests ORF8 contributes to within-individual fitness but not person-to-person transmission. Cross-strain and within-strain evolutionary pressures agree, except for fewer-than-expected within-strain mutations in nsp3 and S1, and more-than-expected in nucleocapsid, which shows a cluster of mutations in a predicted B-cell epitope, suggesting immune-avoidance selection. Evolutionary histories of residues disrupted by spike-protein substitutions D614G, N501Y, E484K, and K417N/T provide clues about their biology, and we catalog likely-functional co-inherited mutations. Previously reported RNA-modification sites show no enrichment for conservation. Here we report a high-confidence gene set and evolutionary-history annotations providing valuable resources and insights on SARS-CoV-2 biology, mutations, and evolution.


Assuntos
COVID-19/virologia , Genoma Viral/genética , Mutação , SARS-CoV-2/genética , Betacoronavirus/classificação , Betacoronavirus/genética , Códon , Evolução Molecular , Genes Virais , Aptidão Genética , Variação Genética , Fases de Leitura Aberta , Filogenia , Glicoproteína da Espícula de Coronavírus/genética , Proteínas Virais/genética
11.
Virology ; 558: 145-151, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33774510

RESUMO

At least six small alternative-frame open reading frames (ORFs) overlapping well-characterized SARS-CoV-2 genes have been hypothesized to encode accessory proteins. Researchers have used different names for the same ORF or the same name for different ORFs, resulting in erroneous homological and functional inferences. We propose standard names for these ORFs and their shorter isoforms, developed in consultation with the Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. We recommend calling the 39 codon Spike-overlapping ORF ORF2b; the 41, 57, and 22 codon ORF3a-overlapping ORFs ORF3c, ORF3d, and ORF3b; the 33 codon ORF3d isoform ORF3d-2; and the 97 and 73 codon Nucleocapsid-overlapping ORFs ORF9b and ORF9c. Finally, we document conflicting usage of the name ORF3b in 32 studies, and consequent erroneous inferences, stressing the importance of reserving identical names for homologs. We recommend that authors referring to these ORFs provide lengths and coordinates to minimize ambiguity caused by prior usage of alternative names.


Assuntos
Fases de Leitura Aberta , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus , Terminologia como Assunto , SARS-CoV-2/imunologia , Glicoproteína da Espícula de Coronavírus/classificação , Glicoproteína da Espícula de Coronavírus/genética
12.
Mol Biol Evol ; 38(2): 486-501, 2021 01 23.
Artigo em Inglês | MEDLINE | ID: mdl-32946576

RESUMO

Bumblebees are a diverse group of globally important pollinators in natural ecosystems and for agricultural food production. With both eusocial and solitary life-cycle phases, and some social parasite species, they are especially interesting models to understand social evolution, behavior, and ecology. Reports of many species in decline point to pathogen transmission, habitat loss, pesticide usage, and global climate change, as interconnected causes. These threats to bumblebee diversity make our reliance on a handful of well-studied species for agricultural pollination particularly precarious. To broadly sample bumblebee genomic and phenotypic diversity, we de novo sequenced and assembled the genomes of 17 species, representing all 15 subgenera, producing the first genus-wide quantification of genetic and genomic variation potentially underlying key ecological and behavioral traits. The species phylogeny resolves subgenera relationships, whereas incomplete lineage sorting likely drives high levels of gene tree discordance. Five chromosome-level assemblies show a stable 18-chromosome karyotype, with major rearrangements creating 25 chromosomes in social parasites. Differential transposable element activity drives changes in genome sizes, with putative domestications of repetitive sequences influencing gene coding and regulatory potential. Dynamically evolving gene families and signatures of positive selection point to genus-wide variation in processes linked to foraging, diet and metabolism, immunity and detoxification, as well as adaptations for life at high altitudes. Our study reveals how bumblebee genes and genomes have evolved across the Bombus phylogeny and identifies variations potentially linked to key ecological and behavioral traits of these important pollinators.


Assuntos
Adaptação Biológica/genética , Abelhas/genética , Evolução Biológica , Genoma de Inseto , Animais , Uso do Códon , Elementos de DNA Transponíveis , Dieta , Comportamento Alimentar , Componentes do Gene , Tamanho do Genoma , Seleção Genética
13.
Nucleic Acids Res ; 49(D1): D916-D923, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33270111

RESUMO

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


Assuntos
COVID-19/prevenção & controle , Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Anotação de Sequência Molecular/métodos , SARS-CoV-2/genética , Animais , COVID-19/epidemiologia , COVID-19/virologia , Epidemias , Humanos , Internet , Camundongos , Pseudogenes/genética , RNA Longo não Codificante/genética , SARS-CoV-2/metabolismo , SARS-CoV-2/fisiologia , Transcrição Gênica/genética
14.
Mol Cell ; 80(6): 1078-1091.e6, 2020 12 17.
Artigo em Inglês | MEDLINE | ID: mdl-33290746

RESUMO

We report that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with viral RNA. N-protein condenses with specific RNA genomic elements under physiological buffer conditions and condensation is enhanced at human body temperatures (33°C and 37°C) and reduced at room temperature (22°C). RNA sequence and structure in specific genomic regions regulate N-protein condensation while other genomic regions promote condensate dissolution, potentially preventing aggregation of the large genome. At low concentrations, N-protein preferentially crosslinks to specific regions characterized by single-stranded RNA flanked by structured elements and these features specify the location, number, and strength of N-protein binding sites (valency). Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is RNA sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules, and therefore presents a screenable process for identifying antiviral compounds effective against SARS-CoV-2.


Assuntos
COVID-19/metabolismo , Proteínas do Nucleocapsídeo de Coronavírus/metabolismo , Genoma Viral , Nucleocapsídeo/metabolismo , RNA Viral/metabolismo , SARS-CoV-2/metabolismo , Animais , Antivirais/farmacologia , COVID-19/genética , Chlorocebus aethiops , Proteínas do Nucleocapsídeo de Coronavírus/genética , Avaliação Pré-Clínica de Medicamentos , Células HEK293 , Humanos , Nucleocapsídeo/genética , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , SARS-CoV-2/genética , Células Vero , Tratamento Farmacológico da COVID-19
15.
Res Sq ; 2020 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-33024961

RESUMO

Despite its overwhelming clinical importance, the SARS-CoV-2 gene set remains unresolved, hindering dissection of COVID-19 biology. Here, we use comparative genomics to provide a high-confidence protein-coding gene set, characterize protein-level and nucleotide-level evolutionary constraint, and prioritize functional mutations from the ongoing COVID-19 pandemic. We select 44 complete Sarbecovirus genomes at evolutionary distances ideally-suited for protein-coding and non-coding element identification, create whole-genome alignments, and quantify protein-coding evolutionary signatures and overlapping constraint. We find strong protein-coding signatures for all named genes and for 3a, 6, 7a, 7b, 8, 9b, and also ORF3c, a novel alternate-frame gene. By contrast, ORF10, and overlapping-ORFs 9c, 3b, and 3d lack protein-coding signatures or convincing experimental evidence and are not protein-coding. Furthermore, we show no other protein-coding genes remain to be discovered. Cross-strain and within-strain evolutionary pressures largely agree at the gene, amino-acid, and nucleotide levels, with some notable exceptions, including fewer-than-expected mutations in nsp3 and Spike subunit S1, and more-than-expected mutations in Nucleocapsid. The latter also shows a cluster of amino-acid-changing variants in otherwise-conserved residues in a predicted B-cell epitope, which may indicate positive selection for immune avoidance. Several Spike-protein mutations, including D614G, which has been associated with increased transmission, disrupt otherwise-perfectly-conserved amino acids, and could be novel adaptations to human hosts. The resulting high-confidence gene set and evolutionary-history annotations provide valuable resources and insights on COVID-19 biology, mutations, and evolution.

16.
Nat Genet ; 52(11): 1145-1150, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-33046855

RESUMO

The influence of genetic background on driver mutations is well established; however, the mechanisms by which the background interacts with Mendelian loci remain unclear. We performed a systematic secondary-variant burden analysis of two independent cohorts of patients with Bardet-Biedl syndrome (BBS) with known recessive biallelic pathogenic mutations in one of 17 BBS genes for each individual. We observed a significant enrichment of trans-acting rare nonsynonymous secondary variants in patients with BBS compared with either population controls or a cohort of individuals with a non-BBS diagnosis and recessive variants in the same gene set. Strikingly, we found a significant over-representation of secondary alleles in chaperonin-encoding genes-a finding corroborated by the observation of epistatic interactions involving this complex in vivo. These data indicate a complex genetic architecture for BBS that informs the biological properties of disease modules and presents a model for secondary-variant burden analysis in recessive disorders.


Assuntos
Síndrome de Bardet-Biedl/genética , Variação Genética , Alelos , Estudos de Coortes , Exoma , Humanos
17.
Cell Syst ; 11(2): 145-160.e5, 2020 08 26.
Artigo em Inglês | MEDLINE | ID: mdl-32710835

RESUMO

Genomic analyses in budding yeast have helped define the foundational principles of eukaryotic gene expression. However, in the absence of empirical methods for defining coding regions, these analyses have historically excluded specific classes of possible coding regions, such as those initiating at non-AUG start codons. Here, we applied an experimental approach to globally annotate translation initiation sites in yeast and identified 149 genes with alternative N-terminally extended protein isoforms initiating from near-cognate codons upstream of annotated AUG start codons. These isoforms are produced in concert with canonical isoforms and translated with high specificity, resulting from initiation at only a small subset of possible start codons. The non-AUG initiation driving their production is enriched during meiosis and induced by low eIF5A, which is seen in this context. These findings reveal widespread production of non-canonical protein isoforms and unexpected complexity to the rules by which even a simple eukaryotic genome is decoded.


Assuntos
Códon/metabolismo , Iniciação Traducional da Cadeia Peptídica/genética , Biossíntese de Proteínas/genética , Isoformas de Proteínas/metabolismo , Saccharomyces cerevisiae/genética
18.
bioRxiv ; 2020 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-32577641

RESUMO

Despite its overwhelming clinical importance, the SARS-CoV-2 gene set remains unresolved, hindering dissection of COVID-19 biology. Here, we use comparative genomics to provide a high-confidence protein-coding gene set, characterize protein-level and nucleotide-level evolutionary constraint, and prioritize functional mutations from the ongoing COVID-19 pandemic. We select 44 complete Sarbecovirus genomes at evolutionary distances ideally-suited for protein-coding and non-coding element identification, create whole-genome alignments, and quantify protein-coding evolutionary signatures and overlapping constraint. We find strong protein-coding signatures for all named genes and for 3a, 6, 7a, 7b, 8, 9b, and also ORF3c, a novel alternate-frame gene. By contrast, ORF10, and overlapping-ORFs 9c, 3b, and 3d lack protein-coding signatures or convincing experimental evidence and are not protein-coding. Furthermore, we show no other protein-coding genes remain to be discovered. Cross-strain and within-strain evolutionary pressures largely agree at the gene, amino-acid, and nucleotide levels, with some notable exceptions, including fewer-than-expected mutations in nsp3 and Spike subunit S1, and more-than-expected mutations in Nucleocapsid. The latter also shows a cluster of amino-acid-changing variants in otherwise-conserved residues in a predicted B-cell epitope, which may indicate positive selection for immune avoidance. Several Spike-protein mutations, including D614G, which has been associated with increased transmission, disrupt otherwise-perfectly-conserved amino acids, and could be novel adaptations to human hosts. The resulting high-confidence gene set and evolutionary-history annotations provide valuable resources and insights on COVID-19 biology, mutations, and evolution.

19.
bioRxiv ; 2020 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-32587965

RESUMO

A mechanistic understanding of the SARS-CoV-2 viral replication cycle is essential to develop new therapies for the COVID-19 global health crisis. In this study, we show that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with the viral genome, and propose a model of viral packaging through LLPS. N-protein condenses with specific RNA sequences in the first 1000 nts (5'-End) under physiological conditions and is enhanced at human upper airway temperatures. N-protein condensates exclude non-packaged RNA sequences. We comprehensively map sites bound by N-protein in the 5'-End and find preferences for single-stranded RNA flanked by stable structured elements. Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules thus presenting screenable processes for identifying antiviral compounds effective against SARS-CoV-2.

20.
BMC Genet ; 21(1): 25, 2020 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-32138667

RESUMO

BACKGROUND: POLG, located on nuclear chromosome 15, encodes the DNA polymerase γ(Pol γ). Pol γ is responsible for the replication and repair of mitochondrial DNA (mtDNA). Pol γ is the only DNA polymerase found in mitochondria for most animal cells. Mutations in POLG are the most common single-gene cause of diseases of mitochondria and have been mapped over the coding region of the POLG ORF. RESULTS: Using PhyloCSF to survey alternative reading frames, we found a conserved coding signature in an alternative frame in exons 2 and 3 of POLG, herein referred to as ORF-Y that arose de novo in placental mammals. Using the synplot2 program, synonymous site conservation was found among mammals in the region of the POLG ORF that is overlapped by ORF-Y. Ribosome profiling data revealed that ORF-Y is translated and that initiation likely occurs at a CUG codon. Inspection of an alignment of mammalian sequences containing ORF-Y revealed that the CUG codon has a strong initiation context and that a well-conserved predicted RNA stem-loop begins 14 nucleotides downstream. Such features are associated with enhanced initiation at near-cognate non-AUG codons. Reanalysis of the Kim et al. (2014) draft human proteome dataset yielded two unique peptides that map unambiguously to ORF-Y. An additional conserved uORF, herein referred to as ORF-Z, was also found in exon 2 of POLG. Lastly, we surveyed Clinvar variants that are synonymous with respect to the POLG ORF and found that most of these variants cause amino acid changes in ORF-Y or ORF-Z. CONCLUSIONS: We provide evidence for a novel coding sequence, ORF-Y, that overlaps the POLG ORF. Ribosome profiling and mass spectrometry data show that ORF-Y is expressed. PhyloCSF and synplot2 analysis show that ORF-Y is subject to strong purifying selection. An abundance of disease-correlated mutations that map to exons 2 and 3 of POLG but also affect ORF-Y provides potential clinical significance to this finding.


Assuntos
Códon de Iniciação/genética , DNA Polimerase gama/genética , Mitocôndrias/genética , Ribossomos/genética , Sequência de Aminoácidos , DNA Mitocondrial/genética , Éxons/genética , Humanos , Conformação de Ácido Nucleico , Fases de Leitura Aberta/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...