Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 2.067
Filtrar
1.
Trends Genet ; 40(10): 819-821, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39278786

RESUMO

Tandem-repeat DNA sequences appear to be singularly capable of yielding abundant repeat-number mutations with a potentially advantageous distribution of fitness effects. Although knowing the rates and relative proportions of deleterious, neutral and beneficial mutations is fundamental for understanding evolvability, analysis of adaptation routinely overlooks small-effect mutations arising in tandem repeats.


Assuntos
Evolução Molecular , Mutação , Sequências de Repetição em Tandem/genética , Aptidão Genética , Humanos , Seleção Genética/genética , Modelos Genéticos
2.
Genes Chromosomes Cancer ; 63(9): e23266, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39248534

RESUMO

Classical and mixed congenital mesoblastic nephroma (CMN) are characterized by an internal tandem duplication (ITD) of the EGFR gene, in contrast to cellular CMN that usually harbors an ETV6::NTRK3 gene fusion. This same fusion occurs in infantile fibrosarcoma, and this tumor can be considered as the soft tissue equivalent of cellular CMN. A soft tissue equivalent of classic/mixed CMN remains undefined at the genetic level. Since classical CMN resembles fibromatosis of soft tissue histologically, we asked whether fibromatosis in children might show EGFR ITD. ITD was investigated using the polymerase chain reaction and primers for exons 18 and 25 of the EGFR gene. Seven of the eight cases of classical or mixed CMN were positive by this approach, but none of the five cellular CMNs. Of 11 cases of fibromatosis (six plantar, two digital, and three desmoid), none were positive for EGFR ITD. Within the limits of this small study, we conclude that pediatric fibromatosis is likely not characterized by EGFR ITD. There are isolated reports of pediatric soft tissue tumors that harbor EGFR ITD, but these have the appearance of infantile fibrosarcoma or mixed CMN rather than fibromatosis. We did not find any such cases, since all 14 cases of infantile fibrosarcoma in our study had an ETV6::NTRK3 fusion. The soft tissue tumors with EGFR ITD are not a morphologic match for the low-grade histology of classical CMN. Whether they have a similar favorable biology or behave more like fibrosarcoma with an ETV6::NTRK3 fusion or an alternative fusion involving other kinases remains to be determined.


Assuntos
Receptores ErbB , Nefroma Mesoblástico , Humanos , Nefroma Mesoblástico/genética , Nefroma Mesoblástico/patologia , Feminino , Receptores ErbB/genética , Lactente , Masculino , Pré-Escolar , Criança , Neoplasias Renais/genética , Neoplasias Renais/patologia , Sequências de Repetição em Tandem/genética , Duplicação Gênica , Proteínas de Fusão Oncogênica/genética
3.
Bioinformatics ; 40(8)2024 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-39110522

RESUMO

MOTIVATION: A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy. RESULTS: In this work, we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mb genome of Arabidopsis thaliana in under 5 min on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes. AVAILABILITY AND IMPLEMENTATION: ModDotPlot is available at https://github.com/marbl/ModDotPlot.


Assuntos
Arabidopsis , Software , Sequências de Repetição em Tandem , Arabidopsis/genética , Sequências de Repetição em Tandem/genética , Genoma de Planta , Interface Usuário-Computador , Genômica/métodos
4.
Nat Commun ; 15(1): 7119, 2024 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-39164244

RESUMO

The insulin-linked polymorphic region is a variable number of tandem repeats region of DNA in the promoter of the insulin gene that regulates transcription of insulin. This region is known to form the alternative DNA structures, i-motifs and G-quadruplexes. Individuals have different sequence variants of tandem repeats and although previous work investigated the effects of some variants on G-quadruplex formation, there is not a clear picture of the relationship between the sequence diversity, the DNA structures formed, and the functional effects on insulin gene expression. Here we show that different sequence variants of the insulin linked polymorphic region form different DNA structures in vitro. Additionally, reporter genes in cellulo indicate that insulin expression may change depending on which DNA structures form. We report the crystal structure and dynamics of an intramolecular i-motif, which reveal sequences within the loop regions forming additional stabilising interactions that are critical to formation of stable i-motif structures. The outcomes of this work reveal the detail in formation of stable i-motif DNA structures, with potential for rational based drug design for compounds to target i-motif DNA.


Assuntos
DNA , Quadruplex G , Insulina , Regiões Promotoras Genéticas , Insulina/química , Insulina/genética , DNA/química , DNA/genética , Humanos , Conformação de Ácido Nucleico , Motivos de Nucleotídeos , Cristalografia por Raios X , Polimorfismo Genético , Sequências de Repetição em Tandem/genética , Sequência de Bases , Modelos Moleculares , Animais , Genes Reporter
5.
Nat Commun ; 15(1): 7291, 2024 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-39181885

RESUMO

Tandem repeats (TRs) are genomic regions that tandemly change in repeat number, which are often multiallelic. Their characteristics and contributions to gene expression and quantitative traits in rice are largely unknown. Here, we survey rice TR variations based on 231 genome assemblies and the rice pan-genome graph. We identify 227,391 multiallelic TR loci, including 54,416 TR variations that are absent from the Nipponbare reference genome. Only 1/3 TR variations show strong linkage with nearby bi-allelic variants (SNPs, Indels and PAVs). Using 193 panicle and 202 leaf transcriptomic data, we reveal 485 and 511 TRs act as QTLs independently of other bi-allelic variations to nearby gene expression, respectively. Using plant height and grain width as examples, we identify and validate TRs contributions to rice agronomic trait variations. These findings would enhance our understanding of the functions of multiallelic variants and facilitate rice molecular breeding.


Assuntos
Alelos , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Oryza , Locos de Características Quantitativas , Sequências de Repetição em Tandem , Oryza/genética , Oryza/crescimento & desenvolvimento , Oryza/metabolismo , Sequências de Repetição em Tandem/genética , Mapeamento Cromossômico , Polimorfismo de Nucleotídeo Único , Fenótipo , Variação Genética
6.
PeerJ ; 12: e17748, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39076774

RESUMO

Background: Tandem duplication (TD) is a common and important type of structural variation in the human genome. TDs have been shown to play an essential role in many diseases, including cancer. However, it is difficult to accurately detect TDs due to the uneven distribution of reads and the inherent complexity of next-generation sequencing (NGS) data. Methods: This article proposes a method called DTDHM (detection of tandem duplications based on hybrid methods), which utilizes NGS data to detect TDs in a single sample. DTDHM builds a pipeline that integrates read depth (RD), split read (SR), and paired-end mapping (PEM) signals. To solve the problem of uneven distribution of normal and abnormal samples, DTDHM uses the K-nearest neighbor (KNN) algorithm for multi-feature classification prediction. Then, the qualified split reads and discordant reads are extracted and analyzed to achieve accurate localization of variation sites. This article compares DTDHM with three other methods on 450 simulated datasets and five real datasets. Results: In 450 simulated data samples, DTDHM consistently maintained the highest F1-score. The average F1-score of DTDHM, SVIM, TARDIS, and TIDDIT were 80.0%, 56.2%, 43.4%, and 67.1%, respectively. The F1-score of DTDHM had a small variation range and its detection effect was the most stable and 1.2 times that of the suboptimal method. Most of the boundary biases of DTDHM fluctuated around 20 bp, and its boundary deviation detection ability was better than TARDIS and TIDDIT. In real data experiments, five real sequencing samples (NA19238, NA19239, NA19240, HG00266, and NA12891) were used to test DTDHM. The results showed that DTDHM had the highest overlap density score (ODS) and F1-score of the four methods. Conclusions: Compared with the other three methods, DTDHM achieved excellent results in terms of sensitivity, precision, F1-score, and boundary bias. These results indicate that DTDHM can be used as a reliable tool for detecting TDs from NGS data, especially in the case of low coverage depth and tumor purity samples.


Assuntos
Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Genoma Humano/genética , Sequências de Repetição em Tandem/genética
7.
Gene ; 928: 148774, 2024 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-39025338

RESUMO

Repetitive elements in DNA sequences are a hallmark of Apicomplexan protozoa. A genome-wide screening for Tandem Repeats was conducted in Toxoplasma gondii and related Coccidian parasites with a novel strategy to assess compositional bias. A conserved pattern of GC skew and purine-pyrimidine bias was observed. Compositional bias was also present at the protein level. Glutamic acid was the most abundant amino acid in the purine (GA) rich cluster, while Serine prevailed in pyrimidine (CT) rich cluster. Purine rich repeats, and consequently glutamic acid abundance, correlated with high scores for intrinsically disordered protein regions/domains. Finally, variability was established for repetitive regions within a well-known rhoptry antigen (ROP1) and an uncharacterized hypothetical protein with similar features. The approach we present could be useful to identify potential antigens bearing repetitive elements.


Assuntos
Proteínas de Protozoários , Sequências de Repetição em Tandem , Toxoplasma , Toxoplasma/genética , Sequências de Repetição em Tandem/genética , Proteínas de Protozoários/genética , Proteínas de Protozoários/química , Proteínas de Protozoários/metabolismo , Genoma de Protozoário , Composição de Bases
8.
Mod Pathol ; 37(9): 100534, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-38852814

RESUMO

The ability to detect low-level disease is key to our understanding of clonal heterogeneity in acute myeloid leukemia (AML) and residual disease that elude conventional assays and seed relapse. We developed a high-sensitivity next-generation sequencing (HS-NGS) clinical assay, able to reliably detect low levels (1 × 10-5) of FLT3-ITD, a frequent, therapeutically targetable and prognostically relevant mutation in AML. By applying this assay to 289 longitudinal samples from 62 patients at initial diagnosis and/or clinical follow-up (mean follow-up of 22 months), we reveal the frequent occurrence of FLT3-ITD subclones at diagnosis and demonstrate a significantly decreased relapse risk when FLT3-ITD is cleared after induction or thereafter. We perform pairwise sequencing of diagnosis and relapse samples from 23 patients to uncover more detailed patterns of FLT3-ITD clonal evolution at relapse than is detectable by less-sensitive assays. Finally, we show that rising ITD level during consecutive biopsies is a harbinger of impending relapse. Our findings corroborate the emerging clinical utility of high-sensitivity FLT3-ITD testing and expands our understanding of clonal dynamics in FLT3-ITD-positive AML.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Leucemia Mieloide Aguda , Tirosina Quinase 3 Semelhante a fms , Humanos , Tirosina Quinase 3 Semelhante a fms/genética , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Pessoa de Meia-Idade , Masculino , Feminino , Adulto , Idoso , Sequências de Repetição em Tandem/genética , Recidiva , Evolução Clonal , Mutação , Duplicação Gênica
9.
Artigo em Inglês | MEDLINE | ID: mdl-38862430

RESUMO

Tandem duplication (TD) is a major type of structural variations (SVs) that plays an important role in novel gene formation and human diseases. However, TDs are often missed or incorrectly classified as insertions by most modern SV detection methods due to the lack of specialized operation on TD-related mutational signals. Herein, we developed a TD detection module for the Pindel tool, referred to as Pindel-TD, based on a TD-specific pattern growth approach. Pindel-TD is capable of detecting TDs with a wide size range at single nucleotide resolution. Using simulated and real read data from HG002, we demonstrated that Pindel-TD outperforms other leading methods in terms of precision, recall, F1-score, and robustness. Furthermore, by applying Pindel-TD to data generated from the K562 cancer cell line, we identified a TD located at the seventh exon of SAGE1, providing an explanation for its high expression. Pindel-TD is available for non-commercial use at https://github.com/xjtu-omics/pindel.


Assuntos
Software , Humanos , Células K562 , Duplicação Gênica , Sequências de Repetição em Tandem/genética , Algoritmos
10.
Gene ; 926: 148644, 2024 Oct 30.
Artigo em Inglês | MEDLINE | ID: mdl-38851366

RESUMO

The non-coding regions of the mitochondrial DNAs (mtDNAs) of hares, rabbits, and pikas (Lagomorpha) contain short (∼20 bp) and long (130-160 bp) tandem repeats, absent in related mammalian orders. In the presented study, we provide in-depth analysis for mountain hare (Lepus timidus) and brown hare (L. europaeus) mtDNA non-coding regions, together with a species- and population-level analysis of tandem repeat variation. Mountain hare short tandem repeats (SRs) as well as other analyzed hare species consist of two conserved 10 bp motifs, with only brown hares exhibiting a single, more variable motif. Long tandem repeats (LRs) also differ in sequence and copy number between species. Mountain hares have four to seven LRs, median value five, while brown hares exhibit five to nine LRs, median value six. Interestingly, introgressed mountain hare mtDNA in brown hares obtained an intermediate LR length distribution, with median copy number being the same as with conspecific brown hare mtDNA. In contrast, transfer of brown hare mtDNA into cultured mtDNA-less mountain hare cells maintained the original LR number, whereas the reciprocal transfer caused copy number instability, suggesting that cellular environment rather than the nuclear genomic background plays a role in the LR maintenance. Due to their dynamic nature and separation from other known conserved sequence elements on the non-coding region of hare mitochondrial genomes, the tandem repeat elements likely to represent signatures of ancient genetic rearrangements. clarifying the nature and dynamics of these rearrangements may shed light on the possible role of NCR repeated elements in mitochondria and in species evolution.


Assuntos
DNA Mitocondrial , Evolução Molecular , Genoma Mitocondrial , Lebres , Polimorfismo Genético , Especificidade da Espécie , Sequências de Repetição em Tandem , Animais , Lebres/genética , Sequências de Repetição em Tandem/genética , DNA Mitocondrial/genética , Filogenia
11.
Fish Shellfish Immunol ; 151: 109721, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38917950

RESUMO

C-type lectins (CTLs) are an important class of pattern recognition receptors (PRRs) that exhibit structural and functional diversity in invertebrates. Repetitive DNA sequences are ubiquitous in eukaryotic genomes, representing distinct modes of genome evolution and promoting new gene generation. Our study revealed a new CTL that is composed of two long tandem repeats, abundant threonine, and one carbohydrate recognition domain (CRD) in Exopalaemon carinicauda and has been designated EcTR-CTL. The full-length cDNA of EcTR-CTL was 1242 bp long and had an open reading frame (ORF) of 999 bp that encoded a protein of 332 amino acids. The genome structure of EcTR-CTL contains 4 exons and 3 introns. The length of each repeat unit in EcTR-CTL was 198 bp, which is different from the short tandem repeats reported previously in prawns and crayfish. EcTR-CTL was abundantly expressed in the intestine and hemocytes. After Vibrio parahaemolyticus and white spot syndrome virus (WSSV) challenge, the expression level of EcTR-CTL in the intestine was upregulated. Knockdown of EcTR-CTL downregulated the expression of anti-lipopolysaccharide factor, crustin, and lysozyme during Vibrio infection. The recombinant CRD of EcTR-CTL (rCRD) could bind to bacteria, lipopolysaccharides, and peptidoglycans. Additionally, rCRD can directly bind to WSSV. These findings indicate that 1) CTLs with tandem repeats may be ubiquitous in crustaceans, 2) EcTR-CTL may act as a PRR to participate in the innate immune defense against bacteria via nonself-recognition and antimicrobial peptide regulation, and 3) EcTR-CTL may play a positive or negative role in the process of WSSV infection by capturing virions.


Assuntos
Sequência de Aminoácidos , Proteínas de Artrópodes , Imunidade Inata , Lectinas Tipo C , Palaemonidae , Filogenia , Vibrio parahaemolyticus , Vírus da Síndrome da Mancha Branca 1 , Animais , Palaemonidae/imunologia , Palaemonidae/genética , Vibrio parahaemolyticus/fisiologia , Vírus da Síndrome da Mancha Branca 1/fisiologia , Proteínas de Artrópodes/genética , Proteínas de Artrópodes/imunologia , Proteínas de Artrópodes/química , Imunidade Inata/genética , Lectinas Tipo C/genética , Lectinas Tipo C/imunologia , Lectinas Tipo C/química , Regulação da Expressão Gênica/imunologia , Perfilação da Expressão Gênica , Alinhamento de Sequência , Sequência de Bases , Sequências de Repetição em Tandem/genética
12.
J Hum Genet ; 69(9): 433-440, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-38866925

RESUMO

BACKGROUND: Intronic GAA repeat expansion ([GAA] ≥250) in FGF14 is associated with the late-onset neurodegenerative disorder, spinocerebellar ataxia 27B (SCA27B, GAA-FGF14 ataxia). We aim to determine the prevalence of the GAA repeat expansion in FGF14 in Chinese populations presenting late-onset cerebellar ataxia (LOCA) and evaluate the characteristics of tandem repeat inheritance, radiological features and sympathetic nerve involvement. METHODS: GAA-FGF14 repeat expansion was screened in an undiagnosed LOCA cohort (n = 664) and variations in repeat-length were analyzed in families of confirmed GAA-FGF14 ataxia patients. Brain magnetic resonance imaging (MRI) was used to evaluate the radiological feature in GAA-FGF14 ataxia patients. Clinical examinations and sympathetic skin response (SSR) recordings in GAA-FGF14 patients (n = 16) were used to quantify sympathetic nerve involvement. RESULTS: Two unrelated probands (2/664) were identified. Genetic screening for GAA-FGF14 repeat expansion was performed in 39 family members, 16 of whom were genetically diagnosed with GAA-FGF14 ataxia. Familial screening revealed expansion of GAA repeats in maternal transmissions, but contraction upon paternal transmission. Brain MRI showed slight to moderate cerebellar atrophy. SSR amplitude was lower in GAA-FGF14 patients in pre-symptomatic stage compared to healthy controls, and further decreased in the symptomatic stage. CONCLUSIONS: GAA-FGF14 ataxia was rare among Chinese LOCA cases. Parental gender appears to affect variability in GAA repeat number between generations. Reduced SSR amplitude is a prominent feature in GAA-FGF14 patients, even in the pre-symptomatic stage.


Assuntos
Fatores de Crescimento de Fibroblastos , Humanos , Masculino , Feminino , Fatores de Crescimento de Fibroblastos/genética , Pessoa de Meia-Idade , Adulto , Imageamento por Ressonância Magnética , Sistema Nervoso Simpático/fisiopatologia , Sistema Nervoso Simpático/patologia , Idoso , Linhagem , Expansão das Repetições de Trinucleotídeos/genética , Sequências de Repetição em Tandem/genética , Degenerações Espinocerebelares
14.
Pathobiology ; 91(5): 370-374, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38718771

RESUMO

INTRODUCTION: Undifferentiated small round-cell sarcomas with BCL6 corepressor (BCOR) alterations, such as an internal tandem duplication (ITD) within exon 15, are typically described as a pediatric group of Ewing-like small round-cell sarcomas. CASE PRESENTATION: In contrast to this notion, we report the case of a 71-year-old woman with a nasosinusal sarcoma featuring a BCOR ITD. To the best of our knowledge, this presence had not been previously documented in a sarcoma of the nasal and sinus cavities in an elderly patient. The identified duplication shares a similar minimal critical region as described in clear-cell sarcomas of the kidney in children. This alteration, located within the PCGF1 binding domain, is believed to disrupt the activity of PRC1.1. CONCLUSION: This case underscores the need for in-depth research into the molecular biology of these rare tumors and explores potential alternative treatment options. The patient achieved remission after two cycles of doxorubicin and cyclophosphamide chemotherapy, highlighting the promise of potential therapeutic options for BCOR ITD sarcomas.


Assuntos
Proteínas Proto-Oncogênicas , Proteínas Repressoras , Humanos , Proteínas Repressoras/genética , Feminino , Proteínas Proto-Oncogênicas/genética , Idoso , Sequências de Repetição em Tandem/genética , Sarcoma/genética , Sarcoma/patologia , Sarcoma/tratamento farmacológico , Sarcoma/diagnóstico , Duplicação Gênica , Doxorrubicina/uso terapêutico , Ciclofosfamida/uso terapêutico , Sarcoma de Células Pequenas/genética , Sarcoma de Células Pequenas/patologia , Sarcoma de Células Pequenas/tratamento farmacológico , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico
17.
PLoS Genet ; 20(5): e1011296, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38814980

RESUMO

Exceptions to Mendelian inheritance often highlight novel chromosomal behaviors. The maize Pl1-Rhoades allele conferring plant pigmentation can display inheritance patterns deviating from Mendelian expectations in a behavior known as paramutation. However, the chromosome features mediating such exceptions remain unknown. Here we show that small RNA production reflecting RNA polymerase IV function within a distal downstream set of five tandem repeats is coincident with meiotically-heritable repression of the Pl1-Rhoades transcription unit. A related pl1 haplotype with three, but not one with two, repeat units also displays the trans-homolog silencing typifying paramutations. 4C interactions, CHD3a-dependent small RNA profiles, nuclease sensitivity, and polyadenylated RNA levels highlight a repeat subregion having regulatory potential. Our comparative and mutant analyses show that transcriptional repression of Pl1-Rhoades correlates with 24-nucleotide RNA production and cytosine methylation at this subregion indicating the action of a specific DNA-dependent RNA polymerase complex. These findings support a working model in which pl1 paramutation depends on trans-chromosomal RNA-directed DNA methylation operating at a discrete cis-linked and copy-number-dependent transcriptional regulatory element.


Assuntos
Regulação da Expressão Gênica de Plantas , Sequências de Repetição em Tandem , Zea mays , Alelos , Metilação de DNA , Haplótipos , Mutação , Pigmentação/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Sequências de Repetição em Tandem/genética , Zea mays/genética
18.
Virus Res ; 345: 199390, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38710287

RESUMO

Cnaphalocrocis medinalis granulovirus (CnmeGV), belonging to Betabaculovirus cnamedinalis, can infect the rice pest, the rice leaf roller. In 1979, a CnmeGV isolate, CnmeGV-EP, was collected from Enping County, China. In 2014, we collected another CnmeGV isolate, CnmeGV-EPDH3, at the same location and obtained the complete virus genome sequence using Illumina and ONT sequencing technologies. By combining these two virus isolates, we updated the genome annotation of CnmeGV and conducted an in-depth analysis of its genome features. CnmeGV genome contains abundant tandem repeat sequences, and the repeating units in the homologous regions (hrs) exhibit overlapping and nested patterns. The genetic variations within EPDH3 population show the high stability of CnmeGV genome, and tandem repeats are the only region of high genetic variation in CnmeGV genome replication. Some defective viral genomes formed by recombination were found within the population. Comparison analysis of the two virus isolates collected from Enping showed that the proteins encoded by the CnmeGV-specific genes were less conserved relative to the baculovirus core genes. At the genomic level, there are a large number of SNPs and InDels between the two virus isolates, especially in and around the bro genes and hrs. Additionally, we discovered that CnmeGV acquired a segment of non-ORF sequence from its host, which does not provide any new proteins but rather serves as redundant genetic material integrated into the viral genome. Furthermore, we observed that the host's transposon piggyBac has inserted into some virus genes. Together, dsDNA viruses could acquire non-coding genetic material from their hosts to expand the size of their genomes. These findings provide new insights into the evolution of dsDNA viruses.


Assuntos
Variação Genética , Genoma Viral , Animais , Filogenia , China , Granulovirus/genética , Granulovirus/classificação , Granulovirus/isolamento & purificação , Sequenciamento Completo do Genoma , Oryza/virologia , Sequências de Repetição em Tandem/genética , Doenças das Plantas/virologia , Recombinação Genética
19.
Cell ; 187(9): 2336-2341.e5, 2024 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-38582080

RESUMO

The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.


Assuntos
Genoma Humano , Sequências de Repetição em Tandem , Humanos , Sequências de Repetição em Tandem/genética , Sequenciamento Completo do Genoma , Bases de Dados Genéticas , Expansão das Repetições de DNA/genética , Estudo de Associação Genômica Ampla
20.
Cancer Sci ; 115(6): 1851-1865, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38581120

RESUMO

Aberrant expression of forkhead box transcription factor 1 (FOXM1) plays critical roles in a variety of human malignancies and predicts poor prognosis. However, little is known about the crosstalk between FOXM1 and long noncoding RNAs (lncRNAs) in tumorigenesis. The present study identifies a previously uncharacterized lncRNA XLOC_008672 in gastric cancer (GC), which is regulated by FOXM1 and possesses multiple copies of tandem repetitive sequences. LncRNA microarrays are used to screen differentially expressed lncRNAs in FOXM1 knockdown GC cells, and then the highest fold downregulation lncRNA XLOC_008672 is screened out. Sequence analysis reveals that the new lncRNA contains 62 copies of 37-bp tandem repeats. It is transcriptionally activated by FOXM1 and functions as a downstream effector of FOXM1 in GC cells through in vitro and in vivo functional assays. Elevated expression of XLOC_008672 is found in GC tissues and indicates worse prognosis. Mechanistically, XLOC_008672 can bind to small nuclear ribonucleoprotein polypeptide A (SNRPA), thereby enhancing mRNA stability of Ras-GTPase-activating protein SH3 domain-binding protein 1 (G3BP1) and, consequently, facilitating GC cell proliferation and migration. Our study discovers a new uncharacterized lncRNA XLOC_008672 involved in GC carcinogenesis and progression. Targeting FOXM1/XLOC_008672/SNRPA/G3BP1 signaling axis might be a promising therapeutic strategy for GC.


Assuntos
Carcinogênese , Proliferação de Células , Proteína Forkhead Box M1 , Regulação Neoplásica da Expressão Gênica , RNA Longo não Codificante , Neoplasias Gástricas , Animais , Feminino , Humanos , Masculino , Camundongos , Linhagem Celular Tumoral , Movimento Celular/genética , Proliferação de Células/genética , DNA Helicases , Proteína Forkhead Box M1/genética , Proteína Forkhead Box M1/metabolismo , Camundongos Nus , Proteínas de Ligação a Poli-ADP-Ribose/genética , Proteínas de Ligação a Poli-ADP-Ribose/metabolismo , Prognóstico , RNA Helicases , Proteínas com Motivo de Reconhecimento de RNA/genética , Proteínas com Motivo de Reconhecimento de RNA/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Neoplasias Gástricas/genética , Neoplasias Gástricas/patologia , Neoplasias Gástricas/metabolismo , Sequências de Repetição em Tandem/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...