Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 6.290
Filtrar
1.
Cell ; 187(9): 2336-2341.e5, 2024 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-38582080

RESUMO

The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.


Assuntos
Genoma Humano , Sequências de Repetição em Tandem , Humanos , Sequências de Repetição em Tandem/genética , Sequenciamento Completo do Genoma , Bases de Dados Genéticas , Expansão das Repetições de DNA/genética , Estudo de Associação Genômica Ampla
2.
Exp Hematol ; 133: 104207, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38522505

RESUMO

Myelodysplastic/myeloproliferative diseases of childhood cause a relevant disease burden, and many of these diseases may have a fatal course. The use of next-generation sequencing (NGS) has led to the identification of novel genetic variants in patients with these diseases, advancing our understanding of the underlying pathophysiology. However, novel mutations can often only be interpreted as variants of unknown significance (VUS), hindering adequate diagnosis and the use of a targeted therapy. To improve variant interpretation and test targeted therapies in a preclinical setting, we are using a rapid zebrafish embryo model that allows functional evaluation of the novel variant and possible therapeutic approaches within days. Thereby, we accelerate the translation from genetic findings to treatment options. Here, we establish this workflow on a novel in-frame tandem duplication in NRAS (c.192_227dup; p.G75_E76insDS65_G75) identified by Sanger sequencing in a 2.5-year-old patient with an unclassifiable myelodysplastic/myeloproliferative neoplasm (MDS/MPN-U). We show that this variant results in a myeloproliferative phenotype in zebrafish embryos with expansion of immature myeloid cells in the caudal hematopoietic tissue, which can be reversed by MEK inhibition. Thus, we could reclassify the variant from likely pathogenic to pathogenic using the American College of Medical Genetics (ACMG) criteria.


Assuntos
GTP Fosfo-Hidrolases , Proteínas de Membrana , Peixe-Zebra , Humanos , Animais , Peixe-Zebra/genética , GTP Fosfo-Hidrolases/genética , Proteínas de Membrana/genética , Pré-Escolar , Doenças Mieloproliferativas-Mielodisplásicas/genética , Doenças Mieloproliferativas-Mielodisplásicas/patologia , Duplicação Gênica , Masculino , Sequências de Repetição em Tandem , Feminino , Sequenciamento de Nucleotídeos em Larga Escala
3.
mBio ; 15(4): e0047624, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38501870

RESUMO

Ehrlichia species are obligatory intracellular bacteria that cause a potentially fatal disease, human ehrlichiosis. The biomolecular mechanisms of tick acquisition of Ehrlichia and transmission between ticks and mammals are poorly understood. Ehrlichia japonica infection of mice recapitulates the full spectrum of human ehrlichiosis. We compared the pathogenicity and host acquisition of wild-type E. japonica with an isogenic transposon mutant of E. japonica that lacks tandem repeat protein 120 (TRP120) (ΔTRP120). Both wild-type and ΔTRP120 E. japonica proliferated similarly in cultures of mammalian and tick cells. Upon inoculation into mice, both wild-type and ΔTRP120 E. japonica multiplied to high levels in various tissues, with similar clinical chemistry and hematologic changes, proinflammatory cytokine induction, and fatal disease. However, the blood levels of ΔTRP120 E. japonica were almost undetectable within 24 h, whereas the levels of the wild type increased exponentially. Greater than 90% of TRP120 was released from infected cells into the culture medium. Mouse blood monocytes exposed to native TRP120 from culture supernatants showed significantly reduced cell surface expression of the transmigration-related markers Ly6C and CD11b. Larval ticks attached to mice infected with either wild-type or ΔTRP120 E. japonica imbibed similar amounts of blood and subsequently molted to nymphs at similar rates. However, unlike wild-type E. japonica, the ΔTRP120 mutant was minimally acquired by larval ticks and subsequent molted nymphs and, thus, failed to transmit to naïve mice. Thus, TRP120 is required for bacteremia but not disease. These findings suggest a novel mechanism whereby an obligatory intracellular bacterium manipulates infected blood monocytes to sustain the tick-mammal transmission cycle. IMPORTANCE: Effective prevention of tick-borne diseases such as human ehrlichiosis requires an understanding of how disease-causing organisms are acquired. Ehrlichia species are intracellular bacteria that require infection of both mammals and ticks, involving cycles of transmission between them. Mouse models of ehrlichiosis and tick-mouse transmission can advance our fundamental understanding of the pathogenesis and prevention of ehrlichiosis. Herein, a mutant of Ehrlichia japonica was used to investigate the role of a single Ehrlichia factor, named tandem repeat protein 120 (TRP120), in infection of mammalian and tick cells in culture, infection and disease progression in mice, and tick acquisition of E. japonica from infected mice. Our results suggest that TRP120 is necessary only for Ehrlichia proliferation in circulating mouse blood and ongoing bacteremia to permit Ehrlichia acquisition by ticks. This study provides new insights into the importance of bacterial factors in regulating bacteremia, which may facilitate tick acquisition of pathogens.


Assuntos
Bacteriemia , Ehrlichiose , Carrapatos , Humanos , Animais , Camundongos , Ehrlichia/genética , Ehrlichiose/microbiologia , Mamíferos , Sequências de Repetição em Tandem
4.
Genome Res ; 34(2): 161-178, 2024 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-38485193

RESUMO

Centromeres are essential regions of eukaryotic chromosomes responsible for the formation of kinetochore complexes, which connect to spindle microtubules during cell division. Notably, although centromeres maintain a conserved function in chromosome segregation, the underlying DNA sequences are diverse both within and between species and are predominantly repetitive in nature. The repeat content of centromeres includes high-copy tandem repeats (satellites), and/or specific families of transposons. The functional region of the centromere is defined by loading of a specific histone 3 variant (CENH3), which nucleates the kinetochore and shows dynamic regulation. In many plants, the centromeres are composed of satellite repeat arrays that are densely DNA methylated and invaded by centrophilic retrotransposons. In some cases, the retrotransposons become the sites of CENH3 loading. We review the structure of plant centromeres, including monocentric, holocentric, and metapolycentric architectures, which vary in the number and distribution of kinetochore attachment sites along chromosomes. We discuss how variation in CENH3 loading can drive genome elimination during early cell divisions of plant embryogenesis. We review how epigenetic state may influence centromere identity and discuss evolutionary models that seek to explain the paradoxically rapid change of centromere sequences observed across species, including the potential roles of recombination. We outline putative modes of selection that could act within the centromeres, as well as the role of repeats in driving cycles of centromere evolution. Although our primary focus is on plant genomes, we draw comparisons with animal and fungal centromeres to derive a eukaryote-wide perspective of centromere structure and function.


Assuntos
Centrômero , Retroelementos , Animais , Retroelementos/genética , Centrômero/genética , Cinetocoros , Plantas/genética , Sequências de Repetição em Tandem
5.
Int J Mol Sci ; 25(5)2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38474241

RESUMO

Tandem repeats (TRs) in protein sequences are consecutive, highly similar sequence motifs. Some types of TRs fold into structural units that pack together in ensembles, forming either an (open) elongated domain or a (closed) propeller, where the last unit of the ensemble packs against the first one. Here, we examine TR proteins (TRPs) to see how their sequence, structure, and evolutionary properties favor them for a function as mediators of protein interactions. Our observations suggest that TRPs bind other proteins using large, structured surfaces like globular domains; in particular, open-structured TR ensembles are favored by flexible termini and the possibility to tightly coil against their targets. While, intuitively, open ensembles of TRs seem prone to evolve due to their potential to accommodate insertions and deletions of units, these evolutionary events are unexpectedly rare, suggesting that they are advantageous for the emergence of the ancestral sequence but are early fixed. We hypothesize that their flexibility makes it easier for further proteins to adapt to interact with them, which would explain their large number of protein interactions. We provide insight into the properties of open TR ensembles, which make them scaffolds for alternative protein complexes to organize genes, RNA and proteins.


Assuntos
Proteínas , Sequências de Repetição em Tandem , Proteínas/química , Sequência de Aminoácidos
6.
EBioMedicine ; 101: 105027, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38418263

RESUMO

BACKGROUND: Cardiomyopathy is a clinically and genetically heterogeneous heart condition that can lead to heart failure and sudden cardiac death in childhood. While it has a strong genetic basis, the genetic aetiology for over 50% of cardiomyopathy cases remains unknown. METHODS: In this study, we analyse the characteristics of tandem repeats from genome sequence data of unrelated individuals diagnosed with cardiomyopathy from Canada and the United Kingdom (n = 1216) and compare them to those found in the general population. We perform burden analysis to identify genomic and epigenomic features that are impacted by rare tandem repeat expansions (TREs), and enrichment analysis to identify functional pathways that are involved in the TRE-associated genes in cardiomyopathy. We use Oxford Nanopore targeted long-read sequencing to validate repeat size and methylation status of one of the most recurrent TREs. We also compare the TRE-associated genes to those that are dysregulated in the heart tissues of individuals with cardiomyopathy. FINDINGS: We demonstrate that tandem repeats that are rarely expanded in the general population are predominantly expanded in cardiomyopathy. We find that rare TREs are disproportionately present in constrained genes near transcriptional start sites, have high GC content, and frequently overlap active enhancer H3K27ac marks, where expansion-related DNA methylation may reduce gene expression. We demonstrate the gene silencing effect of expanded CGG tandem repeats in DIP2B through promoter hypermethylation. We show that the enhancer-associated loci are found in genes that are highly expressed in human cardiomyocytes and are differentially expressed in the left ventricle of the heart in individuals with cardiomyopathy. INTERPRETATION: Our findings highlight the underrecognized contribution of rare tandem repeat expansions to the risk of cardiomyopathy and suggest that rare TREs contribute to ∼4% of cardiomyopathy risk. FUNDING: Government of Ontario (RKCY), The Canadian Institutes of Health Research PJT 175329 (RKCY), The Azrieli Foundation (RKCY), SickKids Catalyst Scholar in Genetics (RKCY), The University of Toronto McLaughlin Centre (RKCY, SM), Ted Rogers Centre for Heart Research (SM), Data Sciences Institute at the University of Toronto (SM), The Canadian Institutes of Health Research PJT 175034 (SM), The Canadian Institutes of Health Research ENP 161429 under the frame of ERA PerMed (SM, RL), Heart and Stroke Foundation of Ontario & Robert M Freedom Chair in Cardiovascular Science (SM), Bitove Family Professorship of Adult Congenital Heart Disease (EO), Canada Foundation for Innovation (SWS, JR), Canada Research Chair (PS), Genome Canada (PS, JR), The Canadian Institutes of Health Research (PS).


Assuntos
Cardiomiopatias , Cardiopatias Congênitas , Humanos , Adulto , Cardiopatias Congênitas/genética , Sequências de Repetição em Tandem/genética , Metilação de DNA , Cardiomiopatias/genética , Ontário , Proteínas do Tecido Nervoso/genética
7.
Genome Biol Evol ; 16(3)2024 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-38340334

RESUMO

Fishes of the genus Carassius are useful experimental vertebrate models for the study of evolutionary biology and cytogenetics. Carassius demonstrates diverse biological characteristics, such as variation in ploidy levels and chromosome numbers, and presence of microchromosomes. Those Carassius polyploids with ≥150 chromosomes have microchromosomes, but the origin of microchromosomes, especially in European populations, is unknown. We used cytogenetics to study evolution of tandem repeats (U1 and U2 small nuclear DNAs and H3 histone) and microchromosomes in Carassius from the Czech Republic. We tested the hypotheses whether the number of tandem repeats was affected by polyploidization or divergence between species and what mechanism drives evolution of microchromosomes. Tandem repeats were found in tetraploid and hexaploid Carassius gibelio, and tetraploid Carassius auratus and Carassius carassius in conserved numbers, with the exception of U1 small nuclear DNA in C. auratus. This conservation indicates reduction and/or loss in the number of copies per locus in hexaploids and may have occurred by divergence rather than polyploidization. To study the evolution of microchromosomes, we used the whole microchromosome painting probe from hexaploid C. gibelio and hybridized it to tetraploid and hexaploid C. gibelio, and tetraploid C. auratus and C. carassius. Our results revealed variation in the number of microchromosomes in hexaploids and indicated that the evolution of the Carassius karyotype is governed by macrochromosome fissions followed by segmental duplication in pericentromeric areas. These are potential mechanisms responsible for the presence of microchromosomes in Carassius hexaploids. Differential efficacy of one or both of these mechanisms in different tetraploids could ensure variability in chromosome number in polyploids in general.


Assuntos
Cyprinidae , Duplicações Segmentares Genômicas , Animais , Tetraploidia , Análise Citogenética , Sequências de Repetição em Tandem , Poliploidia
8.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38269623

RESUMO

MOTIVATION: In diploid organisms, phasing is the problem of assigning the alleles at heterozygous variants to one of two haplotypes. Reads from PacBio HiFi sequencing provide long, accurate observations that can be used as the basis for both calling and phasing variants. HiFi reads also excel at calling larger classes of variation, such as structural or tandem repeat variants. However, current phasing tools typically only phase small variants, leaving larger variants unphased. RESULTS: We developed HiPhase, a tool that jointly phases SNVs, indels, structural, and tandem repeat variants. The main benefits of HiPhase are (i) dual mode allele assignment for detecting large variants, (ii) a novel application of the A*-algorithm to phasing, and (iii) logic allowing phase blocks to span breaks caused by alignment issues around reference gaps and homozygous deletions. In our assessment, HiPhase produced an average phase block NG50 of 480 kb with 929 switchflip errors and fully phased 93.8% of genes, improving over the current state of the art. Additionally, HiPhase jointly phases SNVs, indels, structural, and tandem repeat variants and includes innate multi-threading, statistics gathering, and concurrent phased alignment output generation. AVAILABILITY AND IMPLEMENTATION: HiPhase is available as source code and a pre-compiled Linux binary with a user guide at https://github.com/PacificBiosciences/HiPhase.


Assuntos
Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de DNA , Algoritmos , Haplótipos , Sequências de Repetição em Tandem
9.
Genome Biol ; 25(1): 39, 2024 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-38297326

RESUMO

Expansions of tandem repeats (TRs) cause approximately 60 monogenic diseases. We expect that the discovery of additional pathogenic repeat expansions will narrow the diagnostic gap in many diseases. A growing number of TR expansions are being identified, and interpreting them is a challenge. We present RExPRT (Repeat EXpansion Pathogenicity pRediction Tool), a machine learning tool for distinguishing pathogenic from benign TR expansions. Our results demonstrate that an ensemble approach classifies TRs with an average precision of 93% and recall of 83%. RExPRT's high precision will be valuable in large-scale discovery studies, which require prioritization of candidate loci for follow-up studies.


Assuntos
Aprendizado de Máquina , Sequências de Repetição em Tandem , Virulência
10.
PLoS One ; 19(1): e0295595, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38271341

RESUMO

Mitochondria are known to play an essential role in the cell. These organelles contain their own DNA, which is divided in a coding and non-coding region (NCR). While much of the NCR's function is unknown, tandem repeats have been observed in several vertebrates, with extreme intra-individual, intraspecific and interspecific variation. Taking advantage of a new complete reference for the mitochondrial genome of the Afro-European Barn Owl (Tyto alba), as well as 172 whole genome-resequencing; we (i) describe the reference mitochondrial genome with a special focus on the repeats in the NCR, (ii) quantify the variation in number of copies between individuals, and (iii) explore the possible factors associated with the variation in the number of repetitions. The reference mitochondrial genome revealed a long (256bp) and a short (80bp) tandem repeat in the NCR region. The re-sequenced genomes showed a great variation in number of copies between individuals, with 4 to 38 copies of the Long and 6 to 135 copies of the short repeat. Among the factors associated with this variation between individuals, the tissue used for extraction was the most significant. The exact mechanisms of the formations of these repeats are still to be discovered and understanding them will help explain the maintenance of the polymorphism in the number of copies, as well as their interactions with the metabolism, the aging and health of the individuals.


Assuntos
Genoma Mitocondrial , Estrigiformes , Animais , Humanos , Variações do Número de Cópias de DNA , Estrigiformes/genética , Sequência de Bases , Sequências de Repetição em Tandem/genética
11.
Biochem Biophys Res Commun ; 692: 149349, 2024 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-38056160

RESUMO

While it is well established that a mere 2% of human DNA nucleotides are involved in protein coding, the remainder of the DNA plays a vital role in the preservation of normal cellular genetic function. A significant proportion of tandem repeats (TRs) are present in non-coding DNA. TRs - specific sequences of nucleotides that entail numerous repetitions of a given fragment. In this study, we employed our novel algorithm grounded in finite automata theory, which we refer to as Dafna, to investigate for the first time the likelihood of these nucleotide sequences forming non-canonical DNA structures (NS). Such structures include G-quadruplexes, i-motifs, hairpins, and triplexes. The tandem repeats under consideration in our research encompassed sequences containing 1 to 6 nucleotides per repeated fragment. For comparison, we employed a set of randomly generated sequences of the same length (60 nucleotides) as a benchmark. The outcomes of our research exposed a disparity between the potential for NS formation in random sequences and tandem repeats. Our findings affirm that the propensity of DNA and RNA to form NS is closely tied to various genetic disorders, including Huntington's disease, Fragile X syndrome, and Friedreich's ataxia. In the concluding discussion, we present a proposal for a new therapeutic mechanism to address these diseases. This novel approach revolves around the ability of specific nucleic acid fragments to form multiple types of NS.


Assuntos
Relevância Clínica , Sequências de Repetição em Tandem , Humanos , Sequências de Repetição em Tandem/genética , DNA/química , Sequência de Bases , Nucleotídeos
12.
J Investig Med ; 72(1): 17-25, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37803493

RESUMO

The most important complication of familial Mediterranean fever (FMF) is secondary amyloidosis. The aim of this study is to investigate the risk of developing FMF-related amyloidosis with macrophage migration inhibitory factor (MIF), interleukin 4 (IL-4), and IL-1 receptor antagonist (IL-1RA) variants. This study included 62 FMF patients with amyloidosis, 110 FMF patients without amyloidosis, and 120 controls. The clinical information of the patient groups was compared. MIF-173G/C, IL-4 variant number tandem repeat (VNTR), and IL-1RA VNTR variants were analyzed for all participants. The use of colchicine, pleurisy, and appendectomy was more common in FMF patients with amyloidosis than in FMF patients without amyloidosis. MIF-173G/C C/C genotype and C allele were higher in both patient groups compared to controls. IL-1RA VNTR A1/A2 and A1/A4 genotypes and A1-A4 alleles were more common in both patient groups than controls. The IL-4 VNTR P1 allele was more common in FMF patients with amyloidosis compared to controls. The MIF-173G/C allele and the IL-1RA VNTR A1-A4 allele are associated with FMF in the Turkish population but not with amyloidosis risk in FMF patients. The IL-4 VNTR P1 allele is more common in FMF patients with amyloidosis than in healthy individuals.


Assuntos
Amiloidose , Febre Familiar do Mediterrâneo , Fatores Inibidores da Migração de Macrófagos , Humanos , Amiloidose/genética , Febre Familiar do Mediterrâneo/complicações , Febre Familiar do Mediterrâneo/genética , Predisposição Genética para Doença , Proteína Antagonista do Receptor de Interleucina 1/genética , Interleucina-4/genética , Oxirredutases Intramoleculares/genética , Fatores Inibidores da Migração de Macrófagos/genética , Polimorfismo de Nucleotídeo Único , Sequências de Repetição em Tandem
13.
Proc Natl Acad Sci U S A ; 120(50): e2309359120, 2023 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-38048463

RESUMO

Tandem DNA repeats are often organized into heterochromatin that is crucial for genome organization and stability. Recent studies revealed that individual repeats within tandem DNA repeats can behave very differently. How DNA repeats are assembled into distinct heterochromatin structures remains poorly understood. Here, we developed a genome-wide genetic screen using a reporter gene at different units in a repeat array. This screen led to identification of a conserved protein Rex1BD required for heterochromatin silencing. Our structural analysis revealed that Rex1BD forms a four-helix bundle structure with a distinct charged electrostatic surface. Mechanistically, Rex1BD facilitates the recruitment of Clr6 histone deacetylase (HDAC) by interacting with histones. Interestingly, Rex1BD also interacts with the 14-3-3 protein Rad25, which is responsible for recruiting the RITS (RNA-induced transcriptional silencing) complex to DNA repeats. Our results suggest that coordinated action of Rex1BD and Rad25 mediates formation of distinct heterochromatin structure at DNA repeats via linking RNAi and HDAC pathways.


Assuntos
Proteínas de Schizosaccharomyces pombe , Schizosaccharomyces , Interferência de RNA , Heterocromatina/genética , Heterocromatina/metabolismo , Proteínas de Schizosaccharomyces pombe/metabolismo , Schizosaccharomyces/metabolismo , Proteínas 14-3-3/genética , Proteínas 14-3-3/metabolismo , Histona Desacetilases/metabolismo , DNA/metabolismo , Sequências de Repetição em Tandem
14.
Transl Psychiatry ; 13(1): 402, 2023 Dec 20.
Artigo em Inglês | MEDLINE | ID: mdl-38123544

RESUMO

Tandem repeats (TRs) are prevalent throughout the genome, constituting at least 3% of the genome, and often highly polymorphic. The high mutation rate of TRs, which can be orders of magnitude higher than single-nucleotide polymorphisms and indels, indicates that they are likely to make significant contributions to phenotypic variation, yet their contribution to schizophrenia has been largely ignored by recent genome-wide association studies (GWAS). Tandem repeat expansions are already known causative factors for over 50 disorders, while common tandem repeat variation is increasingly being identified as significantly associated with complex disease and gene regulation. The current review summarizes key background concepts of tandem repeat variation as pertains to disease risk, elucidating their potential for schizophrenia association. An overview of next-generation sequencing-based methods that may be applied for TR genome-wide identification is provided, and some key methodological challenges in TR analyses are delineated.


Assuntos
Estudo de Associação Genômica Ampla , Esquizofrenia , Humanos , Esquizofrenia/genética , Genoma Humano , Sequências de Repetição em Tandem/genética , Polimorfismo de Nucleotídeo Único
15.
Sci Adv ; 9(47): eadj1261, 2023 11 24.
Artigo em Inglês | MEDLINE | ID: mdl-37992162

RESUMO

The biological role of the repetitive DNA sequences in the human genome remains an outstanding question. Recent long-read human genome assemblies have allowed us to identify a function for one of these repetitive regions. We have uncovered a tandem array of conserved primate-specific retrogenes encoding the protein Elongin A3 (ELOA3), a homolog of the RNA polymerase II (RNAPII) elongation factor Elongin A (ELOA). Our genomic analysis shows that the ELOA3 gene cluster is conserved among primates and the number of ELOA3 gene repeats is variable in the human population and across primate species. Moreover, the gene cluster has undergone concerted evolution and homogenization within primates. Our biochemical studies show that ELOA3 functions as a promoter-associated RNAPII pause-release elongation factor with distinct biochemical and functional features from its ancestral homolog, ELOA. We propose that the ELOA3 gene cluster has evolved to fulfil a transcriptional regulatory function unique to the primate lineage that can be targeted to regulate cellular hyperproliferation.


Assuntos
Fatores de Alongamento de Peptídeos , RNA Polimerase II , Animais , Humanos , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , Fatores de Alongamento de Peptídeos/genética , Primatas/genética , Elonguina/genética , Família Multigênica , Sequências de Repetição em Tandem/genética
16.
Vet Microbiol ; 287: 109909, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37925876

RESUMO

Mycoplasma iowae is a worldwide spread and economically important avian pathogen that mostly infects turkeys. Currently, multi-locus sequence typing (MLST) serves as the gold standard method for strain identification in M. iowae. However, additional robust genotyping methods are required to effectively monitor M. iowae infections and conduct epidemiological investigations. The first aim of this study was to develop genotyping assays with high resolution, that specifically target M. iowae, namely a multiple-locus variable number of tandem-repeats analysis (MLVA) and a core genome multi-locus sequence typing (cgMLST) schema. The second aim was the determination of relationships among a diverse selection of M. iowae strains and clinical isolates with a previous and the newly developed assays. The MLVA was designed based on the analyses of tandem-repeat (TR) regions in the six serotype reference strains (I, J, K, N, Q and R). The cgMLST schema was developed based on the coding sequences (CDSs) common in 95% of the examined 99 isolates. The samples were submitted for a previously published MLST assay for comparison with the developed methods. Out of 94 TR regions identified, 17 alleles were selected for further evaluation by PCR. Finally, seven alleles were chosen to establish the MLVA assay. Additionally, whole genome sequence analyses identified a total of 676 CDSs shared by 95% of the isolates, all of which were included into the developed cgMLST schema. The MLVA discriminated 19 distinct genotypes (GT), while with the cgMLST assay 79 sequence types (ST) could be determined with Simpson's diversity indices of 0.810 (MLVA) and 0.989 (cgMLST). The applied assays consistently identified the same main clusters among the diverse selection of isolates, thereby demonstrating their suitability for various genetic analyses and their ability to yield congruent results.


Assuntos
Mycoplasma iowae , Animais , Tipagem de Sequências Multilocus/métodos , Tipagem de Sequências Multilocus/veterinária , Genótipo , Técnicas de Genotipagem/veterinária , Sequências de Repetição em Tandem , Repetições Minissatélites/genética , Filogenia
17.
Nat Commun ; 14(1): 6746, 2023 10 24.
Artigo em Inglês | MEDLINE | ID: mdl-37875492

RESUMO

De novo protein design methods can create proteins with folds not yet seen in nature. These methods largely focus on optimizing the compatibility between the designed sequence and the intended conformation, without explicit consideration of protein folding pathways. Deeply knotted proteins, whose topologies may introduce substantial barriers to folding, thus represent an interesting test case for protein design. Here we report our attempts to design proteins with trefoil (31) and pentafoil (51) knotted topologies. We extended previously described algorithms for tandem repeat protein design in order to construct deeply knotted backbones and matching designed repeat sequences (N = 3 repeats for the trefoil and N = 5 for the pentafoil). We confirmed the intended conformation for the trefoil design by X ray crystallography, and we report here on this protein's structure, stability, and folding behaviour. The pentafoil design misfolded into an asymmetric structure (despite a 5-fold symmetric sequence); two of the four repeat-repeat units matched the designed backbone while the other two diverged to form local contacts, leading to a trefoil rather than pentafoil knotted topology. Our results also provide insights into the folding of knotted proteins.


Assuntos
Dobramento de Proteína , Proteínas , Conformação Proteica , Proteínas/genética , Proteínas/química , Domínios Proteicos , Sequências de Repetição em Tandem/genética
18.
Emerg Top Life Sci ; 7(3): 339-348, 2023 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-37888797

RESUMO

Tandem repeat DNA sequences constitute a significant proportion of the human genome. While previously considered to be functionally inert, these sequences are now broadly accepted as important contributors to genetic diversity. However, the polymorphic nature of these sequences can lead to expansion beyond a gene-specific threshold, causing disease. More than 50 pathogenic repeat expansions have been identified to date, many of which have been discovered in the last decade as a result of advances in sequencing technologies and associated bioinformatic tools. Commonly utilised diagnostic platforms including Sanger sequencing, capillary array electrophoresis, and Southern blot are generally low throughput and are often unable to accurately determine repeat size, composition, and epigenetic signature, which are important when characterising repeat expansions. The rapid advances in bioinformatic tools designed specifically to interrogate short-read sequencing and the development of long-read single molecule sequencing is enabling a new generation of high throughput testing for repeat expansion disorders. In this review, we discuss some of the challenges surrounding the identification and characterisation of disease-causing repeat expansions and the technological advances that are poised to translate the promise of genomic medicine to individuals and families affected by these disorders.


Assuntos
Biologia Computacional , Sequências de Repetição em Tandem , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos
19.
Emerg Top Life Sci ; 7(3): 361-381, 2023 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-37905568

RESUMO

Long-read sequencing platforms provide unparalleled access to the structure and composition of all classes of tandemly repeated DNA from STRs to satellite arrays. This review summarizes our current understanding of their organization within the human genome, their importance with respect to disease, as well as the advances and challenges in understanding their genetic diversity and functional effects. Novel computational methods are being developed to visualize and associate these complex patterns of human variation with disease, expression, and epigenetic differences. We predict accurate characterization of this repeat-rich form of human variation will become increasingly relevant to both basic and clinical human genetics.


Assuntos
DNA , Sequências de Repetição em Tandem , Humanos , Sequências de Repetição em Tandem/genética , Epigênese Genética
20.
Nat Commun ; 14(1): 6711, 2023 10 23.
Artigo em Inglês | MEDLINE | ID: mdl-37872149

RESUMO

Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.


Assuntos
Polimorfismo de Nucleotídeo Único , Sequências de Repetição em Tandem , Humanos , Genótipo , Sequenciamento Completo do Genoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA