Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Mol Cell ; 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38917794

RESUMO

CRISPR-Cas technology has transformed functional genomics, yet understanding of how individual exons differentially shape cellular phenotypes remains limited. Here, we optimized and conducted massively parallel exon deletion and splice-site mutation screens in human cell lines to identify exons that regulate cellular fitness. Fitness-promoting exons are prevalent in essential and highly expressed genes and commonly overlap with protein domains and interaction interfaces. Conversely, fitness-suppressing exons are enriched in nonessential genes, exhibiting lower inclusion levels, and overlap with intrinsically disordered regions and disease-associated mutations. In-depth mechanistic investigation of the screen-hit TAF5 alternative exon-8 revealed that its inclusion is required for assembly of the TFIID general transcription initiation complex, thereby regulating global gene expression output. Collectively, our orthogonal exon perturbation screens established a comprehensive repository of phenotypically important exons and uncovered regulatory mechanisms governing cellular fitness and gene expression.

2.
Genome Res ; 33(6): 907-922, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37433640

RESUMO

Approximately 13% of the human genome at certain motifs have the potential to form noncanonical (non-B) DNA structures (e.g., G-quadruplexes, cruciforms, and Z-DNA), which regulate many cellular processes but also affect the activity of polymerases and helicases. Because sequencing technologies use these enzymes, they might possess increased errors at non-B structures. To evaluate this, we analyzed error rates, read depth, and base quality of Illumina, Pacific Biosciences (PacBio) HiFi, and Oxford Nanopore Technologies (ONT) sequencing at non-B motifs. All technologies showed altered sequencing success for most non-B motif types, although this could be owing to several factors, including structure formation, biased GC content, and the presence of homopolymers. Single-nucleotide mismatch errors had low biases in HiFi and ONT for all non-B motif types but were increased for G-quadruplexes and Z-DNA in all three technologies. Deletion errors were increased for all non-B types but Z-DNA in Illumina and HiFi, as well as only for G-quadruplexes in ONT. Insertion errors for non-B motifs were highly, moderately, and slightly elevated in Illumina, HiFi, and ONT, respectively. Additionally, we developed a probabilistic approach to determine the number of false positives at non-B motifs depending on sample size and variant frequency, and applied it to publicly available data sets (1000 Genomes, Simons Genome Diversity Project, and gnomAD). We conclude that elevated sequencing errors at non-B DNA motifs should be considered in low-read-depth studies (single-cell, ancient DNA, and pooled-sample population sequencing) and in scoring rare variants. Combining technologies should maximize sequencing accuracy in future studies of non-B DNA.


Assuntos
DNA Forma Z , Nanoporos , Humanos , Motivos de Nucleotídeos , Análise de Sequência de DNA , DNA/genética , Composição de Bases , Sequenciamento de Nucleotídeos em Larga Escala
3.
RNA ; 2024 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-38684317

RESUMO

RNA modifications have a substantial impact on tRNA function, with modifications in the anticodon loop contributing to translational fidelity and modifications in the tRNA core impacting structural stability. In bacteria, tRNA modifications are crucial for responding to stress and regulating the expression of virulence factors. Although tRNA modifications are well-characterized in a few model organisms, our knowledge of tRNA modifications in human pathogens, such as Pseudomonas aeruginosa, remains limited. Here we leveraged two orthogonal approaches to build a reference landscape of tRNA modifications in E. coli, which enabled us to identify similar modifications in P. aeruginosa. Our analysis revealed a substantial degree of conservation between the two organisms, while also uncovering potential sites of tRNA modification in P. aeruginosa tRNAs that are not present in E. coli. The mutational signature at one of these sites, position 46 of tRNAGln1(UUG) is dependent on the P. aeruginosa homolog of TapT, the enzyme responsible for the 3-(3-amino-3-carboxypropyl) uridine (acp3U) modification. Identifying which modifications are present on different tRNAs will uncover the pathways impacted by the different tRNA modifying enzymes, some of which play roles in determining virulence and pathogenicity.

4.
Genome Res ; 31(7): 1136-1149, 2021 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-34187812

RESUMO

Approximately 1% of the human genome has the ability to fold into G-quadruplexes (G4s)-noncanonical strand-specific DNA structures forming at G-rich motifs. G4s regulate several key cellular processes (e.g., transcription) and have been hypothesized to participate in others (e.g., firing of replication origins). Moreover, G4s differ in their thermostability, and this may affect their function. Yet, G4s may also hinder replication, transcription, and translation and may increase genome instability and mutation rates. Therefore, depending on their genomic location, thermostability, and functionality, G4 loci might evolve under different selective pressures, which has never been investigated. Here we conducted the first genome-wide analysis of G4 distribution, thermostability, and selection. We found an overrepresentation, high thermostability, and purifying selection for G4s within genic components in which they are expected to be functional-promoters, CpG islands, and 5' and 3' UTRs. A similar pattern was observed for G4s within replication origins, enhancers, eQTLs, and TAD boundary regions, strongly suggesting their functionality. In contrast, G4s on the nontranscribed strand of exons were underrepresented, were unstable, and evolved neutrally. In general, G4s on the nontranscribed strand of genic components had lower density and were less stable than those on the transcribed strand, suggesting that the former are avoided at the RNA level. Across the genome, purifying selection was stronger at stable G4s. Our results suggest that purifying selection preserves the sequences of functional G4s, whereas nonfunctional G4s are too costly to be tolerated in the genome. Thus, G4s are emerging as fundamental, functional genomic elements.

5.
Nucleic Acids Res ; 49(3): 1497-1516, 2021 02 22.
Artigo em Inglês | MEDLINE | ID: mdl-33450015

RESUMO

Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.


Assuntos
DNA/química , Variação Genética , Genoma Humano , Animais , Loci Gênicos , Humanos , Taxa de Mutação , Polimorfismo de Nucleotídeo Único , Pongo pygmaeus
6.
Genome Res ; 28(12): 1767-1778, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30401733

RESUMO

DNA conformation may deviate from the classical B-form in ∼13% of the human genome. Non-B DNA regulates many cellular processes; however, its effects on DNA polymerization speed and accuracy have not been investigated genome-wide. Such an inquiry is critical for understanding neurological diseases and cancer genome instability. Here, we present the first simultaneous examination of DNA polymerization kinetics and errors in the human genome sequenced with Single-Molecule Real-Time (SMRT) technology. We show that polymerization speed differs between non-B and B-DNA: It decelerates at G-quadruplexes and fluctuates periodically at disease-causing tandem repeats. Analyzing polymerization kinetics profiles, we predict and validate experimentally non-B DNA formation for a novel motif. We demonstrate that several non-B motifs affect sequencing errors (e.g., G-quadruplexes increase error rates), and that sequencing errors are positively associated with polymerase slowdown. Finally, we show that highly divergent G4 motifs have pronounced polymerization slowdown and high sequencing error rates, suggesting similar mechanisms for sequencing errors and germline mutations.


Assuntos
DNA/química , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Conformação de Ácido Nucleico , Análise de Sequência de DNA , Replicação do DNA , Quadruplex G , Genômica/métodos , Genômica/normas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Cinética , Mutação , Motivos de Nucleotídeos , Reprodutibilidade dos Testes , Análise de Sequência de DNA/métodos
7.
PLoS Genet ; 9(12): e1004023, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24385924

RESUMO

There is great scientific and popular interest in understanding the genetic history of populations in the Americas. We wish to understand when different regions of the continent were inhabited, where settlers came from, and how current inhabitants relate genetically to earlier populations. Recent studies unraveled parts of the genetic history of the continent using genotyping arrays and uniparental markers. The 1000 Genomes Project provides a unique opportunity for improving our understanding of population genetic history by providing over a hundred sequenced low coverage genomes and exomes from Colombian (CLM), Mexican-American (MXL), and Puerto Rican (PUR) populations. Here, we explore the genomic contributions of African, European, and especially Native American ancestry to these populations. Estimated Native American ancestry is 48% in MXL, 25% in CLM, and 13% in PUR. Native American ancestry in PUR is most closely related to populations surrounding the Orinoco River basin, confirming the Southern American ancestry of the Taíno people of the Caribbean. We present new methods to estimate the allele frequencies in the Native American fraction of the populations, and model their distribution using a demographic model for three ancestral Native American populations. These ancestral populations likely split in close succession: the most likely scenario, based on a peopling of the Americas 16 thousand years ago (kya), supports that the MXL Ancestors split 12.2kya, with a subsequent split of the ancestors to CLM and PUR 11.7kya. The model also features effective populations of 62,000 in Mexico, 8,700 in Colombia, and 1,900 in Puerto Rico. Modeling Identity-by-descent (IBD) and ancestry tract length, we show that post-contact populations also differ markedly in their effective sizes and migration patterns, with Puerto Rico showing the smallest effective size and the earlier migration from Europe. Finally, we compare IBD and ancestry assignments to find evidence for relatedness among European founders to the three populations.


Assuntos
Frequência do Gene/genética , Genética Populacional , Migração Humana , Indígenas Norte-Americanos/genética , População Negra/genética , Mapeamento Cromossômico , Exoma , Genoma Humano , Hispânico ou Latino/genética , Projeto Genoma Humano , Humanos , Americanos Mexicanos/genética , México , Porto Rico , Grupos Raciais/genética , População Branca/genética
8.
Kidney Int ; 88(4): 754-63, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25993319

RESUMO

A third of African Americans with sporadic focal segmental glomerulosclerosis (FSGS) or HIV-associated nephropathy (HIVAN) do not carry APOL1 renal risk genotypes. This raises the possibility that other APOL1 variants may contribute to kidney disease. To address this question, we sequenced all APOL1 exons in 1437 Americans of African and European descent, including 464 patients with biopsy-proven FSGS/HIVAN. Testing for association with 33 common and rare variants with FSGS/HIVAN revealed no association independent of strong recessive G1 and G2 effects. Seeking additional variants that might have been under selection by pathogens and could represent candidates for kidney disease risk, we also sequenced an additional 1112 individuals representing 53 global populations. Except for G1 and G2, none of the 7 common codon-altering variants showed evidence of selection or could restore lysis against trypanosomes causing human African trypanosomiasis. Thus, only APOL1 G1 and G2 confer renal risk, and other common and rare APOL1 missense variants, including the archaic G3 haplotype, do not contribute to sporadic FSGS and HIVAN in the US population. Hence, in most potential clinical or screening applications, our study suggests that sequencing APOL1 exons is unlikely to bring additional information compared to genotyping only APOL1 G1 and G2 risk alleles.


Assuntos
Nefropatia Associada a AIDS/genética , Apolipoproteínas/genética , Glomerulosclerose Segmentar e Focal/genética , Lipoproteínas HDL/genética , Polimorfismo de Nucleotídeo Único , Nefropatia Associada a AIDS/diagnóstico , Nefropatia Associada a AIDS/etnologia , Negro ou Afro-Americano/genética , Apolipoproteína L1 , Apolipoproteínas/sangue , Biópsia , Estudos de Casos e Controles , Éxons , Feminino , Frequência do Gene , Estudos de Associação Genética , Predisposição Genética para Doença , Glomerulosclerose Segmentar e Focal/diagnóstico , Glomerulosclerose Segmentar e Focal/etnologia , Haplótipos , Interações Hospedeiro-Parasita , Humanos , Lipoproteínas HDL/sangue , Masculino , Fenótipo , Medição de Risco , Fatores de Risco , Análise de Sequência de DNA , Trypanosoma brucei gambiense/metabolismo , Trypanosoma brucei gambiense/patogenicidade , Trypanosoma brucei rhodesiense/metabolismo , Trypanosoma brucei rhodesiense/patogenicidade , Estados Unidos/epidemiologia , População Branca/genética
9.
bioRxiv ; 2024 Feb 21.
Artigo em Inglês | MEDLINE | ID: mdl-38529508

RESUMO

RNA modifications have a substantial impact on tRNA function, with modifications in the anticodon loop contributing to translational fidelity and modifications in the tRNA core impacting structural stability. In bacteria, tRNA modifications are crucial for responding to stress and regulating the expression of virulence factors. Although tRNA modifications are well-characterized in a few model organisms, our knowledge of tRNA modifications in human pathogens, such as Pseudomonas aeruginosa, remains limited. Here we leveraged two orthogonal approaches to build a reference landscape of tRNA modifications in E. coli, which enabled us to identify similar modifications in P. aeruginosa. Our analysis revealed a substantial degree of conservation between the two organisms, while also uncovering potential sites of tRNA modification in P. aeruginosa tRNAs that are not present in E. coli. The mutational signature at one of these sites, position 46 of tRNAGln1(UUG) is dependent on the P. aeruginosa homolog of TapT, the enzyme responsible for the 3-(3-amino-3-carboxypropyl) uridine (acp3U) modification. Identifying which modifications are present on different tRNAs will uncover the pathways impacted by the different tRNA modifying enzymes, some of which play roles in determining virulence and pathogenicity.

10.
Clin Epigenetics ; 16(1): 50, 2024 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-38561804

RESUMO

BACKGROUND: Nucleosome repositioning in cancer is believed to cause many changes in genome organisation and gene expression. Understanding these changes is important to elucidate fundamental aspects of cancer. It is also important for medical diagnostics based on cell-free DNA (cfDNA), which originates from genomic DNA regions protected from digestion by nucleosomes. RESULTS: We have generated high-resolution nucleosome maps in paired tumour and normal tissues from the same breast cancer patients using MNase-assisted histone H3 ChIP-seq and compared them with the corresponding cfDNA from blood plasma. This analysis has detected single-nucleosome repositioning at key regulatory regions in a patient-specific manner and common cancer-specific patterns across patients. The nucleosomes gained in tumour versus normal tissue were particularly informative of cancer pathways, with ~ 20-fold enrichment at CpG islands, a large fraction of which marked promoters of genes encoding DNA-binding proteins. The tumour tissues were characterised by a 5-10 bp decrease in the average distance between nucleosomes (nucleosome repeat length, NRL), which is qualitatively similar to the differences between pluripotent and differentiated cells. This effect was correlated with gene activity, differential DNA methylation and changes in local occupancy of linker histone variants H1.4 and H1X. CONCLUSIONS: Our study offers a novel resource of high-resolution nucleosome maps in breast cancer patients and reports for the first time the effect of systematic decrease of NRL in paired tumour versus normal breast tissues from the same patient. Our findings provide a new mechanistic understanding of nucleosome repositioning in tumour tissues that can be valuable for patient diagnostics, stratification and monitoring.


Assuntos
Neoplasias da Mama , Ácidos Nucleicos Livres , Humanos , Feminino , Nucleossomos/genética , Neoplasias da Mama/genética , Metilação de DNA , Histonas/genética , Histonas/metabolismo , DNA/metabolismo , Ácidos Nucleicos Livres/metabolismo , Cromatina
11.
Mol Cancer Ther ; 23(4): 464-477, 2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38151817

RESUMO

Histone deacetylase inhibitors (HDACi) are part of a growing class of epigenetic therapies used for the treatment of cancer. Although HDACis are effective in the treatment of T-cell lymphomas, treatment of solid tumors with this class of drugs has not been successful. Overexpression of the multidrug resistance protein P-glycoprotein (P-gp), encoded by ABCB1, is known to confer resistance to the HDACi romidepsin in vitro, yet increased ABCB1 expression has not been associated with resistance in patients, suggesting that other mechanisms of resistance arise in the clinic. To identify alternative mechanisms of resistance to romidepsin, we selected MCF-7 breast cancer cells with romidepsin in the presence of the P-gp inhibitor verapamil to reduce the likelihood of P-gp-mediated resistance. The resulting cell line, MCF-7 DpVp300, does not express P-gp and was found to be selectively resistant to romidepsin but not to other HDACis such as belinostat, panobinostat, or vorinostat. RNA-sequencing analysis revealed upregulation of the mRNA coding for the putative methyltransferase, METTL7A, whose paralog, METTL7B, was previously shown to methylate thiol groups on hydrogen sulfide and captopril. As romidepsin has a thiol as the zinc-binding moiety, we hypothesized that METTL7A could inactivate romidepsin and other thiol-based HDACis via methylation of the thiol group. We demonstrate that expression of METTL7A or METTL7B confers resistance to thiol-based HDACis and that both enzymes are capable of methylating thiol-containing HDACis. We thus propose that METTL7A and METTL7B confer resistance to thiol-based HDACis by methylating and inactivating the zinc-binding thiol.


Assuntos
Inibidores de Histona Desacetilases , Neoplasias , Humanos , Inibidores de Histona Desacetilases/farmacologia , Inibidores de Histona Desacetilases/uso terapêutico , Metiltransferases/metabolismo , Neoplasias/tratamento farmacológico , Panobinostat/farmacologia , Panobinostat/uso terapêutico , Zinco
12.
bioRxiv ; 2023 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-38076910

RESUMO

The precise mechanisms governing sequence-dependent positioning of nucleosomes on DNA remain unknown in detail. Existing algorithms, taking into account the sequence-dependent deformability of DNA and its interactions with the histone globular domains, predict rotational setting of only 65% of human nucleosomes mapped in vivo. To uncover novel factors responsible for the nucleosome positioning, we analyzed potential involvement of the histone N-tails in this process. To this aim, we reconstituted the H2A/H4 N-tailless nucleosomes on human BRCA1 DNA (~100 kb) and compared their positions and sequences with those of the wild-type nucleosomes. In the case of H2A tailless nucleosomes, the AT content of DNA sequences is changed locally at superhelical location (SHL) ±4, while maintaining the same rotational setting as their wild-type counterparts. Conversely, the H4 tailless nucleosomes display widespread changes of the AT content near SHL ±1 and SHL ±2, where the H4 N-tails interact with DNA. Furthermore, a substantial number of H4 tailless nucleosomes exhibit rotational setting opposite to that of the wild-type nucleosomes. Thus, our findings strongly suggest that the histone N-tails are operative in selection of nucleosome positions, which may have wide-ranging implications for epigenetic modulation of chromatin states.

13.
Genome Biol ; 17(1): 180, 2016 08 26.
Artigo em Inglês | MEDLINE | ID: mdl-27566673

RESUMO

Duplex sequencing was originally developed to detect rare nucleotide polymorphisms normally obscured by the noise of high-throughput sequencing. Here we describe a new, streamlined, reference-free approach for the analysis of duplex sequencing data. We show the approach performs well on simulated data and precisely reproduces previously published results and apply it to a newly produced dataset, enabling us to type low-frequency variants in human mitochondrial DNA. Finally, we provide all necessary tools as stand-alone components as well as integrate them into the Galaxy platform. All analyses performed in this manuscript can be repeated exactly as described at http://usegalaxy.org/duplex .


Assuntos
DNA Mitocondrial/genética , Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único/genética , Software , Genômica , Humanos , Análise de Sequência de DNA/métodos
14.
Gigascience ; 4: 1, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25838885

RESUMO

BACKGROUND: Adaptive alleles may rise in frequency as a consequence of positive selection, creating a pattern of decreased variation in the neighboring loci, known as a selective sweep. When the region containing this pattern is compared to another population with no history of selection, a rise in variance of allele frequencies between populations is observed. One challenge presented by large genome-wide datasets is the ability to differentiate between patterns that are remnants of natural selection from those expected to arise at random and/or as a consequence of selectively neutral demographic forces acting in the population. FINDINGS: SmileFinder is a simple program that looks for diversity and divergence patterns consistent with selection sweeps by evaluating allele frequencies in windows, including neighboring loci from two or more populations of a diploid species against the genome-wide neutral expectation. The program calculates the mean of heterozygosity and FST in a set of sliding windows of incrementally increasing sizes, and then builds a resampled distribution (the baseline) of random multi-locus sets matched to the sizes of sliding windows, using an unrestricted sampling. Percentiles of the values in the sliding windows are derived from the superimposed resampled distribution. The resampling can easily be scaled from 1 K to 100 M; the higher the number, the more precise the percentiles ascribed to the extreme observed values. CONCLUSIONS: The output from SmileFinder can be used to plot percentile values to look for population diversity and divergence patterns that may suggest past actions of positive selection along chromosome maps, and to compare lists of suspected candidate genes under random gene sets to test for the overrepresentation of these patterns among gene categories. Both applications of the algorithm have already been used in published studies. Here we present a publicly available, open source program that will serve as a useful tool for preliminary scans of selection using worldwide databases of human genetic variation, as well as population datasets for many non-human species, from which such data is rapidly emerging with the advent of new genotyping and sequencing technologies.


Assuntos
Frequência do Gene , Seleção Genética , Análise de Sequência de DNA/métodos , Software , Algoritmos , Biologia Computacional , Diploide , Genética Populacional/métodos , Heterozigoto
15.
Gigascience ; 1(1): 14, 2012 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-23587420

RESUMO

BACKGROUND: Amazona vittata is a critically endangered Puerto Rican endemic bird, the only surviving native parrot species in the United States territory, and the first parrot in the large Neotropical genus Amazona, to be studied on a genomic scale. FINDINGS: In a unique community-based funded project, DNA from an A. vittata female was sequenced using a HiSeq Illumina platform, resulting in a total of ~42.5 billion nucleotide bases. This provided approximately 26.89x average coverage depth at the completion of this funding phase. Filtering followed by assembly resulted in 259,423 contigs (N50 = 6,983 bp, longest = 75,003 bp), which was further scaffolded into 148,255 fragments (N50 = 19,470, longest = 206,462 bp). This provided ~76% coverage of the genome based on an estimated size of 1.58 Gb. The assembled scaffolds allowed basic genomic annotation and comparative analyses with other available avian whole-genome sequences. CONCLUSIONS: The current data represents the first genomic information from and work carried out with a unique source of funding. This analysis further provides a means for directed training of young researchers in genetic and bioinformatics analyses and will facilitate progress towards a full assembly and annotation of the Puerto Rican parrot genome. It also adds extensive genomic data to a new branch of the avian tree, making it useful for comparative analyses with other avian species. Ultimately, the knowledge acquired from these data will contribute to an improved understanding of the overall population health of this species and aid in ongoing and future conservation efforts.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA