Pesquisa | BVS CLAP/SMR-OPAS/OMS

The EN-TEx resource of multi-tissue personal epigenomes & variant-impact models.

Rozowsky, Joel; Gao, Jiahao; Borsari, Beatrice; Yang, Yucheng T; Galeev, Timur; Gürsoy, Gamze; Epstein, Charles B; Xiong, Kun; Xu, Jinrui; Li, Tianxiao; Liu, Jason; Yu, Keyang; Berthel, Ana; Chen, Zhanlin; Navarro, Fabio; Sun, Maxwell S; Wright, James; Chang, Justin; Cameron, Christopher J F; Shoresh, Noam; Gaskell, Elizabeth; Drenkow, Jorg; Adrian, Jessika; Aganezov, Sergey; Aguet, François; Balderrama-Gutierrez, Gabriela; Banskota, Samridhi; Corona, Guillermo Barreto; Chee, Sora; Chhetri, Surya B; Cortez Martins, Gabriel Conte; Danyko, Cassidy; Davis, Carrie A; Farid, Daniel; Farrell, Nina P; Gabdank, Idan; Gofin, Yoel; Gorkin, David U; Gu, Mengting; Hecht, Vivian; Hitz, Benjamin C; Issner, Robbyn; Jiang, Yunzhe; Kirsche, Melanie; Kong, Xiangmeng; Lam, Bonita R; Li, Shantao; Li, Bian; Li, Xiqi; Lin, Khine Zin.

Cell ; 186(7): 1493-1511.e40, 2023 03 30.

Artigo em Inglês | MEDLINE | ID: mdl-37001506

RESUMO

Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (â¼30 tissues × â¼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.

Assuntos

Epigenoma , Locos de Características Quantitativas , Estudo de Associação Genômica Ampla , Genômica , Fenótipo , Polimorfismo de Nucleotídeo Único

Targeted de novo phasing and long-range assembly by template mutagenesis.

Li, Siran; Park, Sarah; Ye, Catherine; Danyko, Cassidy; Wroten, Matthew; Andrews, Peter; Wigler, Michael; Levy, Dan.

Nucleic Acids Res ; 50(18): e103, 2022 10 14.

Artigo em Inglês | MEDLINE | ID: mdl-35822882

RESUMO

Short-read sequencers provide highly accurate reads at very low cost. Unfortunately, short reads are often inadequate for important applications such as assembly in complex regions or phasing across distant heterozygous sites. In this study, we describe novel bench protocols and algorithms to obtain haplotype-phased sequence assemblies with ultra-low error for regions 10 kb and longer using short reads only. We accomplish this by imprinting each template strand from a target region with a dense and unique mutation pattern. The mutation process randomly and independently converts â¼50% of cytosines to uracils. Sequencing libraries are made from both mutated and unmutated templates. Using de Bruijn graphs and paired-end read information, we assemble each mutated template and use the unmutated library to correct the mutated bases. Templates are partitioned into two or more haplotypes, and the final haplotypes are assembled and corrected for residual template mutations and PCR errors. With sufficient template coverage, the final assemblies have per-base error rates below 10-9. We demonstrate this method on a four-member nuclear family, correctly assembling and phasing three genomic intervals, including the highly polymorphic HLA-B gene.

Assuntos

Genoma , Genômica , Algoritmos , Antígenos HLA-B , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Mutagênese , Análise de Sequência de DNA/métodos

A limited set of transcriptional programs define major cell types.

Breschi, Alessandra; Muñoz-Aguirre, Manuel; Wucher, Valentin; Davis, Carrie A; Garrido-Martín, Diego; Djebali, Sarah; Gillis, Jesse; Pervouchine, Dmitri D; Vlasova, Anna; Dobin, Alexander; Zaleski, Chris; Drenkow, Jorg; Danyko, Cassidy; Scavelli, Alexandra; Reverter, Ferran; Snyder, Michael P; Gingeras, Thomas R; Guigó, Roderic.

Genome Res ; 30(7): 1047-1059, 2020 07.

Artigo em Inglês | MEDLINE | ID: mdl-32759341

RESUMO

We have produced RNA sequencing data for 53 primary cells from different locations in the human body. The clustering of these primary cells reveals that most cells in the human body share a few broad transcriptional programs, which define five major cell types: epithelial, endothelial, mesenchymal, neural, and blood cells. These act as basic components of many tissues and organs. Based on gene expression, these cell types redefine the basic histological types by which tissues have been traditionally classified. We identified genes whose expression is specific to these cell types, and from these genes, we estimated the contribution of the major cell types to the composition of human tissues. We found this cellular composition to be a characteristic signature of tissues and to reflect tissue morphological heterogeneity and histology. We identified changes in cellular composition in different tissues associated with age and sex, and found that departures from the normal cellular composition correlate with histological phenotypes associated with disease.

Assuntos

Transcrição Gênica , Linhagem Celular , Células Endoteliais/metabolismo , Células Epiteliais/metabolismo , Feminino , Perfilação da Expressão Gênica , Ginecomastia/genética , Ginecomastia/metabolismo , Humanos , Masculino , Mesoderma/citologia , Mesoderma/metabolismo , Neoplasias/genética , Especificidade de Órgãos , Análise de Sequência de RNA

Copolymerization of single-cell nucleic acids into balls of acrylamide gel.

Li, Siran; Kendall, Jude; Park, Sarah; Wang, Zihua; Alexander, Joan; Moffitt, Andrea; Ranade, Nissim; Danyko, Cassidy; Gegenhuber, Bruno; Fischer, Stephan; Robinson, Brian D; Lepor, Herbert; Tollkuhn, Jessica; Gillis, Jesse; Brouzes, Eric; Krasnitz, Alex; Levy, Dan; Wigler, Michael.

Genome Res ; 30(1): 49-61, 2020 01.

Artigo em Inglês | MEDLINE | ID: mdl-31727682

RESUMO

We show the use of 5'-Acrydite oligonucleotides to copolymerize single-cell DNA or RNA into balls of acrylamide gel (BAGs). Combining this step with split-and-pool techniques for creating barcodes yields a method with advantages in cost and scalability, depth of coverage, ease of operation, minimal cross-contamination, and efficient use of samples. We perform DNA copy number profiling on mixtures of cell lines, nuclei from frozen prostate tumors, and biopsy washes. As applied to RNA, the method has high capture efficiency of transcripts and sufficient consistency to clearly distinguish the expression patterns of cell lines and individual nuclei from neurons dissected from the mouse brain. By using varietal tags (UMIs) to achieve sequence error correction, we show extremely low levels of cross-contamination by tracking source-specific SNVs. The method is readily modifiable, and we will discuss its adaptability and diverse applications.

Assuntos

Acrilamida , Ácidos Nucleicos , Análise de Célula Única/métodos , Acrilamida/química , DNA , Contaminação por DNA , Variações do Número de Cópias de DNA , Dosagem de Genes , Perfilação da Expressão Gênica/métodos , Perfilação da Expressão Gênica/normas , Biblioteca Gênica , Humanos , Neoplasias/genética , Neoplasias/metabolismo , Neoplasias/patologia , Ácidos Nucleicos/química , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise de Sequência com Séries de Oligonucleotídeos/normas , Polimerização , RNA , Análise de Célula Única/normas

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA