Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Nat Biomed Eng ; 7(12): 1627-1635, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37652985

RESUMO

Liquid biopsies provide a means for the profiling of cell-free RNAs secreted by cells throughout the body. Although well-annotated coding and non-coding transcripts in blood are readily detectable and can serve as biomarkers of disease, the overall diagnostic utility of the cell-free transcriptome remains unclear. Here we show that RNAs derived from transposable elements and other repeat elements are enriched in the cell-free transcriptome of patients with cancer, and that they serve as signatures for the accurate classification of the disease. We used repeat-element-aware liquid-biopsy technology and single-molecule nanopore sequencing to profile the cell-free transcriptome in plasma from patients with cancer and to examine millions of genomic features comprising all annotated genes and repeat elements throughout the genome. By aggregating individual repeat elements to the subfamily level, we found that samples with pancreatic cancer are enriched with specific Alu subfamilies, whereas other cancers have their own characteristic cell-free RNA profile. Our findings show that repetitive RNA sequences are abundant in blood and can be used as disease-specific diagnostic biomarkers.


Assuntos
Neoplasias , RNA , Humanos , RNA/genética , Sequência de Bases , Elementos de DNA Transponíveis , Plasma , Neoplasias/diagnóstico , Neoplasias/genética , Biomarcadores
2.
Nature ; 611(7936): 519-531, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-36261518

RESUMO

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.


Assuntos
Mapeamento Cromossômico , Diploide , Genoma Humano , Genômica , Humanos , Mapeamento Cromossômico/normas , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Padrões de Referência , Genômica/métodos , Genômica/normas , Cromossomos Humanos/genética , Variação Genética/genética
3.
RNA ; 28(2): 162-176, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34728536

RESUMO

Nanopore sequencing devices read individual RNA strands directly. This facilitates identification of exon linkages and nucleotide modifications; however, using conventional direct RNA nanopore sequencing, the 5' and 3' ends of poly(A) RNA cannot be identified unambiguously. This is due in part to RNA degradation in vivo and in vitro that can obscure transcription start and end sites. In this study, we aimed to identify individual full-length human RNA isoforms among ∼4 million nanopore poly(A)-selected RNA reads. First, to identify RNA strands bearing 5' m7G caps, we exchanged the biological cap for a modified cap attached to a 45-nt oligomer. This oligomer adaptation method improved 5' end sequencing and ensured correct identification of the 5' m7G capped ends. Second, among these 5'-capped nanopore reads, we screened for features consistent with a 3' polyadenylation site. Combining these two steps, we identified 294,107 individual high-confidence full-length RNA scaffolds from human GM12878 cells, most of which (257,721) aligned to protein-coding genes. Of these, 4876 scaffolds indicated unannotated isoforms that were often internal to longer, previously identified RNA isoforms. Orthogonal data for m7G caps and open chromatin, such as CAGE and DNase-HS seq, confirmed the validity of these high-confidence RNA scaffolds.


Assuntos
Isoformas de RNA/química , RNA Mensageiro/química , Linhagem Celular Tumoral , Humanos , Sequenciamento por Nanoporos/métodos , Sinais de Poliadenilação na Ponta 3' do RNA , Isoformas de RNA/genética , RNA Mensageiro/genética , Transcriptoma
4.
Cell Syst ; 12(8): 827-838.e5, 2021 08 18.
Artigo em Inglês | MEDLINE | ID: mdl-34146471

RESUMO

The accurate identification and quantitation of RNA isoforms present in the cancer transcriptome is key for analyses ranging from the inference of the impacts of somatic variants to pathway analysis to biomarker development and subtype discovery. The ICGC-TCGA DREAM Somatic Mutation Calling in RNA (SMC-RNA) challenge was a crowd-sourced effort to benchmark methods for RNA isoform quantification and fusion detection from bulk cancer RNA sequencing (RNA-seq) data. It concluded in 2018 with a comparison of 77 fusion detection entries and 65 isoform quantification entries on 51 synthetic tumors and 32 cell lines with spiked-in fusion constructs. We report the entries used to build this benchmark, the leaderboard results, and the experimental features associated with the accurate prediction of RNA species. This challenge required submissions to be in the form of containerized workflows, meaning each of the entries described is easily reusable through CWL and Docker containers at https://github.com/SMC-RNA-challenge. A record of this paper's transparent peer review process is included in the supplemental information.


Assuntos
Neoplasias , Humanos , Neoplasias/genética , Isoformas de Proteínas/genética , RNA/genética , RNA-Seq , Análise de Sequência de RNA
5.
Elife ; 102021 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-34047695

RESUMO

Determining the layers of gene regulation within the innate immune response is critical to our understanding of the cellular responses to infection and dysregulation in disease. We identified a conserved mechanism of gene regulation in human and mouse via changes in alternative first exon (AFE) usage following inflammation, resulting in changes to the isoforms produced. Of these AFE events, we identified 95 unannotated transcription start sites in mice using a de novo transcriptome generated by long-read native RNA-sequencing, one of which is in the cytosolic receptor for dsDNA and known inflammatory inducible gene, Aim2. We show that this unannotated AFE isoform of Aim2 is the predominant isoform expressed during inflammation and contains an iron-responsive element in its 5'UTR enabling mRNA translation to be regulated by iron levels. This work highlights the importance of examining alternative isoform changes and translational regulation in the innate immune response and uncovers novel regulatory mechanisms of Aim2.


Assuntos
Processamento Alternativo , Proteínas de Ligação a DNA/genética , Éxons , Imunidade Inata/genética , Inflamação/genética , Macrófagos/metabolismo , Regiões 5' não Traduzidas , Animais , Células Cultivadas , Proteínas de Ligação a DNA/metabolismo , Perfilação da Expressão Gênica , Humanos , Inflamação/imunologia , Inflamação/metabolismo , Macrófagos/imunologia , Camundongos , Regiões Promotoras Genéticas , Transcriptoma
6.
Proc Natl Acad Sci U S A ; 118(13)2021 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-33758101

RESUMO

Among the large, diverse set of mammalian long noncoding RNAs (lncRNAs), long noncoding primary microRNAs (lnc-pri-miRNAs) are those that host miRNAs. Whether lnc-pri-miRNA loci have important biological function independent of their cognate miRNAs is poorly understood. From a genome-scale lncRNA screen, lnc-pri-miRNA loci were enriched for function in cell proliferation, and in glioblastoma (i.e., GBM) cells with DGCR8 or DROSHA knockdown, lnc-pri-miRNA screen hits still regulated cell growth. To molecularly dissect the function of a lnc-pri-miRNA locus, we studied LOC646329 (also known as MIR29HG), which hosts the miR-29a/b1 cluster. In GBM cells, LOC646329 knockdown reduced miR-29a/b1 levels, and these cells exhibited decreased growth. However, genetic deletion of the miR-29a/b1 cluster (LOC646329-miR29Δ) did not decrease cell growth, while knockdown of LOC646329-miR29Δ transcripts reduced cell proliferation. The miR-29a/b1-independent activity of LOC646329 corresponded to enhancer-like activation of a neighboring oncogene (MKLN1), regulating cell propagation. The LOC646329 locus interacts with the MKLN1 promoter, and antisense oligonucleotide knockdown of the lncRNA disrupts these interactions and reduces the enhancer-like activity. More broadly, analysis of genome-wide data from multiple human cell types showed that lnc-pri-miRNA loci are significantly enriched for DNA looping interactions with gene promoters as well as genomic and epigenetic characteristics of transcriptional enhancers. Functional studies of additional lnc-pri-miRNA loci demonstrated cognate miRNA-independent enhancer-like activity. Together, these data demonstrate that lnc-pri-miRNA loci can regulate cell biology via both miRNA-dependent and miRNA-independent mechanisms.


Assuntos
Proliferação de Células/genética , Loci Gênicos , RNA Longo não Codificante/metabolismo , Apoptose/genética , Técnicas de Silenciamento de Genes , Células HEK293 , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , RNA Longo não Codificante/genética , RNA-Seq
7.
Am J Hum Genet ; 107(4): 654-669, 2020 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-32937144

RESUMO

There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.


Assuntos
Proteína BRCA1/genética , Epigênese Genética , Doenças Genéticas Inatas/genética , Genoma Humano , Receptores de LDL/genética , Expansão das Repetições de Trinucleotídeos , Proteína BRCA1/metabolismo , Metilação de DNA , Feminino , Ácido Fólico/metabolismo , Inativação Gênica , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/patologia , Loci Gênicos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Regiões Promotoras Genéticas , Receptores de LDL/metabolismo , Gêmeos Monozigóticos
8.
Genome Biol ; 21(1): 83, 2020 03 31.
Artigo em Inglês | MEDLINE | ID: mdl-32234056

RESUMO

BACKGROUND: Long non-coding RNAs (lncRNAs) exhibit highly cell type-specific expression and function, making this class of transcript attractive for targeted cancer therapy. However, the vast majority of lncRNAs have not been tested as potential therapeutic targets, particularly in the context of currently used cancer treatments. Malignant glioma is rapidly fatal, and ionizing radiation is part of the current standard-of-care used to slow tumor growth in both adult and pediatric patients. RESULTS: We use CRISPR interference (CRISPRi) to screen 5689 lncRNA loci in human glioblastoma (GBM) cells, identifying 467 hits that modify cell growth in the presence of clinically relevant doses of fractionated radiation. Thirty-three of these lncRNA hits sensitize cells to radiation, and based on their expression in adult and pediatric gliomas, nine of these hits are prioritized as lncRNA Glioma Radiation Sensitizers (lncGRS). Knockdown of lncGRS-1, a primate-conserved, nuclear-enriched lncRNA, inhibits the growth and proliferation of primary adult and pediatric glioma cells, but not the viability of normal brain cells. Using human brain organoids comprised of mature neural cell types as a three-dimensional tissue substrate to model the invasive growth of glioma, we find that antisense oligonucleotides targeting lncGRS-1 selectively decrease tumor growth and sensitize glioma cells to radiation therapy. CONCLUSIONS: These studies identify lncGRS-1 as a glioma-specific therapeutic target and establish a generalizable approach to rapidly identify novel therapeutic targets in the vast non-coding genome to enhance radiation therapy.


Assuntos
Neoplasias Encefálicas/terapia , Sistemas CRISPR-Cas , Glioblastoma/terapia , RNA Longo não Codificante/antagonistas & inibidores , Adulto , Astrócitos , Encéfalo , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Neoplasias Encefálicas/radioterapia , Linhagem Celular Tumoral , Terapia Combinada , Glioblastoma/genética , Glioblastoma/patologia , Glioblastoma/radioterapia , Humanos , Oligonucleotídeos Antissenso , Organoides , Tolerância a Radiação
9.
Genome Res ; 28(2): 266-274, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29273626

RESUMO

Advances in long-read single molecule sequencing have opened new possibilities for 'benchtop' whole-genome sequencing. The Oxford Nanopore Technologies MinION is a portable device that uses nanopore technology that can directly sequence DNA molecules. MinION single molecule long sequence reads are well suited for de novo assembly of complex genomes as they facilitate the construction of highly contiguous physical genome maps obviating the need for labor-intensive physical genome mapping. Long sequence reads can also be used to delineate complex chromosomal rearrangements, such as those that occur in tumor cells, that can confound analysis using short reads. Here, we assessed MinION long-read-derived sequences for feasibility concerning: (1) the de novo assembly of a large complex genome, and (2) the elucidation of complex rearrangements. The genomes of two Caenorhabditis elegans strains, a wild-type strain and a strain containing two complex rearrangements, were sequenced with MinION. Up to 42-fold coverage was obtained from a single flow cell, and the best pooled data assembly produced a highly contiguous wild-type C. elegans genome containing 48 contigs (N50 contig length = 3.99 Mb) covering >99% of the 100,286,401-base reference genome. Further, the MinION-derived genome assembly expanded the C. elegans reference genome by >2 Mb due to a more accurate determination of repetitive sequence elements and assembled the complete genomes of two co-extracted bacteria. MinION long-read sequence data also facilitated the elucidation of complex rearrangements in a mutagenized strain. The sequence accuracy of the MinION long-read contigs (∼98%) was improved using Illumina-derived sequence data to polish the final genome assembly to 99.8% nucleotide accuracy when compared to the reference assembly.


Assuntos
Caenorhabditis elegans/genética , Genoma/genética , Anotação de Sequência Molecular , Animais , Mapeamento Cromossômico , Rearranjo Gênico/genética , Sequenciamento de Nucleotídeos em Larga Escala , Sequências Repetitivas de Ácido Nucleico/genética
10.
Nat Methods ; 14(4): 411-413, 2017 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-28218897

RESUMO

DNA chemical modifications regulate genomic function. We present a framework for mapping cytosine and adenosine methylation with the Oxford Nanopore Technologies MinION using this nanopore sequencer's ionic current signal. We map three cytosine variants and two adenine variants. The results show that our model is sensitive enough to detect changes in genomic DNA methylation levels as a function of growth phase in Escherichia coli.


Assuntos
5-Metilcitosina/metabolismo , Metilação de DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Nanoporos , 5-Metilcitosina/análise , Escherichia coli/genética , Genoma Bacteriano , Sequenciamento de Nucleotídeos em Larga Escala/instrumentação , Cadeias de Markov , Modelos Genéticos
11.
Nat Methods ; 12(4): 351-6, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25686389

RESUMO

Speed, single-base sensitivity and long read lengths make nanopores a promising technology for high-throughput sequencing. We evaluated and optimized the performance of the MinION nanopore sequencer using M13 genomic DNA and used expectation maximization to obtain robust maximum-likelihood estimates for insertion, deletion and substitution error rates (4.9%, 7.8% and 5.1%, respectively). Over 99% of high-quality 2D MinION reads mapped to the reference at a mean identity of 85%. We present a single-nucleotide-variant detection tool that uses maximum-likelihood parameter estimates and marginalization over many possible read alignments to achieve precision and recall of up to 99%. By pairing our high-confidence alignment strategy with long MinION reads, we resolved the copy number for a cancer-testis gene family (CT47) within an unresolved region of human chromosome Xq24.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Nanoporos , Algoritmos , Dosagem de Genes , Humanos , Neoplasias/genética
12.
Vaccine ; 29(48): 8802-26, 2011 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-21964054

RESUMO

Tumor-associated carbohydrate antigens (TACA) result from the aberrant glycosylation that is seen with transformation to a tumor cell. The carbohydrate antigens that have been found to be tumor-associated include the mucin related Tn, Sialyl Tn, and Thomsen-Friedenreich antigens, the blood group Lewis related Lewis(Y), Sialyl Lewis(X) and Sialyl Lewis(A), and Lewis(X) (also known as stage-specific embryonic antigen-1, SSEA-1), the glycosphingolipids Globo H and stage-specific embryonic antigen-3 (SSEA-3), the sialic acid containing glycosphingolipids, the gangliosides GD2, GD3, GM2, fucosyl GM1, and Neu5GcGM3, and polysialic acid. Recent developments have furthered our understanding of the T-independent type II response that is seen in response to carbohydrate antigens. The selection of a vaccine target antigen is based on not only the presence of the antigen in a variety of tumor tissues but also on the role this antigen plays in tumor growth and metastasis. These roles for TACAs are being elucidated. Newly acquired knowledge in understanding the T-independent immune response and in understanding the key roles that carbohydrates play in metastasis are being applied in attempts to develop an effective vaccine response to TACAs. The role of each of the above mentioned carbohydrate antigens in cancer growth and metastasis and vaccine attempts using these antigens will be described.


Assuntos
Antígenos Glicosídicos Associados a Tumores/imunologia , Vacinas Anticâncer/imunologia , Neoplasias/prevenção & controle , Linfócitos B/imunologia , Células Dendríticas/imunologia , Epitopos/imunologia , Gangliosídeos , Humanos , Antígenos CD15 , Mucinas , Neoplasias/imunologia , Antígenos Embrionários Estágio-Específicos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA