RESUMO
Background: Colon microbiome composition contributes to the pathogenesis of colorectal cancer (CRC) and prognosis. We analyzed 16S rRNA sequencing data from tumor samples of patients with metastatic CRC and determined the clinical implications. Materials and methods: We enrolled 133 patients with metastatic CRC at St. Vincent Hospital in Korea. The V3-V4 regions of the 16S rRNA gene from the tumor DNA were amplified, sequenced on an Illumina MiSeq, and analyzed using the DADA2 package. Results: After excluding samples that retained <5% of the total reads after merging, 120 samples were analyzed. The median age of patients was 63 years (range, 34-82 years), and 76 patients (63.3%) were male. The primary cancer sites were the right colon (27.5%), left colon (30.8%), and rectum (41.7%). All subjects received 5-fluouracil-based systemic chemotherapy. After removing genera with <1% of the total reads in each patient, 523 genera were identified. Rectal origin, high CEA level (≥10 ng/mL), and presence of lung metastasis showed higher richness. Survival analysis revealed that the presence of Prevotella (p = 0.052), Fusobacterium (p = 0.002), Selenomonas (p<0.001), Fretibacterium (p = 0.001), Porphyromonas (p = 0.007), Peptostreptococcus (p = 0.002), and Leptotrichia (p = 0.003) were associated with short overall survival (OS, <24 months), while the presence of Sphingomonas was associated with long OS (p = 0.070). From the multivariate analysis, the presence of Selenomonas (hazard ratio [HR], 6.35; 95% confidence interval [CI], 2.38-16.97; p<0.001) was associated with poor prognosis along with high CEA level. Conclusion: Tumor microbiome features may be useful prognostic biomarkers for metastatic CRC.
RESUMO
BACKGROUND: Gastric cancer is a leading cause of cancer morbidity and mortality. Developing information systems which integrate clinical and genomic data may accelerate discoveries to improve cancer prevention, detection, and treatment. To support translational research in gastric cancer, we developed the Gastric Cancer Registry (GCR), a North American repository of clinical and cancer genomics data. METHODS: Participants self-enrolled online. Entry criteria into the GCR included the following: (i) diagnosis of gastric cancer, (ii) history of gastric cancer in a first- or second-degree relative, or (iii) known germline mutation in the gene CDH1. Participants provided demographic and clinical information through a detailed survey. Some participants provided specimens of saliva and tumor samples. Tumor samples underwent exome sequencing, whole-genome sequencing, and transcriptome sequencing. RESULTS: From 2011 to 2021, 567 individuals registered and returned the clinical questionnaire. For this cohort 65% had a personal history of gastric cancer, 36% reported a family history of gastric cancer, and 14% had a germline CDH1 mutation. 89 patients with gastric cancer provided tumor samples. For the initial study, 41 tumors were sequenced using next-generation sequencing. The data was analyzed for cancer mutations, copy-number variations, gene expression, microbiome, neoantigens, immune infiltrates, and other features. We developed a searchable, web-based interface (the GCR Genome Explorer) to enable researchers' access to these datasets. CONCLUSIONS: The GCR is a unique, North American gastric cancer registry which integrates clinical and genomic annotation. IMPACT: Available for researchers through an open access, web-based explorer, the GCR Genome Explorer will accelerate collaborative gastric cancer research across the United States and world.
Assuntos
Neoplasias Gástricas , Genômica , Mutação em Linhagem Germinativa , Humanos , Sistemas de Informação , Pesquisa Interdisciplinar , Sistema de Registros , Neoplasias Gástricas/genética , Neoplasias Gástricas/patologiaRESUMO
We developed a single-cell approach to detect CRISPR-modified mRNA transcript structures. This method assesses how genetic variants at splicing sites and splicing factors contribute to alternative mRNA isoforms. We determine how alternative splicing is regulated by editing target exon-intron segments or splicing factors by CRISPR-Cas9 and their consequences on transcriptome profile. Our method combines long-read sequencing to characterize the transcript structure and short-read sequencing to match the single-cell gene expression profiles and gRNA sequence and therefore provides targeted genomic edits and transcript isoform structure detection at single-cell resolution.
Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Sequenciamento por Nanoporos/métodos , Isoformas de Proteínas/metabolismo , Processamento Alternativo , Éxons , Genômica , Células HEK293 , Humanos , Proteínas de Neoplasias , Isoformas de Proteínas/genética , Isoformas de RNA/genética , Isoformas de RNA/metabolismo , Splicing de RNA , RNA Guia de Cinetoplastídeos/metabolismo , Receptores de Quinase C Ativada , TranscriptomaRESUMO
BACKGROUND: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contact tracing. METHODS: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies variants, and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. RESULTS: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and a combination of mutations that appear in only a small number of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of 100 SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. CONCLUSIONS: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients and determined their general prevalence when compared to over 70,000 other strains. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.
Assuntos
COVID-19/virologia , Genoma Viral , Mutação , SARS-CoV-2/genética , Sequência de Bases , Sequência Conservada , Impressões Digitais de DNA , Humanos , RNA Viral , Análise de Sequência de RNARESUMO
Background: The genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contract tracing. Methods: Leveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. Results: We built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and rare ones that occur in only small fraction of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. Conclusions: We conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients, mutations demarcating dominant species and the prevalence of mutation signatures, of which a significant number were relatively unique. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.