RESUMEN
Multiple signatures of somatic mutations have been identified in cancer genomes. Exome sequences of 1,001 human cancer cell lines and 577 xenografts revealed most common mutational signatures, indicating past activity of the underlying processes, usually in appropriate cancer types. To investigate ongoing patterns of mutational-signature generation, cell lines were cultured for extended periods and subsequently DNA sequenced. Signatures of discontinued exposures, including tobacco smoke and ultraviolet light, were not generated in vitro. Signatures of normal and defective DNA repair and replication continued to be generated at roughly stable mutation rates. Signatures of APOBEC cytidine deaminase DNA-editing exhibited substantial fluctuations in mutation rate over time with episodic bursts of mutations. The initiating factors for the bursts are unclear, although retrotransposon mobilization may contribute. The examined cell lines constitute a resource of live experimental models of mutational processes, which potentially retain patterns of activity and regulation operative in primary human cancers.
Asunto(s)
Desaminasas APOBEC/genética , Neoplasias/genética , Desaminasas APOBEC/metabolismo , Línea Celular , Línea Celular Tumoral , ADN/metabolismo , Análisis Mutacional de ADN/métodos , Bases de Datos Genéticas , Exoma , Genoma Humano/genética , Xenoinjertos , Humanos , Mutagénesis , Mutación/genética , Tasa de Mutación , Retroelementos , Secuenciación del Exoma/métodosRESUMEN
The rates and patterns of somatic mutation in normal tissues are largely unknown outside of humans1-7. Comparative analyses can shed light on the diversity of mutagenesis across species, and on long-standing hypotheses about the evolution of somatic mutation rates and their role in cancer and ageing. Here we performed whole-genome sequencing of 208 intestinal crypts from 56 individuals to study the landscape of somatic mutation across 16 mammalian species. We found that somatic mutagenesis was dominated by seemingly endogenous mutational processes in all species, including 5-methylcytosine deamination and oxidative damage. With some differences, mutational signatures in other species resembled those described in humans8, although the relative contribution of each signature varied across species. Notably, the somatic mutation rate per year varied greatly across species and exhibited a strong inverse relationship with species lifespan, with no other life-history trait studied showing a comparable association. Despite widely different life histories among the species we examined-including variation of around 30-fold in lifespan and around 40,000-fold in body mass-the somatic mutation burden at the end of lifespan varied only by a factor of around 3. These data unveil common mutational processes across mammals, and suggest that somatic mutation rates are evolutionarily constrained and may be a contributing factor in ageing.
Asunto(s)
Longevidad , Tasa de Mutación , Animales , Humanos , Longevidad/genética , Mamíferos/genética , Mutagénesis/genética , MutaciónRESUMEN
The histological diagnosis of peripheral T-cell lymphoma (PTCL) can represent a challenge, particularly in the case of closely related entities such as angioimmunoblastic T-lymphoma (AITL), PTCL-not otherwise specified (PTCL-NOS), and ALK-negative anaplastic large-cell lymphoma (ALCL). Although gene expression profiling and next generations sequencing have been proven to define specific features recurrently associated with distinct entities, genomic-based stratifications have not yet led to definitive diagnostic criteria and/or entered into the routine clinical practice. Herein, to improve the current molecular classification between AITL and PTCL-NOS, we analyzed the transcriptional profiles from 503 PTCLs stratified according to their molecular configuration and integrated them with genomic data of recurrently mutated genes (RHOA G17V , TET2, IDH2 R172 , and DNMT3A) in 53 cases (39 AITLs and 14 PTCL-NOSs) included in the series. Our analysis unraveled that the mutational status of RHOA G17V , TET2, and DNMT3A poorly correlated, individually, with peculiar transcriptional fingerprints. Conversely, in IDH2 R172 samples a strong transcriptional signature was identified that could act as a surrogate for mutational status. The integrated analysis of clinical, mutational, and molecular data led to a simplified 19-gene signature that retains high accuracy in differentiating the main nodal PTCL entities. The expression levels of those genes were confirmed in an independent cohort profiled by RNA-sequencing.
Asunto(s)
Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Linfoma de Células T Periférico , Mutación , Proteínas de Neoplasias , Transcripción Genética , Femenino , Humanos , Linfoma de Células T Periférico/genética , Linfoma de Células T Periférico/metabolismo , Masculino , Proteínas de Neoplasias/biosíntesis , Proteínas de Neoplasias/genéticaRESUMEN
BACKGROUND: Genome editing by CRISPR-Cas9 technology allows large-scale screening of gene essentiality in cancer. A confounding factor when interpreting CRISPR-Cas9 screens is the high false-positive rate in detecting essential genes within copy number amplified regions of the genome. We have developed the computational tool CRISPRcleanR which is capable of identifying and correcting gene-independent responses to CRISPR-Cas9 targeting. CRISPRcleanR uses an unsupervised approach based on the segmentation of single-guide RNA fold change values across the genome, without making any assumption about the copy number status of the targeted genes. RESULTS: Applying our method to existing and newly generated genome-wide essentiality profiles from 15 cancer cell lines, we demonstrate that CRISPRcleanR reduces false positives when calling essential genes, correcting biases within and outside of amplified regions, while maintaining true positive rates. Established cancer dependencies and essentiality signals of amplified cancer driver genes are detectable post-correction. CRISPRcleanR reports sgRNA fold changes and normalised read counts, is therefore compatible with downstream analysis tools, and works with multiple sgRNA libraries. CONCLUSIONS: CRISPRcleanR is a versatile open-source tool for the analysis of CRISPR-Cas9 knockout screens to identify essential genes.
Asunto(s)
Sistemas CRISPR-Cas , Marcación de Gen/métodos , Genoma Humano , Neoplasias/genética , Línea Celular Tumoral , Variaciones en el Número de Copia de ADN , Amplificación de Genes , Técnicas de Inactivación de Genes/métodos , Genes Esenciales , Ensayos Analíticos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADN , Programas InformáticosRESUMEN
Genetic screens in cancer cell lines inform gene function and drug discovery. More comprehensive screen datasets with multi-omics data are needed to enhance opportunities to functionally map genetic vulnerabilities. Here, we construct a second-generation map of cancer dependencies by annotating 930 cancer cell lines with multi-omic data and analyze relationships between molecular markers and cancer dependencies derived from CRISPR-Cas9 screens. We identify dependency-associated gene expression markers beyond driver genes, and observe many gene addiction relationships driven by gain of function rather than synthetic lethal effects. By combining clinically informed dependency-marker associations with protein-protein interaction networks, we identify 370 anti-cancer priority targets for 27 cancer types, many of which have network-based evidence of a functional link with a marker in a cancer type. Mapping these targets to sequenced tumor cohorts identifies tractable targets in different cancer types. This target prioritization map enhances understanding of gene dependencies and identifies candidate anti-cancer targets for drug development.
Asunto(s)
Pruebas Genéticas , Neoplasias , Humanos , Fenotipo , Descubrimiento de Drogas , Neoplasias/genética , Neoplasias/patología , Línea Celular Tumoral , Sistemas CRISPR-CasRESUMEN
Drug resistance is a principal limitation to the long-term efficacy of cancer therapies. Cancer genome sequencing can retrospectively delineate the genetic basis of drug resistance, but this requires large numbers of post-treatment samples to nominate causal variants. Here we prospectively identify genetic mechanisms of resistance to ten oncology drugs from CRISPR base editing mutagenesis screens in four cancer cell lines using a guide RNA library predicted to install 32,476 variants in 11 cancer genes. We identify four functional classes of protein variants modulating drug sensitivity and use single-cell transcriptomics to reveal how these variants operate through distinct mechanisms, including eliciting a drug-addicted cell state. We identify variants that can be targeted with alternative inhibitors to overcome resistance and functionally validate an epidermal growth factor receptor (EGFR) variant that sensitizes lung cancer cells to EGFR inhibitors. Our variant-to-function map has implications for patient stratification, therapy combinations and drug scheduling in cancer treatment.
RESUMEN
Microsatellite-unstable (MSI) cancers require WRN helicase to resolve replication stress due to expanded DNA (TA)n dinucleotide repeats. WRN is a promising synthetic lethal target for MSI tumors, and WRN inhibitors are in development. In this study, we used CRISPR-Cas9 base editing to map WRN residues critical for MSI cells, validating the helicase domain as the primary drug target. Fragment-based screening led to the development of potent and highly selective WRN helicase covalent inhibitors. These compounds selectively suppressed MSI model growth in vitro and in vivo by mimicking WRN loss, inducing DNA double-strand breaks at expanded TA repeats and DNA damage. Assessment of biomarkers in preclinical models linked TA-repeat expansions and mismatch repair alterations to compound activity. Efficacy was confirmed in immunotherapy-resistant organoids and patient-derived xenograft models. The discovery of potent, selective covalent WRN inhibitors provides proof of concept for synthetic lethal targeting of WRN in MSI cancer and tools to dissect WRN biology. Significance: We report the discovery and characterization of potent, selective WRN helicase inhibitors for MSI cancer treatment, with biomarker analysis and evaluation of efficacy in vivo and in immunotherapy-refractory preclinical models. These findings pave the way to translate WRN inhibition into MSI cancer therapies and provide tools to investigate WRN biology. See related commentary by Wainberg, p. 1369.
Asunto(s)
Helicasa del Síndrome de Werner , Humanos , Helicasa del Síndrome de Werner/genética , Ratones , Animales , Inestabilidad de Microsatélites , Neoplasias/tratamiento farmacológico , Neoplasias/genética , Neoplasias/patología , Línea Celular Tumoral , Ensayos Antitumor por Modelo de Xenoinjerto , Inhibidores Enzimáticos/farmacología , Inhibidores Enzimáticos/uso terapéuticoRESUMEN
Interferon-γ (IFN-γ) signaling mediates host responses to infection, inflammation and anti-tumor immunity. Mutations in the IFN-γ signaling pathway cause immunological disorders, hematological malignancies, and resistance to immune checkpoint blockade (ICB) in cancer; however, the function of most clinically observed variants remains unknown. Here, we systematically investigate the genetic determinants of IFN-γ response in colorectal cancer cells using CRISPR-Cas9 screens and base editing mutagenesis. Deep mutagenesis of JAK1 with cytidine and adenine base editors, combined with pathway-wide screens, reveal loss-of-function and gain-of-function mutations, including causal variants in hematological malignancies and mutations detected in patients refractory to ICB. We functionally validate variants of uncertain significance in primary tumor organoids, where engineering missense mutations in JAK1 enhanced or reduced sensitivity to autologous tumor-reactive T cells. We identify more than 300 predicted missense mutations altering IFN-γ pathway activity, generating a valuable resource for interpreting gene variant function.
Asunto(s)
Neoplasias Hematológicas , Neoplasias , Humanos , Interferón gamma/genética , Interferón gamma/metabolismo , Edición Génica , Neoplasias/genética , Mutación , Transducción de Señal/genética , Sistemas CRISPR-CasRESUMEN
Organoid cell culture methodologies are enabling the generation of cell models from healthy and diseased tissue. Patient-derived cancer organoids that recapitulate the genetic and histopathological diversity of patient tumours are being systematically generated, providing an opportunity to investigate new cancer biology and therapeutic approaches. The use of organoid cultures for many applications, including genetic and chemical perturbation screens, is limited due to the technical demands and cost associated with their handling and propagation. Here we report and benchmark a suspension culture technique for cancer organoids which allows for the expansion of models to tens of millions of cells with increased efficiency in comparison to standard organoid culturing protocols. Using whole-genome DNA and RNA sequencing analyses, as well as medium-throughput drug sensitivity testing and genome-wide CRISPR-Cas9 screening, we demonstrate that cancer organoids grown as a suspension culture are genetically and phenotypically similar to their counterparts grown in standard conditions. This culture technique simplifies organoid cell culture and extends the range of organoid applications, including for routine use in large-scale perturbation screens.
Asunto(s)
Neoplasias , Organoides , Técnicas de Cultivo de Célula , ADN , Humanos , Neoplasias/tratamiento farmacológico , Neoplasias/genética , Neoplasias/patología , Organoides/patologíaRESUMEN
CEP server (http://bioinfo.ernet.in/cep.htm) provides a web interface to the conformational epitope prediction algorithm developed in-house. The algorithm, apart from predicting conformational epitopes, also predicts antigenic determinants and sequential epitopes. The epitopes are predicted using 3D structure data of protein antigens, which can be visualized graphically. The algorithm employs structure-based Bioinformatics approach and solvent accessibility of amino acids in an explicit manner. Accuracy of the algorithm was found to be 75% when evaluated using X-ray crystal structures of Ag-Ab complexes available in the PDB. This is the first and the only method available for the prediction of conformational epitopes, which is an attempt to map probable antibody-binding sites of protein antigens.
Asunto(s)
Mapeo Epitopo/métodos , Epítopos/química , Proteínas/química , Proteínas/inmunología , Programas Informáticos , Algoritmos , Internet , Conformación Proteica , Diseño de Software , Interfaz Usuario-ComputadorRESUMEN
Patterns of genomic evolution between primary and metastatic breast cancer have not been studied in large numbers, despite patients with metastatic breast cancer having dismal survival. We sequenced whole genomes or a panel of 365 genes on 299 samples from 170 patients with locally relapsed or metastatic breast cancer. Several lines of analysis indicate that clones seeding metastasis or relapse disseminate late from primary tumors, but continue to acquire mutations, mostly accessing the same mutational processes active in the primary tumor. Most distant metastases acquired driver mutations not seen in the primary tumor, drawing from a wider repertoire of cancer genes than early drivers. These include a number of clinically actionable alterations and mutations inactivating SWI-SNF and JAK2-STAT3 pathways.
Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias de la Mama/genética , Neoplasias de la Mama/patología , Evolución Molecular , Mutación , Recurrencia Local de Neoplasia/genética , Adulto , Anciano , Anciano de 80 o más Años , Proteínas Cromosómicas no Histona/antagonistas & inhibidores , Proteínas Cromosómicas no Histona/genética , Femenino , Humanos , Janus Quinasa 2/antagonistas & inhibidores , Janus Quinasa 2/genética , Masculino , Persona de Mediana Edad , Metástasis de la Neoplasia/genética , Factor de Transcripción STAT3/antagonistas & inhibidores , Factor de Transcripción STAT3/genética , Factores de Transcripción/antagonistas & inhibidores , Factores de Transcripción/genéticaRESUMEN
BACKGROUND: Whole genome sequence data is a step towards generating the 'parts list' of life to understand the underlying principles of Biocomplexity. Genome sequencing initiatives of human and model organisms are targeted efforts towards understanding principles of evolution with an application envisaged to improve human health. These efforts culminated in the development of dedicated resources. Whereas a large number of viral genomes have been sequenced by groups or individuals with an interest to study antigenic variation amongst strains and species. These independent efforts enabled viruses to attain the status of 'best-represented taxa' with the highest number of genomes. However, due to lack of concerted efforts, viral genomic sequences merely remained as entries in the public repositories until recently. RESULTS: VirGen is a curated resource of viral genomes and their analyses. Since its first release, it has grown both in terms of coverage of viral families and development of new modules for annotation and analysis. The current release (2.0) includes data for twenty-five families with broad host range as against eight in the first release. The taxonomic description of viruses in VirGen is in accordance with the ICTV nomenclature. A well-characterised strain is identified as a 'representative entry' for every viral species. This non-redundant dataset is used for subsequent annotation and analyses using sequenced-based Bioinformatics approaches. VirGen archives precomputed data on genome and proteome comparisons. A new data module that provides structures of viral proteins available in PDB has been incorporated recently. One of the unique features of VirGen is predicted conformational and sequential epitopes of known antigenic proteins using in-house developed algorithms, a step towards reverse vaccinology. CONCLUSION: Structured organization of genomic data facilitates use of data mining tools, which provides opportunities for knowledge discovery. One of the approaches to achieve this goal is to carry out functional annotations using comparative genomics. VirGen, a comprehensive viral genome resource that serves as an annotation and analysis pipeline has been developed for the curation of public domain viral genome data http://bioinfo.ernet.in/virgen/virgen.html. Various steps in the curation and annotation of the genomic data and applications of the value-added derived data are substantiated with case studies.
Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Genoma Viral , Secuencia de Aminoácidos , Variación Antigénica , Simulación por Computador , Bases de Datos de Proteínas , Variación Genética , Modelos Moleculares , Datos de Secuencia Molecular , Filogenia , Homología de Secuencia de Aminoácido , Vacunas/análisisRESUMEN
VirGen is a comprehensive viral genome resource that organizes the 'sequence space' of viral genomes in a structured fashion. It has been developed with the objective of serving as an annotated and curated database comprising complete genome sequences of viruses, value-added derived data and data mining tools. The current release (v1.1) contains 559 complete genomes in addition to 287 putative genomes of viruses belonging to eight viral families for which the host range includes animals and plants. Viral genomes in VirGen are annotated using sequence-based Bioinformatics approaches. The genomic data is also curated to identify 'alternate names' of viral proteins, where available. VirGen archives the results of comparisons of genomes, proteomes and individual proteins within and between viral species. It is the first resource to provide phylogenetic trees of viral species computed using whole-genome sequence data. The module of predicted B-cell antigenic determinants in VirGen is an attempt to link the genome to its vaccinome. Comparative genome analysis data facilitate the study of genome organization and evolution of viruses, which would have implications in applied research to identify candidates for the design of vaccines and antiviral drugs. VirGen is a relational database and is available at http://bioinfo. ernet.in/virgen/virgen.html.
Asunto(s)
Bases de Datos Genéticas , Genoma Viral , Genómica , Virus/genética , Antígenos Virales/inmunología , Antivirales , Biología Computacional , Internet , Filogenia , Proteoma/genética , Proteómica , Alineación de Secuencia , Terminología como Asunto , Proteínas Virales/genética , Vacunas Virales/inmunología , Virus/clasificaciónRESUMEN
INTRODUCTION: The majority of the genetic variance of systemic lupus erythematosus (SLE) remains unexplained by the common disease-common variant hypothesis. Rare variants, which are not detectable by genome-wide association studies because of their low frequencies, are predicted to explain part of this "missing heritability." However, recent studies identifying rare variants within known disease-susceptibility loci have failed to show genetic associations because of their extremely low frequencies, leading to the questioning of the contribution of rare variants to disease susceptibility. A common (minor allele frequency = 17.4% in cases) nonsynonymous coding variant rs1143679 (R77H) in ITGAM (CD11b), which forms half of the heterodimeric integrin receptor, complement receptor 3 (CR3), is robustly associated with SLE and has been shown to impair CR3-mediated phagocytosis. METHODS: We resequenced ITGAM in 73 SLE cases and identified two previously unidentified, case-specific nonsynonymous variants, F941V and G1145S. Both variants were genotyped in 2,107 and 949 additional SLE cases, respectively, to estimate their frequencies in a disease population. An in vitro model was used to assess the impact of F941V and G1145S, together with two nonsynonymous ITGAM polymorphisms, A858V (rs1143683) and M441T (rs11861251), on CR3-mediated phagocytosis. A paired two-tailed t test was used to compare the phagocytic capabilities of each variant with that of wild-type CR3. RESULTS: Both rare variants, F941V and G1145S, significantly impair CR3-mediated phagocytosis in an in vitro model (61% reduction, P = 0.006; 26% reduction, P = 0.0232). However, neither of the common variants, M441T and A858V, had an effect on phagocytosis. Neither rare variant was observed again in the genotyping of additional SLE cases, suggesting that their frequencies are extremely low. CONCLUSIONS: Our results add further evidence to the functional importance of ITGAM in SLE pathogenesis through impaired phagocytosis. Additionally, this study provides a new example of the identification of rare variants in common-allele-associated loci, which, because of their extremely low frequencies, are not statistically associated. However, the demonstration of their functional effects adds support to their contribution to disease risk, and questions the current notion of dismissing the contribution of very rare variants on purely statistical analyses.
Asunto(s)
Antígeno CD11b/genética , Predisposición Genética a la Enfermedad/genética , Lupus Eritematoso Sistémico/genética , Mutación Missense/genética , Animales , Secuencia de Bases , Antígeno CD11b/inmunología , Células COS , Chlorocebus aethiops , Citometría de Flujo , Frecuencia de los Genes , Genotipo , Humanos , Lupus Eritematoso Sistémico/inmunología , Antígeno de Macrófago-1/genética , Antígeno de Macrófago-1/inmunología , Mutación Missense/inmunología , Fagocitosis/genética , Fagocitosis/inmunología , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Human variation in susceptibility to hypoxia-induced pulmonary hypertension is well recognized. High-altitude residents who do not develop pulmonary hypertension may host protective gene mutations. METHODS AND RESULTS: Exome sequencing was conducted on 24 unrelated Kyrgyz highlanders living 2400 to 3800 m above sea level, 12 (10 men; mean age, 54 years) with an elevated mean pulmonary artery pressure (mean±SD, 38.7±2.7 mm Hg) and 12 (11 men; mean age, 52 years) with a normal mean pulmonary artery pressure (19.2±0.6 mm Hg) to identify candidate genes that may influence the pulmonary vascular response to hypoxia. A total of 140 789 exomic variants were identified and 26 116 (18.5%) were classified as novel or rare. Thirty-three novel or rare potential pathogenic variants (frameshift, essential splice-site, and nonsynonymous) were found exclusively in either ≥3 subjects with high-altitude pulmonary hypertension or ≥3 highlanders with a normal mean pulmonary artery pressure. A novel missense mutation in GUCY1A3 in 3 subjects with a normal mean pulmonary artery pressure encodes an α1-A680T soluble guanylate cyclase (sGC) variant. Expression of the α1-A680T sGC variant in reporter cells resulted in higher cyclic guanosine monophosphate production compared with the wild-type enzyme and the purified α1-A680T sGC exhibited enhanced sensitivity to nitric oxide in vitro. CONCLUSIONS: The α1-A680T sGC variant may contribute to protection against high-altitude pulmonary hypertension and supports sGC as a pharmacological target for reducing pulmonary artery pressure in humans at altitude.