RESUMO
BACKGROUND: There are known disparities in incidence and outcomes of colorectal cancer (CRC) by race and ethnicity. Some of these disparities may be mediated by molecular changes in tumors that occur at different rates across populations. Genetic ancestry is a measure complementary to race and ethnicity that can overcome missing data issues and better capture genetic similarity in admixed populations. We aimed to identify somatic mutations and tumor gene expression differences associated with both genetic ancestry and imputed race and ethnicity. METHODS: Sequencing was performed with the Tempus xT NGS 648-gene panel and whole exome capture RNA-Seq for 8454 primarily late-stage CRC patients. Genetic ancestry proportions for five continental groups-Africa (AFR), American indigenous (AMR), East Asia (EAS), Europe (EUR), and South Asia (SAS)-were estimated using ancestry informative markers. To address data gaps, race and ethnicity categories were imputed, resulting in assignments for 952 Hispanic/Latino, 420 non-Hispanic (NH) Asian, 1061 NH Black, and 5763 NH White individuals. We assessed association of genetic ancestry proportions and imputed race and ethnicity categories with somatic mutations in relevant CRC genes and in 2608 expression profiles, as well as 1957 consensus molecular subtypes (CMS). RESULTS: Increased AFR ancestry was associated with higher odds of somatic mutations in APC, KRAS, and PIK3CA and lower odds of BRAF mutations. Additionally, increased EAS ancestry was associated with lower odds of mutations in KRAS, EUR with higher odds in BRAF, and the Hispanic/Latino category with lower odds in BRAF. Greater AFR ancestry and the NH Black category were associated with higher rates of CMS3, while a higher proportion of Hispanic/Latino patients exhibited indeterminate CMS classifications. CONCLUSIONS: Molecular differences in CRC tumor mutation frequencies and gene expression that may underlie observed differences by race and ethnicity were identified. The association of AFR ancestry with increased KRAS mutations aligns with higher CMS3 subtype rates in NH Black patients. The increase of indeterminate CMS in Hispanic/Latino patients suggests that subtype classification methods could benefit from enhanced patient diversity.
Assuntos
Neoplasias Colorretais , Mutação , Humanos , Neoplasias Colorretais/genética , Masculino , Feminino , Pessoa de Meia-Idade , Proteínas Proto-Oncogênicas B-raf/genética , Idoso , Classe I de Fosfatidilinositol 3-Quinases/genética , Proteínas Proto-Oncogênicas p21(ras)/genética , Biomarcadores Tumorais/genética , Proteína da Polipose Adenomatosa do Colo/genéticaRESUMO
The incompleteness of race and ethnicity information in real-world data (RWD) hampers its utility in promoting healthcare equity. This study introduces two methods-one heuristic and the other machine learning-based-to impute race and ethnicity from genetic ancestry using tumor profiling data. Analyzing de-identified data from over 100,000 cancer patients sequenced with the Tempus xT panel, we demonstrate that both methods outperform existing geolocation and surname-based methods, with the machine learning approach achieving high recall (range: 0.859-0.993) and precision (range: 0.932-0.981) across four mutually exclusive race and ethnicity categories. This work presents a novel pathway to enhance RWD utility in studying racial disparities in healthcare.
Assuntos
Etnicidade , Nomes , Humanos , Etnicidade/genética , Grupos Raciais/genética , Biologia Computacional , Testes GenéticosRESUMO
BACKGROUND: Endocrine-resistant HR+/HER2- breast cancer (BC) and triple-negative BC (TNBC) are of interest for molecularly informed treatment due to their aggressive natures and limited treatment profiles. Patients of African Ancestry (AA) experience higher rates of TNBC and mortality than European Ancestry (EA) patients, despite lower overall BC incidence. Here, we compare the molecular landscapes of AA and EA patients with HR+/HER2- BC and TNBC in a real-world cohort to promote equity in precision oncology by illuminating the heterogeneity of potentially druggable genomic and transcriptomic pathways. METHODS: De-identified records from patients with TNBC or HR+/HER2- BC in the Tempus Database were randomly selected (N = 5000), with most having stage IV disease. Mutations, gene expression, and transcriptional signatures were evaluated from next-generation sequencing data. Genetic ancestry was estimated from DNA-seq. Differences in mutational prevalence, gene expression, and transcriptional signatures between AA and EA were compared. EA patients were used as the reference population for log fold-changes (logFC) in expression. RESULTS: After applying inclusion criteria, 3433 samples were evaluated (n = 623 AA and n = 2810 EA). Observed patterns of dysregulated pathways demonstrated significant heterogeneity among the two groups. Notably, PIK3CA mutations were significantly lower in AA HR+/HER2- tumors (AA = 34% vs. EA = 42%, P < 0.05) and the overall cohort (AA = 28% vs. EA = 37%, P = 2.08e-05). Conversely, KMT2C mutation was significantly more frequent in AA than EA TNBC (23% vs. 12%, P < 0.05) and HR+/HER2- (24% vs. 15%, P = 3e-03) tumors. Across all subtypes and stages, over 8000 genes were differentially expressed between the two ancestral groups including RPL10 (logFC = 2.26, P = 1.70e-162), HSPA1A (logFC = - 2.73, P = 2.43e-49), ATRX (logFC = - 1.93, P = 5.89e-83), and NUTM2F (logFC = 2.28, P = 3.22e-196). Ten differentially expressed gene sets were identified among stage IV HR+/HER2- tumors, of which four were considered relevant to BC treatment and were significantly enriched in EA: ERBB2_UP.V1_UP (P = 3.95e-06), LTE2_UP.V1_UP (P = 2.90e-05), HALLMARK_FATTY_ACID_METABOLISM (P = 0.0073), and HALLMARK_ANDROGEN_RESPONSE (P = 0.0074). CONCLUSIONS: We observed significant differences in mutational spectra, gene expression, and relevant transcriptional signatures between patients with genetically determined African and European ancestries, particularly within the HR+/HER2- BC and TNBC subtypes. These findings could guide future development of treatment strategies by providing opportunities for biomarker-informed research and, ultimately, clinical decisions for precision oncology care in diverse populations.
Assuntos
Neoplasias da Mama , Neoplasias de Mama Triplo Negativas , Feminino , Humanos , População Negra/genética , Neoplasias da Mama/etnologia , Neoplasias da Mama/patologia , Mutação , Medicina de Precisão , Neoplasias de Mama Triplo Negativas/etnologia , Neoplasias de Mama Triplo Negativas/patologia , População BrancaRESUMO
The incidence and mortality of early onset colorectal cancer (EOCRC) is rising; outcomes appear to differ by race and ethnicity. We aimed to assess differences in mutational landscape and gene expression of EOCRC by racial and ethnic groups (non-Hispanic Asian, non-Hispanic Black, non-Hispanic White, White Hispanic) using data from the American Association for Cancer Research Project GENIE (10.2) and University of Texas Southwestern, the latter enriched in Hispanic patients. All statistical tests were 2-sided. Of 1752 EOCRC patients, non-Hispanic Black patients had higher rates of KRAS mutations (60.9%; P = .001, q = 0.015), and non-Hispanic White and non-Hispanic Black patients had higher rates of APC mutations (77.1% and 76.6% among non-Hispanic White and non-Hispanic Black patients, respectively; P = .001, q = 0.015) via the Fisher exact test with Benjamini-Hochberg correction. Using R packages DESeq2 and clusterProfiler, we found that White Hispanic patients had increased expression of genes involved in oxidative phosphorylation (P < .001, q = 0.025). Genomic profiling has the potential to identify novel diagnostics and influence individualized treatment options to address the currently limited prognosis of EOCRC.
Assuntos
Neoplasias Colorretais , Etnicidade , População Negra , Neoplasias Colorretais/genética , Etnicidade/genética , Genômica , Hispânico ou Latino/genética , Humanos , Estados Unidos/epidemiologiaRESUMO
In patients with invasive breast cancer, fluorescence in situ hybridization (FISH) testing for HER2 typically demonstrates the clear presence or lack of ERBB2 (HER2) amplification (i.e., groups 1 or 5). However, a small subset of patients can present with unusual HER2 FISH patterns (groups 2-4), resulting in diagnostic confusion. To provide clarity, the 2018 CAP/ASCO HER2 testing guideline recommends additional testing using HER2 immunohistochemistry (IHC) for determining the final HER2 status. Despite this effort, the genomic correlates of unusual HER2 FISH groups remain poorly understood. Here, we used droplet digital PCR (ddPCR) and targeted next-generation sequencing (NGS) to characterize the genomic features of both usual and unusual HER2 FISH groups. In this study, 51 clinical samples were selected to represent FISH groups 1-5. Furthermore, group 1 was subdivided into two groups, with groups 1A and 1B corresponding to cases with HER2 signals/cell ≥6.0 and 4-6, respectively. Overall, our findings revealed a wide range of copy number alterations in HER2 across the different FISH groups. As expected, groups 1A and 5 showed the clear presence and lack of HER2 copy number gain, respectively, as measured by ddPCR and NGS. In contrast, group 1B and other uncommon FISH groups (groups 2-4) were characterized by a broader range of HER2 copy levels with only a few select cases showing high-level gain. Notably, these cases with increased HER2 copy levels also showed HER2 overexpression by IHC, thus highlighting the correlation between HER2 copy number and HER2 protein expression. Given the concordance between the genomic and protein results, our findings suggest that HER2 IHC may inform HER2 copy number status in patients with unusual FISH patterns. Hence, our results support the current recommendation for using IHC to resolve HER2 status in FISH groups 2-4.
Assuntos
Biomarcadores Tumorais/análise , Neoplasias da Mama/genética , Hibridização in Situ Fluorescente/métodos , Receptor ErbB-2/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Biomarcadores Tumorais/genética , Variações do Número de Cópias de DNA , Feminino , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Imuno-Histoquímica/métodos , Pessoa de Meia-Idade , Reação em Cadeia da Polimerase/métodos , Receptor ErbB-2/análise , Análise de Sequência de DNA/métodosRESUMO
In vitro cancer cultures, including three-dimensional organoids, typically contain exclusively neoplastic epithelium but require artificial reconstitution to recapitulate the tumor microenvironment (TME). The co-culture of primary tumor epithelia with endogenous, syngeneic tumor-infiltrating lymphocytes (TILs) as a cohesive unit has been particularly elusive. Here, an air-liquid interface (ALI) method propagated patient-derived organoids (PDOs) from >100 human biopsies or mouse tumors in syngeneic immunocompetent hosts as tumor epithelia with native embedded immune cells (T, B, NK, macrophages). Robust droplet-based, single-cell simultaneous determination of gene expression and immune repertoire indicated that PDO TILs accurately preserved the original tumor T cell receptor (TCR) spectrum. Crucially, human and murine PDOs successfully modeled immune checkpoint blockade (ICB) with anti-PD-1- and/or anti-PD-L1 expanding and activating tumor antigen-specific TILs and eliciting tumor cytotoxicity. Organoid-based propagation of primary tumor epithelium en bloc with endogenous immune stroma should enable immuno-oncology investigations within the TME and facilitate personalized immunotherapy testing.
Assuntos
Modelos Imunológicos , Neoplasias Experimentais/imunologia , Organoides/imunologia , Receptores de Antígenos de Linfócitos T/imunologia , Microambiente Tumoral/imunologia , Animais , Antígeno B7-H1/imunologia , Técnicas de Cocultura , Feminino , Humanos , Imunoterapia , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Proteínas de Neoplasias/imunologia , Neoplasias Experimentais/patologia , Neoplasias Experimentais/terapia , Organoides/patologiaRESUMO
BACKGROUND: Medulloblastoma is associated with rare hereditary cancer predisposition syndromes; however, consensus medulloblastoma predisposition genes have not been defined and screening guidelines for genetic counselling and testing for paediatric patients are not available. We aimed to assess and define these genes to provide evidence for future screening guidelines. METHODS: In this international, multicentre study, we analysed patients with medulloblastoma from retrospective cohorts (International Cancer Genome Consortium [ICGC] PedBrain, Medulloblastoma Advanced Genomics International Consortium [MAGIC], and the CEFALO series) and from prospective cohorts from four clinical studies (SJMB03, SJMB12, SJYC07, and I-HIT-MED). Whole-genome sequences and exome sequences from blood and tumour samples were analysed for rare damaging germline mutations in cancer predisposition genes. DNA methylation profiling was done to determine consensus molecular subgroups: WNT (MBWNT), SHH (MBSHH), group 3 (MBGroup3), and group 4 (MBGroup4). Medulloblastoma predisposition genes were predicted on the basis of rare variant burden tests against controls without a cancer diagnosis from the Exome Aggregation Consortium (ExAC). Previously defined somatic mutational signatures were used to further classify medulloblastoma genomes into two groups, a clock-like group (signatures 1 and 5) and a homologous recombination repair deficiency-like group (signatures 3 and 8), and chromothripsis was investigated using previously established criteria. Progression-free survival and overall survival were modelled for patients with a genetic predisposition to medulloblastoma. FINDINGS: We included a total of 1022 patients with medulloblastoma from the retrospective cohorts (n=673) and the four prospective studies (n=349), from whom blood samples (n=1022) and tumour samples (n=800) were analysed for germline mutations in 110 cancer predisposition genes. In our rare variant burden analysis, we compared these against 53â105 sequenced controls from ExAC and identified APC, BRCA2, PALB2, PTCH1, SUFU, and TP53 as consensus medulloblastoma predisposition genes according to our rare variant burden analysis and estimated that germline mutations accounted for 6% of medulloblastoma diagnoses in the retrospective cohort. The prevalence of genetic predispositions differed between molecular subgroups in the retrospective cohort and was highest for patients in the MBSHH subgroup (20% in the retrospective cohort). These estimates were replicated in the prospective clinical cohort (germline mutations accounted for 5% of medulloblastoma diagnoses, with the highest prevalence [14%] in the MBSHH subgroup). Patients with germline APC mutations developed MBWNT and accounted for most (five [71%] of seven) cases of MBWNT that had no somatic CTNNB1 exon 3 mutations. Patients with germline mutations in SUFU and PTCH1 mostly developed infant MBSHH. Germline TP53 mutations presented only in childhood patients in the MBSHH subgroup and explained more than half (eight [57%] of 14) of all chromothripsis events in this subgroup. Germline mutations in PALB2 and BRCA2 were observed across the MBSHH, MBGroup3, and MBGroup4 molecular subgroups and were associated with mutational signatures typical of homologous recombination repair deficiency. In patients with a genetic predisposition to medulloblastoma, 5-year progression-free survival was 52% (95% CI 40-69) and 5-year overall survival was 65% (95% CI 52-81); these survival estimates differed significantly across patients with germline mutations in different medulloblastoma predisposition genes. INTERPRETATION: Genetic counselling and testing should be used as a standard-of-care procedure in patients with MBWNT and MBSHH because these patients have the highest prevalence of damaging germline mutations in known cancer predisposition genes. We propose criteria for routine genetic screening for patients with medulloblastoma based on clinical and molecular tumour characteristics. FUNDING: German Cancer Aid; German Federal Ministry of Education and Research; German Childhood Cancer Foundation (Deutsche Kinderkrebsstiftung); European Research Council; National Institutes of Health; Canadian Institutes for Health Research; German Cancer Research Center; St Jude Comprehensive Cancer Center; American Lebanese Syrian Associated Charities; Swiss National Science Foundation; European Molecular Biology Organization; Cancer Research UK; Hertie Foundation; Alexander and Margaret Stewart Trust; V Foundation for Cancer Research; Sontag Foundation; Musicians Against Childhood Cancer; BC Cancer Foundation; Swedish Council for Health, Working Life and Welfare; Swedish Research Council; Swedish Cancer Society; the Swedish Radiation Protection Authority; Danish Strategic Research Council; Swiss Federal Office of Public Health; Swiss Research Foundation on Mobile Communication; Masaryk University; Ministry of Health of the Czech Republic; Research Council of Norway; Genome Canada; Genome BC; Terry Fox Research Institute; Ontario Institute for Cancer Research; Pediatric Oncology Group of Ontario; The Family of Kathleen Lorette and the Clark H Smith Brain Tumour Centre; Montreal Children's Hospital Foundation; The Hospital for Sick Children: Sonia and Arthur Labatt Brain Tumour Research Centre, Chief of Research Fund, Cancer Genetics Program, Garron Family Cancer Centre, MDT's Garron Family Endowment; BC Childhood Cancer Parents Association; Cure Search Foundation; Pediatric Brain Tumor Foundation; Brainchild; and the Government of Ontario.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias Cerebelares/genética , Metilação de DNA , Testes Genéticos/métodos , Mutação em Linhagem Germinativa , Meduloblastoma/genética , Modelos Genéticos , Adolescente , Adulto , Neoplasias Cerebelares/mortalidade , Neoplasias Cerebelares/patologia , Neoplasias Cerebelares/terapia , Criança , Pré-Escolar , Análise Mutacional de DNA , Feminino , Perfilação da Expressão Gênica , Predisposição Genética para Doença , Hereditariedade , Humanos , Lactente , Masculino , Meduloblastoma/mortalidade , Meduloblastoma/patologia , Meduloblastoma/terapia , Linhagem , Fenótipo , Valor Preditivo dos Testes , Intervalo Livre de Progressão , Estudos Prospectivos , Reprodutibilidade dos Testes , Estudos Retrospectivos , Fatores de Risco , Transcriptoma , Sequenciamento do Exoma , Adulto JovemRESUMO
Next-generation deep sequencing of gene panels is being adopted as a diagnostic test to identify actionable mutations in cancer patient samples. However, clinical samples, such as formalin-fixed, paraffin-embedded specimens, frequently provide low quantities of degraded, poor quality DNA. To overcome these issues, many sequencing assays rely on extensive PCR amplification leading to an accumulation of bias and artifacts. Thus, there is a need for a targeted sequencing assay that performs well with DNA of low quality and quantity without relying on extensive PCR amplification. We evaluate the performance of a targeted sequencing assay based on Oligonucleotide Selective Sequencing, which permits the enrichment of genes and regions of interest and the identification of sequence variants from low amounts of damaged DNA. This assay utilizes a repair process adapted to clinical FFPE samples, followed by adaptor ligation to single stranded DNA and a primer-based capture technique. Our approach generates sequence libraries of high fidelity with reduced reliance on extensive PCR amplification-this facilitates the accurate assessment of copy number alterations in addition to delivering accurate single nucleotide variant and insertion/deletion detection. We apply this method to capture and sequence the exons of a panel of 130 cancer-related genes, from which we obtain high read coverage uniformity across the targeted regions at starting input DNA amounts as low as 10 ng per sample. We demonstrate the performance using a series of reference DNA samples, and by identifying sequence variants in DNA from matched clinical samples originating from different tissue types.
RESUMO
Although a variety of genetic alterations have been found across cancer types, the identification and functional characterization of candidate driver genetic lesions in an individual patient and their translation into clinically actionable strategies remain major hurdles. Here, we use whole genome sequencing of a prostate cancer tumor, computational analyses, and experimental validation to identify and predict novel oncogenic activity arising from a point mutation in the phosphatase and tensin homolog (PTEN) tumor suppressor protein. We demonstrate that this mutation (p.A126G) produces an enzymatic gain-of-function in PTEN, shifting its function from a phosphoinositide (PI) 3-phosphatase to a phosphoinositide (PI) 5-phosphatase. Using cellular assays, we demonstrate that this gain-of-function activity shifts cellular phosphoinositide levels, hyperactivates the PI3K/Akt cell proliferation pathway, and exhibits increased cell migration beyond canonical PTEN loss-of-function mutants. These findings suggest that mutationally modified PTEN can actively contribute to well-defined hallmarks of cancer. Lastly, we demonstrate that these effects can be substantially mitigated through chemical PI3K inhibitors. These results demonstrate a new dysfunction paradigm for PTEN cancer biology and suggest a potential framework for the translation of genomic data into actionable clinical strategies for targeted patient therapy.
Assuntos
Genes Supressores de Tumor , Proteínas de Neoplasias/genética , PTEN Fosfo-Hidrolase/genética , Monoéster Fosfórico Hidrolases/genética , Neoplasias da Próstata/genética , Análise de Variância , Animais , Sequência de Bases , Células CHO , Movimento Celular/fisiologia , Proliferação de Células/fisiologia , Biologia Computacional/métodos , Cricetinae , Cricetulus , Humanos , Immunoblotting , Masculino , Microscopia de Fluorescência , Anotação de Sequência Molecular , Dados de Sequência Molecular , Mutagênese Sítio-Dirigida , Técnicas de Patch-Clamp , Fosfatidilinositóis/metabolismo , Monoéster Fosfórico Hidrolases/metabolismo , Análise de Sequência de DNARESUMO
Triple-negative breast cancer (TNBC) is characterized by the absence of expression of estrogen receptor, progesterone receptor, and HER-2. Thirty percent of patients recur after first-line treatment, and metastatic TNBC (mTNBC) has a poor prognosis with median survival of one year. Here, we present initial analyses of whole genome and transcriptome sequencing data from 14 prospective mTNBC. We have cataloged the collection of somatic genomic alterations in these advanced tumors, particularly those that may inform targeted therapies. Genes mutated in multiple tumors included TP53, LRP1B, HERC1, CDH5, RB1, and NF1. Notable genes involved in focal structural events were CTNNA1, PTEN, FBXW7, BRCA2, WT1, FGFR1, KRAS, HRAS, ARAF, BRAF, and PGCP. Homozygous deletion of CTNNA1 was detected in 2 of 6 African Americans. RNA sequencing revealed consistent overexpression of the FOXM1 gene when tumor gene expression was compared with nonmalignant breast samples. Using an outlier analysis of gene expression comparing one cancer with all the others, we detected expression patterns unique to each patient's tumor. Integrative DNA/RNA analysis provided evidence for deregulation of mutated genes, including the monoallelic expression of TP53 mutations. Finally, molecular alterations in several cancers supported targeted therapeutic intervention on clinical trials with known inhibitors, particularly for alterations in the RAS/RAF/MEK/ERK and PI3K/AKT/mTOR pathways. In conclusion, whole genome and transcriptome profiling of mTNBC have provided insights into somatic events occurring in this difficult to treat cancer. These genomic data have guided patients to investigational treatment trials and provide hypotheses for future trials in this irremediable cancer.
Assuntos
Neoplasias da Mama/genética , Transcriptoma , Adulto , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Cromossomos Humanos Par 7 , Análise Mutacional de DNA , Feminino , Proteína Forkhead Box M1 , Fatores de Transcrição Forkhead/genética , Fatores de Transcrição Forkhead/metabolismo , Expressão Gênica , Genes Neoplásicos , Genoma Humano , Estudo de Associação Genômica Ampla , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Pessoa de Meia-Idade , Terapia de Alvo Molecular , Metástase Neoplásica , Estudos Prospectivos , Receptor ErbB-2/metabolismo , Receptores de Estrogênio/metabolismo , Receptores de Progesterona/metabolismo , Análise de Sequência de RNA , Deleção de Sequência , Transdução de Sinais , Resultado do Tratamento , Proteína Supressora de Tumor p53/genética , alfa Catenina/genéticaRESUMO
Clinical management of human cancer is dependent on the accurate monitoring of residual and recurrent tumors. The evaluation of patient-specific translocations in leukemias and lymphomas has revolutionized diagnostics for these diseases. We have developed a method, called personalized analysis of rearranged ends (PARE), which can identify translocations in solid tumors. Analysis of four colorectal and two breast cancers with massively parallel sequencing revealed an average of nine rearranged sequences (range, 4 to 15) per tumor. Polymerase chain reaction with primers spanning the breakpoints was able to detect mutant DNA molecules present at levels lower than 0.001% and readily identified mutated circulating DNA in patient plasma samples. This approach provides an exquisitely sensitive and broadly applicable approach for the development of personalized biomarkers to enhance the clinical management of cancer patients.
Assuntos
Biomarcadores Tumorais/genética , Rearranjo Gênico , Neoplasias/genética , Reação em Cadeia da Polimerase/métodos , Medicina de Precisão , Análise de Sequência de DNA/métodos , Translocação Genética , Sequência de Bases , Biomarcadores Tumorais/sangue , DNA/sangue , DNA/genética , Humanos , Dados de Sequência Molecular , Medicina de Precisão/instrumentação , Medicina de Precisão/métodosRESUMO
Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq) should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately measure allele-specific expression. To investigate these advantages we have applied a novel, strand-specific RNA-Seq method to tumors and matched normal tissue from three patients with oral squamous cell carcinomas. Additionally, to better understand the genomic determinants of the gene expression changes observed, we have sequenced the tumor and normal genomes of one of these patients. We demonstrate here that our RNA-Seq method accurately measures allelic imbalance and that measurement on the genome-wide scale yields novel insights into cancer etiology. As expected, the set of genes differentially expressed in the tumors is enriched for cell adhesion and differentiation functions, but, unexpectedly, the set of allelically imbalanced genes is also enriched for these same cancer-related functions. By comparing the transcriptomic perturbations observed in one patient to his underlying normal and tumor genomes, we find that allelic imbalance in the tumor is associated with copy number mutations and that copy number mutations are, in turn, strongly associated with changes in transcript abundance. These results support a model in which allele-specific deletions and duplications drive allele-specific changes in gene expression in the developing tumor.
Assuntos
Carcinoma de Células Escamosas/genética , Perfilação da Expressão Gênica , Neoplasias Bucais/genética , Análise de Sequência de DNA/métodos , Desequilíbrio Alélico , Análise por Conglomerados , Deleção de Genes , Dosagem de Genes , Duplicação Gênica , Regulação Neoplásica da Expressão Gênica , Estudo de Associação Genômica Ampla/métodos , Humanos , Mutação , Análise de Sequência com Séries de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único , Reação em Cadeia da Polimerase Via Transcriptase ReversaRESUMO
We describe the genome sequencing of an anonymous individual of African origin using a novel ligation-based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9%, allowing us to accurately call SNPs with as few as two reads per allele. We collected several billion mate-paired reads yielding approximately 18x haploid coverage of aligned sequence and close to 300x clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read, and 99.65% is spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data are used to physically resolve haplotype phases of nearly two-thirds of the genotypes obtained and produce phased segments of up to 215 kb. We detect 226,529 intra-read indels, 5590 indels between mate-paired reads, 91 inversions, and four gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual. There is more genetic variation in the human genome still to be uncovered, and we provide guidance for future surveys in populations and cancer biopsies.
Assuntos
Pareamento de Bases , Biologia Computacional/métodos , Variação Genética , Genoma Humano , Ligases , Análise de Sequência de DNA/métodos , África , Sequência de Bases , Genômica , Genótipo , Heterozigoto , Homozigoto , Humanos , Polimorfismo de Nucleotídeo Único , Padrões de ReferênciaRESUMO
A class of genes, known as drug metabolism enzymes (DMEs) are responsible for the metabolism and transport of drugs and other xenobiotics. Variation in DME genes most likely accounts for a proportion of the variability in drug response in humans, and may contribute to complex diseases such as cancer (Nebert DW, Dieter MZ. Pharmacology 2000;61:124-135). To date, assessing the extent of this variation has proven difficult, especially because of sequence paralogy issues that cause difficulty when attempting to genotype polymorphisms in very closely-related gene families (Murphy MP. Pharmacogenomics 2000;1:115-123; Ingelman-Sundberg M. Drug Metab Rev 1999;31:449-459). We have developed and genotyped a panel of N=2,325 individual TaqMan genotyping assays for polymorphisms in >200 DME genes; many of the variants in the panel are single nucleotide polymorphisms (SNPs) that are of known or putative function (e.g., missense, nonsense or frameshift). Using these assays, we have examined genetic variation among several groups of populations, including: 1) the two SNP500 Cancer population panels (http://snp500cancer.nci.nih.gov; last accessed: 11 December 2007); and 2) the panel used in the International HapMap Project panel (www.hapmap.org; last accessed: 11 December 2007). We have developed a comprehensive validation strategy to ensure reproducibility and accuracy of the assays and estimated minor allele frequencies. Here, we present the results of these analyses, which strongly suggest that this panel of DME assays are of extremely high quality and produce robust, accurate, and reproducible results.
Assuntos
Enzimas/genética , Preparações Farmacêuticas/metabolismo , Polimorfismo de Nucleotídeo Único , Enzimas/metabolismo , Genótipo , Humanos , MutaçãoRESUMO
We performed a genome-wide association study of 19,779 nonsynonymous SNPs in 735 individuals with Crohn disease and 368 controls. A total of 7,159 of these SNPs were informative. We followed up on all 72 SNPs with P