Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
JAMA Oncol ; 10(5): 652-657, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38512297

RESUMO

Importance: Racially minoritized and socioeconomically disadvantaged populations are currently underrepresented in clinical trials. Data-driven, quantitative analyses and strategies are required to help address this inequity. Objective: To systematically analyze the geographical distribution of self-identified racial and socioeconomic demographics within commuting distance to cancer clinical trial centers and other hospitals in the US. Design, Setting, and Participants: This longitudinal quantitative study used data from the US Census 2020 Decennial and American community survey (which collects data from all US residents), OpenStreetMap, National Cancer Institute-designated Cancer Centers list, Nature Index of Cancer Research Health Institutions, National Trial registry, and National Homeland Infrastructure Foundation-Level Data. Statistical analyses were performed on data collected between 2006 and 2020. Main Outcomes and Measures: Population distributions of socioeconomic deprivation indices and self-identified race within 30-, 60-, and 120-minute 1-way driving commute times from US cancer trial sites. Map overlay of high deprivation index and high diversity areas with existing hospitals, existing major cancer trial centers, and commuting distance to the closest cancer trial center. Results: The 78 major US cancer trial centers that are involved in 94% of all US cancer trials and included in this study were found to be located in areas with socioeconomically more affluent populations with higher proportions of self-identified White individuals (+10.1% unpaired mean difference; 95% CI, +6.8% to +13.7%) compared with the national average. The top 10th percentile of all US hospitals has catchment populations with a range of absolute sum difference from 2.4% to 35% from one-third each of Asian/multiracial/other (Asian alone, American Indian or Alaska Native alone, Native Hawaiian or Other Pacific Islander alone, some other race alone, population of 2 or more races), Black or African American, and White populations. Currently available data are sufficient to identify diverse census tracks within preset commuting times (30, 60, or 120 minutes) from all hospitals in the US (N = 7623). Maps are presented for each US city above 500 000 inhabitants, which display all prospective hospitals and major cancer trial sites within commutable distance to racially diverse and socioeconomically disadvantaged populations. Conclusion and Relevance: This study identified biases in the sociodemographics of populations living within commuting distance to US-based cancer trial sites and enables the determination of more equitably commutable prospective satellite hospital sites that could be mobilized for enhanced racial and socioeconomic representation in clinical trials. The maps generated in this work may inform the design of future clinical trials or investigations in enrollment and retention strategies for clinical trials; however, other recruitment barriers still need to be addressed to ensure racial and socioeconomic demographics within the geographical vicinity of a clinical site can translate to equitable trial participant representation.


Assuntos
Ensaios Clínicos como Assunto , Acessibilidade aos Serviços de Saúde , Neoplasias , Viagem , Humanos , Estados Unidos , Viagem/estatística & dados numéricos , Acessibilidade aos Serviços de Saúde/estatística & dados numéricos , Ensaios Clínicos como Assunto/estatística & dados numéricos , Neoplasias/terapia , Neoplasias/etnologia , Fatores Socioeconômicos , Fatores de Tempo , Institutos de Câncer/estatística & dados numéricos , Estudos Longitudinais
2.
Genes Cancer ; 15: 1-14, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38323119

RESUMO

Hepatocellular carcinoma (HCC) is the third leading cause of death from cancer worldwide but is often diagnosed at an advanced incurable stage. Yet, despite the urgent need for blood-based biomarkers for early detection, few studies capture ongoing biology to identify risk-stratifying biomarkers. We address this gap using the TGF-ß pathway because of its biological role in liver disease and cancer, established through rigorous animal models and human studies. Using machine learning methods with blood levels of 108 proteomic markers in the TGF-ß family, we found a pattern that differentiates HCC from non-HCC in a cohort of 216 patients with cirrhosis, which we refer to as TGF-ß based Protein Markers for Early Detection of HCC (TPEARLE) comprising 31 markers. Notably, 20 of the patients with cirrhosis alone presented an HCC-like pattern, suggesting that they may be a group with as yet undetected HCC or at high risk for developing HCC. In addition, we found two other biologically relevant markers, Myostatin and Pyruvate Kinase M2 (PKM2), which were significantly associated with HCC. We tested these for risk stratification of HCC in multivariable models adjusted for demographic and clinical variables, as well as batch and site. These markers reflect ongoing biology in the liver. They potentially indicate the presence of HCC early in its evolution and before it is manifest as a detectable lesion, thereby providing a set of markers that may be able to stratify risk for HCC.

4.
Cancer Res ; 83(1): 49-58, 2023 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-36351074

RESUMO

Genetic ancestry-oriented cancer research requires the ability to perform accurate and robust genetic ancestry inference from existing cancer-derived data, including whole-exome sequencing, transcriptome sequencing, and targeted gene panels, very often in the absence of matching cancer-free genomic data. Here we examined the feasibility and accuracy of computational inference of genetic ancestry relying exclusively on cancer-derived data. A data synthesis framework was developed to optimize and assess the performance of the ancestry inference for any given input cancer-derived molecular profile. In its core procedure, the ancestral background of the profiled patient is replaced with one of any number of individuals with known ancestry. The data synthesis framework is applicable to multiple profiling platforms, making it possible to assess the performance of inference specifically for a given molecular profile and separately for each continental-level ancestry; this ability extends to all ancestries, including those without statistically sufficient representation in the existing cancer data. The inference procedure was demonstrated to be accurate and robust in a wide range of sequencing depths. Testing of the approach in four representative cancer types and across three molecular profiling modalities showed that continental-level ancestry of patients can be inferred with high accuracy, as quantified by its agreement with the gold standard of deriving ancestry from matching cancer-free molecular data. This study demonstrates that vast amounts of existing cancer-derived molecular data are potentially amenable to ancestry-oriented studies of the disease without requiring matching cancer-free genomes or patient self-reported ancestry. SIGNIFICANCE: The development of a computational approach that enables accurate and robust ancestry inference from cancer-derived molecular profiles without matching cancer-free data provides a valuable methodology for genetic ancestry-oriented cancer research.


Assuntos
Neoplasias , Transcriptoma , Humanos , Genoma Humano , Genômica , Perfilação da Expressão Gênica , Polimorfismo de Nucleotídeo Único , Neoplasias/genética
5.
BMC Cancer ; 22(1): 1320, 2022 Dec 16.
Artigo em Inglês | MEDLINE | ID: mdl-36526993

RESUMO

BACKGROUND: Research infrastructures such as biorepositories are essential to facilitate genomics and its growing applications in health research and translational medicine in Africa. Using a cervical cancer cohort, this study describes the establishment of a biorepository consisting of biospecimens and matched phenotype data for use in genomic association analysis and pharmacogenomics research. METHOD: Women aged > 18 years with a recent histologically confirmed cervical cancer diagnosis were recruited. A workflow pipeline was developed to collect, store, and analyse biospecimens comprising donor recruitment and informed consent, followed by data and biospecimen collection, nucleic acid extraction, storage of genomic DNA, genetic characterization, data integration, data analysis and data interpretation. The biospecimen and data storage infrastructure included shared -20 °C to -80 °C freezers, lockable cupboards, secured access-controlled laptop, password protected online data storage on OneDrive software. The biospecimen or data storage, transfer and sharing were compliant with the local and international biospecimen and data protection laws and policies, to ensure donor privacy, trust, and benefits for the wider community. RESULTS: This initial establishment of the biorepository recruited 410 women with cervical cancer. The mean (± SD) age of the donors was 52 (± 12) years, comprising stage I (15%), stage II (44%), stage III (47%) and stage IV (6%) disease. The biorepository includes whole blood and corresponding genomic DNA from 311 (75.9%) donors, and tumour biospecimens and corresponding tumour DNA from 258 (62.9%) donors. Datasets included information on sociodemographic characteristics, lifestyle, family history, clinical information, and HPV genotype. Treatment response was followed up for 12 months, namely, treatment-induced toxicities, survival vs. mortality, and disease status, that is disease-free survival, progression or relapse, 12 months after therapy commencement. CONCLUSION: The current work highlights a framework for developing a cancer genomics cohort-based biorepository on a limited budget. Such a resource plays a central role in advancing genomics research towards the implementation of personalised management of cancer.


Assuntos
Pesquisa Biomédica , Neoplasias do Colo do Útero , Humanos , Feminino , Neoplasias do Colo do Útero/tratamento farmacológico , Neoplasias do Colo do Útero/genética , Farmacogenética , Zimbábue , Recidiva Local de Neoplasia , Bancos de Espécimes Biológicos , Manejo de Espécimes
6.
Genes Cancer ; 13: 72-87, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36533190

RESUMO

Hepatocellular carcinoma (HCC) is the most common primary liver cancer whose incidence continues to rise in many parts of the world due to a concomitant rise in many associated risk factors, such as alcohol use and obesity. Although early-stage HCC can be potentially curable through liver resection, liver-directed therapies, or transplantation, patients usually present with intermediate to advanced disease, which continues to be associated with a poor prognosis. This is because HCC is a cancer with significant complexities, including substantial clinical, histopathologic, and genomic heterogeneity. However, the scientific community has made a major effort to better characterize HCC in those aspects via utilizing tissue sampling and histological classification, whole genome sequencing, and developing viable animal models. These efforts ultimately aim to develop clinically relevant biomarkers and discover molecular targets for new therapies. For example, until recently, there was only one approved systemic therapy for advanced or metastatic HCC in the form of sorafenib. Through these efforts, several additional targeted therapies have gained approval in the United States, although much progress remains to be desired. This review will focus on the link between characterizing the pathogenesis of HCC with current and future HCC management.

8.
STAR Protoc ; 3(3): 101586, 2022 09 16.
Artigo em Inglês | MEDLINE | ID: mdl-35942349

RESUMO

Differential mRNA expression between ancestry groups can be explained by both genetic and environmental factors. We outline a computational workflow to determine the extent to which germline genetic variation explains cancer-specific molecular differences across ancestry groups. Using multi-omics datasets from The Cancer Genome Atlas (TCGA), we enumerate ancestry-informative markers colocalized with cancer-type-specific expression quantitative trait loci (e-QTLs) at ancestry-associated genes. This approach is generalizable to other settings with paired germline genotyping and mRNA expression data for a multi-ethnic cohort. For complete details on the use and execution of this protocol, please refer to Carrot-Zhang et al. (2020), Robertson et al. (2021), and Sayaman et al. (2021).


Assuntos
Neoplasias , Locos de Características Quantitativas , Expressão Gênica , Células Germinativas , Humanos , Neoplasias/genética , Locos de Características Quantitativas/genética , RNA Mensageiro
9.
F1000Res ; 11: 493, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36761837

RESUMO

Synthetic lethal interactions (SLIs), genetic interactions in which the simultaneous inactivation of two genes leads to a lethal phenotype, are promising targets for therapeutic intervention in cancer, as exemplified by the recent success of PARP inhibitors in treating BRCA1/2-deficient tumors. We present SL-Cloud, a new component of the Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC), that provides an integrated framework of cloud-hosted data resources and curated workflows to enable facile prediction of SLIs. This resource addresses two main challenges related to SLI inference: the need to wrangle and preprocess large multi-omic datasets and the availability of multiple comparable prediction approaches. SL-Cloud enables customizable computational inference of SLIs and testing of prediction approaches across multiple datasets. We anticipate that cancer researchers will find utility in this tool for discovery of SLIs to support further investigation into potential drug targets for anticancer therapies.


Assuntos
Computação em Nuvem , Neoplasias , Humanos , Neoplasias/genética , Biologia de Sistemas , Multiômica
10.
STAR Protoc ; 2(2): 100483, 2021 06 18.
Artigo em Inglês | MEDLINE | ID: mdl-33982016

RESUMO

Cellular and molecular aberrations contribute to the disparity of human cancer incidence and etiology between ancestry groups. Multiomics profiling in The Cancer Genome Atlas (TCGA) allows for querying of the molecular underpinnings of ancestry-specific discrepancies in human cancer. Here, we provide a protocol for integrative associative analysis of ancestry with molecular correlates, including somatic mutations, DNA methylation, mRNA transcription, miRNA transcription, and pathway activity, using TCGA data. This protocol can be generalized to analyze other cancer cohorts and human diseases. For complete details on the use and execution of this protocol, please refer to Carrot-Zhang et al. (2020).


Assuntos
Genômica/métodos , Modelos Genéticos , Neoplasias/genética , Metilação de DNA/genética , Bases de Dados Genéticas , Feminino , Humanos , Masculino , MicroRNAs/genética , Transcrição Gênica/genética
11.
Cancer Cell ; 37(5): 639-654.e6, 2020 05 11.
Artigo em Inglês | MEDLINE | ID: mdl-32396860

RESUMO

We evaluated ancestry effects on mutation rates, DNA methylation, and mRNA and miRNA expression among 10,678 patients across 33 cancer types from The Cancer Genome Atlas. We demonstrated that cancer subtypes and ancestry-related technical artifacts are important confounders that have been insufficiently accounted for. Once accounted for, ancestry-associated differences spanned all molecular features and hundreds of genes. Biologically significant differences were usually tissue specific but not specific to cancer. However, admixture and pathway analyses suggested some of these differences are causally related to cancer. Specific findings included increased FBXW7 mutations in patients of African origin, decreased VHL and PBRM1 mutations in renal cancer patients of African origin, and decreased immune activity in bladder cancer patients of East Asian origin.


Assuntos
Metilação de DNA , Etnicidade/genética , Predisposição Genética para Doença , MicroRNAs/genética , Mutação , Proteínas de Neoplasias/genética , Neoplasias/genética , Proteínas de Ligação a DNA/genética , Proteína 7 com Repetições F-Box-WD/genética , Regulação Neoplásica da Expressão Gênica , Genética Populacional , Genoma Humano , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Neoplasias/etnologia , Neoplasias/patologia , Fatores de Transcrição/genética , Proteína Supressora de Tumor Von Hippel-Lindau/genética
12.
Proc Natl Acad Sci U S A ; 117(12): 6476-6483, 2020 03 24.
Artigo em Inglês | MEDLINE | ID: mdl-32152114

RESUMO

We tested the hypothesis that underrepresented students in active-learning classrooms experience narrower achievement gaps than underrepresented students in traditional lecturing classrooms, averaged across all science, technology, engineering, and mathematics (STEM) fields and courses. We conducted a comprehensive search for both published and unpublished studies that compared the performance of underrepresented students to their overrepresented classmates in active-learning and traditional-lecturing treatments. This search resulted in data on student examination scores from 15 studies (9,238 total students) and data on student failure rates from 26 studies (44,606 total students). Bayesian regression analyses showed that on average, active learning reduced achievement gaps in examination scores by 33% and narrowed gaps in passing rates by 45%. The reported proportion of time that students spend on in-class activities was important, as only classes that implemented high-intensity active learning narrowed achievement gaps. Sensitivity analyses showed that the conclusions are robust to sampling bias and other issues. To explain the extensive variation in efficacy observed among studies, we propose the heads-and-hearts hypothesis, which holds that meaningful reductions in achievement gaps only occur when course designs combine deliberate practice with inclusive teaching. Our results support calls to replace traditional lecturing with evidence-based, active-learning course designs across the STEM disciplines and suggest that innovations in instructional strategies can increase equity in higher education.


Assuntos
Logro , Grupos Minoritários/educação , Aprendizagem Baseada em Problemas , Avaliação Educacional , Engenharia/educação , Humanos , Matemática/educação , Ciência/educação , Estudantes , Tecnologia/educação , Estados Unidos , Universidades
13.
Proc Natl Acad Sci U S A ; 116(12): 5819-5827, 2019 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-30833390

RESUMO

Preterm birth (PTB) complications are the leading cause of long-term morbidity and mortality in children. By using whole blood samples, we integrated whole-genome sequencing (WGS), RNA sequencing (RNA-seq), and DNA methylation data for 270 PTB and 521 control families. We analyzed this combined dataset to identify genomic variants associated with PTB and secondary analyses to identify variants associated with very early PTB (VEPTB) as well as other subcategories of disease that may contribute to PTB. We identified differentially expressed genes (DEGs) and methylated genomic loci and performed expression and methylation quantitative trait loci analyses to link genomic variants to these expression and methylation changes. We performed enrichment tests to identify overlaps between new and known PTB candidate gene systems. We identified 160 significant genomic variants associated with PTB-related phenotypes. The most significant variants, DEGs, and differentially methylated loci were associated with VEPTB. Integration of all data types identified a set of 72 candidate biomarker genes for VEPTB, encompassing genes and those previously associated with PTB. Notably, PTB-associated genes RAB31 and RBPJ were identified by all three data types (WGS, RNA-seq, and methylation). Pathways associated with VEPTB include EGFR and prolactin signaling pathways, inflammation- and immunity-related pathways, chemokine signaling, IFN-γ signaling, and Notch1 signaling. Progress in identifying molecular components of a complex disease is aided by integrated analyses of multiple molecular data types and clinical data. With these data, and by stratifying PTB by subphenotype, we have identified associations between VEPTB and the underlying biology.


Assuntos
Predisposição Genética para Doença/genética , Nascimento Prematuro/genética , Metilação de DNA/genética , Feminino , Genômica/métodos , Humanos , Recém-Nascido , Masculino , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Transdução de Sinais/genética , Sequenciamento Completo do Genoma/métodos
14.
DNA Repair (Amst) ; 72: 1-9, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30389308

RESUMO

Formaldehyde is a ubiquitous DNA damaging agent, with human exposures occurring from both exogenous and endogenous sources. Formaldehyde exposure can result in multiple types of DNA damage, including DNA-protein crosslinks and thus, is representative of other exposures that induce DNA-protein crosslinks such as cigarette smoke, automobile exhaust, wood smoke, metals, ionizing radiation, and certain chemotherapeutics. Our objective in this study was to identify the genes necessary to mitigate formaldehyde toxicity following chronic exposure in human cells. We used siRNAs that targeted 320 genes representing all major human DNA repair and damage response pathways, in order to assess cell proliferation following siRNA depletion and subsequent formaldehyde treatment. Three unrelated human cell lines frequently used in genotoxicity studies (SW480, U-2 OS and GM00639) were used to identify common pathways involved in mitigating formaldehyde sensitivity. Although there were gene-specific differences among the cell lines, four inter-related cellular pathways were determined to mitigate formaldehyde toxicity: homologous recombination, DNA double-strand break repair, ionizing radiation response and DNA replication. Additional insight into cell line-specific response patterns was obtained by using a combination of exome sequencing and Cancer Cell Line Encyclopedia genomic data. The results of this DNA damage repair pathway-focused siRNA screen for formaldehyde toxicity in human cells provide a foundation for detailed mechanistic analyses of pathway-specific involvement in the response to environmentally-induced DNA-protein crosslinks and, more broadly, genotoxicity studies using human and other mammalian cell lines.


Assuntos
Dano ao DNA , Reparo do DNA/efeitos dos fármacos , Reparo do DNA/genética , Formaldeído/toxicidade , Interferência de RNA , Linhagem Celular , Proliferação de Células/efeitos dos fármacos , Proliferação de Células/genética , Genômica , Humanos
15.
Blood ; 132(7): e13-e23, 2018 08 16.
Artigo em Inglês | MEDLINE | ID: mdl-29967128

RESUMO

The biological role of extracellular vesicles (EVs) in diffuse large B-cell lymphoma (DLBCL) initiation and progression remains largely unknown. We characterized EVs secreted by 5 DLBCL cell lines, a primary DLBCL tumor, and a normal control B-cell sample, optimized their purification, and analyzed their content. We found that DLBCLs secreted large quantities of CD63, Alix, TSG101, and CD81 EVs, which can be extracted using an ultracentrifugation-based method and traced by their cell of origin surface markers. We also showed that tumor-derived EVs can be exchanged between lymphoma cells, normal tonsillar cells, and HK stromal cells. We then examined the content of EVs, focusing on isolation of high-quality total RNA. We sequenced the total RNA and analyzed the nature of RNA species, including coding and noncoding RNAs. We compared whole-cell and EV-derived RNA composition in benign and malignant B cells and discovered that transcripts from EVs were involved in many critical cellular functions. Finally, we performed mutational analysis and found that mutations detected in EVs exquisitely represented mutations in the cell of origin. These results enhance our understanding and enable future studies of the role that EVs may play in the pathogenesis of DLBCL, particularly with regards to the exchange of genomic information. Current findings open a new strategy for liquid biopsy approaches in disease monitoring.


Assuntos
Vesículas Extracelulares/metabolismo , Linfoma Difuso de Grandes Células B/metabolismo , Proteínas de Neoplasias/metabolismo , RNA Neoplásico/metabolismo , Linhagem Celular Tumoral , Vesículas Extracelulares/genética , Vesículas Extracelulares/patologia , Humanos , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/patologia , Proteínas de Neoplasias/genética , RNA Neoplásico/genética
16.
Cell Rep ; 23(1): 239-254.e6, 2018 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-29617664

RESUMO

DNA damage repair (DDR) pathways modulate cancer risk, progression, and therapeutic response. We systematically analyzed somatic alterations to provide a comprehensive view of DDR deficiency across 33 cancer types. Mutations with accompanying loss of heterozygosity were observed in over 1/3 of DDR genes, including TP53 and BRCA1/2. Other prevalent alterations included epigenetic silencing of the direct repair genes EXO5, MGMT, and ALKBH3 in ∼20% of samples. Homologous recombination deficiency (HRD) was present at varying frequency in many cancer types, most notably ovarian cancer. However, in contrast to ovarian cancer, HRD was associated with worse outcomes in several other cancers. Protein structure-based analyses allowed us to predict functional consequences of rare, recurrent DDR mutations. A new machine-learning-based classifier developed from gene expression data allowed us to identify alterations that phenocopy deleterious TP53 mutations. These frequent DDR gene alterations in many human cancers have functional consequences that may determine cancer progression and guide therapy.


Assuntos
Genoma Humano , Neoplasias/genética , Reparo de DNA por Recombinação , Linhagem Celular Tumoral , Dano ao DNA , Inativação Gênica , Humanos , Perda de Heterozigosidade , Aprendizado de Máquina , Mutação , Neoplasias/classificação , Proteínas Supressoras de Tumor/genética , Proteínas Supressoras de Tumor/metabolismo
17.
Cell Rep ; 12(12): 2086-98, 2015 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-26365193

RESUMO

Changes in DNA methylation are required for the formation of germinal centers (GCs), but the mechanisms of such changes are poorly understood. Activation-induced cytidine deaminase (AID) has been recently implicated in DNA demethylation through its deaminase activity coupled with DNA repair. We investigated the epigenetic function of AID in vivo in germinal center B cells (GCBs) isolated from wild-type (WT) and AID-deficient (Aicda(-/-)) mice. We determined that the transit of B cells through the GC is associated with marked locus-specific loss of methylation and increased methylation diversity, both of which are lost in Aicda(-/-) animals. Differentially methylated cytosines (DMCs) between GCBs and naive B cells (NBs) are enriched in genes that are targeted for somatic hypermutation (SHM) by AID, and these genes form networks required for B cell development and proliferation. Finally, we observed significant conservation of AID-dependent epigenetic reprogramming between mouse and human B cells.


Assuntos
Linfócitos B/metabolismo , Citidina Desaminase/metabolismo , Epigênese Genética , Centro Germinativo/metabolismo , Animais , Linfócitos B/citologia , Linfócitos B/imunologia , Diferenciação Celular , Movimento Celular , Proliferação de Células , Sequência Conservada , Citidina Desaminase/genética , Citidina Desaminase/imunologia , Citosina/metabolismo , Metilação de DNA , Centro Germinativo/citologia , Centro Germinativo/imunologia , Humanos , Ativação Linfocitária , Camundongos , Camundongos Endogâmicos BALB C , Camundongos Knockout
18.
Blood ; 123(11): 1699-708, 2014 Mar 13.
Artigo em Inglês | MEDLINE | ID: mdl-24385541

RESUMO

Diffuse large B-cell lymphoma (DLBCL) is the most common aggressive form of non-Hodgkin lymphoma with variable biology and clinical behavior. The current classification does not fully explain the biological and clinical heterogeneity of DLBCLs. In this study, we carried out genomewide DNA methylation profiling of 140 DLBCL samples and 10 normal germinal center B cells using the HpaII tiny fragment enrichment by ligation-mediated polymerase chain reaction assay and hybridization to a custom Roche NimbleGen promoter array. We defined methylation disruption as a main epigenetic event in DLBCLs and designed a method for measuring the methylation variability of individual cases. We then used a novel approach for unsupervised hierarchical clustering based on the extent of DNA methylation variability. This approach identified 6 clusters (A-F). The extent of methylation variability was associated with survival outcomes, with significant differences in overall and progression-free survival. The novel clusters are characterized by disruption of specific biological pathways such as cytokine-mediated signaling, ephrin signaling, and pathways associated with apoptosis and cell-cycle regulation. In a subset of patients, we profiled gene expression and genomic variation to investigate their interplay with methylation changes. This study is the first to identify novel epigenetic clusters of DLBCLs and their aberrantly methylated genes, molecular associations, and survival.


Assuntos
Metilação de DNA/genética , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Variação Genética/genética , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/mortalidade , Proteínas de Neoplasias/genética , Estudos de Casos e Controles , Células Cultivadas , Seguimentos , Humanos , Linfoma Difuso de Grandes Células B/classificação , Prognóstico , Taxa de Sobrevida
19.
PLoS One ; 8(11): e79871, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24260313

RESUMO

Large biological datasets are being produced at a rapid pace and create substantial storage challenges, particularly in the domain of high-throughput sequencing (HTS). Most approaches currently used to store HTS data are either unable to quickly adapt to the requirements of new sequencing or analysis methods (because they do not support schema evolution), or fail to provide state of the art compression of the datasets. We have devised new approaches to store HTS data that support seamless data schema evolution and compress datasets substantially better than existing approaches. Building on these new approaches, we discuss and demonstrate how a multi-tier data organization can dramatically reduce the storage, computational and network burden of collecting, analyzing, and archiving large sequencing datasets. For instance, we show that spliced RNA-Seq alignments can be stored in less than 4% the size of a BAM file with perfect data fidelity. Compared to the previous compression state of the art, these methods reduce dataset size more than 40% when storing exome, gene expression or DNA methylation datasets. The approaches have been integrated in a comprehensive suite of software tools (http://goby.campagnelab.org) that support common analyses for a range of high-throughput sequencing assays.


Assuntos
Biologia Computacional/métodos , Compressão de Dados/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software
20.
PLoS One ; 8(7): e69666, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23936070

RESUMO

We present GobyWeb, a web-based system that facilitates the management and analysis of high-throughput sequencing (HTS) projects. The software provides integrated support for a broad set of HTS analyses and offers a simple plugin extension mechanism. Analyses currently supported include quantification of gene expression for messenger and small RNA sequencing, estimation of DNA methylation (i.e., reduced bisulfite sequencing and whole genome methyl-seq), or the detection of pathogens in sequenced data. In contrast to previous analysis pipelines developed for analysis of HTS data, GobyWeb requires significantly less storage space, runs analyses efficiently on a parallel grid, scales gracefully to process tens or hundreds of multi-gigabyte samples, yet can be used effectively by researchers who are comfortable using a web browser. We conducted performance evaluations of the software and found it to either outperform or have similar performance to analysis programs developed for specialized analyses of HTS data. We found that most biologists who took a one-hour GobyWeb training session were readily able to analyze RNA-Seq data with state of the art analysis tools. GobyWeb can be obtained at http://gobyweb.campagnelab.org and is freely available for non-commercial use. GobyWeb plugins are distributed in source code and licensed under the open source LGPL3 license to facilitate code inspection, reuse and independent extensions http://github.com/CampagneLaboratory/gobyweb2-plugins.


Assuntos
Metilação de DNA/genética , Sistemas de Gerenciamento de Base de Dados , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Software , Sequência de Bases , Genômica , Humanos , Splicing de RNA/genética , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA