Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 35(6): 1070-1072, 2019 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-30875428

RESUMO

SUMMARY: Adjutant is an open-source, interactive and R-based application to support mining PubMed for literature reviews. Given a PubMed-compatible search query, Adjutant downloads the relevant articles and allows the user to perform an unsupervised clustering analysis to identify data-driven topic clusters. Following clustering, users can also sample documents using different strategies to obtain a more manageable dataset for further analysis. Adjutant makes explicit trade-offs between speed and accuracy, which are modifiable by the user, such that a complete analysis of several thousand documents can take a few minutes. All analytic datasets generated by Adjutant are saved, allowing users to easily conduct other downstream analyses that Adjutant does not explicitly support. AVAILABILITY AND IMPLEMENTATION: Adjutant is implemented in R, using Shiny, and is available at https://github.com/amcrisan/Adjutant. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Análise por Conglomerados , PubMed
2.
Bioinformatics ; 35(10): 1668-1676, 2019 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-30256887

RESUMO

MOTIVATION: Data visualization is an important tool for exploring and communicating findings from genomic and healthcare datasets. Yet, without a systematic way of organizing and describing the design space of data visualizations, researchers may not be aware of the breadth of possible visualization design choices or how to distinguish between good and bad options. RESULTS: We have developed a method that systematically surveys data visualizations using the analysis of both text and images. Our method supports the construction of a visualization design space that is explorable along two axes: why the visualization was created and how it was constructed. We applied our method to a corpus of scientific research articles from infectious disease genomic epidemiology and derived a Genomic Epidemiology Visualization Typology (GEViT) that describes how visualizations were created from a series of chart types, combinations and enhancements. We have also implemented an online gallery that allows others to explore our resulting design space of visualizations. Our results have important implications for visualization design and for researchers intending to develop or use data visualization tools. Finally, the method that we introduce is extensible to constructing visualizations design spaces across other research areas. AVAILABILITY AND IMPLEMENTATION: Our browsable gallery is available at http://gevit.net and all project code can be found at https://github.com/amcrisan/gevitAnalysisRelease. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Visualização de Dados , Software , Genoma , Genômica , Inquéritos e Questionários
3.
Nature ; 486(7403): 395-9, 2012 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-22495314

RESUMO

Primary triple-negative breast cancers (TNBCs), a tumour type defined by lack of oestrogen receptor, progesterone receptor and ERBB2 gene amplification, represent approximately 16% of all breast cancers. Here we show in 104 TNBC cases that at the time of diagnosis these cancers exhibit a wide and continuous spectrum of genomic evolution, with some having only a handful of coding somatic aberrations in a few pathways, whereas others contain hundreds of coding somatic mutations. High-throughput RNA sequencing (RNA-seq) revealed that only approximately 36% of mutations are expressed. Using deep re-sequencing measurements of allelic abundance for 2,414 somatic mutations, we determine for the first time-to our knowledge-in an epithelial tumour subtype, the relative abundance of clonal frequencies among cases representative of the population. We show that TNBCs vary widely in their clonal frequencies at the time of diagnosis, with the basal subtype of TNBC showing more variation than non-basal TNBC. Although p53 (also known as TP53), PIK3CA and PTEN somatic mutations seem to be clonally dominant compared to other genes, in some tumours their clonal frequencies are incompatible with founder status. Mutations in cytoskeletal, cell shape and motility proteins occurred at lower clonal frequencies, suggesting that they occurred later during tumour progression. Taken together, our results show that understanding the biology and therapeutic responses of patients with TNBC will require the determination of individual tumour clonal genotypes.


Assuntos
Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Evolução Molecular , Mutação/genética , Alelos , Neoplasias da Mama/diagnóstico , Células Clonais/metabolismo , Células Clonais/patologia , Variações do Número de Cópias de DNA/genética , Análise Mutacional de DNA , Progressão da Doença , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica/genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL/genética , Mutação Puntual/genética , Medicina de Precisão , Reprodutibilidade dos Testes , Análise de Sequência de RNA
5.
Can J Infect Dis Med Microbiol ; 2016: 5381871, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27366170

RESUMO

Background. Streptococcus pneumoniae can cause a wide spectrum of disease, including invasive pneumococcal disease (IPD). From 2005 to 2009 an outbreak of IPD occurred in Western Canada, caused by a S. pneumoniae strain with multilocus sequence type (MLST) 289 and serotype 5. We sought to investigate the incidence of IPD due to this S. pneumoniae strain and to characterize the outbreak in British Columbia using whole-genome sequencing. Methods. IPD was defined according to Public Health Agency of Canada guidelines. Two isolates representing the beginning and end of the outbreak were whole-genome sequenced. The sequences were analyzed for single nucleotide variants (SNVs) and putative genomic islands. Results. The peak of the outbreak in British Columbia was in 2006, when 57% of invasive S. pneumoniae isolates were serotype 5. Comparison of two whole-genome sequenced strains showed only 10 SNVs between them. A 15.5 kb genomic island was identified in outbreak strains, allowing the design of a PCR assay to track the spread of the outbreak strain. Discussion. We show that the serotype 5 MLST 289 strain contains a distinguishing genomic island, which remained genetically consistent over time. Whole-genome sequencing holds great promise for real-time characterization of outbreaks in the future and may allow responses tailored to characteristics identified in the genome.

6.
BJU Int ; 116(4): 556-67, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25762434

RESUMO

OBJECTIVE: To better characterize the genomics of patients with biochemical recurrence (BCR) who have metastatic disease progression in order to improve treatment decisions for prostate cancer. METHODS: The expression profiles of three clinical outcome groups after radical prostatectomy (RP) were compared: those with no evidence of disease (NED; n = 108); those with BCR (rise in prostate-specific antigen [PSA] level without metastasis; n = 163); and those with metastasis (n = 192). The patients were profiled using Human Exon 1.0 ST microarrays, and outcomes were supported by a median 18 years of follow-up. A metastasis signature was defined and verified in an independent RP cohort to ensure the robustness of the signature. Furthermore, bioinformatics characterization of the signature was conducted to decipher its biology. RESULTS: Minimal gene expression differences were observed between adjuvant treatment-naïve patients in the NED group and patients without metastasis in the BCR group. More than 95% of the differentially expressed genes (metastasis signature) were found in comparisons between primary tumours of metastasis patients and the two other outcome groups. The metastasis signature was validated in an independent cohort and was significantly associated with cell cycle genes, ubiquitin-mediated proteolysis, DNA repair, androgen, G-protein coupled and NOTCH signal transduction pathways. CONCLUSION: This study shows that metastasis development after BCR is associated with a distinct transcriptional programme that can be detected in the primary tumour. Patients with NED and BCR have highly similar transcriptional profiles, suggesting that measurement of PSA on its own is a poor surrogate for lethal disease. Use of genomic testing in patients undergoing RP with an initial rise in PSA level may be useful to improve secondary therapy decision-making.


Assuntos
Recidiva Local de Neoplasia/sangue , Recidiva Local de Neoplasia/genética , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Transcriptoma/genética , Estudos de Casos e Controles , Progressão da Doença , Perfilação da Expressão Gênica , Humanos , Masculino , Recidiva Local de Neoplasia/epidemiologia , Recidiva Local de Neoplasia/patologia , Período Pós-Operatório , Antígeno Prostático Específico/sangue , Neoplasias da Próstata/epidemiologia , Neoplasias da Próstata/metabolismo , Mapas de Interação de Proteínas/genética , RNA não Traduzido/genética
7.
Bioinformatics ; 28(7): 907-13, 2012 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-22285562

RESUMO

MOTIVATION: Identification of somatic single nucleotide variants (SNVs) in tumour genomes is a necessary step in defining the mutational landscapes of cancers. Experimental designs for genome-wide ascertainment of somatic mutations now routinely include next-generation sequencing (NGS) of tumour DNA and matched constitutional DNA from the same individual. This allows investigators to control for germline polymorphisms and distinguish somatic mutations that are unique to the tumour, thus reducing the burden of labour-intensive and expensive downstream experiments needed to verify initial predictions. In order to make full use of such paired datasets, computational tools for simultaneous analysis of tumour-normal paired sequence data are required, but are currently under-developed and under-represented in the bioinformatics literature. RESULTS: In this contribution, we introduce two novel probabilistic graphical models called JointSNVMix1 and JointSNVMix2 for jointly analysing paired tumour-normal digital allelic count data from NGS experiments. In contrast to independent analysis of the tumour and normal data, our method allows statistical strength to be borrowed across the samples and therefore amplifies the statistical power to identify and distinguish both germline and somatic events in a unified probabilistic framework. AVAILABILITY: The JointSNVMix models and four other models discussed in the article are part of the JointSNVMix software package available for download at http://compbio.bccrc.ca CONTACT: sshah@bccrc.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Análise Mutacional de DNA/métodos , DNA de Neoplasias/genética , Modelos Estatísticos , Neoplasias/genética , Humanos , Mutação , Software
8.
J Urol ; 190(6): 2047-53, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-23770138

RESUMO

PURPOSE: Patients with locally advanced prostate cancer after radical prostatectomy are candidates for secondary therapy. However, this higher risk population is heterogeneous. Many cases do not metastasize even when conservatively managed. Given the limited specificity of pathological features to predict metastasis, newer risk prediction models are needed. We report a validation study of a genomic classifier that predicts metastasis after radical prostatectomy in a high risk population. MATERIALS AND METHODS: A case-cohort design was used to sample 1,010 patients after radical prostatectomy at high risk for recurrence who were treated from 2000 to 2006. Patients had preoperative prostate specific antigen greater than 20 ng/ml, Gleason 8 or greater, pT3b or a Mayo Clinic nomogram score of 10 or greater. Patients with metastasis at diagnosis or any prior treatment for prostate cancer were excluded from analysis. A 20% random sampling created a subcohort that included all patients with metastasis. We generated 22-marker genomic classifier scores for 219 patients with available genomic data. ROC and decision curves, competing risk and weighted regression models were used to assess genomic classifier performance. RESULTS: The genomic classifier AUC was 0.79 for predicting 5-year metastasis after radical prostatectomy. Decision curves showed that the genomic classifier net benefit exceeded that of clinical only models. The genomic classifier was the predominant predictor of metastasis on multivariable analysis. The cumulative incidence of metastasis 5 years after radical prostatectomy was 2.4%, 6.0% and 22.5% in patients with low (60%), intermediate (21%) and high (19%) genomic classifier scores, respectively (p<0.001). CONCLUSIONS: Results indicate that genomic information from the primary tumor can identify patients with adverse pathological features who are most at risk for metastasis and potentially lethal prostate cancer.


Assuntos
Genômica , Prostatectomia , Neoplasias da Próstata/classificação , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Estudos de Coortes , Humanos , Masculino , Metástase Neoplásica , Prognóstico , Neoplasias da Próstata/cirurgia
9.
Artigo em Inglês | MEDLINE | ID: mdl-37844008

RESUMO

Visual and interactive machine learning systems (IML) are becoming ubiquitous as they empower individuals with varied machine learning expertise to analyze data. However, it remains complex to align interactions with visual marks to a user's intent for steering machine learning models. We explore using data and visual design probes to elicit users' desired interactions to steer ML models via visual encodings within IML interfaces. We conducted an elicitation study with 20 data analysts with varying expertise in ML. We summarize our findings as pairs of target-interaction, which we compare to prior systems to assess the utility of the probes. We additionally surfaced insights about factors influencing how and why participants chose to interact with visual encodings, including refraining from interacting. Finally, we reflect on the value of gathering such formative empirical evidence via data and visual design probes ahead of developing IML prototypes.

10.
IEEE Comput Graph Appl ; 43(1): 22-36, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-34928788

RESUMO

Dashboards are the ubiquitous means of data communication within organizations. Yet we have limited understanding of how they factor into data practices in the workplace, particularly for data workers who do not self-identify as professional analysts. We focus on data workers who use dashboards as a primary interface to data, reporting on an interview study that characterizes their data practices and the accompanying barriers to seamless data interaction. While dashboards are typically designed for data consumption, our findings show that dashboard users have far more diverse needs. To capture these activities, we frame data workers' practices as data conversations: conversations with data capture classic analysis (asking and answering data questions), while conversations through and around data involve constructing representations and narratives for sharing and communication. Dashboard users faced substantial barriers in their data conversations: their engagement with data was often intermittent, dependent on experts, and involved an awkward assembly of tools. We challenge the visualization and analytics community to embrace dashboard users as a population and design tools that blend seamlessly into their work contexts.

11.
IEEE Trans Vis Comput Graph ; 28(12): 4855-4872, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-34449391

RESUMO

Genomic Epidemiology (genEpi) is a branch of public health that uses many different data types including tabular, network, genomic, and geographic, to identify and contain outbreaks of deadly diseases. Due to the volume and variety of data, it is challenging for genEpi domain experts to conduct data reconnaissance; that is, have an overview of the data they have and make assessments toward its quality, completeness, and suitability. We present an algorithm for data reconnaissance through automatic visualization recommendation, GEViTRec. Our approach handles a broad variety of dataset types and automatically generates visually coherent combinations of charts, in contrast to existing systems that primarily focus on singleton visual encodings of tabular datasets. We automatically detect linkages across multiple input datasets by analyzing non-numeric attribute fields, creating a data source graph within which we analyze and rank paths. For each high-ranking path, we specify chart combinations with positional and color alignments between shared fields, using a gradual binding approach to transform initial partial specifications of singleton charts to complete specifications that are aligned and oriented consistently. A novel aspect of our approach is its combination of domain-agnostic elements with domain-specific information that is captured through a domain-specific visualization prevalence design space. Our implementation is applied to both synthetic data and real Ebola outbreak data. We compare GEViTRec's output to what previous visualization recommendation systems would generate, and to manually crafted visualizations used by practitioners. We conducted formative evaluations with ten genEpi experts to assess the relevance and interpretability of our results. Code, Data, and Study Materials Availability: https://github.com/amcrisan/GEVitRec.


Assuntos
Gráficos por Computador , Software , Prevalência , Genômica , Genoma
12.
Bioinformatics ; 26(6): 730-6, 2010 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-20130035

RESUMO

MOTIVATION: Next-generation sequencing (NGS) has enabled whole genome and transcriptome single nucleotide variant (SNV) discovery in cancer. NGS produces millions of short sequence reads that, once aligned to a reference genome sequence, can be interpreted for the presence of SNVs. Although tools exist for SNV discovery from NGS data, none are specifically suited to work with data from tumors, where altered ploidy and tumor cellularity impact the statistical expectations of SNV discovery. RESULTS: We developed three implementations of a probabilistic Binomial mixture model, called SNVMix, designed to infer SNVs from NGS data from tumors to address this problem. The first models allelic counts as observations and infers SNVs and model parameters using an expectation maximization (EM) algorithm and is therefore capable of adjusting to deviation of allelic frequencies inherent in genomically unstable tumor genomes. The second models nucleotide and mapping qualities of the reads by probabilistically weighting the contribution of a read/nucleotide to the inference of a SNV based on the confidence we have in the base call and the read alignment. The third combines filtering out low-quality data in addition to probabilistic weighting of the qualities. We quantitatively evaluated these approaches on 16 ovarian cancer RNASeq datasets with matched genotyping arrays and a human breast cancer genome sequenced to >40x (haploid) coverage with ground truth data and show systematically that the SNVMix models outperform competing approaches. AVAILABILITY: Software and data are available at http://compbio.bccrc.ca CONTACT: sshah@bccrc.ca SUPPLEMANTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Variação Genética , Neoplasias/genética , Análise de Sequência de DNA/métodos , Software , Algoritmos , Sequência de Bases , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Genoma Humano , Humanos , Dados de Sequência Molecular , Alinhamento de Sequência
13.
IEEE Trans Vis Comput Graph ; 27(2): 1860-1870, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33048684

RESUMO

Data science is a rapidly growing discipline and organizations increasingly depend on data science work. Yet the ambiguity around data science, what it is, and who data scientists are can make it difficult for visualization researchers to identify impactful research trajectories. We have conducted a retrospective analysis of data science work and workers as described within the data visualization, human computer interaction, and data science literature. From this analysis we synthesis a comprehensive model that describes data science work and breakdown to data scientists into nine distinct roles. We summarise and reflect on the role that visualization has throughout data science work and the varied needs of data scientists themselves for tooling support. Our findings are intended to arm visualization researchers with a more concrete framing of data science with the hope that it will help them surface innovative opportunities for impacting data science work. Data availability: https://osf.io/z2xpd/?view_only=87fa24be486a473884adb9ffbe8db4ec.

14.
PeerJ ; 6: e4218, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29340235

RESUMO

BACKGROUND: Microbial genome sequencing is now being routinely used in many clinical and public health laboratories. Understanding how to report complex genomic test results to stakeholders who may have varying familiarity with genomics-including clinicians, laboratorians, epidemiologists, and researchers-is critical to the successful and sustainable implementation of this new technology; however, there are no evidence-based guidelines for designing such a report in the pathogen genomics domain. Here, we describe an iterative, human-centered approach to creating a report template for communicating tuberculosis (TB) genomic test results. METHODS: We used Design Study Methodology-a human centered approach drawn from the information visualization domain-to redesign an existing clinical report. We used expert consults and an online questionnaire to discover various stakeholders' needs around the types of data and tasks related to TB that they encounter in their daily workflow. We also evaluated their perceptions of and familiarity with genomic data, as well as its utility at various clinical decision points. These data shaped the design of multiple prototype reports that were compared against the existing report through a second online survey, with the resulting qualitative and quantitative data informing the final, redesigned, report. RESULTS: We recruited 78 participants, 65 of whom were clinicians, nurses, laboratorians, researchers, and epidemiologists involved in TB diagnosis, treatment, and/or surveillance. Our first survey indicated that participants were largely enthusiastic about genomic data, with the majority agreeing on its utility for certain TB diagnosis and treatment tasks and many reporting some confidence in their ability to interpret this type of data (between 58.8% and 94.1%, depending on the specific data type). When we compared our four prototype reports against the existing design, we found that for the majority (86.7%) of design comparisons, participants preferred the alternative prototype designs over the existing version, and that both clinicians and non-clinicians expressed similar design preferences. Participants showed clearer design preferences when asked to compare individual design elements versus entire reports. Both the quantitative and qualitative data informed the design of a revised report, available online as a LaTeX template. CONCLUSIONS: We show how a human-centered design approach integrating quantitative and qualitative feedback can be used to design an alternative report for representing complex microbial genomic data. We suggest experimental and design guidelines to inform future design studies in the bioinformatics and microbial genomics domains, and suggest that this type of mixed-methods study is important to facilitate the successful translation of pathogen genomics in the clinic, not only for clinical reports but also more complex bioinformatics data visualization software.

15.
Microb Genom ; 2(5): e000060, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-28348853

RESUMO

We report an updated method for inferring the time at which an infectious disease was transmitted between persons from a time-labelled pathogen genome phylogeny. We applied the method to 48 Mycobacterium tuberculosis genomes as part of a real-time public health outbreak investigation, demonstrating that although active tuberculosis (TB) cases were diagnosed through 2013, no transmission events took place beyond mid-2012. Subsequent cases were the result of progression from latent TB infection to active disease, and not recent transmission. This evolutionary genomic approach was used to declare the outbreak over in January 2015.


Assuntos
Surtos de Doenças , Genômica , Epidemiologia Molecular/métodos , Mycobacterium tuberculosis/genética , Tuberculose/epidemiologia , Humanos , Controle de Infecções , Tuberculose/prevenção & controle , Tuberculose/transmissão
16.
Eur Urol ; 67(2): 326-33, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24998118

RESUMO

BACKGROUND: Risk prediction models that incorporate biomarkers and clinicopathologic variables may be used to improve decision making after radical prostatectomy (RP). We compared two previously validated post-RP classifiers-the Cancer of the Prostate Risk Assessment Postsurgical (CAPRA-S) and the Decipher genomic classifier (GC)-to predict prostate cancer-specific mortality (CSM) in a contemporary cohort of RP patients. OBJECTIVE: To evaluate the combined prognostic ability of CAPRA-S and GC to predict CSM. DESIGN, SETTING, AND PARTICIPANTS: A cohort of 1010 patients at high risk of recurrence after RP were treated at the Mayo Clinic between 2000 and 2006. High risk was defined by any of the following: preoperative prostate-specific antigen >20 ng/ml, pathologic Gleason score ≥8, or stage pT3b. A case-cohort random sample identified 225 patients (with cases defined as patients who experienced CSM), among whom CAPRA-S and GC could be determined for 185 patients. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: The scores were evaluated individually and in combination using concordance index (c-index), decision curve analysis, reclassification, cumulative incidence, and Cox regression for the prediction of CSM. RESULTS AND LIMITATIONS: Among 185 men, 28 experienced CSM. The c-indices for CAPRA-S and GC were 0.75 (95% confidence interval [CI], 0.55-0.84) and 0.78 (95% CI, 0.68-0.87), respectively. GC showed higher net benefit on decision curve analysis, but a score combining CAPRA-S and GC did not improve the area under the receiver-operating characteristic curve after optimism-adjusted bootstrapping. In 82 patients stratified to high risk based on CAPRA-S score ≥6, GC scores were likewise high risk for 33 patients, among whom 17 had CSM events. GC reclassified the remaining 49 men as low to intermediate risk; among these men, three CSM events were observed. In multivariable analysis, GC and CAPRA-S as continuous variables were independently prognostic of CSM, with hazard ratios (HRs) of 1.81 (p<0.001 per 0.1-unit change in score) and 1.36 (p=0.01 per 1-unit change in score). When categorized into risk groups, the multivariable HR for high CAPRA-S scores (≥6) was 2.36 (p=0.04) and was 11.26 (p<0.001) for high GC scores (≥0.6). For patients with both high GC and high CAPRA-S scores, the cumulative incidence of CSM was 45% at 10 yr. The study is limited by its retrospective design. CONCLUSIONS: Both GC and CAPRA-S were significant independent predictors of CSM. GC was shown to reclassify many men stratified to high risk based on CAPRA-S ≥6 alone. Patients with both high GC and high CAPRA-S risk scores were at markedly elevated post-RP risk for lethal prostate cancer. If validated prospectively, these findings suggest that integration of a genomic-clinical classifier may enable better identification of those post-RP patients who should be considered for more aggressive secondary therapies and clinical trials. PATIENT SUMMARY: The Cancer of the Prostate Risk Assessment Postsurgical (CAPRA-S) and the Decipher genomic classifier (GC) were significant independent predictors of prostate cancer-specific mortality. These findings suggest that integration of a genomic-clinical classifier may enable better identification of those post-radical prostatectomy patients who should be considered for more aggressive secondary therapies and clinical trials.


Assuntos
Biomarcadores Tumorais/genética , Genômica , Prostatectomia , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/cirurgia , Idoso , Biomarcadores Tumorais/sangue , Genômica/métodos , Humanos , Calicreínas/sangue , Estimativa de Kaplan-Meier , Masculino , Pessoa de Meia-Idade , Minnesota , Gradação de Tumores , Estadiamento de Neoplasias , Valor Preditivo dos Testes , Modelos de Riscos Proporcionais , Antígeno Prostático Específico/sangue , Prostatectomia/efeitos adversos , Prostatectomia/mortalidade , Neoplasias da Próstata/sangue , Neoplasias da Próstata/genética , Neoplasias da Próstata/mortalidade , Neoplasias da Próstata/patologia , Reprodutibilidade dos Testes , Estudos Retrospectivos , Medição de Risco , Fatores de Risco , Resultado do Tratamento
17.
J Clin Endocrinol Metab ; 98(10): 4072-9, 2013 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-23928671

RESUMO

PURPOSE: Due to the limitations of fine-needle aspiration biopsy (FNAB) cytopathology, many individuals who present with thyroid nodules eventually undergo thyroid surgery to diagnose thyroid cancer. The objective of this study was to use whole-transcriptome profiling to develop and validate a genomic classifier that significantly improves the accuracy of preoperative thyroid cancer diagnosis. MATERIALS AND METHODS: Nucleic acids were extracted and amplified for microarray expression analysis on the Affymetrix Human Exon 1.0 ST GeneChips from 1-mm-diameter formalin-fixed and paraffin-embedded thyroid tumor tissue cores. A training group of 60 thyroidectomy specimens (30 cancers and 30 benign lesions) were used to assess differential expression and for subsequent generation of a genomic classifier. The classifier was validated in a blinded fashion on a group of 31 formalin-fixed and paraffin-embedded thyroid FNAB specimens. RESULTS: Expression profiles of the 57 thyroidectomy training and 31 FNAB validation specimens that passed a series of quality control steps were analyzed. A genomic classifier composed of 249 markers that corresponded to 154 genes, had an overall validated accuracy of 90.0% in the 31 patient FNAB specimens and had positive and negative predictive values of 100% and 85.7%, respectively. The majority of the identified markers that made up the classifier represented non-protein-encoding RNAs. CONCLUSIONS: Whole-transcriptome profiling of thyroid nodule surgical specimens allowed for the development of a genomic classifier that improved the accuracy of preoperative thyroid cancer FNAB diagnosis.


Assuntos
Perfilação da Expressão Gênica , Glândula Tireoide/patologia , Neoplasias da Glândula Tireoide/diagnóstico , Nódulo da Glândula Tireoide/diagnóstico , Nódulo da Glândula Tireoide/genética , Transcriptoma , Adulto , Idoso , Idoso de 80 Anos ou mais , Biópsia por Agulha Fina , Diagnóstico Diferencial , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Glândula Tireoide/cirurgia , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia , Nódulo da Glândula Tireoide/patologia
18.
PLoS One ; 8(6): e66855, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23826159

RESUMO

PURPOSE: Clinicopathologic features and biochemical recurrence are sensitive, but not specific, predictors of metastatic disease and lethal prostate cancer. We hypothesize that a genomic expression signature detected in the primary tumor represents true biological potential of aggressive disease and provides improved prediction of early prostate cancer metastasis. METHODS: A nested case-control design was used to select 639 patients from the Mayo Clinic tumor registry who underwent radical prostatectomy between 1987 and 2001. A genomic classifier (GC) was developed by modeling differential RNA expression using 1.4 million feature high-density expression arrays of men enriched for rising PSA after prostatectomy, including 213 who experienced early clinical metastasis after biochemical recurrence. A training set was used to develop a random forest classifier of 22 markers to predict for cases--men with early clinical metastasis after rising PSA. Performance of GC was compared to prognostic factors such as Gleason score and previous gene expression signatures in a withheld validation set. RESULTS: Expression profiles were generated from 545 unique patient samples, with median follow-up of 16.9 years. GC achieved an area under the receiver operating characteristic curve of 0.75 (0.67-0.83) in validation, outperforming clinical variables and gene signatures. GC was the only significant prognostic factor in multivariable analyses. Within Gleason score groups, cases with high GC scores experienced earlier death from prostate cancer and reduced overall survival. The markers in the classifier were found to be associated with a number of key biological processes in prostate cancer metastatic disease progression. CONCLUSION: A genomic classifier was developed and validated in a large patient cohort enriched with prostate cancer metastasis patients and a rising PSA that went on to experience metastatic disease. This early metastasis prediction model based on genomic expression in the primary tumor may be useful for identification of aggressive prostate cancer.


Assuntos
Genoma Humano/genética , Prostatectomia , Neoplasias da Próstata/genética , Neoplasias da Próstata/cirurgia , Idoso , Biomarcadores Tumorais/metabolismo , Estudos de Casos e Controles , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Estimativa de Kaplan-Meier , Masculino , Pessoa de Meia-Idade , Modelos Biológicos , Análise Multivariada , Metástase Neoplásica , Razão de Chances , Prognóstico , Neoplasias da Próstata/patologia , Reprodutibilidade dos Testes , Fatores de Risco
19.
Front Genet ; 3: 23, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22371711

RESUMO

Prostate cancer is the most diagnosed cancer among men in the United States. While the majority of patients who undergo surgery (prostatectomy) will essentially be cured, about 30-40% men remain at risk for disease progression and recurrence. Currently, patients are deemed at risk by evaluation of clinical factors, but these do not resolve whether adjuvant therapy will significantly attenuate or delay disease progression for a patient at risk. Numerous efforts using mRNA-based biomarkers have been described for this purpose, but none have successfully reached widespread clinical practice in helping to make an adjuvant therapy decision. Here, we assess the utility of non-coding RNAs as biomarkers for prostate cancer recurrence based on high-resolution oligonucleotide microarray analysis of surgical tissue specimens from normal adjacent prostate, primary tumors, and metastases. We identify differentially expressed non-coding RNAs that distinguish between the different prostate tissue types and show that these non-coding RNAs can predict clinical outcomes in primary tumors. Together, these results suggest that non-coding RNAs are emerging from the "dark matter" of the genome as a new source of biomarkers for characterizing disease recurrence and progression. While this study shows that non-coding RNA biomarkers can be highly informative, future studies will be needed to further characterize the specific roles of these non-coding RNA biomarkers in the development of aggressive disease.

20.
PLoS One ; 7(8): e41551, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22916110

RESUMO

Next generation sequencing has now enabled a cost-effective enumeration of the full mutational complement of a tumor genome-in particular single nucleotide variants (SNVs). Most current computational and statistical models for analyzing next generation sequencing data, however, do not account for cancer-specific biological properties, including somatic segmental copy number alterations (CNAs)-which require special treatment of the data. Here we present CoNAn-SNV (Copy Number Annotated SNV): a novel algorithm for the inference of single nucleotide variants (SNVs) that overlap copy number alterations. The method is based on modelling the notion that genomic regions of segmental duplication and amplification induce an extended genotype space where a subset of genotypes will exhibit heavily skewed allelic distributions in SNVs (and therefore render them undetectable by methods that assume diploidy). We introduce the concept of modelling allelic counts from sequencing data using a panel of Binomial mixture models where the number of mixtures for a given locus in the genome is informed by a discrete copy number state given as input. We applied CoNAn-SNV to a previously published whole genome shotgun data set obtained from a lobular breast cancer and show that it is able to discover 21 experimentally revalidated somatic non-synonymous mutations in a lobular breast cancer genome that were not detected using copy number insensitive SNV detection algorithms. Importantly, ROC analysis shows that the increased sensitivity of CoNAn-SNV does not result in disproportionate loss of specificity. This was also supported by analysis of a recently published lymphoma genome with a relatively quiescent karyotype, where CoNAn-SNV showed similar results to other callers except in regions of copy number gain where increased sensitivity was conferred. Our results indicate that in genomically unstable tumors, copy number annotation for SNV detection will be critical to fully characterize the mutational landscape of cancer genomes.


Assuntos
Genes Neoplásicos , Genoma , Mutação , Neoplasias/genética , Algoritmos , Variações do Número de Cópias de DNA , Humanos , Modelos Genéticos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA