Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
1.
Clin Pharmacol Ther ; 113(6): 1217-1222, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36408668

RESUMEN

Legislative and technological advancements over the past decade have given rise to the proliferation of healthcare data generated from routine clinical practice, often referred to as real-world data (RWD). These data have piqued the interest of healthcare stakeholders due to their potential utility in generating evidence to support clinical and regulatory decision making. In the oncology setting, studies leveraging RWD offer distinct advantages that are complementary to randomized controlled trials (RCTs). They also permit the conduct of investigations that may not be possible through prospective designs due to ethics or feasibility. Despite its promise, the use of RWD for the generation of clinical evidence remains controversial due to concerns of unmeasured confounding and other sources of bias that must be carefully addressed in the study design and analysis. To facilitate a better understanding of when RWD can provide reliable conclusions on drug effectiveness, we seek to conduct 10 RWD-based studies that emulate RCTs in oncology using a systematic, protocol-driven approach described herein. Results of this investigation will help inform clinical, scientific, and regulatory stakeholders on the applications of RWD in the context of product labeling expansion, drug safety, and comparative effectiveness in oncology.


Asunto(s)
Oncología Médica , Proyectos de Investigación , Humanos , Ensayos Clínicos Controlados Aleatorios como Asunto
2.
Clin Pharmacol Ther ; 111(1): 77-89, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34839524

RESUMEN

Interest in real-world data (RWD) and real-world evidence (RWE) to expedite and enrich the development of new biopharmaceutical products has proliferated in recent years, spurred by the 21st Century Cures Act in the United States and similar policy efforts in other countries, willingness by regulators to consider RWE in their decisions, demands from third-party payers, and growing concerns about the limitations of traditional clinical trials. Although much of the recent literature on RWE has focused on potential regulatory uses (e.g., product approvals in oncology or rare diseases based on single-arm trials with external control arms), this article reviews how biopharmaceutical companies can leverage RWE to inform internal decisions made throughout the product development process. Specifically, this article will review use of RWD to guide pipeline and portfolio strategy; use of novel sources of RWD to inform product development, use of RWD to inform clinical development, use of advanced analytics to harness "big" RWD, and considerations when using RWD to inform internal decisions. Topics discussed will include the use of molecular, clinicogenomic, medical imaging, radiomic, and patient-derived xenograft data to augment traditional sources of RWE, the use of RWD to inform clinical trial eligibility criteria, enrich trial population based on predicted response, select endpoints, estimate sample size, understand disease progression, and enhance diversity of participants, the growing use of data tokenization and advanced analytical techniques based on artificial intelligence in RWE, as well as the importance of data quality and methodological transparency in RWE.


Asunto(s)
Ensayos Clínicos como Asunto/métodos , Desarrollo de Medicamentos/métodos , Práctica Clínica Basada en la Evidencia/métodos , Ciencia de los Datos , Industria Farmacéutica/organización & administración , Registros Electrónicos de Salud , Humanos
3.
Sci Data ; 6(1): 252, 2019 10 31.
Artículo en Inglés | MEDLINE | ID: mdl-31672983

RESUMEN

Mining of integrated public transcriptomic and ChIP-Seq (cistromic) datasets can illuminate functions of mammalian cellular signaling pathways not yet explored in the research literature. Here, we designed a web knowledgebase, the Signaling Pathways Project (SPP), which incorporates community classifications of signaling pathway nodes (receptors, enzymes, transcription factors and co-nodes) and their cognate bioactive small molecules. We then mapped over 10,000 public transcriptomic or cistromic experiments to their pathway node or biosample of study. To enable prediction of pathway node-gene target transcriptional regulatory relationships through SPP, we generated consensus 'omics signatures, or consensomes, which ranked genes based on measures of their significant differential expression or promoter occupancy across transcriptomic or cistromic experiments mapped to a specific node family. Consensomes were validated using alignment with canonical literature knowledge, gene target-level integration of transcriptomic and cistromic data points, and in bench experiments confirming previously uncharacterized node-gene target regulatory relationships. To expose the SPP knowledgebase to researchers, a web browser interface was designed that accommodates numerous routine data mining strategies. SPP is freely accessible at https://www.signalingpathways.org .


Asunto(s)
Bases de Datos Factuales , Transducción de Señal , Animales , Humanos , Bases del Conocimiento , Mamíferos , Transcriptoma
4.
Learn Health Syst ; 3(1): e10076, 2019 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-31245598

RESUMEN

The benefits of reusing EHR data for clinical research studies are numerous. They portend the opportunity to bring new therapies to patients sooner, potentially at a lower cost, and to accelerate learning health cycles-through faster data acquisition in clinical research studies. Metrics have proven that time can be saved, workflow and processes streamlined, and data quality increased significantly. Pilot projects and now actual investigational trials used for regulatory submissions have shown that these benefits support the transformation of clinical research by leveraging EHRs for research. Panelists at a recent collaborative focused on bridging clinical research and clinical care offered varying perspectives on how the latest standards and technologies could be leveraged to facilitate data transfer from EHR systems into clinical research databases, as well as the associated improvements in data quality. Panelists also discussed other avenues to leverage EHR in clinical research. Improvements and exciting possibilities notwithstanding, much work remains. Data ownership and access, attention to metadata and structured data for data sharing, and broader adoption of global standards are key areas for collaboration. With the steady increase in adoption of EHRs around the world, this is an excellent time for all stakeholders to work together and create an environment such that EHRs can be used more readily for research. The capacity for research can thus be increased to provide more high-quality information that will contribute to rapid continuous learning health systems from which all patients can benefit.

5.
Bioinformatics ; 35(9): 1562-1565, 2019 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-30256906

RESUMEN

MOTIVATION: Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets. RESULTS: We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine. AVAILABILITY AND IMPLEMENTATION: The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS-Master Tree branch, where the discussed examples can be visualized.


Asunto(s)
Almacenamiento y Recuperación de la Información , Exactitud de los Datos , Recolección de Datos , Humanos , Difusión de la Información
6.
AMIA Jt Summits Transl Sci Proc ; 2017: 94-103, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29888049

RESUMEN

The Clinical Data Interchange Standards Consortium (CDISC) is a global non-profit standards development organization that creates consensus-based standards for clinical and translational research. Several of these standards are now required by regulators for electronic submissions of regulated clinical trials' data and by government funding agencies. These standards are free and open, available for download on the CDISC Website as PDFs. While these documents are human readable, they are not amenable to ready use by electronic systems. CDISC launched the CDISC Shared Health And Research Electronic library (SHARE) to provide the standards metadata in machine-readable formats to facilitate the automated management and implementation of the standards. This paper describes how CDISC SHARE'S standards can facilitate collecting, aggregating and analyzing standardized data from early design to end analysis; and its role as a central resource providing information systems with metadata that drives process automation including study setup and data pipelining.

8.
BMJ Open ; 7(12): e018647, 2017 Dec 14.
Artículo en Inglés | MEDLINE | ID: mdl-29247106

RESUMEN

OBJECTIVES: We examined major issues associated with sharing of individual clinical trial data and developed a consensus document on providing access to individual participant data from clinical trials, using a broad interdisciplinary approach. DESIGN AND METHODS: This was a consensus-building process among the members of a multistakeholder task force, involving a wide range of experts (researchers, patient representatives, methodologists, information technology experts, and representatives from funders, infrastructures and standards development organisations). An independent facilitator supported the process using the nominal group technique. The consensus was reached in a series of three workshops held over 1 year, supported by exchange of documents and teleconferences within focused subgroups when needed. This work was set within the Horizon 2020-funded project CORBEL (Coordinated Research Infrastructures Building Enduring Life-science Services) and coordinated by the European Clinical Research Infrastructure Network. Thus, the focus was on non-commercial trials and the perspective mainly European. OUTCOME: We developed principles and practical recommendations on how to share data from clinical trials. RESULTS: The task force reached consensus on 10 principles and 50 recommendations, representing the fundamental requirements of any framework used for the sharing of clinical trials data. The document covers the following main areas: making data sharing a reality (eg, cultural change, academic incentives, funding), consent for data sharing, protection of trial participants (eg, de-identification), data standards, rights, types and management of access (eg, data request and access models), data management and repositories, discoverability, and metadata. CONCLUSIONS: The adoption of the recommendations in this document would help to promote and support data sharing and reuse among researchers, adequately inform trial participants and protect their rights, and provide effective and efficient systems for preparing, storing and accessing data. The recommendations now need to be implemented and tested in practice. Further work needs to be done to integrate these proposals with those from other geographical areas and other academic domains.


Asunto(s)
Investigación Biomédica/normas , Ensayos Clínicos como Asunto , Consenso , Difusión de la Información/métodos , Comités Consultivos , Humanos
9.
AMIA Jt Summits Transl Sci Proc ; 2017: 158-165, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28815125

RESUMEN

The Food & Drug Administration has begun requiring that electronic submissions of regulated clinical studies utilize the Clinical Data Information Standards Consortium data standards. Within regulated clinical research, traceability is a requirement and indicates that the analysis results can be traced back to the original source data. Current solutions for clinical research data traceability are limited in terms of querying, validation and visualization capabilities. This paper describes (1) the development of metadata models to support computable traceability and traceability visualizations that are compatible with industry data standards for the regulated clinical research domain, (2) adaptation of graph traversal algorithms to make them capable of identifying traceability gaps and validating traceability across the clinical research data lifecycle, and (3) development of a traceability query capability for retrieval and visualization of traceability information.

10.
Sci Signal ; 10(476)2017 Apr 25.
Artículo en Inglés | MEDLINE | ID: mdl-28442630

RESUMEN

We previously developed a web tool, Transcriptomine, to explore expression profiling data sets involving small-molecule or genetic manipulations of nuclear receptor signaling pathways. We describe advances in biocuration, query interface design, and data visualization that enhance the discovery of uncharacterized biology in these pathways using this tool. Transcriptomine currently contains about 45 million data points encompassing more than 2000 experiments in a reference library of nearly 550 data sets retrieved from public archives and systematically curated. To make the underlying data points more accessible to bench biologists, we classified experimental small molecules and gene manipulations into signaling pathways and experimental tissues and cell lines into physiological systems and organs. Incorporation of these mappings into Transcriptomine enables the user to readily evaluate tissue-specific regulation of gene expression by nuclear receptor signaling pathways. Data points from animal and cell model experiments and from clinical data sets elucidate the roles of nuclear receptor pathways in gene expression events accompanying various normal and pathological cellular processes. In addition, data sets targeting non-nuclear receptor signaling pathways highlight transcriptional cross-talk between nuclear receptors and other signaling pathways. We demonstrate with specific examples how data points that exist in isolation in individual data sets validate each other when connected and made accessible to the user in a single interface. In summary, Transcriptomine allows bench biologists to routinely develop research hypotheses, validate experimental data, or model relationships between signaling pathways, genes, and tissues.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Regulación de la Expresión Génica , Genes , Receptores Citoplasmáticos y Nucleares/genética , Programas Informáticos , Transcriptoma , Animales , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Humanos , Internet , Especificidad de Órganos , Receptores Citoplasmáticos y Nucleares/metabolismo , Transducción de Señal
11.
J Am Med Inform Assoc ; 24(5): 882-890, 2017 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-28339791

RESUMEN

BACKGROUND: It is critical to integrate and analyze data from biological, translational, and clinical studies with data from health systems; however, electronic artifacts are stored in thousands of disparate systems that are often unable to readily exchange data. OBJECTIVE: To facilitate meaningful data exchange, a model that presents a common understanding of biomedical research concepts and their relationships with health care semantics is required. The Biomedical Research Integrated Domain Group (BRIDG) domain information model fulfills this need. Software systems created from BRIDG have shared meaning "baked in," enabling interoperability among disparate systems. For nearly 10 years, the Clinical Data Standards Interchange Consortium, the National Cancer Institute, the US Food and Drug Administration, and Health Level 7 International have been key stakeholders in developing BRIDG. METHODS: BRIDG is an open-source Unified Modeling Language-class model developed through use cases and harmonization with other models. RESULTS: With its 4+ releases, BRIDG includes clinical and now translational research concepts in its Common, Protocol Representation, Study Conduct, Adverse Events, Regulatory, Statistical Analysis, Experiment, Biospecimen, and Molecular Biology subdomains. INTERPRETATION: The model is a Clinical Data Standards Interchange Consortium, Health Level 7 International, and International Standards Organization standard that has been utilized in national and international standards-based software development projects. It will continue to mature and evolve in the areas of clinical imaging, pathology, ontology, and vocabulary support. BRIDG 4.1.1 and prior releases are freely available at https://bridgmodel.nci.nih.gov .


Asunto(s)
Investigación Biomédica , Interoperabilidad de la Información en Salud/normas , Web Semántica , Web Semántica/normas , Programas Informáticos , Terminología como Asunto
12.
J Am Med Inform Assoc ; 24(2): 388-393, 2017 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-27413121

RESUMEN

Although omics datasets represent valuable assets for hypothesis generation, model testing, and data validation, the infrastructure supporting their reuse lacks organization and consistency. Using nuclear receptor signaling transcriptomic datasets as proof of principle, we developed a model to improve the discoverability, accessibility, and citability of published omics datasets. Primary datasets were retrieved from archives, processed to extract data points, then subjected to metadata enrichment and gap filling. The resulting secondary datasets were exposed on responsive web pages to support mining of gene lists, discovery of related datasets, and single-click citation integration with popular reference managers. Automated processes were established to embed digital object identifier-driven links to the secondary datasets in associated journal articles, small molecule and gene-centric databases, and a dataset search engine. Our model creates multiple points of access to reprocessed and reannotated derivative datasets across the digital biomedical research ecosystem, promoting their visibility and usability across disparate research communities.


Asunto(s)
Conjuntos de Datos como Asunto , Transcriptoma , Investigación Biomédica , Bases de Datos Genéticas , Genómica , Humanos , Metadatos
13.
Sci Data ; 3: 160010, 2016 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-26882539

RESUMEN

Genomic data sharing in cancer has been restricted to aggregate or controlled-access initiatives to protect the privacy of research participants. By limiting access to these data, it has been argued that the autonomy of individuals who decide to participate in data sharing efforts has been superseded and the utility of the data as research and educational tools reduced. In a pilot Open Access (OA) project from the CPRIT-funded Texas Cancer Research Biobank, many Texas cancer patients were willing to openly share genomic data from tumor and normal matched pair specimens. For the first time, genetic data from 7 human cancer cases with matched normal are freely available without requirement for data use agreements nor any major restriction except that end users cannot attempt to re-identify the participants (http://txcrb.org/open.html).


Asunto(s)
ADN de Neoplasias , Bases de Datos Genéticas , Genoma Humano , Neoplasias Pancreáticas/genética , Acceso a la Información , Bancos de Muestras Biológicas , Humanos , Difusión de la Información , Texas
14.
PLoS One ; 10(9): e0135615, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26325041

RESUMEN

Signaling pathways involving nuclear receptors (NRs), their ligands and coregulators, regulate tissue-specific transcriptomes in diverse processes, including development, metabolism, reproduction, the immune response and neuronal function, as well as in their associated pathologies. The Nuclear Receptor Signaling Atlas (NURSA) is a Consortium focused around a Hub website (www.nursa.org) that annotates and integrates diverse 'omics datasets originating from the published literature and NURSA-funded Data Source Projects (NDSPs). These datasets are then exposed to the scientific community on an Open Access basis through user-friendly data browsing and search interfaces. Here, we describe the redesign of the Hub, version 3.0, to deploy "Web 2.0" technologies and add richer, more diverse content. The Molecule Pages, which aggregate information relevant to NR signaling pathways from myriad external databases, have been enhanced to include resources for basic scientists, such as post-translational modification sites and targeting miRNAs, and for clinicians, such as clinical trials. A portal to NURSA's Open Access, PubMed-indexed journal Nuclear Receptor Signaling has been added to facilitate manuscript submissions. Datasets and information on reagents generated by NDSPs are available, as is information concerning periodic new NDSP funding solicitations. Finally, the new website integrates the Transcriptomine analysis tool, which allows for mining of millions of richly annotated public transcriptomic data points in the field, providing an environment for dataset re-use and citation, bench data validation and hypothesis generation. We anticipate that this new release of the NURSA database will have tangible, long term benefits for both basic and clinical research in this field.


Asunto(s)
Atlas como Asunto , Receptores Citoplasmáticos y Nucleares/fisiología , Transducción de Señal/fisiología , Animales , Conjuntos de Datos como Asunto , Humanos , Difusión de la Información , Internet
15.
Bioinformatics ; 31(10): 1655-62, 2015 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-25573920

RESUMEN

MOTIVATION: The probability of effective treatment of cancer with a targeted therapeutic can be improved for patients with defined genotypes containing actionable mutations. To this end, many human cancer biobanks are integrating more tightly with genomic sequencing facilities and with those creating and maintaining patient-derived xenografts (PDX) and cell lines to provide renewable resources for translational research. RESULTS: To support the complex data management needs and workflows of several such biobanks, we developed Acquire. It is a robust, secure, web-based, database-backed open-source system that supports all major needs of a modern cancer biobank. Its modules allow for i) up-to-the-minute 'scoreboard' and graphical reporting of collections; ii) end user roles and permissions; iii) specimen inventory through caTissue Suite; iv) shipping forms for distribution of specimens to pathology, genomic analysis and PDX/cell line creation facilities; v) robust ad hoc querying; vi) molecular and cellular quality control metrics to track specimens' progress and quality; vii) public researcher request; viii) resource allocation committee distribution request review and oversight and ix) linkage to available derivatives of specimen.


Asunto(s)
Bancos de Muestras Biológicas , Minería de Datos/métodos , Almacenamiento y Recuperación de la Información/métodos , Neoplasias , Control de Calidad , Programas Informáticos , Biología Computacional/métodos , Sistemas de Administración de Bases de Datos , Bases de Datos Factuales , Genómica , Humanos , Interfaz Usuario-Computador
16.
PLoS One ; 8(6): e65961, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23824211

RESUMEN

Current efforts to understand antibiotic resistance on the whole genome scale tend to focus on known genes even as high throughput sequencing strategies uncover novel mechanisms. To identify genomic variations associated with antibiotic resistance, we employed a modified genome-wide association study; we sequenced genomic DNA from pools of E. coli clinical isolates with similar antibiotic resistance phenotypes using SOLiD technology to uncover single nucleotide polymorphisms (SNPs) unanimously conserved in each pool. The multidrug-resistant pools were genotypically similar to SMS-3-5, a previously sequenced multidrug-resistant isolate from a polluted environment. The similarity was evenly spread across the entire genome and not limited to plasmid or pathogenicity island loci. Among the pools of clinical isolates, genomic variation was concentrated adjacent to previously reported inversion and duplication differences between the SMS-3-5 isolate and the drug-susceptible laboratory strain, DH10B. SNPs that result in non-synonymous changes in gyrA (encoding the well-known S83L allele associated with fluoroquinolone resistance), mutM, ligB, and recG were unanimously conserved in every fluoroquinolone-resistant pool. Alleles of the latter three genes are tightly linked among most sequenced E. coli genomes, and had not been implicated in antibiotic resistance previously. The changes in these genes map to amino acid positions in alpha helices that are involved in DNA binding. Plasmid-encoded complementation of null strains with either allelic variant of mutM or ligB resulted in variable responses to ultraviolet light or hydrogen peroxide treatment as markers of induced DNA damage, indicating their importance in DNA metabolism and revealing a potential mechanism for fluoroquinolone resistance. Our approach uncovered evidence that additional DNA binding enzymes may contribute to fluoroquinolone resistance and further implicate environmental bacteria as a reservoir for antibiotic resistance.


Asunto(s)
Antibacterianos/farmacología , Farmacorresistencia Bacteriana/genética , Escherichia coli/efectos de los fármacos , Fluoroquinolonas/farmacología , Genotipo , ADN Bacteriano/genética , Escherichia coli/genética , Pruebas de Sensibilidad Microbiana , Polimorfismo de Nucleótido Simple
17.
Mol Endocrinol ; 27(3): 548-54, 2013 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-23340253

RESUMEN

In order to understand the biology of the endometrium and potentially develop new diagnostic tools and treatments for endometrial diseases, the highly orchestrated gene expression/regulation that occurs within the uterus must first be understood. Even though a wealth of information on endometrial gene expression/regulation is available, this information is scattered across several different resources in formats that can be difficult for the average bench scientist to query, integrate, and utilize. The Endometrium Database Resource (EDR) was created as a single evolving resource for protein- and micro-RNA-encoding genes that have been shown by gene expression microarray, Northern blot, or other experiments in the literature to have their expression regulated in the uterus of humans, mice, rats, cows, domestic pigs, guinea pigs, and sheep. Genes are annotated in EDR with basic gene information (eg, gene symbol and chromosome), gene orthologs, and gene ontologies. Links are also provided to external resources for publication/s, nucleic and amino acid sequence, gene product function, and Gene Expression Omnibus (GEO) phase expression graph information. The resource also allows for direct comparison of relative gene expression in different microarray experiments for genes shown in the literature to be differentially expressed in the uterus. It is available via a user-friendly, web-based interface and is available without charge or restriction to the entire scientific community. The EDR can be accessed at http://edr.research.bcm.edu.


Asunto(s)
Bases de Datos Genéticas , Endometrio/metabolismo , Investigación , Animales , Femenino , Regulación de la Expresión Génica , Humanos , Anotación de Secuencia Molecular , Análisis de Secuencia por Matrices de Oligonucleótidos
18.
Mol Endocrinol ; 26(10): 1660-74, 2012 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-22902541

RESUMEN

The proteome represents the identity, expression levels, interacting partners, and posttranslational modifications of proteins expressed within any given cell. Proteomic studies aim to census the quantitative and qualitative factors regulating the biological relationships of proteins acting in concert as functional cellular networks. In the field of endocrinology, proteomics has been of considerable value in determining the function and mechanism of action of endocrine signaling molecules in the cell membrane, cytoplasm, and nucleus and for the discovery of proteins as candidates for clinical biomarkers. The volume of data that can be generated by proteomics methodologies, up to gigabytes of data within a few hours, brings with it its own logistical hurdles and presents significant challenges to realizing the full potential of these datasets. In this minireview, we describe selected current proteomics methodologies and their application in basic and translational endocrinology before focusing on mass spectrometry as a model for current progress and challenges in data analysis, management, sharing, and integration.


Asunto(s)
Gestión de la Información , Almacenamiento y Recuperación de la Información , Animales , Interpretación Estadística de Datos , Humanos , Difusión de la Información , Espectrometría de Masas , Análisis por Matrices de Proteínas , Proteoma/química , Proteoma/metabolismo , Proteómica , Transducción de Señal , Electroforesis Bidimensional Diferencial en Gel
19.
Physiol Genomics ; 44(17): 853-63, 2012 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-22786849

RESUMEN

The nuclear receptor (NR) superfamily of ligand-regulated transcription factors directs ligand- and tissue-specific transcriptomes in myriad developmental, metabolic, immunological, and reproductive processes. The NR signaling field has generated a wealth of genome-wide expression data points, but due to deficits in their accessibility, annotation, and integration, the full potential of these studies has not yet been realized. We searched public gene expression databases and MEDLINE for global transcriptomic datasets relevant to NRs, their ligands, and coregulators. We carried out extensive, deep reannotation of the datasets using controlled vocabularies for RNA Source and regulating molecule and resolved disparate gene identifiers to official gene symbols to facilitate comparison of fold changes and their significance across multiple datasets. We assembled these data points into a database, Transcriptomine (http://www.nursa.org/transcriptomine), that allows for multiple, menu-driven querying strategies of this transcriptomic "superdataset," including single and multiple genes, Gene Ontology terms, disease terms, and uploaded custom gene lists. Experimental variables such as regulating molecule, RNA Source, as well as fold-change and P value cutoff values can be modified, and full data records can be either browsed or downloaded for downstream analysis. We demonstrate the utility of Transcriptomine as a hypothesis generation and validation tool using in silico and experimental use cases. Our resource empowers users to instantly and routinely mine the collective biology of millions of previously disparate transcriptomic data points. By incorporating future transcriptome-wide datasets in the NR signaling field, we anticipate Transcriptomine developing into a powerful resource for the NR- and other signal transduction research communities.


Asunto(s)
Bases de Datos Genéticas , Internet , Receptores Citoplasmáticos y Nucleares/metabolismo , Transducción de Señal/genética , Programas Informáticos , Transcriptoma/genética , Animales , Diferenciación Celular/fisiología , Cartilla de ADN/genética , Células Madre Embrionarias/citología , Humanos , Ratones , Ratas , Reacción en Cadena en Tiempo Real de la Polimerasa
20.
Mol Endocrinol ; 26(10): 1675-81, 2012 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-22734043

RESUMEN

The National Institute of Diabetes, Digestive and Kidney Diseases (NIDDK) supports multiple basic science consortia that generate high-content datasets, reagent resources, and methodologies, in the fields of kidney, urology, hematology, digestive, and endocrine diseases, as well as metabolic diseases such as diabetes and obesity. These currently include the Beta Cell Biology Consortium, the Nuclear Receptor Signaling Atlas, the Diabetic Complications Consortium, and the Mouse Metabolic Phenotyping Centers. Recognizing the synergy that would accrue from aggregating information generated and curated by these initiatives in a contiguous informatics network, we created the NIDDK Consortium Interconnectivity Network (dkCOIN; www.dkcoin.org). The goal of this pilot project, organized by the NIDDK, was to establish a single point of access to a toolkit of interconnected resources (datasets, reagents, and protocols) generated from individual consortia that could be readily accessed by biologists of diverse backgrounds and research interests. During the pilot phase of this activity dkCOIN collected nearly 2000 consortium-curated resources, including datasets (functional genomics) and reagents (mouse strains, antibodies, and adenoviral constructs) and built nearly 3000 resource-to-resource connections, thereby demonstrating the feasibility of further extending this database in the future. Thus, dkCOIN promises to be a useful informatics solution for rapidly identifying useful resources generated by participating research consortia.


Asunto(s)
Difusión de la Información , Gestión de la Información , Academias e Institutos , Animales , Recolección de Datos , Bases de Datos Factuales/estadística & datos numéricos , Humanos , Internet , Ratones , National Institute of Diabetes and Digestive and Kidney Diseases (U.S.) , Proyectos Piloto , Investigación , Estados Unidos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA