Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 63
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Biom J ; 65(1): e2100186, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-35818698

RESUMEN

This work presents a joint spatial modeling framework to improve estimation of the spatial distribution of the latent COVID-19 incidence in Belgium, based on test-confirmed COVID-19 cases and crowd-sourced symptoms data as reported in a large-scale online survey. Correction is envisioned for stochastic dependence between the survey's response rate and spatial COVID-19 incidence, commonly known as preferential sampling, but not found significant. Results show that an online survey can provide valuable auxiliary data to optimize spatial COVID-19 incidence estimation based on confirmed cases in situations with limited testing capacity. Furthermore, it is shown that an online survey on COVID-19 symptoms with a sufficiently large sample size per spatial entity is capable of pinpointing the same locations that appear as test-confirmed clusters, approximately 1 week earlier. We conclude that a large-scale online study provides an inexpensive and flexible method to collect timely information of an epidemic during its early phase, which can be used by policy makers in an early phase of an epidemic and in conjunction with other monitoring systems.


Asunto(s)
COVID-19 , Humanos , COVID-19/epidemiología , SARS-CoV-2 , Autoinforme , Incidencia
2.
Proc Natl Acad Sci U S A ; 116(24): 11878-11887, 2019 06 11.
Artículo en Inglés | MEDLINE | ID: mdl-31127050

RESUMEN

Notwithstanding important advances in the context of single-variant pathogenicity identification, novel breakthroughs in discerning the origins of many rare diseases require methods able to identify more complex genetic models. We present here the Variant Combinations Pathogenicity Predictor (VarCoPP), a machine-learning approach that identifies pathogenic variant combinations in gene pairs (called digenic or bilocus variant combinations). We show that the results produced by this method are highly accurate and precise, an efficacy that is endorsed when validating the method on recently published independent disease-causing data. Confidence labels of 95% and 99% are identified, representing the probability of a bilocus combination being a true pathogenic result, providing geneticists with rational markers to evaluate the most relevant pathogenic combinations and limit the search space and time. Finally, the VarCoPP has been designed to act as an interpretable method that can provide explanations on why a bilocus combination is predicted as pathogenic and which biological information is important for that prediction. This work provides an important step toward the genetic understanding of rare diseases, paving the way to clinical knowledge and improved patient care.


Asunto(s)
Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Enfermedades Raras/genética , Marcadores Genéticos/genética , Humanos
3.
Euro Surveill ; 27(7)2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35177167

RESUMEN

BackgroundCOVID-19 mortality, excess mortality, deaths per million population (DPM), infection fatality ratio (IFR) and case fatality ratio (CFR) are reported and compared for many countries globally. These measures may appear objective, however, they should be interpreted with caution.AimWe examined reported COVID-19-related mortality in Belgium from 9 March 2020 to 28 June 2020, placing it against the background of excess mortality and compared the DPM and IFR between countries and within subgroups.MethodsThe relation between COVID-19-related mortality and excess mortality was evaluated by comparing COVID-19 mortality and the difference between observed and weekly average predictions of all-cause mortality. DPM were evaluated using demographic data of the Belgian population. The number of infections was estimated by a stochastic compartmental model. The IFR was estimated using a delay distribution between infection and death.ResultsIn the study period, 9,621 COVID-19-related deaths were reported, which is close to the excess mortality estimated using weekly averages (8,985 deaths). This translates to 837 DPM and an IFR of 1.5% in the general population. Both DPM and IFR increase with age and are substantially larger in the nursing home population.DiscussionDuring the first pandemic wave, Belgium had no discrepancy between COVID-19-related mortality and excess mortality. In light of this close agreement, it is useful to consider the DPM and IFR, which are both age, sex, and nursing home population-dependent. Comparison of COVID-19 mortality between countries should rather be based on excess mortality than on COVID-19-related mortality.


Asunto(s)
COVID-19 , Bélgica/epidemiología , Humanos , Mortalidad , Casas de Salud , Pandemias , SARS-CoV-2
4.
Nat Methods ; 14(11): 1083-1086, 2017 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-28991892

RESUMEN

We present SCENIC, a computational method for simultaneous gene regulatory network reconstruction and cell-state identification from single-cell RNA-seq data (http://scenic.aertslab.org). On a compendium of single-cell data from tumors and brain, we demonstrate that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states. SCENIC provides critical biological insights into the mechanisms driving cellular heterogeneity.


Asunto(s)
Redes Reguladoras de Genes , Análisis de la Célula Individual , Algoritmos , Animales , Encéfalo/metabolismo , Análisis por Conglomerados , Perfilación de la Expresión Génica , Humanos , Ratones
5.
Bioinformatics ; 35(12): 2159-2161, 2019 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-30445495

RESUMEN

SUMMARY: Inferring a Gene Regulatory Network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances in high-throughput gene profiling technology, such as single-cell RNA-seq. To equip researchers with a toolset to infer GRNs from large expression datasets, we propose GRNBoost2 and the Arboreto framework. GRNBoost2 is an efficient algorithm for regulatory network inference using gradient boosting, based on the GENIE3 architecture. Arboreto is a computational framework that scales up GRN inference algorithms complying with this architecture. Arboreto includes both GRNBoost2 and an improved implementation of GENIE3, as a user-friendly open source Python package. AVAILABILITY AND IMPLEMENTATION: Arboreto is available under the 3-Clause BSD license at http://arboreto.readthedocs.io. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Redes Reguladoras de Genes , Biología Computacional , Expresión Génica , Programas Informáticos
6.
Nucleic Acids Res ; 44(W1): W117-21, 2016 07 08.
Artículo en Inglés | MEDLINE | ID: mdl-27131783

RESUMEN

Genomic studies and high-throughput experiments often produce large lists of candidate genes among which only a small fraction are truly relevant to the disease, phenotype or biological process of interest. Gene prioritization tackles this problem by ranking candidate genes by profiling candidates across multiple genomic data sources and integrating this heterogeneous information into a global ranking. We describe an extended version of our gene prioritization method, Endeavour, now available for six species and integrating 75 data sources. The performance (Area Under the Curve) of Endeavour on cross-validation benchmarks using 'gold standard' gene sets varies from 88% (for human phenotypes) to 95% (for worm gene function). In addition, we have also validated our approach using a time-stamped benchmark derived from the Human Phenotype Ontology, which provides a setting close to prospective validation. With this benchmark, using 3854 novel gene-phenotype associations, we observe a performance of 82%. Altogether, our results indicate that this extended version of Endeavour efficiently prioritizes candidate genes. The Endeavour web server is freely available at https://endeavour.esat.kuleuven.be/.


Asunto(s)
Algoritmos , Predisposición Genética a la Enfermedad , Genotipo , Programas Informáticos , Animales , Benchmarking , Estudios de Asociación Genética , Humanos , Internet , Fenotipo
7.
Nat Methods ; 10(11): 1083-4, 2013 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-24076761

RESUMEN

Massively parallel sequencing greatly facilitates the discovery of novel disease genes causing Mendelian and oligogenic disorders. However, many mutations are present in any individual genome, and identifying which ones are disease causing remains a largely open problem. We introduce eXtasy, an approach to prioritize nonsynonymous single-nucleotide variants (nSNVs) that substantially improves prediction of disease-causing variants in exome sequencing data by integrating variant impact prediction, haploinsufficiency prediction and phenotype-specific gene prioritization.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Polimorfismo de Nucleótido Simple , Predisposición Genética a la Enfermedad , Humanos , Mutación , Fenotipo
8.
Nature ; 464(7289): 704-12, 2010 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-19812545

RESUMEN

Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Predisposición Genética a la Enfermedad/genética , Genoma Humano/genética , Mutagénesis/genética , Duplicación de Gen , Estudio de Asociación del Genoma Completo , Genotipo , Haplotipos/genética , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple/genética , Grupos Raciales/genética , Reproducibilidad de los Resultados
9.
Nature ; 464(7289): 713-20, 2010 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-20360734

RESUMEN

Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn's disease, HLA for Crohn's disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Enfermedad , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Artritis Reumatoide/genética , Estudios de Casos y Controles , Enfermedad de Crohn/genética , Diabetes Mellitus/genética , Frecuencia de los Genes/genética , Humanos , Hibridación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Proyectos Piloto , Polimorfismo de Nucleótido Simple/genética , Control de Calidad
10.
Nucleic Acids Res ; 41(11): e118, 2013 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-23605045

RESUMEN

The introduction of next generation sequencing methods in genome studies has made it possible to shift research from a gene-centric approach to a genome wide view. Although methods and tools to detect single nucleotide polymorphisms are becoming more mature, methods to identify and visualize structural variation (SV) are still in their infancy. Most genome browsers can only compare a given sequence to a reference genome; therefore, direct comparison of multiple individuals still remains a challenge. Therefore, the implementation of efficient approaches to explore and visualize SVs and directly compare two or more individuals is desirable. In this article, we present a visualization approach that uses space-filling Hilbert curves to explore SVs based on both read-depth and pair-end information. An interactive open-source Java application, called Meander, implements the proposed methodology, and its functionality is demonstrated using two cases. With Meander, users can explore variations at different levels of resolution and simultaneously compare up to four different individuals against a common reference. The application was developed using Java version 1.6 and Processing.org and can be run on any platform. It can be found at http://homes.esat.kuleuven.be/~bioiuser/meander.


Asunto(s)
Variación Estructural del Genoma , Programas Informáticos , Arabidopsis/genética , Neoplasias de la Mama/genética , Cromosomas , Femenino , Genómica , Humanos
11.
Bioinformatics ; 29(14): 1801-2, 2013 Jul 15.
Artículo en Inglés | MEDLINE | ID: mdl-23658417

RESUMEN

SUMMARY: TrioVis is a visual analytics tool developed for filtering on coverage and variant frequency for genomic variants from exome sequencing of parent-child trios. In TrioVis, the variant data are organized by grouping each variant based on the laws of Mendelian inheritance. Taking three Variant Call Format files as input, TrioVis allows the user to test different coverage thresholds (i.e. different levels of stringency), to find the optimal threshold values tailored to their hypotheses and to gain insights into the global effects of filtering through interaction. AVAILABILITY: Executables, source code and sample data are available at https://bitbucket.org/biovizleuven/triovis. Screencast is available at http://vimeo.com/user6757771/triovis. CONTACT: ryo.sakai@esat.kuleuven.be.


Asunto(s)
Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Programas Informáticos , Niño , Gráficos por Computador , Exoma , Genómica/métodos , Humanos , Padres
12.
Bioinformatics ; 29(17): 2206-7, 2013 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-23803468

RESUMEN

SUMMARY: Pipit is a gene-centric interactive visualization tool designed to study structural genomic variations. Through focusing on individual genes as the functional unit, researchers are able to study and generate hypotheses on the biological impact of different structural variations, for instance, the deletion of dosage-sensitive genes or the formation of fusion genes. Pipit is a cross-platform Java application that visualizes structural variation data from Genome Variation Format files. AVAILABILITY: Executables, source code, sample data, documentation and screencast are available at https://bitbucket.org/biovizleuven/pipit.


Asunto(s)
Variación Estructural del Genoma , Programas Informáticos , Animales , Gráficos por Computador , Genes , Genoma , Ratones
13.
Hum Reprod ; 29(4): 842-51, 2014 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-24491297

RESUMEN

STUDY QUESTION: What are the analytical and clinical validity and the clinical utility of in vitro screening of embryos by whole-genome sequencing? SUMMARY ANSWER: At present there are still many limitations in terms of analytical and clinical validity and utility and many ethical questions remain. WHAT IS KNOWN ALREADY: Whole-genome sequencing of IVF/ICSI embryos is technically possible. Many loss-of-function mutations exist in the general population without serious effects on the phenotype of the individual. Moreover, annotations of genes and the reference genome are still not 100% correct. STUDY DESIGN, SIZE, DURATION: We used publicly available samples from the 1000 Genomes project and Complete Genomics, together with 42 samples from in-house research samples of parents from trios to investigate the presence of loss-of-function mutations in healthy individuals. PARTICIPANTS/MATERIALS, SETTING, METHODS: In the samples, we looked for mutations in genes that are associated with a selection of severe Mendelian disorders with a known molecular basis. We looked for mutations predicted to be damaging by PolyPhen and SIFT and for mutations annotated as disease causing in Human Genome Mutation Database (HGMD). MAIN RESULTS AND THE ROLE OF CHANCE: More than 40% of individuals who can be considered healthy have mutations that are predicted to be damaging in genes associated with severe Mendelian disorders or are annotated as disease causing. LIMITATIONS, REASONS FOR CAUTION: The analysis relies on current knowledge and databases are continuously updated to reflect our increasing knowledge about the genome. In the process of our analysis several updates were already made. WIDER IMPLICATIONS OF THE FINDINGS: At this moment it is not advisable to use whole-genome sequencing as a tool to set up health profiles to select embryos for transfer. We also raise some ethical questions that have to be addressed before this technology can be used for embryo selection. TRIAL REGISTRATION NUMBER: N/A.


Asunto(s)
Genoma Humano , Diagnóstico Preimplantación/métodos , Blastocisto , Análisis Mutacional de ADN , Humanos , Diagnóstico Preimplantación/ética , Diagnóstico Preimplantación/tendencias , Medición de Riesgo/métodos
14.
Front Bioinform ; 4: 1331043, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38375239

RESUMEN

Current visualizations in microbiome research rely on aggregations in taxonomic classifications or do not show less abundant taxa. We introduce Snowflake: a new visualization method that creates a clear overview of the microbiome composition in collected samples without losing any information due to classification or neglecting less abundant reads. Snowflake displays every observed OTU/ASV in the microbiome abundance table and provides a solution to include the data's hierarchical structure and additional information obtained from downstream analysis (e.g., alpha- and beta-diversity) and metadata. Based on the value-driven ICE-T evaluation methodology, Snowflake was positively received. Experts in microbiome research found the visualizations to be user-friendly and detailed and liked the possibility of including and relating additional information to the microbiome's composition. Exploring the topological structure of the microbiome abundance table allows them to quickly identify which taxa are unique to specific samples and which are shared among multiple samples (i.e., separating sample-specific taxa from the core microbiome), and see the compositional differences between samples. An R package for constructing and visualizing Snowflake microbiome composition graphs is available at https://gitlab.com/vda-lab/snowflake.

15.
IEEE Trans Vis Comput Graph ; 30(1): 649-660, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37934634

RESUMEN

This paper is a call to action for research and discussion on data visualization education. As visualization evolves and spreads through our professional and personal lives, we need to understand how to support and empower a broad and diverse community of learners in visualization. Data Visualization is a diverse and dynamic discipline that combines knowledge from different fields, is tailored to suit diverse audiences and contexts, and frequently incorporates tacit knowledge. This complex nature leads to a series of interrelated challenges for data visualization education. Driven by a lack of consolidated knowledge, overview, and orientation for visualization education, the 21 authors of this paper-educators and researchers in data visualization-identify and describe 19 challenges informed by our collective practical experience. We organize these challenges around seven themes People, Goals & Assessment, Environment, Motivation, Methods, Materials, and Change. Across these themes, we formulate 43 research questions to address these challenges. As part of our call to action, we then conclude with 5 cross-cutting opportunities and respective action items: embrace DIVERSITY+INCLUSION, build COMMUNITIES, conduct RESEARCH, act AGILE, and relish RESPONSIBILITY. We aim to inspire researchers, educators and learners to drive visualization education forward and discuss why, how, who and where we educate, as we learn to use visualization to address challenges across many scales and many domains in a rapidly changing world: viseducationchallenges.github.io.

16.
Bioinformatics ; 28(7): 1035-7, 2012 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-22332238

RESUMEN

SUMMARY: Biogem provides a software development environment for the Ruby programming language, which encourages community-based software development for bioinformatics while lowering the barrier to entry and encouraging best practices. Biogem, with its targeted modular and decentralized approach, software generator, tools and tight web integration, is an improved general model for scaling up collaborative open source software development in bioinformatics. AVAILABILITY: Biogem and modules are free and are OSS. Biogem runs on all systems that support recent versions of Ruby, including Linux, Mac OS X and Windows. Further information at http://www.biogems.info. A tutorial is available at http://www.biogems.info/howto.html CONTACT: bonnal@ingm.org.


Asunto(s)
Biología Computacional/métodos , Internet , Lenguajes de Programación , Programas Informáticos
17.
PLoS One ; 18(12): e0295361, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38096184

RESUMEN

One of the challenges in multi-omics data analysis for precision medicine is the efficient exploration of undiscovered molecular interactions in disease processes. We present BioMOBS, a workflow consisting of two data visualization tools integrated with an open-source molecular information database to perform clinically relevant analyses (https://github.com/driesheylen123/BioMOBS). We performed exploratory pathway analysis with BioMOBS and demonstrate its ability to generate relevant molecular hypotheses, by reproducing recent findings in type 2 diabetes UK biobank data. The central visualisation tool, where data-driven and literature-based findings can be integrated, is available within the github link as well. BioMOBS is a workflow that leverages information from multiple data-driven interactive analyses and visually integrates it with established pathway knowledge. The demonstrated use cases place trust in the usage of BioMOBS as a procedure to offer clinically relevant insights in disease pathway analyses on various types of omics data.


Asunto(s)
Diabetes Mellitus Tipo 2 , Programas Informáticos , Humanos , Multiómica , Flujo de Trabajo
18.
Eur Urol Open Sci ; 56: 39-46, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37822515

RESUMEN

Background: The Prostate Cancer Radiological Estimation of Change in Sequential Evaluation (PRECISE) score has been developed to standardise prostate magnetic resonance imaging (MRI) reporting in men on active surveillance (AS) for prostate cancer (PCa). Objective: To evaluate the feasibility of PRECISE scoring and assess its diagnostic accuracy. Design setting and participants: All PCa patients on AS with a baseline MRI and at least one follow-up MRI scan between January 2008 and September 2022 at a single tertiary referral centre were included in a database. The follow-up protocol of the Prostate Cancer International Active Surveillance (PRIAS) study was used. All scans were retrospectively re-reported by a dedicated uroradiologist and appointed a Prostate Imaging Reporting and Data System (version 2.1) and PRECISE score. Outcome measurements and statistical analysis: Clinically significant progression was defined by histopathological upgrading (on biopsy or radical prostatectomy) to grade group ≥3 and/or evolution to T3 stage. A survival analysis was performed to assess differential progression-free survival (PFS) according to the PRECISE score. Results and limitations: A total of 188 patients were included for an analysis with a total of 358 repeat MRI scans and 144 repeat biopsies. The median follow-up was 46 mo (interquartile range 21-74). Radiological progression (PRECISE 4-5) had sensitivity, specificity, negative predictive value, and positive predictive value of, respectively, 78%, 70%, 90%, and 49% for clinically significant progression. Four-year PFS was 91% for PRECISE 1-3 versus 66% for PRECISE 4-5 (p < 0.001). In total, 137 patients underwent a confirmation MRI scan within 18 mo after diagnosis. Four-year PFS in this group was 81% for PRECISE 1-3 versus 43% for PRECISE 4-5 (p < 0.001). Limitations include retrospective design and no strict adherence to AS protocol. Conclusions: Implementation of PRECISE scoring for PCa patients on AS is feasible and offers a prognostic value. Patients with PRECISE score 4-5 on confirmation MRI within 18 mo after diagnosis have a three-fold higher risk of clinically significant progression after 4 yr. Patient summary: Patients with low-risk prostate cancer can be followed up carefully. In this study, we evaluate the standardised reporting of repeat magnetic resonance imaging scans (using the Prostate Cancer Radiological Estimation of Change in Sequential Evaluation [PRECISE] recommendations). PRECISE scoring is feasible and helps identify patients in need of further treatment.

19.
BMC Bioinformatics ; 13: 45, 2012 Mar 22.
Artículo en Inglés | MEDLINE | ID: mdl-22439608

RESUMEN

BACKGROUND: Elucidating the genotype-phenotype connection is one of the big challenges of modern molecular biology. To fully understand this connection, it is necessary to consider the underlying networks and the time factor. In this context of data deluge and heterogeneous information, visualization plays an essential role in interpreting complex and dynamic topologies. Thus, software that is able to bring the network, phenotypic and temporal information together is needed. Arena3D has been previously introduced as a tool that facilitates link discovery between processes. It uses a layered display to separate different levels of information while emphasizing the connections between them. We present novel developments of the tool for the visualization and analysis of dynamic genotype-phenotype landscapes. RESULTS: Version 2.0 introduces novel features that allow handling time course data in a phenotypic context. Gene expression levels or other measures can be loaded and visualized at different time points and phenotypic comparison is facilitated through clustering and correlation display or highlighting of impacting changes through time. Similarity scoring allows the identification of global patterns in dynamic heterogeneous data. In this paper we demonstrate the utility of the tool on two distinct biological problems of different scales. First, we analyze a medium scale dataset that looks at perturbation effects of the pluripotency regulator Nanog in murine embryonic stem cells. Dynamic cluster analysis suggests alternative indirect links between Nanog and other proteins in the core stem cell network. Moreover, recurrent correlations from the epigenetic to the translational level are identified. Second, we investigate a large scale dataset consisting of genome-wide knockdown screens for human genes essential in the mitotic process. Here, a potential new role for the gene lsm14a in cytokinesis is suggested. We also show how phenotypic patterning allows for extensive comparison and identification of high impact knockdown targets. CONCLUSIONS: We present a new visualization approach for perturbation screens with multiple phenotypic outcomes. The novel functionality implemented in Arena3D enables effective understanding and comparison of temporal patterns within morphological layers, to help with the system-wide analysis of dynamic processes. Arena3D is available free of charge for academics as a downloadable standalone application from: http://arena3d.org/.


Asunto(s)
Genotipo , Fenotipo , Programas Informáticos , Animales , Análisis por Conglomerados , Células Madre Embrionarias/metabolismo , Epigénesis Genética , Técnicas de Silenciamiento del Gen , Humanos , Ratones , Biosíntesis de Proteínas , Proteínas/metabolismo , Transcripción Genética
20.
BMC Bioinformatics ; 13: 240, 2012 Sep 21.
Artículo en Inglés | MEDLINE | ID: mdl-22994508

RESUMEN

BACKGROUND: The University of California, Santa Cruz (UCSC) genome database is among the most used sources of genomic annotation in human and other organisms. The database offers an excellent web-based graphical user interface (the UCSC genome browser) and several means for programmatic queries. A simple application programming interface (API) in a scripting language aimed at the biologist was however not yet available. Here, we present the Ruby UCSC API, a library to access the UCSC genome database using Ruby. RESULTS: The API is designed as a BioRuby plug-in and built on the ActiveRecord 3 framework for the object-relational mapping, making writing SQL statements unnecessary. The current version of the API supports databases of all organisms in the UCSC genome database including human, mammals, vertebrates, deuterostomes, insects, nematodes, and yeast.The API uses the bin index-if available-when querying for genomic intervals. The API also supports genomic sequence queries using locally downloaded *.2bit files that are not stored in the official MySQL database. The API is implemented in pure Ruby and is therefore available in different environments and with different Ruby interpreters (including JRuby). CONCLUSIONS: Assisted by the straightforward object-oriented design of Ruby and ActiveRecord, the Ruby UCSC API will facilitate biologists to query the UCSC genome database programmatically. The API is available through the RubyGem system. Source code and documentation are available at https://github.com/misshie/bioruby-ucsc-api/ under the Ruby license. Feedback and help is provided via the website at http://rubyucscapi.userecho.com/.


Asunto(s)
Bases de Datos Genéticas , Genómica , Programas Informáticos , Animales , Genoma , Humanos , Internet , Invertebrados/genética , Vertebrados/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA