Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
BMC Bioinformatics ; 24(1): 221, 2023 May 31.
Artículo en Inglés | MEDLINE | ID: mdl-37259021

RESUMEN

BACKGROUND: As genome sequencing becomes better integrated into scientific research, government policy, and personalized medicine, the primary challenge for researchers is shifting from generating raw data to analyzing these vast datasets. Although much work has been done to reduce compute times using various configurations of traditional CPU computing infrastructures, Graphics Processing Units (GPUs) offer opportunities to accelerate genomic workflows by orders of magnitude. Here we benchmark one GPU-accelerated software suite called NVIDIA Parabricks on Amazon Web Services (AWS), Google Cloud Platform (GCP), and an NVIDIA DGX cluster. We benchmarked six variant calling pipelines, including two germline callers (HaplotypeCaller and DeepVariant) and four somatic callers (Mutect2, Muse, LoFreq, SomaticSniper). RESULTS: We achieved up to 65 × acceleration with germline variant callers, bringing HaplotypeCaller runtimes down from 36 h to 33 min on AWS, 35 min on GCP, and 24 min on the NVIDIA DGX. Somatic callers exhibited more variation between the number of GPUs and computing platforms. On cloud platforms, GPU-accelerated germline callers resulted in cost savings compared with CPU runs, whereas some somatic callers were more expensive than CPU runs because their GPU acceleration was not sufficient to overcome the increased GPU cost. CONCLUSIONS: Germline variant callers scaled well with the number of GPUs across platforms, whereas somatic variant callers exhibited more variation in the number of GPUs with the fastest runtimes, suggesting that, at least with the version of Parabricks used here, these workflows are less GPU optimized and require benchmarking on the platform of choice before being deployed at production scales. Our study demonstrates that GPUs can be used to greatly accelerate genomic workflows, thus bringing closer to grasp urgent societal advances in the areas of biosurveillance and personalized medicine.


Asunto(s)
Gráficos por Computador , Programas Informáticos , Flujo de Trabajo , Genómica
4.
Cell Syst ; 9(5): 417-421, 2019 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-31677972

RESUMEN

As more digital resources are produced by the research community, it is becoming increasingly important to harmonize and organize them for synergistic utilization. The findable, accessible, interoperable, and reusable (FAIR) guiding principles have prompted many stakeholders to consider strategies for tackling this challenge. The FAIRshake toolkit was developed to enable the establishment of community-driven FAIR metrics and rubrics paired with manual and automated FAIR assessments. FAIR assessments are visualized as an insignia that can be embedded within digital-resources-hosting websites. Using FAIRshake, a variety of biomedical digital resources were manually and automatically evaluated for their level of FAIRness.


Asunto(s)
Difusión de la Información/métodos , Internet/tendencias , Sistemas en Línea/normas , Recursos en Salud/normas , Humanos
5.
PLoS Biol ; 15(4): e2001818, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28388615

RESUMEN

The thesis presented here is that biomedical research is based on the trusted exchange of services. That exchange would be conducted more efficiently if the trusted software platforms to exchange those services, if they exist, were more integrated. While simpler and narrower in scope than the services governing biomedical research, comparison to existing internet-based platforms, like Airbnb, can be informative. We illustrate how the analogy to internet-based platforms works and does not work and introduce The Commons, under active development at the National Institutes of Health (NIH) and elsewhere, as an example of the move towards platforms for research.


Asunto(s)
Investigación Biomédica/normas , Sistemas de Administración de Bases de Datos/normas , Difusión de la Información/métodos , Evaluación de Programas y Proyectos de Salud/normas , Cambio Social , Confianza , Animales , Investigación Biomédica/tendencias , Barreras de Comunicación , Sistemas de Administración de Bases de Datos/tendencias , Eficiencia , Humanos , Internet , National Institutes of Health (U.S.) , Publicaciones Periódicas como Asunto/normas , Publicaciones Periódicas como Asunto/tendencias , Evaluación de Programas y Proyectos de Salud/tendencias , Apoyo a la Investigación como Asunto/tendencias , Mala Conducta Científica , Programas Informáticos , Transferencia de Tecnología , Estados Unidos , Recursos Humanos
7.
Nat Biotechnol ; 29(5): 415-20, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21552244

RESUMEN

Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.


Asunto(s)
Biomarcadores , Ambiente , Metagenómica/normas , Análisis de Secuencia de ADN/normas , Lista de Verificación , Bases de Datos Genéticas , Genes de ARNr , Variación Genética , Humanos , Almacenamiento y Recuperación de la Información/normas , Internet , Lenguajes de Programación , Programas Informáticos
8.
Genome Res ; 19(12): 2317-23, 2009 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-19819907

RESUMEN

The Human Microbiome Project (HMP), funded as an initiative of the NIH Roadmap for Biomedical Research (http://nihroadmap.nih.gov), is a multi-component community resource. The goals of the HMP are: (1) to take advantage of new, high-throughput technologies to characterize the human microbiome more fully by studying samples from multiple body sites from each of at least 250 "normal" volunteers; (2) to determine whether there are associations between changes in the microbiome and health/disease by studying several different medical conditions; and (3) to provide both a standardized data resource and new technological approaches to enable such studies to be undertaken broadly in the scientific community. The ethical, legal, and social implications of such research are being systematically studied as well. The ultimate objective of the HMP is to demonstrate that there are opportunities to improve human health through monitoring or manipulation of the human microbiome. The history and implementation of this new program are described here.


Asunto(s)
Bacterias , Tracto Gastrointestinal/microbiología , Metagenoma/genética , Boca/microbiología , National Institutes of Health (U.S.) , Piel/microbiología , Vagina/microbiología , Bacterias/clasificación , Bacterias/genética , Bacterias/aislamiento & purificación , Femenino , Humanos , Programas Nacionales de Salud , ARN Ribosómico 16S/genética , Análisis de Secuencia de ADN , Estados Unidos
9.
Nucleic Acids Res ; 30(1): 129-36, 2002 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-11752274

RESUMEN

The Celera Discovery System (CDS) is a web-accessible research workbench for mining genomic and related biological information. Users have access to the human and mouse genome sequences with annotation presented in summary form in BioMolecule Reports for genes, transcripts and proteins. Over 40 additional databases are available, including sequence, mapping, mutation, genetic variation, mRNA expression, protein structure, motif and classification data. Data are accessible by browsing reports, through a variety of interactive graphical viewers, and by advanced query capability provided by the LION SRS search engine. A growing number of sequence analysis tools are available, including sequence similarity, pattern searching, multiple sequence alignment and Hidden Markov Model search. A user workspace keeps track of queries and analyses. CDS is widely used by the academic research community and requires a subscription for access. The system and academic pricing information are available at http://cds.celera.com.


Asunto(s)
Bases de Datos Genéticas , Genoma , Animales , Mapeo Cromosómico , Cromosomas Humanos , Gráficos por Computador , Secuencia Conservada , Predicción , Variación Genética , Humanos , Almacenamiento y Recuperación de la Información , Internet , Cadenas de Markov , Ratones , Mutación , Proteínas/clasificación , Proteínas/genética , Proteínas/fisiología , ARN Mensajero/genética , Alineación de Secuencia , Homología de Secuencia , Transcripción Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...