Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
STAR Protoc ; 4(1): 102069, 2023 03 17.
Artículo en Inglés | MEDLINE | ID: mdl-36853701

RESUMEN

Understanding cellular metabolism is important across biotechnology and biomedical research and has critical implications in a broad range of normal and pathological conditions. Here, we introduce the user-friendly web-based platform ImmCellFie, which allows the comprehensive analysis of metabolic functions inferred from transcriptomic or proteomic data. We explain how to set up a run using publicly available omics data and how to visualize the results. The ImmCellFie algorithm pushes beyond conventional statistical enrichment and incorporates complex biological mechanisms to quantify cell activity. For complete details on the use and execution of this protocol, please refer to Richelle et al. (2021).1.


Asunto(s)
Biología Computacional , Proteómica , Proteómica/métodos , Biología Computacional/métodos , Algoritmos , Internet
2.
Mol Psychiatry ; 28(2): 822-833, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36266569

RESUMEN

Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected clinical, diagnostic, and leukocyte RNA data from 240 ASD and typically developing (TD) toddlers (175 toddlers for training and 65 for test). To identify gene expression ASD diagnostic classifiers, we developed 42,840 models composed of 3570 gene expression feature selection sets and 12 classification methods. We found that 742 models had AUC-ROC ≥ 0.8 on both Training and Test sets. Weighted Bayesian model averaging of these 742 models yielded an ensemble classifier model with accurate performance in Training and Test gene expression datasets with ASD diagnostic classification AUC-ROC scores of 85-89% and AUC-PR scores of 84-92%. ASD toddlers with ensemble scores above and below the overall ASD ensemble mean of 0.723 (on a scale of 0 to 1) had similar diagnostic and psychometric scores, but those below this ASD ensemble mean had more prenatal risk events than TD toddlers. Ensemble model feature genes were involved in cell cycle, inflammation/immune response, transcriptional gene regulation, cytokine response, and PI3K-AKT, RAS and Wnt signaling pathways. We additionally collected targeted DNA sequencing smMIPs data on a subset of ASD risk genes from 217 of the 240 ASD and TD toddlers. This DNA sequencing found about the same percentage of SFARI Level 1 and 2 ASD risk gene mutations in TD (12 of 105) as in ASD (13 of 112) toddlers, and classification based only on the presence of mutation in these risk genes performed at a chance level of 49%. By contrast, the leukocyte ensemble gene expression classifier correctly diagnostically classified 88% of TD and ASD toddlers with ASD risk gene mutations. Our ensemble ASD gene expression classifier is diagnostically predictive and replicable across different toddler ages, races, and ethnicities; out-performs a risk gene mutation classifier; and has potential for clinical translation.


Asunto(s)
Trastorno del Espectro Autista , Humanos , Preescolar , Lactante , Trastorno del Espectro Autista/diagnóstico , Trastorno del Espectro Autista/genética , Teorema de Bayes , Fosfatidilinositol 3-Quinasas , Inmunidad , Expresión Génica
3.
Cell Rep Methods ; 1(3)2021 07 26.
Artículo en Inglés | MEDLINE | ID: mdl-34761247

RESUMEN

Omics experiments are ubiquitous in biological studies, leading to a deluge of data. However, it is still challenging to connect changes in these data to changes in cell functions because of complex interdependencies between genes, proteins, and metabolites. Here, we present a framework allowing researchers to infer how metabolic functions change on the basis of omics data. To enable this, we curated and standardized lists of metabolic tasks that mammalian cells can accomplish. Genome-scale metabolic networks were used to define gene sets associated with each metabolic task. We further developed a framework to overlay omics data on these sets and predict pathway usage for each metabolic task. We demonstrated how this approach can be used to quantify metabolic functions of diverse biological samples from the single cell to whole tissues and organs by using multiple transcriptomic datasets. To facilitate its adoption, we integrated the approach into GenePattern (www.genepattern.org-CellFie).


Asunto(s)
Genoma , Redes y Vías Metabólicas , Animales , Redes y Vías Metabólicas/genética , Fenómenos Fisiológicos Celulares , Perfilación de la Expresión Génica , Transcriptoma/genética , Mamíferos/genética
4.
BMC Bioinformatics ; 22(1): 374, 2021 Jul 20.
Artículo en Inglés | MEDLINE | ID: mdl-34284719

RESUMEN

BACKGROUND: As exome sequencing (ES) integrates into clinical practice, we should make every effort to utilize all information generated. Copy-number variation can lead to Mendelian disorders, but small copy-number variants (CNVs) often get overlooked or obscured by under-powered data collection. Many groups have developed methodology for detecting CNVs from ES, but existing methods often perform poorly for small CNVs and rely on large numbers of samples not always available to clinical laboratories. Furthermore, methods often rely on Bayesian approaches requiring user-defined priors in the setting of insufficient prior knowledge. This report first demonstrates the benefit of multiplexed exome capture (pooling samples prior to capture), then presents a novel detection algorithm, mcCNV ("multiplexed capture CNV"), built around multiplexed capture. RESULTS: We demonstrate: (1) multiplexed capture reduces inter-sample variance; (2) our mcCNV method, a novel depth-based algorithm for detecting CNVs from multiplexed capture ES data, improves the detection of small CNVs. We contrast our novel approach, agnostic to prior information, with the the commonly-used ExomeDepth. In a simulation study mcCNV demonstrated a favorable false discovery rate (FDR). When compared to calls made from matched genome sequencing, we find the mcCNV algorithm performs comparably to ExomeDepth. CONCLUSION: Implementing multiplexed capture increases power to detect single-exon CNVs. The novel mcCNV algorithm may provide a more favorable FDR than ExomeDepth. The greatest benefits of our approach derive from (1) not requiring a database of reference samples and (2) not requiring prior information about the prevalance or size of variants.


Asunto(s)
Variaciones en el Número de Copia de ADN , Exoma , Algoritmos , Teorema de Bayes , Exoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Secuenciación del Exoma
5.
Cell Syst ; 9(5): 417-421, 2019 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-31677972

RESUMEN

As more digital resources are produced by the research community, it is becoming increasingly important to harmonize and organize them for synergistic utilization. The findable, accessible, interoperable, and reusable (FAIR) guiding principles have prompted many stakeholders to consider strategies for tackling this challenge. The FAIRshake toolkit was developed to enable the establishment of community-driven FAIR metrics and rubrics paired with manual and automated FAIR assessments. FAIR assessments are visualized as an insignia that can be embedded within digital-resources-hosting websites. Using FAIRshake, a variety of biomedical digital resources were manually and automatically evaluated for their level of FAIRness.


Asunto(s)
Difusión de la Información/métodos , Internet/tendencias , Sistemas en Línea/normas , Recursos en Salud/normas , Humanos
6.
Artículo en Inglés | MEDLINE | ID: mdl-31119199

RESUMEN

Electronic Health Record (EHR) systems typically define laboratory test results using the Laboratory Observation Identifier Names and Codes (LOINC) and can transmit them using Fast Healthcare Interoperability Resource (FHIR) standards. LOINC has not yet been semantically integrated with computational resources for phenotype analysis. Here, we provide a method for mapping LOINC-encoded laboratory test results transmitted in FHIR standards to Human Phenotype Ontology (HPO) terms. We annotated the medical implications of 2923 commonly used laboratory tests with HPO terms. Using these annotations, our software assesses laboratory test results and converts each result into an HPO term. We validated our approach with EHR data from 15,681 patients with respiratory complaints and identified known biomarkers for asthma. Finally, we provide a freely available SMART on FHIR application that can be used within EHR systems. Our approach allows readily available laboratory tests in EHR to be reused for deep phenotyping and exploits the hierarchical structure of HPO to integrate distinct tests that have comparable medical interpretations for association studies.

7.
J Pers Med ; 8(2)2018 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-29710874

RESUMEN

As part of the Heart Healthy Lenoir Project, we developed a practice level intervention to improve blood pressure control. The goal of this study was: (i) to determine if single nucleotide polymorphisms (SNPs) that associate with blood pressure variation, identified in large studies, are applicable to blood pressure control in subjects from a rural population; (ii) to measure the association of these SNPs with subjects' responsiveness to the hypertension intervention; and (iii) to identify other SNPs that may help understand patient-specific responses to an intervention. We used a combination of candidate SNPs and genome-wide analyses to test associations with either baseline systolic blood pressure (SBP) or change in systolic blood pressure one year after the intervention in two genetically defined ancestral groups: African Americans (AA) and Caucasian Americans (CAU). Of the 48 candidate SNPs, 13 SNPs associated with baseline SBP in our study; however, one candidate SNP, rs592582, also associated with a change in SBP after one year. Using our study data, we identified 4 and 15 additional loci that associated with a change in SBP in the AA and CAU groups, respectively. Our analysis of gene-age interactions identified genotypes associated with SBP improvement within different age groups of our populations. Moreover, our integrative analysis identified AQP4-AS1 and PADI2 as genes whose expression levels may contribute to the pleiotropy of complex traits involved in cardiovascular health and blood pressure regulation in response to an intervention targeting hypertension. In conclusion, the identification of SNPs associated with the success of a hypertension treatment intervention suggests that genetic factors in combination with age may contribute to an individual's success in lowering SBP. If these findings prove to be applicable to other populations, the use of this genetic variation in making patient-specific interventions may help providers with making decisions to improve patient outcomes. Further investigation is required to determine the role of this genetic variance with respect to the management of hypertension such that more precise treatment recommendations may be made in the future as part of personalized medicine.

8.
G3 (Bethesda) ; 8(6): 2107-2119, 2018 05 31.
Artículo en Inglés | MEDLINE | ID: mdl-29686110

RESUMEN

Although vegetable consumption associates with decreased risk for a variety of diseases, few Americans meet dietary recommendations for vegetable intake. TAS2R38 encodes a taste receptor that confers bitter taste sensing from chemicals found in some vegetables. Common polymorphisms in TAS2R38 lead to coding substitutions that alter receptor function and result in the loss of bitter taste perception. Our study examined whether bitter taste perception TAS2R38 diplotypes associated with vegetable consumption in participants enrolled in either an enhanced or a minimal nutrition counseling intervention. DNA was isolated from the peripheral blood cells of study participants (N = 497) and analyzed for polymorphisms. Vegetable consumption was determined using the Block Fruit and Vegetable screener. We tested for differences in the frequency of vegetable consumption between intervention and genotype groups over time using mixed effects models. Baseline vegetable consumption frequency did not associate with bitter taste diplotypes (P = 0.937), however after six months of the intervention, we observed an interaction between bitter taste diplotypes and time (P = 0.046). Participants in the enhanced intervention increased their vegetable consumption frequency (P = 0.020) and within this intervention group, the bitter non-tasters and intermediate-bitter tasters had the largest increase in vegetable consumption. In contrast, in the minimal intervention group, the bitter tasting participants reported a decrease in vegetable consumption. Bitter-non tasters and intermediate-bitter tasters increased vegetable consumption in either intervention more than those who perceive bitterness. Future precision medicine applications could consider genetic variation in bitter taste perception genes when designing dietary interventions.


Asunto(s)
Dieta , Predisposición Genética a la Enfermedad , Receptores Acoplados a Proteínas G/genética , Características de la Residencia , Gusto/genética , Verduras , Adolescente , Adulto , Anciano , Estudios de Cohortes , Demografía , Femenino , Haplotipos/genética , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Análisis Multivariante , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Análisis de Regresión , Percepción del Gusto/genética , Adulto Joven
9.
Genome Med ; 9(1): 86, 2017 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-28954626

RESUMEN

BACKGROUND: The human leukocyte antigen (HLA) system is a genomic region involved in regulating the human immune system by encoding cell membrane major histocompatibility complex (MHC) proteins that are responsible for self-recognition. Understanding the variation in this region provides important insights into autoimmune disorders, disease susceptibility, oncological immunotherapy, regenerative medicine, transplant rejection, and toxicogenomics. Traditional approaches to HLA typing are low throughput, target only a few genes, are labor intensive and costly, or require specialized protocols. RNA sequencing promises a relatively inexpensive, high-throughput solution for HLA calling across all genes, with the bonus of complete transcriptome information and widespread availability of historical data. Existing tools have been limited in their ability to accurately and comprehensively call HLA genes from RNA-seq data. RESULTS: We created HLAProfiler ( https://github.com/ExpressionAnalysis/HLAProfiler ), a k-mer profile-based method for HLA calling in RNA-seq data which can identify rare and common HLA alleles with > 99% accuracy at two-field precision in both biological and simulated data. For 68% of novel alleles not present in the reference database, HLAProfiler can correctly identify the two-field precision or exact coding sequence, a significant advance over existing algorithms. CONCLUSIONS: HLAProfiler allows for accurate HLA calls in RNA-seq data, reliably expanding the utility of these data in HLA-related research and enabling advances across a broad range of disciplines. Additionally, by using the observed data to identify potential novel alleles and update partial alleles, HLAProfiler will facilitate further improvements to the existing database of reference HLA alleles. HLAProfiler is available at https://expressionanalysis.github.io/HLAProfiler/ .


Asunto(s)
Antígenos HLA/genética , Prueba de Histocompatibilidad/métodos , Análisis de Secuencia de ARN , Programas Informáticos , Alelos , Línea Celular , Humanos , Datos de Secuencia Molecular , Valores de Referencia
10.
Nat Rev Genet ; 15(1): 56-62, 2014 01.
Artículo en Inglés | MEDLINE | ID: mdl-24322726

RESUMEN

Advances in next-generation sequencing (NGS) technologies have rapidly improved sequencing fidelity and substantially decreased sequencing error rates. However, given that there are billions of nucleotides in a human genome, even low experimental error rates yield many errors in variant calls. Erroneous variants can mimic true somatic and rare variants, thus requiring costly confirmatory experiments to minimize the number of false positives. Here, we discuss sources of experimental errors in NGS and how replicates can be used to abate such errors.


Asunto(s)
Genoma Humano/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Proyectos de Investigación/normas , Humanos , Tamaño de la Muestra
11.
Genome Biol ; 14(9): R100, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24028704

RESUMEN

BACKGROUND: Haplotypes are important for assessing genealogy and disease susceptibility of individual genomes,but are difficult to obtain with routine sequencing approaches. Experimental haplotype reconstruction based on assembling fragments of individual chromosomes is promising, but with variable yields due to incompletely understood parameter choices. RESULTS: We parameterize the clone-based haplotyping problem in order to provide theoretical and empirical assessments of the impact of different parameters on haplotype assembly. We confirm the intuition that long clones help link together heterozygous variants and thus improve haplotype length. Furthermore, given the length of the clones, we address how to choose the other parameters, including number of pools, clone coverage and sequencing coverage, so as to maximize haplotype length. We model the problem theoretically and show empirically the benefits of using larger clones with moderate number of pools and sequencing coverage. In particular, using 140 kb BAC clones, we construct haplotypes for a personal genome and assemble haplotypes with N50 values greater than 2.6 Mb. These assembled haplotypes are longer and at least as accurate as haplotypes of existing clone-based strategies, whether in vivo or in vitro. CONCLUSIONS: Our results provide practical guidelines for the development and design of clone-based methods to achieve long range, high-resolution and accurate haplotypes.


Asunto(s)
Algoritmos , Mapeo Contig/métodos , Genoma Humano , Antígenos HLA/genética , Haplotipos , Tipificación Molecular/métodos , Cromosomas Artificiales Bacterianos , Clonación Molecular , Mapeo Contig/estadística & datos numéricos , Humanos , Tipificación Molecular/estadística & datos numéricos , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN
12.
Proc Natl Acad Sci U S A ; 109(30): 11920-7, 2012 Jul 24.
Artículo en Inglés | MEDLINE | ID: mdl-22797899

RESUMEN

Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved "open consent" process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain-we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.


Asunto(s)
Bases de Datos Genéticas , Variación Genética , Genoma Humano/genética , Fenotipo , Medicina de Precisión/métodos , Programas Informáticos , Línea Celular , Recolección de Datos , Humanos , Medicina de Precisión/tendencias , Análisis de Secuencia de ADN
13.
Nature ; 487(7406): 190-5, 2012 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-22785314

RESUMEN

Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ∼100 picograms of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants were assembled into long haplotype contigs. Removal of false positive single nucleotide variants not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10 megabases. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.


Asunto(s)
Genoma Humano , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Alelos , Línea Celular , Femenino , Silenciador del Gen , Variación Genética , Haplotipos , Humanos , Mutación , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN/economía , Análisis de Secuencia de ADN/normas
14.
Nucleic Acids Res ; 39(Database issue): D124-8, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21037262

RESUMEN

The Universal PBM Resource for Oligonucleotide-Binding Evaluation (UniPROBE) database is a centralized repository of information on the DNA-binding preferences of proteins as determined by universal protein-binding microarray (PBM) technology. Each entry for a protein (or protein complex) in UniPROBE provides the quantitative preferences for all possible nucleotide sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In this update, we describe >130% expansion of the database content, incorporation of a protein BLAST (blastp) tool for finding protein sequence matches in UniPROBE, the introduction of UniPROBE accession numbers and additional database enhancements. The UniPROBE database is available at http://uniprobe.org.


Asunto(s)
Proteínas de Unión al ADN/metabolismo , Bases de Datos de Proteínas , Animales , Sitios de Unión , ADN/química , ADN/metabolismo , Proteínas de Unión al ADN/química , Humanos , Internet , Análisis por Matrices de Proteínas , Análisis de Secuencia de Proteína , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...