Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Am J Hum Genet ; 110(1): 3-12, 2023 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-36608682

RESUMEN

Although genomic research has predominantly relied on phenotypic ascertainment of individuals affected with heritable disease, the falling costs of sequencing allow consideration of genomic ascertainment and reverse phenotyping (the ascertainment of individuals with specific genomic variants and subsequent evaluation of physical characteristics). In this research modality, the scientific question is inverted: investigators gather individuals with a genomic variant and test the hypothesis that there is an associated phenotype via targeted phenotypic evaluations. Genomic ascertainment research is thus a model of predictive genomic medicine and genomic screening. Here, we provide our experience implementing this research method. We describe the infrastructure we developed to perform reverse phenotyping studies, including aggregating a super-cohort of sequenced individuals who consented to recontact for genomic ascertainment research. We assessed 13 studies completed at the National Institutes of Health (NIH) that piloted our reverse phenotyping approach. The studies can be broadly categorized as (1) facilitating novel genotype-disease associations, (2) expanding the phenotypic spectra, or (3) demonstrating ex vivo functional mechanisms of disease. We highlight three examples of reverse phenotyping studies in detail and describe how using a targeted reverse phenotyping approach (as opposed to phenotypic ascertainment or clinical informatics approaches) was crucial to the conclusions reached. Finally, we propose a framework and address challenges to building collaborative genomic ascertainment research programs at other institutions. Our goal is for more researchers to take advantage of this approach, which will expand our understanding of the predictive capability of genomic medicine and increase the opportunity to mitigate genomic disease.


Asunto(s)
Genoma , Informática Médica , Fenotipo , Genotipo , Genómica/métodos
2.
BMC Biol ; 21(1): 32, 2023 02 13.
Artículo en Inglés | MEDLINE | ID: mdl-36782149

RESUMEN

BACKGROUND: Sex determination occurs across animal species, but most of our knowledge about its mechanisms comes from only a handful of bilaterian taxa. This limits our ability to infer the evolutionary history of sex determination within animals. RESULTS: In this study, we generated a linkage map of the genome of the colonial cnidarian Hydractinia symbiolongicarpus and used it to demonstrate that this species has an XX/XY sex determination system. We demonstrate that the X and Y chromosomes have pseudoautosomal and non-recombining regions. We then use the linkage map and a method based on the depth of sequencing coverage to identify genes encoded in the non-recombining region and show that many of them have male gonad-specific expression. In addition, we demonstrate that recombination rates are enhanced in the female genome and that the haploid chromosome number in Hydractinia is n = 15. CONCLUSIONS: These findings establish Hydractinia as a tractable non-bilaterian model system for the study of sex determination and the evolution of sex chromosomes.


Asunto(s)
Hidrozoos , Cromosomas Sexuales , Masculino , Femenino , Animales , Cromosomas Sexuales/genética , Mapeo Cromosómico , Cromosoma Y/genética , Hidrozoos/genética , Evolución Molecular
3.
Am J Hum Genet ; 100(5): 695-705, 2017 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-28475856

RESUMEN

Provision of a molecularly confirmed diagnosis in a timely manner for children and adults with rare genetic diseases shortens their "diagnostic odyssey," improves disease management, and fosters genetic counseling with respect to recurrence risks while assuring reproductive choices. In a general clinical genetics setting, the current diagnostic rate is approximately 50%, but for those who do not receive a molecular diagnosis after the initial genetics evaluation, that rate is much lower. Diagnostic success for these more challenging affected individuals depends to a large extent on progress in the discovery of genes associated with, and mechanisms underlying, rare diseases. Thus, continued research is required for moving toward a more complete catalog of disease-related genes and variants. The International Rare Diseases Research Consortium (IRDiRC) was established in 2011 to bring together researchers and organizations invested in rare disease research to develop a means of achieving molecular diagnosis for all rare diseases. Here, we review the current and future bottlenecks to gene discovery and suggest strategies for enabling progress in this regard. Each successful discovery will define potential diagnostic, preventive, and therapeutic opportunities for the corresponding rare disease, enabling precision medicine for this patient population.


Asunto(s)
Cooperación Internacional , Enfermedades Raras/diagnóstico , Enfermedades Raras/genética , Bases de Datos Factuales , Exoma , Genoma Humano , Humanos
4.
Nucleic Acids Res ; 45(D1): D985-D994, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899665

RESUMEN

We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.


Asunto(s)
Biología Computacional/métodos , Terapia Molecular Dirigida , Motor de Búsqueda , Programas Informáticos , Bases de Datos Factuales , Humanos , Terapia Molecular Dirigida/métodos , Reproducibilidad de los Resultados , Navegador Web , Flujo de Trabajo
5.
Nat Rev Genet ; 12(10): 730-6, 2011 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-21921928

RESUMEN

Access to genetic data across studies is an important aspect of identifying new genetic associations through genome-wide association studies (GWASs). Meta-analysis across multiple GWASs with combined cohort sizes of tens of thousands of individuals often uncovers many more genome-wide associated loci than the original individual studies; this emphasizes the importance of tools and mechanisms for data sharing. However, even sharing summary-level data, such as allele frequencies, inherently carries some degree of privacy risk to study participants. Here we discuss mechanisms and resources for sharing data from GWASs, particularly focusing on approaches for assessing and quantifying the privacy risks to participants that result from the sharing of summary-level data.


Asunto(s)
Recolección de Datos , Variación Genética , Estudio de Asociación del Genoma Completo , Difusión de la Información/métodos , Estudios de Cohortes , Confidencialidad , Recolección de Datos/legislación & jurisprudencia , Bases de Datos Genéticas , Variación Genética/fisiología , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Difusión de la Información/legislación & jurisprudencia , Metaanálisis como Asunto , Polimorfismo de Nucleótido Simple , Medición de Riesgo
6.
Hum Mutat ; 36(10): 915-21, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26295439

RESUMEN

There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.


Asunto(s)
Predisposición Genética a la Enfermedad/genética , Difusión de la Información/métodos , Enfermedades Raras/genética , Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Estudios de Asociación Genética , Humanos , Programas Informáticos
7.
Nucleic Acids Res ; 41(Database issue): D936-41, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23193291

RESUMEN

Much has changed in the last two years at DGVa (http://www.ebi.ac.uk/dgva) and dbVar (http://www.ncbi.nlm.nih.gov/dbvar). We are now processing direct submissions rather than only curating data from the literature and our joint study catalog includes data from over 100 studies in 11 organisms. Studies from human dominate with data from control and case populations, tumor samples as well as three large curated studies derived from multiple sources. During the processing of these data, we have made improvements to our data model, submission process and data representation. Additionally, we have made significant improvements in providing access to these data via web and FTP interfaces.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Variación Estructural del Genoma , Genotipo , Humanos , Internet , Fenotipo
8.
J Gen Intern Med ; 29 Suppl 3: S780-7, 2014 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-25029978

RESUMEN

Research into rare diseases is typically fragmented by data type and disease. Individual efforts often have poor interoperability and do not systematically connect data across clinical phenotype, genomic data, biomaterial availability, and research/trial data sets. Such data must be linked at both an individual-patient and whole-cohort level to enable researchers to gain a complete view of their disease and patient population of interest. Data access and authorization procedures are required to allow researchers in multiple institutions to securely compare results and gain new insights. Funded by the European Union's Seventh Framework Programme under the International Rare Diseases Research Consortium (IRDiRC), RD-Connect is a global infrastructure project initiated in November 2012 that links genomic data with registries, biobanks, and clinical bioinformatics tools to produce a central research resource for rare diseases.


Asunto(s)
Bancos de Muestras Biológicas , Biología Computacional , Bases de Datos Factuales , Intercambio de Información en Salud , Enfermedades Raras , Sistema de Registros , Humanos
9.
Genet Epidemiol ; 35(8): 887-98, 2011 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-22125226

RESUMEN

Genome-wide association studies (GWAS) are a useful approach in the study of the genetic components of complex phenotypes. Aside from large cohorts, GWAS have generally been limited to the study of one or a few diseases or traits. The emergence of biobanks linked to electronic medical records (EMRs) allows the efficient reuse of genetic data to yield meaningful genotype-phenotype associations for multiple phenotypes or traits. Phase I of the electronic MEdical Records and GEnomics (eMERGE-I) Network is a National Human Genome Research Institute-supported consortium composed of five sites to perform various genetic association studies using DNA repositories and EMR systems. Each eMERGE site has developed EMR-based algorithms to comprise a core set of 14 phenotypes for extraction of study samples from each site's DNA repository. Each eMERGE site selected samples for a specific phenotype, and these samples were genotyped at either the Broad Institute or at the Center for Inherited Disease Research using the Illumina Infinium BeadChip technology. In all, approximately 17,000 samples from across the five sites were genotyped. A unified quality control (QC) pipeline was developed by the eMERGE Genomics Working Group and used to ensure thorough cleaning of the data. This process includes examination of sample and marker quality and various batch effects. Upon completion of the genotyping and QC analyses for each site's primary study, eMERGE Coordinating Center merged the datasets from all five sites. This larger merged dataset reentered the established eMERGE QC pipeline. Based on lessons learned during the process, additional analyses and QC checkpoints were added to the pipeline to ensure proper merging. Here, we explore the challenges associated with combining datasets from different genotyping centers and describe the expansion to eMERGE QC pipeline for merged datasets. These additional steps will be useful as the eMERGE project expands to include additional sites in eMERGE-II, and also serve as a starting point for investigators merging multiple genotype datasets accessible through the National Center for Biotechnology Information in the database of Genotypes and Phenotypes. Our experience demonstrates that merging multiple datasets after additional QC can be an efficient use of genotype data despite new challenges that appear in the process.


Asunto(s)
Registros Electrónicos de Salud , Estudio de Asociación del Genoma Completo/normas , Control de Calidad , Algoritmos , Genotipo , Humanos , National Human Genome Research Institute (U.S.) , Fenotipo , Estados Unidos
10.
Genes (Basel) ; 13(5)2022 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-35627201

RESUMEN

Craniosynostosis (CS) is a major birth defect in which one or more skull sutures fuse prematurely. We previously performed a genome-wide association study (GWAS) for sagittal non-syndromic CS (sNCS), identifying associations downstream from BMP2 on 20p12.3 and intronic to BBS9 on 7p14.3; analyses of imputed variants in DLG1 on 3q29 were also genome-wide significant. We followed this work with a GWAS for metopic non-syndromic NCS (mNCS), discovering a significant association intronic to BMP7 on 20q13.31. In the current study, we sequenced the associated regions on 3q29, 7p14.3, and 20p12.3, including two candidate genes (BMP2 and BMPER) near some of these regions in 83 sNCS child-parent trios, and sequenced regions on 7p14.3 and 20q13.2-q13.32 in 80 mNCS child-parent trios. These child-parent trios were selected from the original GWAS cohorts if the probands carried at least one copy of the top associated GWAS variant (rs1884302 C allele for sNCS; rs6127972 T allele for mNCS). Many of the variants sequenced in these targeted regions are strongly predicted to be within binding sites for transcription factors involved in craniofacial development or bone morphogenesis. Variants enriched in more than one trio and predicted to be damaging to gene function are prioritized for functional studies.


Asunto(s)
Craneosinostosis , Estudio de Asociación del Genoma Completo , Alelos , Proteínas Portadoras/genética , Craneosinostosis/genética , Humanos
11.
Genes (Basel) ; 13(7)2022 07 18.
Artículo en Inglés | MEDLINE | ID: mdl-35886053

RESUMEN

The Hawaiian monk seal (HMS) is the single extant species of tropical earless seals of the genus Neomonachus. The species survived a severe bottleneck in the late 19th century and experienced subsequent population declines until becoming the subject of a NOAA-led species recovery effort beginning in 1976 when the population was fewer than 1000 animals. Like other recovering species, the Hawaiian monk seal has been reported to have reduced genetic heterogeneity due to the bottleneck and subsequent inbreeding. Here, we report a chromosomal reference assembly for a male animal produced using a variety of methods. The final assembly consisted of 16 autosomes, an X, and portions of the Y chromosomes. We compared variants in this animal to other HMS and to a frequently sequenced human sample, confirming about 12% of the variation seen in man. To confirm that the reference animal was representative of the HMS, we compared his sequence to that of 10 other individuals and noted similarly low variation in all. Variation in the major histocompatibility (MHC) genes was nearly absent compared to the orthologous human loci. Demographic analysis predicts that Hawaiian monk seals have had a long history of small populations preceding the bottleneck, and their current low levels of heterozygosity may indicate specialization to a stable environment. When we compared our reference assembly to that of other species, we observed significant conservation of chromosomal architecture with other pinnipeds, especially other phocids. This reference should be a useful tool for future evolutionary studies as well as the long-term management of this species.


Asunto(s)
Phocidae , Animales , Cromosomas , Inestabilidad Genómica , Hawaii/epidemiología , Humanos , Masculino , Phocidae/genética
12.
Front Immunol ; 13: 941839, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36466872

RESUMEN

Rationale: Previous studies identified an interaction between HLA and oral peanut exposure. HLA-DQA1*01:02 had a protective role with the induction of Ara h 2 epitope-specific IgG4 associated with peanut consumption during the LEAP clinical trial for prevention of peanut allergy, while it was a risk allele for peanut allergy in the peanut avoidance group. We have now evaluated this gene-environment interaction in two subsequent peanut oral immunotherapy (OIT) trials - IMPACT and POISED - to better understand the potential for the HLA-DQA1*01:02 allele as an indicator of higher likelihood of desensitization, sustained unresponsiveness, and peanut allergy remission. Methods: We determined HLA-DQA1*01:02 carrier status using genome sequencing from POISED (N=118, age: 7-55yr) and IMPACT (N=126, age: 12-<48mo). We tested for association with remission, sustained unresponsiveness (SU), and desensitization in the OIT groups, as well as peanut component specific IgG4 (psIgG4) using generalized linear models and adjusting for relevant covariates and ancestry. Results: While not quite statistically significant, a higher proportion of HLA-DQA1*01:02 carriers receiving OIT in IMPACT were desensitized (93%) compared to non-carriers (78%); odds ratio (OR)=5.74 (p=0.06). In this sample we also observed that a higher proportion of carriers achieved remission (35%) compared to non-carriers (22%); OR=1.26 (p=0.80). In POISED, carriers more frequently attained continued desensitization (80% versus 61% among non-carriers; OR=1.28, p=0.86) and achieved SU (52% versus 31%; OR=2.32, p=0.19). psIgG4 associations with HLA-DQA1*01:02 in the OIT arm of IMPACT which included younger study subjects recapitulated patterns noted in LEAP, but no associations of note were observed in the older POISED study subjects. Conclusions: Findings across three clinical trials show a pattern of a gene environment interaction between HLA and oral peanut exposure. Age, and prior sensitization contribute additional determinants of outcomes, consistent with a mechanism of restricted antigen recognition fundamental to driving protective immune responses to OIT.


Asunto(s)
Arachis , Hipersensibilidad al Cacahuete , Adolescente , Adulto , Niño , Humanos , Persona de Mediana Edad , Adulto Joven , Inmunoglobulina G , Factores Inmunológicos , Inmunoterapia , Hipersensibilidad al Cacahuete/genética , Hipersensibilidad al Cacahuete/terapia , Ensayos Clínicos como Asunto
13.
Genet Epidemiol ; 34(6): 591-602, 2010 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-20718045

RESUMEN

Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies (GWAS). This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium test P-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis to SNP selection. The methods are illustrated with examples from the "Gene Environment Association Studies" (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of GWAS.


Asunto(s)
Estudio de Asociación del Genoma Completo/normas , Genotipo , Aneuploidia , Artefactos , Estudios de Casos y Controles , Aberraciones Cromosómicas , Femenino , Frecuencia de los Genes , Variación Genética , Genética de Población , Estudio de Asociación del Genoma Completo/métodos , Humanos , Neoplasias Pulmonares/genética , Masculino , Polimorfismo de Nucleótido Simple , Control de Calidad , Aberraciones Cromosómicas Sexuales/estadística & datos numéricos , Trastornos Relacionados con Sustancias/genética
14.
Genet Epidemiol ; 34(4): 364-72, 2010 May.
Artículo en Inglés | MEDLINE | ID: mdl-20091798

RESUMEN

Genome-wide association studies (GWAS) have emerged as powerful means for identifying genetic loci related to complex diseases. However, the role of environment and its potential to interact with key loci has not been adequately addressed in most GWAS. Networks of collaborative studies involving different study populations and multiple phenotypes provide a powerful approach for addressing the challenges in analysis and interpretation shared across studies. The Gene, Environment Association Studies (GENEVA) consortium was initiated to: identify genetic variants related to complex diseases; identify variations in gene-trait associations related to environmental exposures; and ensure rapid sharing of data through the database of Genotypes and Phenotypes. GENEVA consists of several academic institutions, including a coordinating center, two genotyping centers and 14 independently designed studies of various phenotypes, as well as several Institutes and Centers of the National Institutes of Health led by the National Human Genome Research Institute. Minimum detectable effect sizes include relative risks ranging from 1.24 to 1.57 and proportions of variance explained ranging from 0.0097 to 0.02. Given the large number of research participants (N>80,000), an important feature of GENEVA is harmonization of common variables, which allow analyses of additional traits. Environmental exposure information available from most studies also enables testing of gene-environment interactions. Facilitated by its sizeable infrastructure for promoting collaboration, GENEVA has established a unified framework for genotyping, data quality control, analysis and interpretation. By maximizing knowledge obtained through collaborative GWAS incorporating environmental exposure information, GENEVA aims to enhance our understanding of disease etiology, potentially identifying opportunities for intervention.


Asunto(s)
Estudio de Asociación del Genoma Completo , Ambiente , Genotipo , Humanos , Modelos Genéticos , Epidemiología Molecular/métodos , Fenotipo , Polimorfismo Genético , Grupos de Población , Control de Calidad , Sitios de Carácter Cuantitativo , Riesgo
15.
Genet Med ; 13(9): 777-84, 2011 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-21844811

RESUMEN

PURPOSE: Copy number variants have emerged as a major cause of human disease such as autism and intellectual disabilities. Because copy number variants are common in normal individuals, determining the functional and clinical significance of rare copy number variants in patients remains challenging. The adoption of whole-genome chromosomal microarray analysis as a first-tier diagnostic test for individuals with unexplained developmental disabilities provides a unique opportunity to obtain large copy number variant datasets generated through routine patient care. METHODS: A consortium of diagnostic laboratories was established (the International Standards for Cytogenomic Arrays consortium) to share copy number variant and phenotypic data in a central, public database. We present the largest copy number variant case-control study to date comprising 15,749 International Standards for Cytogenomic Arrays cases and 10,118 published controls, focusing our initial analysis on recurrent deletions and duplications involving 14 copy number variant regions. RESULTS: Compared with controls, 14 deletions and seven duplications were significantly overrepresented in cases, providing a clinical diagnosis as pathogenic. CONCLUSION: Given the rapid expansion of clinical chromosomal microarray analysis testing, very large datasets will be available to determine the functional significance of increasingly rare copy number variants. This data will provide an evidence-based guide to clinicians across many disciplines involved in the diagnosis, management, and care of these patients and their families.


Asunto(s)
Variaciones en el Número de Copia de ADN , Discapacidades del Desarrollo/genética , Medicina Basada en la Evidencia/métodos , Discapacidad Intelectual/genética , Análisis Citogenético , Dosificación de Gen , Genoma Humano , Humanos
16.
Mol Cell Proteomics ; 7(4): 739-49, 2008 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-18281724

RESUMEN

Chemical cross-linking and high resolution MS have been integrated successfully to capture protein interactions and provide low resolution structural data for proteins that are refractive to analyses by NMR or crystallography. Despite the versatility of these combined techniques, the array of products that is generated from the cross-linking and proteolytic digestion of proteins is immense and generally requires the use of labeling strategies and/or data base search algorithms to distinguish actual cross-linked peptides from the many side products of cross-linking. Most strategies reported to date have focused on the analysis of small cross-linked protein complexes (<60 kDa) because the number of potential forms of covalently modified peptides increases dramatically with the number of peptides generated from the digestion of such complexes. We report herein the development of a user-friendly search engine, CrossSearch, that provides the foundation for an overarching strategy to detect cross-linked peptides from the digests of large (>or=170-kDa) cross-linked proteins, i.e. conjugates. Our strategy combines the use of a low excess of cross-linker, data base searching, and Fourier transform ion cyclotron resonance MS to experimentally minimize and theoretically cull the side products of cross-linking. Using this strategy, the (alpha beta gamma delta)(4) phosphorylase kinase model complex was cross-linked to form with high specificity a 170-kDa betagamma conjugate in which we identified residues involved in the intramolecular cross-linking of the 125-kDa beta subunit between its regulatory N terminus and its C terminus. This finding provides an explanation for previously published homodimeric two-hybrid interactions of the beta subunit and suggests a dynamic structural role for the regulatory N terminus of that subunit. The results offer proof of concept for the CrossSearch strategy for analyzing conjugates and are the first to reveal a tertiary structural element of either homologous alpha or beta regulatory subunit of phosphorylase kinase.


Asunto(s)
Reactivos de Enlaces Cruzados/química , Péptidos/análisis , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Programas Informáticos , Animales , Ciclotrones , Análisis de Fourier , Internet , Espectrometría de Masas/métodos , Péptidos/química , Fosforilasa Quinasa/química , Subunidades de Proteína/química , Conejos
17.
Cell Syst ; 9(6): 609-613.e3, 2019 12 18.
Artículo en Inglés | MEDLINE | ID: mdl-31812694

RESUMEN

The decreasing cost of DNA sequencing over the past decade has led to an explosion of sequencing datasets, leaving us with petabytes of data to analyze. However, current sequencing visualization tools are designed to run on single machines, which limits their scalability and interactivity on modern genomic datasets. Here, we leverage the scalability of Apache Spark to provide Mango, consisting of a Jupyter notebook and genome browser, which removes scalability and interactivity constraints by leveraging multi-node compute clusters to allow interactive analysis over terabytes of sequencing data. We demonstrate scalability of the Mango tools by performing quality control analyses on 10 terabytes of 100 high-coverage sequencing samples from the Simons Genome Diversity Project, enabling capability for interactive genomic exploration of multi-sample datasets that surpass the computational limitations of single-node visualization tools. Mango is freely available for download with full documentation at https://bdg-mango.readthedocs.io/en/latest/.


Asunto(s)
Genómica/métodos , Análisis de Secuencia de ADN/métodos , Algoritmos , Macrodatos , Análisis de Datos , Genoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Programas Informáticos
18.
J Mol Biol ; 365(5): 1429-45, 2007 Feb 02.
Artículo en Inglés | MEDLINE | ID: mdl-17123541

RESUMEN

Phosphorylase kinase (PhK), an (alphabetagammadelta)(4) complex, regulates glycogenolysis. Its activity, catalyzed by the gamma subunit, is tightly controlled by phosphorylation and activators acting through allosteric sites on its regulatory alpha, beta and delta subunits. Activation by phosphorylation is predominantly mediated by the regulatory beta subunit, which undergoes a conformational change that is structurally linked with the gamma subunit and that is characterized by the ability of a short chemical crosslinker to form beta-beta dimers. To determine potential regions of interaction of the beta and gamma subunits, we have used chemical crosslinking and two-hybrid screening. The beta and gamma subunits were crosslinked to each other in phosphorylated PhK, and crosslinked peptides from digests were identified by Fourier transform mass spectrometry, beginning with a search engine developed "in house" that generates a hypothetical list of crosslinked peptides. A conjugate between beta and gamma that was verified by MS/MS corresponded to crosslinking between K303 in the C-terminal regulatory domain of gamma (gammaCRD) and R18 in the N-terminal regulatory region of beta (beta1-31), which contains the phosphorylatable serines 11 and 26. A synthetic peptide corresponding to residues 1-22 of beta inhibited the crosslinking between beta and gamma, and was itself crosslinked to K303 of gamma. In two-hybrid screening, the beta1-31 region controlled beta subunit self-interactions, in that they were favored by truncation of this region or by mutation of the phosphorylatable serines 11 and 26, thus providing structural evidence for a phosphorylation-dependent subunit communication network in the PhK complex involving at least these two regulatory regions of the beta and gamma subunits. The sum of our results considered together with previous findings implicates the gammaCRD as being an allosteric activation switch in PhK that interacts with all three of the enzyme's regulatory subunits and is proximal to the active site cleft.


Asunto(s)
Regulación Alostérica/efectos de los fármacos , Sitio Alostérico/efectos de los fármacos , Reactivos de Enlaces Cruzados/farmacología , Espectrometría de Masas/métodos , Péptidos/metabolismo , Fosforilasa Quinasa/metabolismo , Secuencia de Aminoácidos , Aminoácidos/metabolismo , Animales , Modelos Biológicos , Datos de Secuencia Molecular , Proteínas Mutantes/análisis , Proteínas Mutantes/química , Proteínas Mutantes/metabolismo , Fosforilasa Quinasa/análisis , Fosforilasa Quinasa/química , Fosforilación/efectos de los fármacos , Fosfoserina/metabolismo , Mutación Puntual/genética , Unión Proteica/efectos de los fármacos , Mapeo de Interacción de Proteínas , Estructura Cuaternaria de Proteína/efectos de los fármacos , Estructura Terciaria de Proteína/efectos de los fármacos , Subunidades de Proteína/análisis , Subunidades de Proteína/química , Subunidades de Proteína/metabolismo , Conejos , Eliminación de Secuencia/genética , Homología Estructural de Proteína , Succinimidas/farmacología
19.
Bioinformatics ; 22(22): 2835-7, 2006 Nov 15.
Artículo en Inglés | MEDLINE | ID: mdl-16966361

RESUMEN

MOTIVATION: The abundance of nucleotide sequence information available has expanded horizons of inquiry for molecular evolution; however, the full potential of whole-genome analysis has not been realized because of inadequate tools. Here, we present one of the first toolkits to aid multidisciplinary high-throughput analysis. SUMMARY: SPEED was created to integrate molecular evolutionary data with existing genetic resources and provide a straightforward user interface to 17,352 orthologous gene groups, containing representatives from eight mammalian species and an avian outgroup. AVAILABILITY: See http://bioinfobase.umkc.edu/speed/ for access.


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Proteínas , Evolución Molecular , Algoritmos , Animales , Bases de Datos Genéticas , Genoma , Humanos , Filogenia , Alineación de Secuencia , Programas Informáticos
20.
F1000Res ; 6: 1795, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-29123647

RESUMEN

The impact of structural variants (SVs) on a variety of organisms and diseases like cancer has become increasingly evident. Methods for SV detection when studying genomic differences across cells, individuals or populations are being actively developed. Currently, just a few methods are available to compare different SVs callsets, and no specialized methods are available to annotate SVs that account for the unique characteristics of these variant types. Here, we introduce SURVIVOR_ant, a tool that compares types and breakpoints for candidate SVs from different callsets and enables fast comparison of SVs to genomic features such as genes and repetitive regions, as well as to previously established SV datasets such as from the 1000 Genomes Project. As proof of concept we compared 16 SV callsets generated by different SV calling methods on a single genome, the Genome in a Bottle sample HG002 (Ashkenazi son), and annotated the SVs with gene annotations, 1000 Genomes Project SV calls, and four different types of repetitive regions. Computation time to annotate 134,528 SVs with 33,954 of annotations was 22 seconds on a laptop.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA