Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 48(D1): D570-D578, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31696235

RESUMO

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.


Assuntos
Metagenoma , Microbiota , Filogenia , Software , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , DNA Espaçador Ribossômico/genética , Bases de Dados Genéticas , Metagenômica/métodos
2.
Nucleic Acids Res ; 47(W1): W636-W641, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-30976793

RESUMO

The EMBL-EBI provides free access to popular bioinformatics sequence analysis applications as well as to a full-featured text search engine with powerful cross-referencing and data retrieval capabilities. Access to these services is provided via user-friendly web interfaces and via established RESTful and SOAP Web Services APIs (https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/EMBL-EBI+Web+Services+APIs+-+Data+Retrieval). Both systems have been developed with the same core principles that allow them to integrate an ever-increasing volume of biological data, making them an integral part of many popular data resources provided at the EMBL-EBI. Here, we describe the latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability.


Assuntos
Análise de Sequência , Software , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Alinhamento de Sequência , Análise de Sequência de Proteína
3.
Nucleic Acids Res ; 47(D1): D427-D432, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30357350

RESUMO

The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their classification into Pfam clans, as well as their functional annotation. We recently began to collaborate with the RepeatsDB resource to improve the definition of tandem repeat families within Pfam. We carried out a significant comparison to the structural classification database, namely the Evolutionary Classification of Protein Domains (ECOD) that led to the creation of 825 new families based on their set of uncharacterized families (EUFs). Furthermore, we also connected Pfam entries to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms. Since Pfam has many community contributors, we recently enabled the linking between authorship of all Pfam entries with the corresponding authors' ORCID identifiers. This effectively permits authors to claim credit for their Pfam curation and link them to their ORCID record.


Assuntos
Bases de Dados de Proteínas , Proteínas/classificação , Anotação de Sequência Molecular , Domínios Proteicos , Proteínas/química , Sequências Repetitivas de Aminoácidos
4.
Nucleic Acids Res ; 47(D1): D351-D360, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30398656

RESUMO

The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.


Assuntos
Bases de Dados de Proteínas , Anotação de Sequência Molecular , Animais , Bases de Dados Genéticas , Ontologia Genética , Humanos , Internet , Família Multigênica , Domínios Proteicos/genética , Homologia de Sequência de Aminoácidos , Software , Interface Usuário-Computador
5.
Nucleic Acids Res ; 46(W1): W200-W204, 2018 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-29905871

RESUMO

The HMMER webserver [http://www.ebi.ac.uk/Tools/hmmer] is a free-to-use service which provides fast searches against widely used sequence databases and profile hidden Markov model (HMM) libraries using the HMMER software suite (http://hmmer.org). The results of a sequence search may be summarized in a number of ways, allowing users to view and filter the significant hits by domain architecture or taxonomy. For large scale usage, we provide an application programmatic interface (API) which has been expanded in scope, such that all result presentations are available via both HTML and API. Furthermore, we have refactored our JavaScript visualization library to provide standalone components for different result representations. These consume the aforementioned API and can be integrated into third-party websites. The range of databases that can be searched against has been expanded, adding four sequence datasets (12 in total) and one profile HMM library (6 in total). To help users explore the biological context of their results, and to discover new data resources, search results are now supplemented with cross references to other EMBL-EBI databases.


Assuntos
Análise de Sequência , Software , Domínio Catalítico , Bases de Dados Genéticas , Internet , Cadeias de Markov , Análise de Sequência de Proteína , Interface Usuário-Computador
6.
Nucleic Acids Res ; 46(D1): D726-D735, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29069476

RESUMO

EBI metagenomics (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the analysis and archiving of sequence data derived from the microbial populations found in a particular environment. Over the past two years, EBI metagenomics has increased the number of datasets analysed 10-fold. In addition to increased throughput, the underlying analysis pipeline has been overhauled to include both new or updated tools and reference databases. Of particular note is a new workflow for taxonomic assignments that has been extended to include assignments based on both the large and small subunit RNA marker genes and to encompass all cellular micro-organisms. We also describe the addition of metagenomic assembly as a new analysis service. Our pilot studies have produced over 2400 assemblies from datasets in the public domain. From these assemblies, we have produced a searchable, non-redundant protein database of over 50 million sequences. To provide improved access to the data stored within the resource, we have developed a programmatic interface that provides access to the analysis results and associated sample metadata. Finally, we have integrated the results of a series of statistical analyses that provide estimations of diversity and sample comparisons.


Assuntos
Bases de Dados Genéticas , Metagenômica , Microbiota , Algoritmos , Sequência de Bases , Classificação/métodos , Conjuntos de Dados como Assunto , Metagenômica/métodos , RNA Arqueal/genética , RNA Bacteriano/genética , RNA Viral/genética , Ribotipagem , Software , Transcriptoma , Interface Usuário-Computador , Navegador , Fluxo de Trabalho
7.
Nucleic Acids Res ; 46(D1): D802-D808, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29092050

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.


Assuntos
Archaea/genética , Bactérias/genética , Bases de Dados Genéticas , Bases de Dados de Proteínas , Eucariotos/genética , Genômica , Sequência de Aminoácidos , Animais , Sequência de Bases , Mineração de Dados , Previsões , Genoma , Anotação de Sequência Molecular , RNA/genética , Interface Usuário-Computador
8.
Nucleic Acids Res ; 45(D1): D190-D199, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899635

RESUMO

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Domínios e Motivos de Interação entre Proteínas , Software , Humanos , Anotação de Sequência Molecular , Filogenia
9.
Nucleic Acids Res ; 44(D1): D595-603, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26582919

RESUMO

EBI metagenomics (https://www.ebi.ac.uk/metagenomics/) is a freely available hub for the analysis and archiving of metagenomic and metatranscriptomic data. Over the last 2 years, the resource has undergone rapid growth, with an increase of over five-fold in the number of processed samples and consequently represents one of the largest resources of analysed shotgun metagenomes. Here, we report the status of the resource in 2016 and give an overview of new developments. In particular, we describe updates to data content, a complete overhaul of the analysis pipeline, streamlining of data presentation via the website and the development of a new web based tool to compare functional analyses of sequence runs within a study. We also highlight two of the higher profile projects that have been analysed using the resource in the last year: the oceanographic projects Ocean Sampling Day and Tara Oceans.


Assuntos
Bases de Dados de Ácidos Nucleicos , Metagenômica , Perfilação da Expressão Gênica , Internet , Oceanos e Mares , Software
10.
Nucleic Acids Res ; 44(D1): D279-85, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26673716

RESUMO

In the last two years the Pfam database (http://pfam.xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greater stability, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automatically-generated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool.


Assuntos
Bases de Dados de Proteínas , Proteínas/classificação , Proteoma/química , Alinhamento de Sequência , Análise de Sequência de Proteína , Anotação de Sequência Molecular
11.
Nature ; 476(7359): 214-9, 2011 Aug 10.
Artigo em Inglês | MEDLINE | ID: mdl-21833088

RESUMO

Multiple sclerosis is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability. Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals, and systematic attempts to identify linkage in multiplex families have confirmed that variation within the major histocompatibility complex (MHC) exerts the greatest individual effect on risk. Modestly powered genome-wide association studies (GWAS) have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects have a key role in disease susceptibility. Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require the analysis of sample sizes that are beyond the numbers currently available to individual research groups. In a collaborative GWAS involving 9,772 cases of European descent collected by 23 research groups working in 15 different countries, we have replicated almost all of the previously suggested associations and identified at least a further 29 novel susceptibility loci. Within the MHC we have refined the identity of the HLA-DRB1 risk alleles and confirmed that variation in the HLA-A gene underlies the independent protective effect attributable to the class I region. Immunologically relevant genes are significantly overrepresented among those mapping close to the identified loci and particularly implicate T-helper-cell differentiation in the pathogenesis of multiple sclerosis.


Assuntos
Predisposição Genética para Doença/genética , Imunidade Celular/imunologia , Esclerose Múltipla/genética , Esclerose Múltipla/imunologia , Alelos , Diferenciação Celular/imunologia , Europa (Continente)/etnologia , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Antígenos HLA-A/genética , Antígenos HLA-DR/genética , Cadeias HLA-DRB1 , Humanos , Imunidade Celular/genética , Complexo Principal de Histocompatibilidade/genética , Polimorfismo de Nucleotídeo Único/genética , Tamanho da Amostra , Linfócitos T Auxiliares-Indutores/citologia , Linfócitos T Auxiliares-Indutores/imunologia
12.
PLoS Genet ; 8(8): e1002793, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22876189

RESUMO

Genome-wide association studies have identified hundreds of loci for type 2 diabetes, coronary artery disease and myocardial infarction, as well as for related traits such as body mass index, glucose and insulin levels, lipid levels, and blood pressure. These studies also have pointed to thousands of loci with promising but not yet compelling association evidence. To establish association at additional loci and to characterize the genome-wide significant loci by fine-mapping, we designed the "Metabochip," a custom genotyping array that assays nearly 200,000 SNP markers. Here, we describe the Metabochip and its component SNP sets, evaluate its performance in capturing variation across the allele-frequency spectrum, describe solutions to methodological challenges commonly encountered in its analysis, and evaluate its performance as a platform for genotype imputation. The metabochip achieves dramatic cost efficiencies compared to designing single-trait follow-up reagents, and provides the opportunity to compare results across a range of related traits. The metabochip and similar custom genotyping arrays offer a powerful and cost-effective approach to follow-up large-scale genotyping and sequencing studies and advance our understanding of the genetic basis of complex human diseases and traits.


Assuntos
Antropometria/instrumentação , Metabolômica/instrumentação , Análise de Sequência com Séries de Oligonucleotídeos/instrumentação , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Alelos , Antropometria/métodos , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/genética , Doenças Cardiovasculares/metabolismo , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Frequência do Gene , Genoma Humano , Estudo de Associação Genômica Ampla , Genótipo , Técnicas de Genotipagem , Humanos , Metabolômica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fenótipo
13.
Am J Hum Genet ; 89(5): 619-27, 2011 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-22055160

RESUMO

Abdominal aortic aneurysm (AAA) is a common cause of morbidity and mortality and has a significant heritability. We carried out a genome-wide association discovery study of 1866 patients with AAA and 5435 controls and replication of promising signals (lead SNP with a p value < 1 × 10(-5)) in 2871 additional cases and 32,687 controls and performed further follow-up in 1491 AAA and 11,060 controls. In the discovery study, nine loci demonstrated association with AAA (p < 1 × 10(-5)). In the replication sample, the lead SNP at one of these loci, rs1466535, located within intron 1 of low-density-lipoprotein receptor-related protein 1 (LRP1) demonstrated significant association (p = 0.0042). We confirmed the association of rs1466535 and AAA in our follow-up study (p = 0.035). In a combined analysis (6228 AAA and 49182 controls), rs1466535 had a consistent effect size and direction in all sample sets (combined p = 4.52 × 10(-10), odds ratio 1.15 [1.10-1.21]). No associations were seen for either rs1466535 or the 12q13.3 locus in independent association studies of coronary artery disease, blood pressure, diabetes, or hyperlipidaemia, suggesting that this locus is specific to AAA. Gene-expression studies demonstrated a trend toward increased LRP1 expression for the rs1466535 CC genotype in arterial tissues; there was a significant (p = 0.029) 1.19-fold (1.04-1.36) increase in LRP1 expression in CC homozygotes compared to TT homozygotes in aortic adventitia. Functional studies demonstrated that rs1466535 might alter a SREBP-1 binding site and influence enhancer activity at the locus. In conclusion, this study has identified a biologically plausible genetic variant associated specifically with AAA, and we suggest that this variant has a possible functional role in LRP1 expression.


Assuntos
Aorta/metabolismo , Aneurisma da Aorta Abdominal/genética , Loci Gênicos/genética , Proteína-1 Relacionada a Receptor de Lipoproteína de Baixa Densidade/genética , Polimorfismo de Nucleotídeo Único , Idoso , Estudos de Casos e Controles , Linhagem Celular Tumoral , Interpretação Estatística de Dados , Feminino , Seguimentos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Homozigoto , Humanos , Masculino , Razão de Chances , Especificidade de Órgãos , Fatores de Risco , Proteína de Ligação a Elemento Regulador de Esterol 1/genética
14.
PLoS Genet ; 7(2): e1002003, 2011 Feb 03.
Artigo em Inglês | MEDLINE | ID: mdl-21304890

RESUMO

While there have been studies exploring regulatory variation in one or more tissues, the complexity of tissue-specificity in multiple primary tissues is not yet well understood. We explore in depth the role of cis-regulatory variation in three human tissues: lymphoblastoid cell lines (LCL), skin, and fat. The samples (156 LCL, 160 skin, 166 fat) were derived simultaneously from a subset of well-phenotyped healthy female twins of the MuTHER resource. We discover an abundance of cis-eQTLs in each tissue similar to previous estimates (858 or 4.7% of genes). In addition, we apply factor analysis (FA) to remove effects of latent variables, thus more than doubling the number of our discoveries (1,822 eQTL genes). The unique study design (Matched Co-Twin Analysis--MCTA) permits immediate replication of eQTLs using co-twins (93%-98%) and validation of the considerable gain in eQTL discovery after FA correction. We highlight the challenges of comparing eQTLs between tissues. After verifying previous significance threshold-based estimates of tissue-specificity, we show their limitations given their dependency on statistical power. We propose that continuous estimates of the proportion of tissue-shared signals and direct comparison of the magnitude of effect on the fold change in expression are essential properties that jointly provide a biologically realistic view of tissue-specificity. Under this framework we demonstrate that 30% of eQTLs are shared among the three tissues studied, while another 29% appear exclusively tissue-specific. However, even among the shared eQTLs, a substantial proportion (10%-20%) have significant differences in the magnitude of fold change between genotypic classes across tissues. Our results underline the need to account for the complexity of eQTL tissue-specificity in an effort to assess consequences of such variants for complex traits.


Assuntos
Tecido Adiposo/metabolismo , Genes Reguladores/genética , Locos de Características Quantitativas/genética , Pele/metabolismo , Linhagem Celular , Células Cultivadas , Interpretação Estatística de Dados , Feminino , Perfilação da Expressão Gênica , Genótipo , Humanos , Especificidade de Órgãos/genética , Fenótipo , Gêmeos
15.
Hum Mol Genet ; 20(2): 345-53, 2011 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-21044948

RESUMO

We performed a genome-wide association study (GWAS) in 1705 Parkinson's disease (PD) UK patients and 5175 UK controls, the largest sample size so far for a PD GWAS. Replication was attempted in an additional cohort of 1039 French PD cases and 1984 controls for the 27 regions showing the strongest evidence of association (P< 10(-4)). We replicated published associations in the 4q22/SNCA and 17q21/MAPT chromosome regions (P< 10(-10)) and found evidence for an additional independent association in 4q22/SNCA. A detailed analysis of the haplotype structure at 17q21 showed that there are three separate risk groups within this region. We found weak but consistent evidence of association for common variants located in three previously published associated regions (4p15/BST1, 4p16/GAK and 1q32/PARK16). We found no support for the previously reported SNP association in 12q12/LRRK2. We also found an association of the two SNPs in 4q22/SNCA with the age of onset of the disease.


Assuntos
Cromossomos Humanos Par 17/genética , Predisposição Genética para Doença , Doença de Parkinson/genética , alfa-Sinucleína/genética , Idade de Início , Estudos de Casos e Controles , Estudo de Associação Genômica Ampla , Haplótipos , Humanos , Polimorfismo de Nucleotídeo Único , Tamanho da Amostra , População Branca
16.
Lancet ; 380(9844): 815-23, 2012 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-22763110

RESUMO

BACKGROUND: Osteoarthritis is the most common form of arthritis worldwide and is a major cause of pain and disability in elderly people. The health economic burden of osteoarthritis is increasing commensurate with obesity prevalence and longevity. Osteoarthritis has a strong genetic component but the success of previous genetic studies has been restricted due to insufficient sample sizes and phenotype heterogeneity. METHODS: We undertook a large genome-wide association study (GWAS) in 7410 unrelated and retrospectively and prospectively selected patients with severe osteoarthritis in the arcOGEN study, 80% of whom had undergone total joint replacement, and 11,009 unrelated controls from the UK. We replicated the most promising signals in an independent set of up to 7473 cases and 42,938 controls, from studies in Iceland, Estonia, the Netherlands, and the UK. All patients and controls were of European descent. FINDINGS: We identified five genome-wide significant loci (binomial test p≤5·0×10(-8)) for association with osteoarthritis and three loci just below this threshold. The strongest association was on chromosome 3 with rs6976 (odds ratio 1·12 [95% CI 1·08-1·16]; p=7·24×10(-11)), which is in perfect linkage disequilibrium with rs11177. This SNP encodes a missense polymorphism within the nucleostemin-encoding gene GNL3. Levels of nucleostemin were raised in chondrocytes from patients with osteoarthritis in functional studies. Other significant loci were on chromosome 9 close to ASTN2, chromosome 6 between FILIP1 and SENP6, chromosome 12 close to KLHDC5 and PTHLH, and in another region of chromosome 12 close to CHST11. One of the signals close to genome-wide significance was within the FTO gene, which is involved in regulation of bodyweight-a strong risk factor for osteoarthritis. All risk variants were common in frequency and exerted small effects. INTERPRETATION: Our findings provide insight into the genetics of arthritis and identify new pathways that might be amenable to future therapeutic intervention. FUNDING: arcOGEN was funded by a special purpose grant from Arthritis Research UK.


Assuntos
Osteoartrite/genética , Artroplastia de Substituição , Estudos de Casos e Controles , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação , Masculino , Osteoartrite/cirurgia , Osteoartrite do Quadril/genética , Osteoartrite do Quadril/cirurgia , Osteoartrite do Joelho/genética , Osteoartrite do Joelho/cirurgia , Polimorfismo de Nucleotídeo Único
17.
Nat Genet ; 36(9): 1003-7, 2004 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-15286788

RESUMO

Hartnup disorder (OMIM 234500) is an autosomal recessive abnormality of renal and gastrointestinal neutral amino acid transport noted for its clinical variability. We localized a gene causing Hartnup disorder to chromosome 5p15.33 and cloned a new gene, SLC6A19, in this region. SLC6A19 is a sodium-dependent and chloride-independent neutral amino acid transporter, expressed predominately in kidney and intestine, with properties of system B(0). We identified six mutations in SLC6A19 that cosegregated with disease in the predicted recessive manner, with most affected individuals being compound heterozygotes. The disease-causing mutations that we tested reduced neutral amino acid transport function in vitro. Population frequencies for the most common mutated SLC6A19 alleles are 0.007 for 517G --> A and 0.001 for 718C --> T. Our findings indicate that SLC6A19 is the long-sought gene that is mutated in Hartnup disorder; its identification provides the opportunity to examine the inconsistent multisystemic features of this disorder.


Assuntos
Sistemas de Transporte de Aminoácidos Neutros/genética , Doença de Hartnup/genética , Mutação , Sequência de Aminoácidos , Cromossomos Humanos Par 5 , Clonagem Molecular , Frequência do Gene , Humanos , Rim/metabolismo , Dados de Sequência Molecular , Linhagem
18.
PLoS Genet ; 5(4): e1000445, 2009 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-19343178

RESUMO

Recent genome-wide (GW) scans have identified several independent loci affecting human stature, but their contribution through the different skeletal components of height is still poorly understood. We carried out a genome-wide scan in 12,611 participants, followed by replication in an additional 7,187 individuals, and identified 17 genomic regions with GW-significant association with height. Of these, two are entirely novel (rs11809207 in CATSPER4, combined P-value = 6.1x10(-8) and rs910316 in TMED10, P-value = 1.4x10(-7)) and two had previously been described with weak statistical support (rs10472828 in NPR3, P-value = 3x10(-7) and rs849141 in JAZF1, P-value = 3.2x10(-11)). One locus (rs1182188 at GNA12) identifies the first height eQTL. We also assessed the contribution of height loci to the upper- (trunk) and lower-body (hip axis and femur) skeletal components of height. We find evidence for several loci associated with trunk length (including rs6570507 in GPR126, P-value = 4x10(-5) and rs6817306 in LCORL, P-value = 4x10(-4)), hip axis length (including rs6830062 at LCORL, P-value = 4.8x10(-4) and rs4911494 at UQCC, P-value = 1.9x10(-4)), and femur length (including rs710841 at PRKG2, P-value = 2.4x10(-5) and rs10946808 at HIST1H1D, P-value = 6.4x10(-6)). Finally, we used conditional analyses to explore a possible differential contribution of the height loci to these different skeletal size measurements. In addition to validating four novel loci controlling adult stature, our study represents the first effort to assess the contribution of genetic loci to three skeletal components of height. Further statistical tests in larger numbers of individuals will be required to verify if the height loci affect height preferentially through these subcomponents of height.


Assuntos
Estatura , Osso e Ossos/química , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Esqueleto , População Branca/genética , Adulto Jovem
19.
Blood ; 113(16): 3831-7, 2009 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-19221038

RESUMO

Mean platelet volume (MPV) and platelet count (PLT) are highly heritable and tightly regulated traits. We performed a genome-wide association study for MPV and identified one SNP, rs342293, as having highly significant and reproducible association with MPV (per-G allele effect 0.016 +/- 0.001 log fL; P < 1.08 x 10(-24)) and PLT (per-G effect -4.55 +/- 0.80 10(9)/L; P < 7.19 x 10(-8)) in 8586 healthy subjects. Whole-genome expression analysis in the 1-MB region showed a significant association with platelet transcript levels for PIK3CG (n = 35; P = .047). The G allele at rs342293 was also associated with decreased binding of annexin V to platelets activated with collagen-related peptide (n = 84; P = .003). The region 7q22.3 identifies the first QTL influencing platelet volume, counts, and function in healthy subjects. Notably, the association signal maps to a chromosome region implicated in myeloid malignancies, indicating this site as an important regulatory site for hematopoiesis. The identification of loci regulating MPV by this and other studies will increase our insight in the processes of megakaryopoiesis and proplatelet formation, and it may aid the identification of genes that are somatically mutated in essential thrombocytosis.


Assuntos
Plaquetas , Cromossomos Humanos Par 7/genética , Genoma Humano/genética , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética , Trombopoese/genética , Adulto , Idoso , Mapeamento Cromossômico , Estudos de Coortes , Feminino , Regulação da Expressão Gênica/genética , Neoplasias Hematológicas/genética , Humanos , Masculino , Pessoa de Meia-Idade , Contagem de Plaquetas , Trombocitemia Essencial/genética
20.
Arterioscler Thromb Vasc Biol ; 30(7): 1467-73, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20395598

RESUMO

OBJECTIVE: To examine variants at the 9p21 locus in a case-control study of acute myocardial infarction (MI) in Pakistanis and to perform an updated meta-analysis of published studies in people of European ancestry. METHODS AND RESULTS: A total of 1851 patients with first-ever confirmed MI and 1903 controls were genotyped for 89 tagging single-nucleotide polymorphisms at locus 9p21, including the lead variant (rs1333049) identified by the Wellcome Trust Case Control Consortium. Minor allele frequencies and extent of linkage disequilibrium observed in Pakistanis were broadly similar to those seen in Europeans. In the Pakistani study, 6 variants were associated with MI (P<10(-2)) in the initial sample set, and in an additional 741 cases and 674 controls in whom further genotyping was performed for these variants. For Pakistanis, the odds ratio for MI was 1.13 (95% CI, 1.05 to 1.22; P=2 x 10(-3)) for each copy of the C allele at rs1333049. In comparison, a meta-analysis of studies in Europeans yielded an odds ratio of 1.31 (95% CI, 1.26 to 1.37) for the same variant (P=1 x 10(-3) for heterogeneity). Meta-analyses of 23 variants, in up to 38,250 cases and 84,820 controls generally yielded higher values in Europeans than in Pakistanis. CONCLUSIONS: To our knowledge, this study provides the first demonstration that variants at the 9p21 locus are significantly associated with MI risk in Pakistanis. However, association signals at this locus were weaker in Pakistanis than those in European studies.


Assuntos
Povo Asiático/genética , Cromossomos Humanos Par 9 , Infarto do Miocárdio/genética , Polimorfismo de Nucleotídeo Único , População Branca/genética , Estudos de Casos e Controles , Europa (Continente) , Feminino , Frequência do Gene , Predisposição Genética para Doença , Humanos , Desequilíbrio de Ligação , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Infarto do Miocárdio/etnologia , Razão de Chances , Paquistão , Fenótipo , Medição de Risco , Fatores de Risco
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA