Pesquisa | BVS IEC

1.

Uganda Genome Resource Enables Insights into Population History and Genomic Discovery in Africa.

Gurdasani, Deepti; Carstensen, Tommy; Fatumo, Segun; Chen, Guanjie; Franklin, Chris S; Prado-Martinez, Javier; Bouman, Heleen; Abascal, Federico; Haber, Marc; Tachmazidou, Ioanna; Mathieson, Iain; Ekoru, Kenneth; DeGorter, Marianne K; Nsubuga, Rebecca N; Finan, Chris; Wheeler, Eleanor; Chen, Li; Cooper, David N; Schiffels, Stephan; Chen, Yuan; Ritchie, Graham R S; Pollard, Martin O; Fortune, Mary D; Mentzer, Alex J; Garrison, Erik; Bergström, Anders; Hatzikotoulas, Konstantinos; Adeyemo, Adebowale; Doumatey, Ayo; Elding, Heather; Wain, Louise V; Ehret, Georg; Auer, Paul L; Kooperberg, Charles L; Reiner, Alexander P; Franceschini, Nora; Maher, Dermot; Montgomery, Stephen B; Kadie, Carl; Widmer, Chris; Xue, Yali; Seeley, Janet; Asiki, Gershim; Kamali, Anatoli; Young, Elizabeth H; Pomilla, Cristina; Soranzo, Nicole; Zeggini, Eleftheria; Pirie, Fraser; Morris, Andrew P.

Cell ; 179(4): 984-1002.e36, 2019 10 31.

Artigo em Inglês | MEDLINE | ID: mdl-31675503

RESUMO

Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists. We demonstrate the value of the largest sequence panel from Africa to date as an imputation resource. Examining 34 cardiometabolic traits, we show systematic differences in trait heritability between European and African populations, probably reflecting the differential impact of genes and environment. In a multi-trait pan-African GWAS of up to 14,126 individuals, we identify novel loci associated with anthropometric, hematological, lipid, and glycemic traits. We find that several functionally important signals are driven by Africa-specific variants, highlighting the value of studying diverse populations across the region.

Assuntos

População Negra/genética , Predisposição Genética para Doença , Genoma Humano/genética , Genômica , Feminino , Frequência do Gene/genética , Estudo de Associação Genômica Ampla , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética , Uganda/epidemiologia , Sequenciamento Completo do Genoma

2.

The African Genome Variation Project shapes medical genetics in Africa.

Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O; Choudhury, Ananyo; Ritchie, Graham R S; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N; Young, Elizabeth H; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S.

Nature ; 517(7534): 327-32, 2015 Jan 15.

Artigo em Inglês | MEDLINE | ID: mdl-25470054

RESUMO

Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

Assuntos

Variação Genética/genética , Genética Médica/tendências , Genoma Humano/genética , Genômica/tendências , África , África Subsaariana , Ásia/etnologia , Europa (Continente)/etnologia , Humanos , Fatores de Risco , Seleção Genética/genética

3.

Whole-Genome Sequencing Coupled to Imputation Discovers Genetic Signals for Anthropometric Traits.

Tachmazidou, Ioanna; Süveges, Dániel; Min, Josine L; Ritchie, Graham R S; Steinberg, Julia; Walter, Klaudia; Iotchkova, Valentina; Schwartzentruber, Jeremy; Huang, Jie; Memari, Yasin; McCarthy, Shane; Crawford, Andrew A; Bombieri, Cristina; Cocca, Massimiliano; Farmaki, Aliki-Eleni; Gaunt, Tom R; Jousilahti, Pekka; Kooijman, Marjolein N; Lehne, Benjamin; Malerba, Giovanni; Männistö, Satu; Matchan, Angela; Medina-Gomez, Carolina; Metrustry, Sarah J; Nag, Abhishek; Ntalla, Ioanna; Paternoster, Lavinia; Rayner, Nigel W; Sala, Cinzia; Scott, William R; Shihab, Hashem A; Southam, Lorraine; St Pourcain, Beate; Traglia, Michela; Trajanoska, Katerina; Zaza, Gialuigi; Zhang, Weihua; Artigas, María S; Bansal, Narinder; Benn, Marianne; Chen, Zhongsheng; Danecek, Petr; Lin, Wei-Yu; Locke, Adam; Luan, Jian'an; Manning, Alisa K; Mulas, Antonella; Sidore, Carlo; Tybjaerg-Hansen, Anne; Varbo, Anette.

Am J Hum Genet ; 100(6): 865-884, 2017 Jun 01.

Artigo em Inglês | MEDLINE | ID: mdl-28552196

RESUMO

Deep sequence-based imputation can enhance the discovery power of genome-wide association studies by assessing previously unexplored variation across the common- and low-frequency spectra. We applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals. We report 106 genome-wide significant signals that have not been previously identified, including 9 low-frequency variants pointing to functional candidates. Of the 106 signals, 6 are in genomic regions that have not been implicated with related traits before, 28 are independent signals at previously reported regions, and 72 represent previously reported signals for a different anthropometric trait. 71% of signals reside within genes and fine mapping resolves 23 signals to one or two likely causal variants. We confirm genetic overlap between human monogenic and polygenic anthropometric traits and find signal enrichment in cis expression QTLs in relevant tissues. Our results highlight the potential of WGS strategies to enhance biologically relevant discoveries across the frequency spectrum.

Assuntos

Antropometria , Genoma Humano , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas/genética , Análise de Sequência de DNA/métodos , Estatura/genética , Estudos de Coortes , Metilação de DNA/genética , Bases de Dados Genéticas , Feminino , Variação Genética , Humanos , Lipodistrofia/genética , Masculino , Metanálise como Assunto , Obesidade/genética , Mapeamento Físico do Cromossomo , Caracteres Sexuais , Síndrome , Reino Unido

4.

Very low-depth whole-genome sequencing in complex trait association studies.

Gilly, Arthur; Southam, Lorraine; Suveges, Daniel; Kuchenbaecker, Karoline; Moore, Rachel; Melloni, Giorgio E M; Hatzikotoulas, Konstantinos; Farmaki, Aliki-Eleni; Ritchie, Graham; Schwartzentruber, Jeremy; Danecek, Petr; Kilian, Britt; Pollard, Martin O; Ge, Xiangyu; Tsafantakis, Emmanouil; Dedoussis, George; Zeggini, Eleftheria.

Bioinformatics ; 35(15): 2555-2561, 2019 08 01.

Artigo em Inglês | MEDLINE | ID: mdl-30576415

RESUMO

MOTIVATION: Very low-depth sequencing has been proposed as a cost-effective approach to capture low-frequency and rare variation in complex trait association studies. However, a full characterization of the genotype quality and association power for very low-depth sequencing designs is still lacking. RESULTS: We perform cohort-wide whole-genome sequencing (WGS) at low depth in 1239 individuals (990 at 1× depth and 249 at 4× depth) from an isolated population, and establish a robust pipeline for calling and imputing very low-depth WGS genotypes from standard bioinformatics tools. Using genotyping chip, whole-exome sequencing (75× depth) and high-depth (22×) WGS data in the same samples, we examine in detail the sensitivity of this approach, and show that imputed 1× WGS recapitulates 95.2% of variants found by imputed GWAS with an average minor allele concordance of 97% for common and low-frequency variants. In our study, 1× further allowed the discovery of 140 844 true low-frequency variants with 73% genotype concordance when compared to high-depth WGS data. Finally, using association results for 57 quantitative traits, we show that very low-depth WGS is an efficient alternative to imputed GWAS chip designs, allowing the discovery of up to twice as many true association signals than the classical imputed GWAS design. AVAILABILITY AND IMPLEMENTATION: The HELIC genotype and WGS datasets have been deposited to the European Genome-phenome Archive (https://www.ebi.ac.uk/ega/home): EGAD00010000518; EGAD00010000522; EGAD00010000610; EGAD00001001636, EGAD00001001637. The peakplotter software is available at https://github.com/wtsi-team144/peakplotter, the transformPhenotype app can be downloaded at https://github.com/wtsi-team144/transformPhenotype. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único , Genótipo , Humanos , Herança Multifatorial , Sequenciamento Completo do Genoma

5.

Evaluation of shared genetic aetiology between osteoarthritis and bone mineral density identifies SMAD3 as a novel osteoarthritis risk locus.

Hackinger, Sophie; Trajanoska, Katerina; Styrkarsdottir, Unnur; Zengini, Eleni; Steinberg, Julia; Ritchie, Graham R S; Hatzikotoulas, Konstantinos; Gilly, Arthur; Evangelou, Evangelos; Kemp, John P; Evans, David; Ingvarsson, Thorvaldur; Jonsson, Helgi; Thorsteinsdottir, Unnur; Stefansson, Kari; McCaskie, Andrew W; Brooks, Roger A; Wilkinson, Jeremy M; Rivadeneira, Fernando; Zeggini, Eleftheria.

Hum Mol Genet ; 26(19): 3850-3858, 2017 10 01.

Artigo em Inglês | MEDLINE | ID: mdl-28934396

RESUMO

Osteoarthritis (OA) is a common complex disease with high public health burden and no curative therapy. High bone mineral density (BMD) is associated with an increased risk of developing OA, suggesting a shared underlying biology. Here, we performed the first systematic overlap analysis of OA and BMD on a genome wide scale. We used summary statistics from the GEFOS consortium for lumbar spine (n = 31,800) and femoral neck (n = 32,961) BMD, and from the arcOGEN consortium for three OA phenotypes (hip, ncases=3,498; knee, ncases=3,266; hip and/or knee, ncases=7,410; ncontrols=11,009). Performing LD score regression we found a significant genetic correlation between the combined OA phenotype (hip and/or knee) and lumbar spine BMD (rg=0.18, P = 2.23 × 10-2), which may be driven by the presence of spinal osteophytes. We identified 143 variants with evidence for cross-phenotype association which we took forward for replication in independent large-scale OA datasets, and subsequent meta-analysis with arcOGEN for a total sample size of up to 23,425 cases and 236,814 controls. We found robustly replicating evidence for association with OA at rs12901071 (OR 1.08 95% CI 1.05-1.11, Pmeta=3.12 × 10-10), an intronic variant in the SMAD3 gene, which is known to play a role in bone remodeling and cartilage maintenance. We were able to confirm expression of SMAD3 in intact and degraded cartilage of the knee and hip. Our findings provide the first systematic evaluation of pleiotropy between OA and BMD, highlight genes with biological relevance to both traits, and establish a robust new OA genetic risk locus at SMAD3.

Assuntos

Densidade Óssea/genética , Osteoartrite/genética , Proteína Smad3/genética , Bases de Dados de Ácidos Nucleicos , Colo do Fêmur/química , Colo do Fêmur/fisiologia , Estudos de Associação Genética/métodos , Pleiotropia Genética/genética , Humanos , Vértebras Lombares/fisiologia , Osteoartrite/etiologia , Osteoartrite do Quadril/genética , Osteoartrite do Joelho/genética , Fatores de Risco , Proteína Smad3/metabolismo

6.

Very low-depth sequencing in a founder population identifies a cardioprotective APOC3 signal missed by genome-wide imputation.

Gilly, Arthur; Ritchie, Graham Rs; Southam, Lorraine; Farmaki, Aliki-Eleni; Tsafantakis, Emmanouil; Dedoussis, George; Zeggini, Eleftheria.

Hum Mol Genet ; 25(11): 2360-2365, 2016 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-27146844

RESUMO

Cohort-wide very low-depth whole-genome sequencing (WGS) can comprehensively capture low-frequency sequence variation for the cost of a dense genome-wide genotyping array. Here, we analyse 1x sequence data across the APOC3 gene in a founder population from the island of Crete in Greece (n = 1239) and find significant evidence for association with blood triglyceride levels with the previously reported R19X cardioprotective null mutation (ß = -1.09,σ = 0.163, P = 8.2 × 10-11) and a second loss of function mutation, rs138326449 (ß = -1.17,σ = 0.188, P = 1.14 × 10-9). The signal cannot be recapitulated by imputing genome-wide genotype data on a large reference panel of 5122 individuals including 249 with 4x WGS data from the same population. Gene-level meta-analysis with other studies reporting burden signals at APOC3 provides robust evidence for a replicable cardioprotective rare variant aggregation (P = 3.2 × 10-31, n = 13 480).

Assuntos

Apolipoproteína C-III/genética , Doenças Cardiovasculares/genética , Sequenciamento de Nucleotídeos em Larga Escala , Triglicerídeos/genética , Alelos , Doenças Cardiovasculares/sangue , Doenças Cardiovasculares/patologia , Feminino , Efeito Fundador , Genética Populacional , Genoma Humano , Estudo de Associação Genômica Ampla , Genótipo , Grécia , Humanos , Masculino , Mutação , Polimorfismo de Nucleotídeo Único , Triglicerídeos/sangue , População Branca/genética

7.

Revisiting the thrifty gene hypothesis via 65 loci associated with susceptibility to type 2 diabetes.

Ayub, Qasim; Moutsianas, Loukas; Chen, Yuan; Panoutsopoulou, Kalliope; Colonna, Vincenza; Pagani, Luca; Prokopenko, Inga; Ritchie, Graham R S; Tyler-Smith, Chris; McCarthy, Mark I; Zeggini, Eleftheria; Xue, Yali.

Am J Hum Genet ; 94(2): 176-85, 2014 Feb 06.

Artigo em Inglês | MEDLINE | ID: mdl-24412096

RESUMO

We have investigated the evidence for positive selection in samples of African, European, and East Asian ancestry at 65 loci associated with susceptibility to type 2 diabetes (T2D) previously identified through genome-wide association studies. Selection early in human evolutionary history is predicted to lead to ancestral risk alleles shared between populations, whereas late selection would result in population-specific signals at derived risk alleles. By using a wide variety of tests based on the site frequency spectrum, haplotype structure, and population differentiation, we found no global signal of enrichment for positive selection when we considered all T2D risk loci collectively. However, in a locus-by-locus analysis, we found nominal evidence for positive selection at 14 of the loci. Selection favored the protective and risk alleles in similar proportions, rather than the risk alleles specifically as predicted by the thrifty gene hypothesis, and may not be related to influence on diabetes. Overall, we conclude that past positive selection has not been a powerful influence driving the prevalence of T2D risk alleles.

Assuntos

Diabetes Mellitus Tipo 2/epidemiologia , Diabetes Mellitus Tipo 2/genética , Loci Gênicos , Predisposição Genética para Doença , Seleção Genética , Alelos , Povo Asiático/genética , População Negra/genética , Frequência do Gene , Estudo de Associação Genômica Ampla , Haplótipos , Humanos , Polimorfismo de Nucleotídeo Único , Fatores de Risco , População Branca/genética

8.

Functional annotation of noncoding sequence variants.

Ritchie, Graham R S; Dunham, Ian; Zeggini, Eleftheria; Flicek, Paul.

Nat Methods ; 11(3): 294-6, 2014 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-24487584

RESUMO

Identifying functionally relevant variants against the background of ubiquitous genetic variation is a major challenge in human genetics. For variants in protein-coding regions, our understanding of the genetic code and splicing allows us to identify likely candidates, but interpreting variants outside genic regions is more difficult. Here we present genome-wide annotation of variants (GWAVA), a tool that supports prioritization of noncoding variants by integrating various genomic and epigenomic annotations.

Assuntos

Anotação de Sequência Molecular , Regiões não Traduzidas/genética , Algoritmos , Simulação por Computador , Variação Genética , Humanos

9.

Computational approaches to identify functional genetic variants in cancer genomes.

Gonzalez-Perez, Abel; Mustonen, Ville; Reva, Boris; Ritchie, Graham R S; Creixell, Pau; Karchin, Rachel; Vazquez, Miguel; Fink, J Lynn; Kassahn, Karin S; Pearson, John V; Bader, Gary D; Boutros, Paul C; Muthuswamy, Lakshmi; Ouellette, B F Francis; Reimand, Jüri; Linding, Rune; Shibata, Tatsuhiro; Valencia, Alfonso; Butler, Adam; Dronov, Serge; Flicek, Paul; Shannon, Nick B; Carter, Hannah; Ding, Li; Sander, Chris; Stuart, Josh M; Stein, Lincoln D; Lopez-Bigas, Nuria.

Nat Methods ; 10(8): 723-9, 2013 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-23900255

RESUMO

The International Cancer Genome Consortium (ICGC) aims to catalog genomic abnormalities in tumors from 50 different cancer types. Genome sequencing reveals hundreds to thousands of somatic mutations in each tumor but only a minority of these drive tumor progression. We present the result of discussions within the ICGC on how to address the challenge of identifying mutations that contribute to oncogenesis, tumor maintenance or response to therapy, and recommend computational techniques to annotate somatic variants and predict their impact on cancer phenotype.

Assuntos

Biologia Computacional/métodos , Genoma Humano , Neoplasias/genética , Variação Genética , Humanos , Mutação

10.

The Ensembl REST API: Ensembl Data for Any Language.

Yates, Andrew; Beal, Kathryn; Keenan, Stephen; McLaren, William; Pignatelli, Miguel; Ritchie, Graham R S; Ruffier, Magali; Taylor, Kieron; Vullo, Alessandro; Flicek, Paul.

Bioinformatics ; 31(1): 143-5, 2015 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-25236461

RESUMO

MOTIVATION: We present a Web service to access Ensembl data using Representational State Transfer (REST). The Ensembl REST server enables the easy retrieval of a wide range of Ensembl data by most programming languages, using standard formats such as JSON and FASTA while minimizing client work. We also introduce bindings to the popular Ensembl Variant Effect Predictor tool permitting large-scale programmatic variant analysis independent of any specific programming language. AVAILABILITY AND IMPLEMENTATION: The Ensembl REST API can be accessed at http://rest.ensembl.org and source code is freely available under an Apache 2.0 license from http://github.com/Ensembl/ensembl-rest.

Assuntos

Biologia Computacional/métodos , Bases de Dados Factuais , Linguagens de Programação , Software , Variação Genética , Genômica , Humanos

11.

Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction.

Frankish, Adam; Uszczynska, Barbara; Ritchie, Graham R S; Gonzalez, Jose M; Pervouchine, Dmitri; Petryszak, Robert; Mudge, Jonathan M; Fonseca, Nuno; Brazma, Alvis; Guigo, Roderic; Harrow, Jennifer.

BMC Genomics ; 16 Suppl 8: S2, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26110515

RESUMO

BACKGROUND: A vast amount of DNA variation is being identified by increasingly large-scale exome and genome sequencing projects. To be useful, variants require accurate functional annotation and a wide range of tools are available to this end. McCarthy et al recently demonstrated the large differences in prediction of loss-of-function (LoF) variation when RefSeq and Ensembl transcripts are used for annotation, highlighting the importance of the reference transcripts on which variant functional annotation is based. RESULTS: We describe a detailed analysis of the similarities and differences between the gene and transcript annotation in the GENCODE and RefSeq genesets. We demonstrate that the GENCODE Comprehensive set is richer in alternative splicing, novel CDSs, novel exons and has higher genomic coverage than RefSeq, while the GENCODE Basic set is very similar to RefSeq. Using RNAseq data we show that exons and introns unique to one geneset are expressed at a similar level to those common to both. We present evidence that the differences in gene annotation lead to large differences in variant annotation where GENCODE and RefSeq are used as reference transcripts, although this is predominantly confined to non-coding transcripts and UTR sequence, with at most ~30% of LoF variants annotated discordantly. We also describe an investigation of dominant transcript expression, showing that it both supports the utility of the GENCODE Basic set in providing a smaller set of more highly expressed transcripts and provides a useful, biologically-relevant filter for further reducing the complexity of the transcriptome. CONCLUSIONS: The reference transcripts selected for variant functional annotation do have a large effect on the outcome. The GENCODE Comprehensive transcripts contain more exons, have greater genomic coverage and capture many more variants than RefSeq in both genome and exome datasets, while the GENCODE Basic set shows a higher degree of concordance with RefSeq and has fewer unique features. We propose that the GENCODE Comprehensive set has great utility for the discovery of new variants with functional potential, while the GENCODE Basic set is more suitable for applications demanding less complex interpretation of functional variants.

Assuntos

Biologia Computacional , Genoma Humano , Anotação de Sequência Molecular , Isoformas de Proteínas/metabolismo , Software , Processamento Alternativo , Bases de Dados Genéticas , Humanos , Isoformas de Proteínas/genética , Transcriptoma

12.

Ensembl 2013.

Flicek, Paul; Ahmed, Ikhlak; Amode, M Ridwan; Barrell, Daniel; Beal, Kathryn; Brent, Simon; Carvalho-Silva, Denise; Clapham, Peter; Coates, Guy; Fairley, Susan; Fitzgerald, Stephen; Gil, Laurent; García-Girón, Carlos; Gordon, Leo; Hourlier, Thibaut; Hunt, Sarah; Juettemann, Thomas; Kähäri, Andreas K; Keenan, Stephen; Komorowska, Monika; Kulesha, Eugene; Longden, Ian; Maurel, Thomas; McLaren, William M; Muffato, Matthieu; Nag, Rishi; Overduin, Bert; Pignatelli, Miguel; Pritchard, Bethan; Pritchard, Emily; Riat, Harpreet Singh; Ritchie, Graham R S; Ruffier, Magali; Schuster, Michael; Sheppard, Daniel; Sobral, Daniel; Taylor, Kieron; Thormann, Anja; Trevanion, Stephen; White, Simon; Wilder, Steven P; Aken, Bronwen L; Birney, Ewan; Cunningham, Fiona; Dunham, Ian; Harrow, Jennifer; Herrero, Javier; Hubbard, Tim J P; Johnson, Nathan; Kinsella, Rhoda.

Nucleic Acids Res ; 41(Database issue): D48-55, 2013 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-23203987

RESUMO

The Ensembl project (http://www.ensembl.org) provides genome information for sequenced chordate genomes with a particular focus on human, mouse, zebrafish and rat. Our resources include evidenced-based gene sets for all supported species; large-scale whole genome multiple species alignments across vertebrates and clade-specific alignments for eutherian mammals, primates, birds and fish; variation data resources for 17 species and regulation annotations based on ENCODE and other data sets. Ensembl data are accessible through the genome browser at http://www.ensembl.org and through other tools and programmatic interfaces.

Assuntos

Bases de Dados Genéticas , Genômica , Animais , Regulação da Expressão Gênica , Variação Genética , Humanos , Internet , Camundongos , Anotação de Sequência Molecular , Ratos , Software , Peixe-Zebra/genética

13.

Ensembl 2012.

Flicek, Paul; Amode, M Ridwan; Barrell, Daniel; Beal, Kathryn; Brent, Simon; Carvalho-Silva, Denise; Clapham, Peter; Coates, Guy; Fairley, Susan; Fitzgerald, Stephen; Gil, Laurent; Gordon, Leo; Hendrix, Maurice; Hourlier, Thibaut; Johnson, Nathan; Kähäri, Andreas K; Keefe, Damian; Keenan, Stephen; Kinsella, Rhoda; Komorowska, Monika; Koscielny, Gautier; Kulesha, Eugene; Larsson, Pontus; Longden, Ian; McLaren, William; Muffato, Matthieu; Overduin, Bert; Pignatelli, Miguel; Pritchard, Bethan; Riat, Harpreet Singh; Ritchie, Graham R S; Ruffier, Magali; Schuster, Michael; Sobral, Daniel; Tang, Y Amy; Taylor, Kieron; Trevanion, Stephen; Vandrovcova, Jana; White, Simon; Wilson, Mark; Wilder, Steven P; Aken, Bronwen L; Birney, Ewan; Cunningham, Fiona; Dunham, Ian; Durbin, Richard; Fernández-Suarez, Xosé M; Harrow, Jennifer; Herrero, Javier; Hubbard, Tim J P.

Nucleic Acids Res ; 40(Database issue): D84-90, 2012 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-22086963

RESUMO

The Ensembl project (http://www.ensembl.org) provides genome resources for chordate genomes with a particular focus on human genome data as well as data for key model organisms such as mouse, rat and zebrafish. Five additional species were added in the last year including gibbon (Nomascus leucogenys) and Tasmanian devil (Sarcophilus harrisii) bringing the total number of supported species to 61 as of Ensembl release 64 (September 2011). Of these, 55 species appear on the main Ensembl website and six species are provided on the Ensembl preview site (Pre!Ensembl; http://pre.ensembl.org) with preliminary support. The past year has also seen improvements across the project.

Assuntos

Bases de Dados Genéticas , Genômica , Animais , Regulação da Expressão Gênica , Variação Genética , Humanos , Camundongos , Anotação de Sequência Molecular , Ratos

14.

Ensembl 2011.

Flicek, Paul; Amode, M Ridwan; Barrell, Daniel; Beal, Kathryn; Brent, Simon; Chen, Yuan; Clapham, Peter; Coates, Guy; Fairley, Susan; Fitzgerald, Stephen; Gordon, Leo; Hendrix, Maurice; Hourlier, Thibaut; Johnson, Nathan; Kähäri, Andreas; Keefe, Damian; Keenan, Stephen; Kinsella, Rhoda; Kokocinski, Felix; Kulesha, Eugene; Larsson, Pontus; Longden, Ian; McLaren, William; Overduin, Bert; Pritchard, Bethan; Riat, Harpreet Singh; Rios, Daniel; Ritchie, Graham R S; Ruffier, Magali; Schuster, Michael; Sobral, Daniel; Spudich, Giulietta; Tang, Y Amy; Trevanion, Stephen; Vandrovcova, Jana; Vilella, Albert J; White, Simon; Wilder, Steven P; Zadissa, Amonida; Zamora, Jorge; Aken, Bronwen L; Birney, Ewan; Cunningham, Fiona; Dunham, Ian; Durbin, Richard; Fernández-Suarez, Xosé M; Herrero, Javier; Hubbard, Tim J P; Parker, Anne; Proctor, Glenn.

Nucleic Acids Res ; 39(Database issue): D800-6, 2011 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-21045057

RESUMO

The Ensembl project (http://www.ensembl.org) seeks to enable genomic science by providing high quality, integrated annotation on chordate and selected eukaryotic genomes within a consistent and accessible infrastructure. All supported species include comprehensive, evidence-based gene annotations and a selected set of genomes includes additional data focused on variation, comparative, evolutionary, functional and regulatory annotation. The most advanced resources are provided for key species including human, mouse, rat and zebrafish reflecting the popularity and importance of these species in biomedical research. As of Ensembl release 59 (August 2010), 56 species are supported of which 5 have been added in the past year. Since our previous report, we have substantially improved the presentation and integration of both data of disease relevance and the regulatory state of different cell types.

Assuntos

Bases de Dados Genéticas , Genômica , Animais , Variação Genética , Humanos , Camundongos , Anotação de Sequência Molecular , Ratos , Sequências Reguladoras de Ácido Nucleico , Software , Peixe-Zebra/genética

15.

A combined functional annotation score for non-synonymous variants.

Lopes, Margarida C; Joyce, Chris; Ritchie, Graham R S; John, Sally L; Cunningham, Fiona; Asimit, Jennifer; Zeggini, Eleftheria.

Hum Hered ; 73(1): 47-51, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22261837

RESUMO

AIMS: Next-generation sequencing has opened the possibility of large-scale sequence-based disease association studies. A major challenge in interpreting whole-exome data is predicting which of the discovered variants are deleterious or neutral. To address this question in silico, we have developed a score called Combined Annotation scoRing toOL (CAROL), which combines information from 2 bioinformatics tools: PolyPhen-2 and SIFT, in order to improve the prediction of the effect of non-synonymous coding variants. METHODS: We used a weighted Z method that combines the probabilistic scores of PolyPhen-2 and SIFT. We defined 2 dataset pairs to train and test CAROL using information from the dbSNP: 'HGMD-PUBLIC' and 1000 Genomes Project databases. The training pair comprises a total of 980 positive control (disease-causing) and 4,845 negative control (non-disease-causing) variants. The test pair consists of 1,959 positive and 9,691 negative controls. RESULTS: CAROL has higher predictive power and accuracy for the effect of non-synonymous variants than each individual annotation tool (PolyPhen-2 and SIFT) and benefits from higher coverage. CONCLUSION: The combination of annotation tools can help improve automated prediction of whole-genome/exome non-synonymous variant functional consequences.

Assuntos

Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Algoritmos , Humanos , Polimorfismo de Nucleotídeo Único , Curva ROC

16.

Modernizing reference genome assemblies.

Church, Deanna M; Schneider, Valerie A; Graves, Tina; Auger, Katherine; Cunningham, Fiona; Bouk, Nathan; Chen, Hsiu-Chuan; Agarwala, Richa; McLaren, William M; Ritchie, Graham R S; Albracht, Derek; Kremitzki, Milinn; Rock, Susan; Kotkiewicz, Holland; Kremitzki, Colin; Wollam, Aye; Trani, Lee; Fulton, Lucinda; Fulton, Robert; Matthews, Lucy; Whitehead, Siobhan; Chow, Will; Torrance, James; Dunn, Matthew; Harden, Glenn; Threadgold, Glen; Wood, Jonathan; Collins, Joanna; Heath, Paul; Griffiths, Guy; Pelan, Sarah; Grafham, Darren; Eichler, Evan E; Weinstock, George; Mardis, Elaine R; Wilson, Richard K; Howe, Kerstin; Flicek, Paul; Hubbard, Tim.

PLoS Biol ; 9(7): e1001091, 2011 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-21750661

Assuntos

Bases de Dados Genéticas , Genoma Humano , Humanos , Cooperação Internacional

17.

GARFIELD classifies disease-relevant genomic features through integration of functional annotations with association signals.

Iotchkova, Valentina; Ritchie, Graham R S; Geihs, Matthias; Morganella, Sandro; Min, Josine L; Walter, Klaudia; Timpson, Nicholas John; Dunham, Ian; Birney, Ewan; Soranzo, Nicole.

Nat Genet ; 51(2): 343-353, 2019 02.

Artigo em Inglês | MEDLINE | ID: mdl-30692680

RESUMO

Loci discovered by genome-wide association studies predominantly map outside protein-coding genes. The interpretation of the functional consequences of non-coding variants can be greatly enhanced by catalogs of regulatory genomic regions in cell lines and primary tissues. However, robust and readily applicable methods are still lacking by which to systematically evaluate the contribution of these regions to genetic variation implicated in diseases or quantitative traits. Here we propose a novel approach that leverages genome-wide association studies' findings with regulatory or functional annotations to classify features relevant to a phenotype of interest. Within our framework, we account for major sources of confounding not offered by current methods. We further assess enrichment of genome-wide association studies for 19 traits within Encyclopedia of DNA Elements- and Roadmap-derived regulatory regions. We characterize unique enrichment patterns for traits and annotations driving novel biological insights. The method is implemented in standalone software and an R package, to facilitate its application by the research community.

Assuntos

Doença/genética , Genoma/genética , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Anotação de Sequência Molecular/métodos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Software

18.

Song learning as an indicator mechanism: modelling the developmental stress hypothesis.

Ritchie, Graham R S; Kirby, Simon; Hawkey, David J C.

J Theor Biol ; 251(4): 570-83, 2008 Apr 21.

Artigo em Inglês | MEDLINE | ID: mdl-18294656

RESUMO

The 'developmental stress hypothesis' attempts to provide a functional explanation of the evolutionary maintenance of song learning in songbirds. It argues that song learning can be viewed as an indicator mechanism that allows females to use learned features of song as a window on a male's early development, a potentially stressful period that may have long-term phenotypic effects. In this paper we formally model this hypothesis for the first time, presenting a population genetic model that takes into account both the evolution of genetic learning preferences and cultural transmission of song. The models demonstrate that a preference for song types that reveal developmental stress can evolve in a population, and that cultural transmission of these song types can be stable, lending more support to the hypothesis.

Assuntos

Período Crítico Psicológico , Aprendizagem/fisiologia , Modelos Genéticos , Aves Canoras/crescimento & desenvolvimento , Estresse Fisiológico , Vocalização Animal/fisiologia , Animais , Evolução Biológica , Feminino , Genética Populacional , Masculino , Comportamento Sexual Animal , Aves Canoras/genética

19.

Integrative epigenomics, transcriptomics and proteomics of patient chondrocytes reveal genes and pathways involved in osteoarthritis.

Steinberg, Julia; Ritchie, Graham R S; Roumeliotis, Theodoros I; Jayasuriya, Raveen L; Clark, Matthew J; Brooks, Roger A; Binch, Abbie L A; Shah, Karan M; Coyle, Rachael; Pardo, Mercedes; Le Maitre, Christine L; Ramos, Yolande F M; Nelissen, Rob G H H; Meulenbelt, Ingrid; McCaskie, Andrew W; Choudhary, Jyoti S; Wilkinson, J Mark; Zeggini, Eleftheria.

Sci Rep ; 7(1): 8935, 2017 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-28827734

RESUMO

Osteoarthritis (OA) is a common disease characterized by cartilage degeneration and joint remodeling. The underlying molecular changes underpinning disease progression are incompletely understood. We investigated genes and pathways that mark OA progression in isolated primary chondrocytes taken from paired intact versus degraded articular cartilage samples across 38 patients undergoing joint replacement surgery (discovery cohort: 12 knee OA, replication cohorts: 17 knee OA, 9 hip OA patients). We combined genome-wide DNA methylation, RNA sequencing, and quantitative proteomics data. We identified 49 genes differentially regulated between intact and degraded cartilage in at least two -omics levels, 16 of which have not previously been implicated in OA progression. Integrated pathway analysis implicated the involvement of extracellular matrix degradation, collagen catabolism and angiogenesis in disease progression. Using independent replication datasets, we showed that the direction of change is consistent for over 90% of differentially expressed genes and differentially methylated CpG probes. AQP1, COL1A1 and CLEC3B were significantly differentially regulated across all three -omics levels, confirming their differential expression in human disease. Through integration of genome-wide methylation, gene and protein expression data in human primary chondrocytes, we identified consistent molecular players in OA progression that replicated across independent datasets and that have translational potential.

Assuntos

Aquaporina 1/genética , Condrócitos/metabolismo , Colágeno Tipo I/genética , Metilação de DNA , Lectinas Tipo C/genética , Osteoartrite do Quadril/cirurgia , Osteoartrite do Joelho/cirurgia , Aquaporina 1/metabolismo , Artroplastia de Quadril , Artroplastia do Joelho , Estudos de Casos e Controles , Condrócitos/química , Cromatografia Líquida , Colágeno Tipo I/metabolismo , Cadeia alfa 1 do Colágeno Tipo I , Progressão da Doença , Epigênese Genética , Epigenômica/métodos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Humanos , Lectinas Tipo C/metabolismo , Masculino , Espectrometria de Massas , Osteoartrite do Quadril/genética , Osteoartrite do Quadril/metabolismo , Osteoartrite do Joelho/genética , Osteoartrite do Joelho/metabolismo , Proteômica/métodos , Análise de Sequência de RNA

20.

Enrichment of low-frequency functional variants revealed by whole-genome sequencing of multiple isolated European populations.

Xue, Yali; Mezzavilla, Massimo; Haber, Marc; McCarthy, Shane; Chen, Yuan; Narasimhan, Vagheesh; Gilly, Arthur; Ayub, Qasim; Colonna, Vincenza; Southam, Lorraine; Finan, Christopher; Massaia, Andrea; Chheda, Himanshu; Palta, Priit; Ritchie, Graham; Asimit, Jennifer; Dedoussis, George; Gasparini, Paolo; Palotie, Aarno; Ripatti, Samuli; Soranzo, Nicole; Toniolo, Daniela; Wilson, James F; Durbin, Richard; Tyler-Smith, Chris; Zeggini, Eleftheria.

Nat Commun ; 8: 15927, 2017 06 23.

Artigo em Inglês | MEDLINE | ID: mdl-28643794

RESUMO

The genetic features of isolated populations can boost power in complex-trait association studies, and an in-depth understanding of how their genetic variation has been shaped by their demographic history can help leverage these advantageous characteristics. Here, we perform a comprehensive investigation using 3,059 newly generated low-depth whole-genome sequences from eight European isolates and two matched general populations, together with published data from the 1000 Genomes Project and UK10K. Sequencing data give deeper and richer insights into population demography and genetic characteristics than genotype-chip data, distinguishing related populations more effectively and allowing their functional variants to be studied more fully. We demonstrate relaxation of purifying selection in the isolates, leading to enrichment of rare and low-frequency functional variants, using novel statistics, DVxy and SVxy. We also develop an isolation-index (Isx) that predicts the overall level of such key genetic characteristics and can thus help guide population choice in future complex-trait association studies.

Assuntos

Genoma Humano , População Branca/genética , Frequência do Gene , Variação Genética , Genética Populacional , Humanos , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA