Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Plant Cell ; 36(3): 559-584, 2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-37971938

RESUMO

Cellular condensates are usually ribonucleoprotein assemblies with liquid- or solid-like properties. Because these subcellular structures lack a delineating membrane, determining their compositions is difficult. Here we describe a proximity-biotinylation approach for capturing the RNAs of the condensates known as processing bodies (PBs) in Arabidopsis (Arabidopsis thaliana). By combining this approach with RNA detection, in silico, and high-resolution imaging approaches, we studied PBs under normal conditions and heat stress. PBs showed a much more dynamic RNA composition than the total transcriptome. RNAs involved in cell wall development and regeneration, plant hormonal signaling, secondary metabolism/defense, and RNA metabolism were enriched in PBs. RNA-binding proteins and the liquidity of PBs modulated RNA recruitment, while RNAs were frequently recruited together with their encoded proteins. In PBs, RNAs follow distinct fates: in small liquid-like PBs, RNAs get degraded while in more solid-like larger ones, they are stored. PB properties can be regulated by the actin-polymerizing SCAR (suppressor of the cyclic AMP)-WAVE (WASP family verprolin homologous) complex. SCAR/WAVE modulates the shuttling of RNAs between PBs and the translational machinery, thereby adjusting ethylene signaling. In summary, we provide an approach to identify RNAs in condensates that allowed us to reveal a mechanism for regulating RNA fate.


Assuntos
Arabidopsis , RNA , Corpos de Processamento , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Resposta ao Choque Térmico , Arabidopsis/genética , Arabidopsis/metabolismo
2.
Front Immunol ; 14: 1251067, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38077337

RESUMO

Introduction: The two-stage molecular profile of the progression of SARS-CoV-2 (SCOV2) infection is explored in terms of five key biological/clinical questions: (a) does SCOV2 exhibits a two-stage infection profile? (b) SARS-CoV-1 (SCOV1) vs. SCOV2: do they differ? (c) does and how SCOV2 differs from Influenza/INFL infection? (d) does low viral-load and (e) does COVID-19 early host response relate to the two-stage SCOV2 infection profile? We provide positive answers to the above questions by analyzing the time-series gene-expression profiles of preserved cell-lines infected with SCOV1/2 or, the gene-expression profiles of infected individuals with different viral-loads levels and different host-response phenotypes. Methods: Our analytical methodology follows an in-silico quest organized around an elaborate multi-step analysis pipeline including: (a) utilization of fifteen gene-expression datasets from NCBI's gene expression omnibus/GEO repository; (b) thorough designation of SCOV1/2 and INFL progression stages and COVID-19 phenotypes; (c) identification of differentially expressed genes (DEGs) and enriched biological processes and pathways that contrast and differentiate between different infection stages and phenotypes; (d) employment of a graph-based clustering process for the induction of coherent groups of networked genes as the representative core molecular fingerprints that characterize the different SCOV2 progression stages and the different COVID-19 phenotypes. In addition, relying on a sensibly selected set of induced fingerprint genes and following a Machine Learning approach, we devised and assessed the performance of different classifier models for the differentiation of acute respiratory illness/ARI caused by SCOV2 or other infections (diagnostic classifiers), as well as for the prediction of COVID-19 disease severity (prognostic classifiers), with quite encouraging results. Results: The central finding of our experiments demonstrates the down-regulation of type-I interferon genes (IFN-1), interferon induced genes (ISGs) and fundamental innate immune and defense biological processes and molecular pathways during the early SCOV2 infection stages, with the inverse to hold during the later ones. It is highlighted that upregulation of these genes and pathways early after infection may prove beneficial in preventing subsequent uncontrolled hyperinflammatory and potentially lethal events. Discussion: The basic aim of our study was to utilize in an intuitive, efficient and productive way the most relevant and state-of-the-art bioinformatics methods to reveal the core molecular mechanisms which govern the progression of SCOV2 infection and the different COVID-19 phenotypes.


Assuntos
COVID-19 , Interferon Tipo I , Humanos , COVID-19/genética , SARS-CoV-2 , Diferenciação Celular , Gravidade do Paciente
3.
Neurobiol Aging ; 123: 111-128, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36117051

RESUMO

Using exome sequencing, we analyzed 196 participants of the Cretan Aging Cohort (CAC; 95 with Alzheimer's disease [AD], 20 with mild cognitive impairment [MCI], and 81 cognitively normal controls). The APOE ε4 allele was more common in AD patients (23.2%) than in controls (7.4%; p < 0.01) and the PSEN2 p.Arg29His and p.Cys391Arg variants were found in 3 AD and 1 MCI patient, respectively. Also, we found the frontotemporal dementia (FTD)-associated TARDBP gene p.Ile383Val variant in 2 elderly patients diagnosed with AD and in 2 patients, non CAC members, with the amyotrophic lateral sclerosis/FTD phenotype. Furthermore, the p.Ser498Ala variant in the positively selected GLUD2 gene was less frequent in AD patients (2.11%) than in controls (16%; p < 0.01), suggesting a possible protective effect. While the same trend was found in another local replication cohort (n = 406) and in section of the ADNI cohort (n = 808), this finding did not reach statistical significance and therefore it should be considered preliminary. Our results attest to the value of genetic testing to study aged adults with AD phenotype.


Assuntos
Doença de Alzheimer , Disfunção Cognitiva , Demência Frontotemporal , Doença de Pick , Humanos , Doença de Alzheimer/genética , Doença de Alzheimer/diagnóstico , Demência Frontotemporal/genética , Demência Frontotemporal/diagnóstico
4.
Cytotherapy ; 24(2): 183-192, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34465516

RESUMO

BACKGROUND AIMS: The high genetic diversity of HLA across populations significantly confines the effectiveness of a donor or umbilical cord blood search for allogeneic hematopoietic stem cell transplantation (HSCT). This study aims to probe the HLA immunogenetic profile of the population of Crete, a Greek region with specific geographic and historical characteristics, and to investigate potential patterns in HLA distribution following comparison with the Deutsche Knochenmarkspenderdatei (DKMS) donor registry. It also aims to highlight the importance of regional public cord blood banks (PCBBs) in fulfilling HSCT needs, especially in countries with significant genetic diversity. METHODS: A cohort of 1835 samples representative of the Cretan population was typed for HLA class I (HLA-A, HLA-B, HLA-C) and class II (HLA-DRB1, HLA-DQB1, HLA-DPB1) loci by high-resolution second field next-generation sequencing. Data were compared with the respective HLA profiles of 12 DKMS populations (n = 20 032). Advanced statistical and bioinformatics methods were employed to assess specific intra- and inter-population genetic indexes associated with the regional and geographic distribution of HLA alleles and haplotypes. RESULTS: A considerable HLA allelic and haplotypic diversity was identified among the Cretan samples and between the latter and the pooled DKMS cohort. Even though the HLA allele and haplotype frequency distribution was similar to regions of close geographic proximity to Crete, a clinal distribution pattern from the northern to southern regions was identified. Significant differences were also observed between Crete and the Greek population of DKMS. CONCLUSIONS: This study provides an in-depth characterization of the HLA immunogenetic profile in Crete and reveals the importance of demographic history in HLA heterogeneity and donor selection. The novel HLA allele and haplotype frequency comparative data between the Cretan and other European populations signify the importance of regional PCBBs in prioritizing HLA diversity to efficiently promote the HSCT program at the national level and beyond.


Assuntos
Medula Óssea , Sangue Fetal , Antígenos HLA , Bancos de Sangue , Frequência do Gene , Variação Genética , Grécia , Antígenos HLA/genética , Haplótipos/genética , Humanos , Imunogenética , Sistema de Registros , Doadores de Tecidos
5.
HLA ; 97(2): 163-166, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33124731

RESUMO

Characterization of the novel HLA-DRB1*04:311 and HLA-DRB1*11:277 alleles in two Greek individuals of Cretan origin.


Assuntos
Cadeias HLA-DRB1 , Alelos , Frequência do Gene , Grécia , Cadeias HLA-DRB1/genética , Haplótipos , Humanos
6.
HLA ; 97(4): 353-356, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33241918

RESUMO

Characterization of two novel HLA-A alleles in two Greek individuals of Cretan origin.


Assuntos
Antígenos HLA-A , Alelos , Grécia , Antígenos HLA-A/genética , Humanos
7.
Hum Mutat ; 41(6): 1112-1122, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32248568

RESUMO

FINDbase (http://www.findbase.org) is a comprehensive data resource recording the prevalence of clinically relevant genomic variants in various populations worldwide, such as pathogenic variants underlying genetic disorders as well as pharmacogenomic biomarkers that can guide drug treatment. Here, we report significant new developments and technological advancements in the database architecture, leading to a completely revamped database structure, querying interface, accompanied with substantial extensions of data content and curation. In particular, the FINDbase upgrade further improves the user experience by introducing responsive features that support a wide variety of mobile and stationary devices, while enhancing computational runtime due to the use of a modern Javascript framework such as ReactJS. Data collection is significantly enriched, with the data records being divided in a Public and Private version, the latter being accessed on the basis of data contribution, according to the microattribution approach, while the front end was redesigned to support the new functionalities and querying tools. The abovementioned updates further enhance the impact of FINDbase, improve the overall user experience, facilitate further data sharing by microattribution, and strengthen the role of FINDbase as a key resource for personalized medicine applications and personalized public health.


Assuntos
Bases de Dados Genéticas , Frequência do Gene , Marcadores Genéticos , Biologia Computacional , Documentação , Genômica , Humanos , Internet , Farmacogenética , Software , Interface Usuário-Computador
8.
Front Genet ; 10: 675, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31447879

RESUMO

Gilthead sea bream (Sparus aurata) is a teleost of considerable economic importance in Southern European aquaculture. The aquaculture industry shows a growing interest in the application of genetic methods that can locate phenotype-genotype associations with high economic impact. Through selective breeding, the aquaculture industry can exploit this information to maximize the financial yield. Here, we present a Genome Wide Association Study (GWAS) of 112 samples belonging to seven different sea bream families collected from a Greek commercial aquaculture company. Through double digest Random Amplified DNA (ddRAD) Sequencing, we generated a per-sample genetic profile consisting of 2,258 high-quality Single Nucleotide Polymorphisms (SNPs). These profiles were tested for association with four phenotypes of major financial importance: Fat, Weight, Tag Weight, and the Length to Width ratio. We applied two methods of association analysis. The first is the typical single-SNP to phenotype test, and the second is a feature selection (FS) method through two novel algorithms that are employed for the first time in aquaculture genomics and produce groups with multiple SNPs associated to a phenotype. In total, we identified 9 single SNPs and 6 groups of SNPs associated with weight-related phenotypes (Weight and Tag Weight), 2 groups associated with Fat, and 16 groups associated with the Length to Width ratio. Six identified loci (Chr4:23265532, Chr6:12617755, Chr:8:11613979, Chr13:1098152, Chr15:3260819, and Chr22:14483563) were present in genes associated with growth in other teleosts or even mammals, such as semaphorin-3A and neurotrophin-3. These loci are strong candidates for future studies that will help us unveil the genetic mechanisms underlying growth and improve the sea bream aquaculture productivity by providing genomic anchors for selection programs.

9.
PLoS One ; 12(8): e0182138, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28771511

RESUMO

Racial and ethnic differences in drug responses are now well studied and documented. Pharmacogenomics research seeks to unravel the genetic underpinnings of inter-individual variability with the aim of tailored-made theranostics and therapeutics. Taking into account the differential expression of pharmacogenes coding for key metabolic enzymes and transporters that affect drug pharmacokinetics and pharmacodynamics, we advise that data interpretation and analysis need to occur in light of geographical ancestry, if implications for drug development and global health are to be considered. Herein, we exploit ePGA, a web-based electronic Pharmacogenomics Assistant and publicly available genetic data from the 1000 Genomes Project to explore genotype to phenotype associations among the 1000 Genomes Project populations.


Assuntos
Genoma Humano , Metagenômica , Grupos Populacionais/genética , Sistema Enzimático do Citocromo P-450/genética , Bases de Dados Factuais , Frequência do Gene , Estudos de Associação Genética , Genótipo , Haplótipos , Humanos , Fenótipo , Interface Usuário-Computador
10.
PLoS Comput Biol ; 12(11): e1005187, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27832067

RESUMO

Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers' exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes.


Assuntos
Mineração de Dados/métodos , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Modelos Biológicos , Proteoma/metabolismo , Transdução de Sinais/fisiologia , Algoritmos , Simulação por Computador , Proteoma/genética , Software
11.
PLoS One ; 11(9): e0162801, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27631363

RESUMO

One of the challenges that arise from the advent of personal genomics services is to efficiently couple individual data with state of the art Pharmacogenomics (PGx) knowledge. Existing services are limited to either providing static views of PGx variants or applying a simplistic match between individual genotypes and existing PGx variants. Moreover, there is a considerable amount of haplotype variation associated with drug metabolism that is currently insufficiently addressed. Here, we present a web-based electronic Pharmacogenomics Assistant (ePGA; http://www.epga.gr/) that provides personalized genotype-to-phenotype translation, linked to state of the art clinical guidelines. ePGA's translation service matches individual genotype-profiles with PGx gene haplotypes and infers the corresponding diplotype and phenotype profiles, accompanied with summary statistics. Additional features include i) the ability to customize translation based on subsets of variants of clinical interest, and ii) to update the knowledge base with novel PGx findings. We demonstrate ePGA's functionality on genetic variation data from the 1000 Genomes Project.


Assuntos
Sistemas de Informação , Internet , Farmacogenética , Modelos Teóricos
12.
Eur J Hum Genet ; 24(2): 263-70, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25966632

RESUMO

Segmental duplications (SDs) comprise about 5% of the human genome and are enriched for immune genes. SD loci often show copy numbers variations (CNV), which are difficult to tag with genotyping methods. CNV in the Fcγ receptor region (FCGR) has been suggested to be associated with rheumatic diseases. The objective of this study was to delineate association of FCGR-CNV with rheumatoid arthritis (RA), coeliac disease and Inflammatory bowel disease incidence. We developed a method to accurately quantify CNV in SD loci based on the intensity values from the Immunochip platform and applied it to the FCGR locus. We determined the method's validity using three independent assays: segregation analysis in families, arrayCGH, and whole genome sequencing. Our data showed the presence of two separate CNVs in the FCGR locus. The first region encodes FCGR2A, FCGR3A and part of FCGR2C gene, the second encodes another part of FCGR2C, FCGR3B and FCGR2B. Analysis of CNV status in 4578 individuals with RA and 5457 controls indicated association of duplications in the FCGR3B gene in antibody-negative RA (P=0.002, OR=1.43). Deletion in FCGR3B was associated with increased risk of antibody-positive RA, consistently with previous reports (P=0.023, OR=1.23). A clear genotype-phenotype relationship was observed: CNV polymorphisms of the FCGR3A gene correlated to CD16A expression (encoded by FCGR3A) on CD8 T-cells. In conclusion, our method allows determining the CNV status of the FCGR locus, we identified association of CNV in FCGR3B to RA and showed a functional relationship between CNV in the FCGR3A gene and CD16A expression.


Assuntos
Artrite Reumatoide/genética , Doenças Inflamatórias Intestinais/genética , Receptores de IgG/biossíntese , Receptores de IgG/genética , Artrite Reumatoide/imunologia , Artrite Reumatoide/patologia , Variações do Número de Cópias de DNA/genética , Feminino , Proteínas Ligadas por GPI/genética , Estudos de Associação Genética , Predisposição Genética para Doença , Genótipo , Humanos , Doenças Inflamatórias Intestinais/patologia , Masculino , Fenótipo , Polimorfismo de Nucleotídeo Único
13.
Source Code Biol Med ; 10: 14, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26587054

RESUMO

BACKGROUND: Today researchers can choose from many bioinformatics protocols for all types of life sciences research, computational environments and coding languages. Although the majority of these are open source, few of them possess all virtues to maximize reuse and promote reproducible science. Wikipedia has proven a great tool to disseminate information and enhance collaboration between users with varying expertise and background to author qualitative content via crowdsourcing. However, it remains an open question whether the wiki paradigm can be applied to bioinformatics protocols. RESULTS: We piloted PyPedia, a wiki where each article is both implementation and documentation of a bioinformatics computational protocol in the python language. Hyperlinks within the wiki can be used to compose complex workflows and induce reuse. A RESTful API enables code execution outside the wiki. Initial content of PyPedia contains articles for population statistics, bioinformatics format conversions and genotype imputation. Use of the easy to learn wiki syntax effectively lowers the barriers to bring expert programmers and less computer savvy researchers on the same page. CONCLUSIONS: PyPedia demonstrates how wiki can provide a collaborative development, sharing and even execution environment for biologists and bioinformaticians that complement existing resources, useful for local and multi-center research teams. AVAILABILITY: PyPedia is available online at: http://www.pypedia.com. The source code and installation instructions are available at: https://github.com/kantale/PyPedia_server. The PyPedia python library is available at: https://github.com/kantale/pypedia. PyPedia is open-source, available under the BSD 2-Clause License.

14.
BMC Res Notes ; 8: 359, 2015 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-26286716

RESUMO

BACKGROUND: Genotype imputation is an important procedure in current genomic analysis such as genome-wide association studies, meta-analyses and fine mapping. Although high quality tools are available that perform the steps of this process, considerable effort and expertise is required to set up and run a best practice imputation pipeline, particularly for larger genotype datasets, where imputation has to scale out in parallel on computer clusters. RESULTS: Here we present MOLGENIS-impute, an 'imputation in a box' solution that seamlessly and transparently automates the set up and running of all the steps of the imputation process. These steps include genome build liftover (liftovering), genotype phasing with SHAPEIT2, quality control, sample and chromosomal chunking/merging, and imputation with IMPUTE2. MOLGENIS-impute builds on MOLGENIS-compute, a simple pipeline management platform for submission and monitoring of bioinformatics tasks in High Performance Computing (HPC) environments like local/cloud servers, clusters and grids. All the required tools, data and scripts are downloaded and installed in a single step. Researchers with diverse backgrounds and expertise have tested MOLGENIS-impute on different locations and imputed over 30,000 samples so far using the 1,000 Genomes Project and new Genome of the Netherlands data as the imputation reference. The tests have been performed on PBS/SGE clusters, cloud VMs and in a grid HPC environment. CONCLUSIONS: MOLGENIS-impute gives priority to the ease of setting up, configuring and running an imputation. It has minimal dependencies and wraps the pipeline in a simple command line interface, without sacrificing flexibility to adapt or limiting the options of underlying imputation tools. It does not require knowledge of a workflow system or programming, and is targeted at researchers who just want to apply best practices in imputation via simple commands. It is built on the MOLGENIS compute workflow framework to enable customization with additional computational steps or it can be included in other bioinformatics pipelines. It is available as open source from: https://github.com/molgenis/molgenis-imputation.


Assuntos
Algoritmos , Estudo de Associação Genômica Ampla , Genoma , Genômica/estatística & dados numéricos , Genótipo , Software , Mapeamento Cromossômico , Marcadores Genéticos , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único
15.
Nat Protoc ; 10(9): 1285-96, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26226460

RESUMO

In order to meaningfully analyze common and rare genetic variants, results from genome-wide association studies (GWASs) of multiple cohorts need to be combined in a meta-analysis in order to obtain enough power. This requires all cohorts to have the same single-nucleotide polymorphisms (SNPs) in their GWASs. To this end, genotypes that have not been measured in a given cohort can be imputed on the basis of a set of reference haplotypes. This protocol provides guidelines for performing imputations with two widely used tools: minimac and IMPUTE2. These guidelines were developed and used by the Genome of the Netherlands (GoNL) consortium, which has created a population-specific reference panel for genetic imputations and used this reference to impute various Dutch biobanks. We also describe several factors that might influence the final imputation quality. This protocol, which has been used by the largest Dutch biobanks, should take approximately several days, depending on the sample size of the biobank and the computer resources available.


Assuntos
Estudo de Associação Genômica Ampla , Haplótipos , Metanálise como Assunto , Polimorfismo de Nucleotídeo Único , Software
16.
Nat Commun ; 6: 6065, 2015 Mar 09.
Artigo em Inglês | MEDLINE | ID: mdl-25751400

RESUMO

Variants associated with blood lipid levels may be population-specific. To identify low-frequency variants associated with this phenotype, population-specific reference panels may be used. Here we impute nine large Dutch biobanks (~35,000 samples) with the population-specific reference panel created by the Genome of The Netherlands Project and perform association testing with blood lipid levels. We report the discovery of five novel associations at four loci (P value <6.61 × 10(-4)), including a rare missense variant in ABCA6 (rs77542162, p.Cys1359Arg, frequency 0.034), which is predicted to be deleterious. The frequency of this ABCA6 variant is 3.65-fold increased in the Dutch and its effect (ßLDL-C=0.135, ßTC=0.140) is estimated to be very similar to those observed for single variants in well-known lipid genes, such as LDLR.


Assuntos
Transportadores de Cassetes de Ligação de ATP/genética , Colesterol/sangue , Mutação de Sentido Incorreto/genética , Frequência do Gene , Estudos de Associação Genética , Humanos , Países Baixos
17.
Eur J Hum Genet ; 22(11): 1321-6, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24896149

RESUMO

Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with 'true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05-0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r(2), increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r(2) improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r(2) increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results.


Assuntos
Frequência do Gene , Genoma Humano , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , População Branca/genética , Estudos de Casos e Controles , Análise por Conglomerados , Dinamarca , Genótipo , Técnicas de Genotipagem , Humanos , Itália , Países Baixos , Fenótipo , Análise de Componente Principal , Reino Unido
18.
Hum Mol Genet ; 23(9): 2481-9, 2014 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-24334606

RESUMO

Using the Immunochip for genotyping, we identified 39 non-human leukocyte antigen (non-HLA) loci associated to celiac disease (CeD), an immune-mediated disease with a worldwide frequency of ∼1%. The most significant non-HLA signal mapped to the intronic region of 70 kb in the LPP gene. Our aim was to fine map and identify possible functional variants in the LPP locus. We performed a meta-analysis in a cohort of 25 169 individuals from six different populations previously genotyped using Immunochip. Imputation using data from the Genome of the Netherlands and 1000 Genomes projects, followed by meta-analysis, confirmed the strong association signal on the LPP locus (rs2030519, P = 1.79 × 10(-49)), without any novel associations. The conditional analysis on this top SNP-indicated association to a single common haplotype. By performing haplotype analyses in each population separately, as well as in a combined group of the four populations that reach the significant threshold after correction (P < 0.008), we narrowed down the CeD-associated region from 70 to 2.8 kb (P = 1.35 × 10(-44)). By intersecting regulatory data from the ENCODE project, we found a functional SNP, rs4686484 (P = 3.12 × 10(-49)), that maps to several B-cell enhancer elements and a highly conserved region. This SNP was also predicted to change the binding motif of the transcription factors IRF4, IRF11, Nkx2.7 and Nkx2.9, suggesting its role in transcriptional regulation. We later found significantly low levels of LPP mRNA in CeD biopsies compared with controls, thus our results suggest that rs4686484 is the functional variant in this locus, while LPP expression is decreased in CeD.


Assuntos
Doença Celíaca/genética , Proteínas do Citoesqueleto/genética , Proteínas com Domínio LIM/genética , Polimorfismo de Nucleotídeo Único/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Haplótipos , Humanos , Fatores Reguladores de Interferon/genética , Desequilíbrio de Ligação , Fatores de Transcrição/genética
19.
Eur J Hum Genet ; 22(2): 221-7, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23714750

RESUMO

Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.


Assuntos
Variação Genética , Genoma Humano , Adulto , Idoso , Idoso de 80 Anos ou mais , Bases de Dados Genéticas , Feminino , Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Pessoa de Meia-Idade , Países Baixos , Filogeografia , Análise de Sequência de DNA , Adulto Jovem
20.
Stud Health Technol Inform ; 160(Pt 2): 1304-8, 2010.
Artigo em Inglês | MEDLINE | ID: mdl-20841895

RESUMO

Scientific workflow technologies and tools have become an important weapon in the arsenal of the bioinformaticians and computational biologists. To support this view we present a typical exploratory data analysis scenario involving the combination of information from Gene Regulatory Networks and gene expression data. We further describe the implementation of this scenario using the Workflow Environment implemented in the context of a large EU funded project. In this process desirable features that similar environments should offer are identified and analyzed. The ICT platform presented is evaluated using the chosen scenario as a benchmark. Finally we conclude with an outlook to future work.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Software , Interface Usuário-Computador , Fluxo de Trabalho
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA