Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
2.
Genome Res ; 28(5): 759-765, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29650552

RESUMO

Scientific research plays a key role in the advancement of human knowledge and pursuit of solutions to important societal challenges. Typically, research occurs within specific institutions where data are generated and subsequently analyzed. Although collaborative science bringing together multiple institutions is now common, in such collaborations the analytical processing of the data is often performed by individual researchers within the team, with only limited internal oversight and critical analysis of the workflow prior to publication. Here, we show how hackathons can be a means of enhancing collaborative science by enabling peer review before results of analyses are published by cross-validating the design of studies or underlying data sets and by driving reproducibility of scientific analyses. Traditionally, in data analysis processes, data generators and bioinformaticians are divided and do not collaborate on analyzing the data. Hackathons are a good strategy to build bridges over the traditional divide and are potentially a great agile extension to the more structured collaborations between multiple investigators and institutions.


Assuntos
Pesquisa Biomédica/métodos , Sistemas de Informação/estatística & dados numéricos , Comunicação Interdisciplinar , Transferência de Tecnologia , Pesquisa Biomédica/organização & administração , Comportamento Cooperativo , Humanos , Sistemas de Informação/organização & administração , Malária Falciparum/parasitologia , Malária Falciparum/prevenção & controle , Plasmodium falciparum/fisiologia , África do Sul
3.
PLoS Pathog ; 14(11): e1007438, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30475919

RESUMO

Streptococcus pneumoniae serotype 3 remains a significant cause of morbidity and mortality worldwide, despite inclusion in the 13-valent pneumococcal conjugate vaccine (PCV13). Serotype 3 increased in carriage since the implementation of PCV13 in the USA, while invasive disease rates remain unchanged. We investigated the persistence of serotype 3 in carriage and disease, through genomic analyses of a global sample of 301 serotype 3 isolates of the Netherlands3-31 (PMEN31) clone CC180, combined with associated patient data and PCV utilization among countries of isolate collection. We assessed phenotypic variation between dominant clades in capsule charge (zeta potential), capsular polysaccharide shedding, and susceptibility to opsonophagocytic killing, which have previously been associated with carriage duration, invasiveness, and vaccine escape. We identified a recent shift in the CC180 population attributed to a lineage termed Clade II, which was estimated by Bayesian coalescent analysis to have first appeared in 1968 [95% HPD: 1939-1989] and increased in prevalence and effective population size thereafter. Clade II isolates are divergent from the pre-PCV13 serotype 3 population in non-capsular antigenic composition, competence, and antibiotic susceptibility, the last of which resulting from the acquisition of a Tn916-like conjugative transposon. Differences in recombination rates among clades correlated with variations in the ATP-binding subunit of Clp protease, as well as amino acid substitutions in the comCDE operon. Opsonophagocytic killing assays elucidated the low observed efficacy of PCV13 against serotype 3. Variation in PCV13 use among sampled countries was not independently correlated with the CC180 population shift; therefore, genotypic and phenotypic differences in protein antigens and, in particular, antibiotic resistance may have contributed to the increase of Clade II. Our analysis emphasizes the need for routine, representative sampling of isolates from disperse geographic regions, including historically under-sampled areas. We also highlight the value of genomics in resolving antigenic and epidemiological variations within a serotype, which may have implications for future vaccine development.


Assuntos
Infecções Pneumocócicas/imunologia , Streptococcus pneumoniae/genética , Streptococcus pneumoniae/imunologia , Teorema de Bayes , Portador Sadio/epidemiologia , Evolução Molecular , Genética Populacional/métodos , Humanos , Filogenia , Infecções Pneumocócicas/transmissão , Vacinas Pneumocócicas/imunologia , Dinâmica Populacional , Prevalência , Sorogrupo , Sorotipagem/métodos , Streptococcus pneumoniae/patogenicidade , Vacinas Conjugadas , Sequenciamento Completo do Genoma/métodos
4.
PLoS Comput Biol ; 13(10): e1005715, 2017 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-28981516

RESUMO

Africa is not unique in its need for basic bioinformatics training for individuals from a diverse range of academic backgrounds. However, particular logistical challenges in Africa, most notably access to bioinformatics expertise and internet stability, must be addressed in order to meet this need on the continent. H3ABioNet (www.h3abionet.org), the Pan African Bioinformatics Network for H3Africa, has therefore developed an innovative, free-of-charge "Introduction to Bioinformatics" course, taking these challenges into account as part of its educational efforts to provide on-site training and develop local expertise inside its network. A multiple-delivery-mode learning model was selected for this 3-month course in order to increase access to (mostly) African, expert bioinformatics trainers. The content of the course was developed to include a range of fundamental bioinformatics topics at the introductory level. For the first iteration of the course (2016), classrooms with a total of 364 enrolled participants were hosted at 20 institutions across 10 African countries. To ensure that classroom success did not depend on stable internet, trainers pre-recorded their lectures, and classrooms downloaded and watched these locally during biweekly contact sessions. The trainers were available via video conferencing to take questions during contact sessions, as well as via online "question and discussion" forums outside of contact session time. This learning model, developed for a resource-limited setting, could easily be adapted to other settings.


Assuntos
Biologia Computacional/educação , Instrução por Computador/métodos , Internet , África , Biologia Computacional/organização & administração , Bases de Dados Factuais , Humanos , Interface Usuário-Computador
5.
Brief Bioinform ; 16(2): 355-64, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24990350

RESUMO

The discipline of bioinformatics has developed rapidly since the complete sequencing of the first genomes in the 1990s. The development of many high-throughput techniques during the last decades has ensured that bioinformatics has grown into a discipline that overlaps with, and is required for, the modern practice of virtually every field in the life sciences. This has placed a scientific premium on the availability of skilled bioinformaticians, a qualification that is extremely scarce on the African continent. The reasons for this are numerous, although the absence of a skilled bioinformatician at academic institutions to initiate a training process and build sustained capacity seems to be a common African shortcoming. This dearth of bioinformatics expertise has had a knock-on effect on the establishment of many modern high-throughput projects at African institutes, including the comprehensive and systematic analysis of genomes from African populations, which are among the most genetically diverse anywhere on the planet. Recent funding initiatives from the National Institutes of Health and the Wellcome Trust are aimed at ameliorating this shortcoming. In this paper, we discuss the problems that have limited the establishment of the bioinformatics field in Africa, as well as propose specific actions that will help with the education and training of bioinformaticians on the continent. This is an absolute requirement in anticipation of a boom in high-throughput approaches to human health issues unique to data from African populations.


Assuntos
Biologia Computacional/educação , África , Biologia Computacional/história , Educação , Genômica , História do Século XX , História do Século XXI , Humanos , Internet/provisão & distribuição , Universidades
8.
Infect Genet Evol ; 50: 110-120, 2017 06.
Artigo em Inglês | MEDLINE | ID: mdl-27818279

RESUMO

Zoonotic cutaneous leishmaniasis caused by Leishmania (L.) major parasites affects urban and suburban areas in the center and south of Tunisia where the disease is endemo-epidemic. Several cases were reported in human patients for which infection due to L. major induced lesions with a broad range of severity. However, very little is known about the mechanisms underlying this diversity. Our hypothesis is that parasite genomic variability could, in addition to the host immunological background, contribute to the intra-species clinical variability observed in patients and explain the lesion size differences observed in the experimental model. Based on several epidemiological, in vivo and in vitro experiments, we focused on two clinical isolates showing contrasted severity in patients and BALB/c experimental mice model. We used DNA-seq as a high-throughput technology to facilitate the identification of genetic variants with discriminating potential between both isolates. Our results demonstrate that various levels of heterogeneity could be found between both L. major isolates in terms of chromosome or gene copy number variation (CNV), and that the intra-species divergence could surprisingly be related to single nucleotide polymorphisms (SNPs) and Insertion/Deletion (InDels) events. Interestingly, we particularly focused here on genes affected by both types of variants and correlated them with the observed gene CNV. Whether these differences are sufficient to explain the severity in patients is obviously still open to debate, but we do believe that additional layers of -omic information is needed to complement the genomic screen in order to draw a more complete map of severity determinants.


Assuntos
Cromossomos/química , Doenças Endêmicas , Dosagem de Genes , Leishmania major/genética , Leishmaniose Cutânea/epidemiologia , Filogenia , Animais , DNA de Protozoário/genética , Feminino , Seguimentos , Genômica , Humanos , Mutação INDEL , Leishmania major/classificação , Leishmania major/isolamento & purificação , Leishmaniose Cutânea/parasitologia , Leishmaniose Cutânea/transmissão , Camundongos , Camundongos Endogâmicos BALB C , Filogeografia , Polimorfismo de Nucleotídeo Único , Índice de Gravidade de Doença , Tunísia/epidemiologia
9.
Glob Heart ; 12(2): 91-98, 2017 06.
Artigo em Inglês | MEDLINE | ID: mdl-28302555

RESUMO

BACKGROUND: Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet's role has evolved in response to changing needs from the consortium and the African bioinformatics community. OBJECTIVES: H3ABioNet set out to develop core bioinformatics infrastructure and capacity for genomics research in various aspects of data collection, transfer, storage, and analysis. METHODS AND RESULTS: Various resources have been developed to address genomic data management and analysis needs of H3Africa researchers and other scientific communities on the continent. NetMap was developed and used to build an accurate picture of network performance within Africa and between Africa and the rest of the world, and Globus Online has been rolled out to facilitate data transfer. A participant recruitment database was developed to monitor participant enrollment, and data is being harmonized through the use of ontologies and controlled vocabularies. The standardized metadata will be integrated to provide a search facility for H3Africa data and biospecimens. Because H3Africa projects are generating large-scale genomic data, facilities for analysis and interpretation are critical. H3ABioNet is implementing several data analysis platforms that provide a large range of bioinformatics tools or workflows, such as Galaxy, the Job Management System, and eBiokits. A set of reproducible, portable, and cloud-scalable pipelines to support the multiple H3Africa data types are also being developed and dockerized to enable execution on multiple computing infrastructures. In addition, new tools have been developed for analysis of the uniquely divergent African data and for downstream interpretation of prioritized variants. To provide support for these and other bioinformatics queries, an online bioinformatics helpdesk backed by broad consortium expertise has been established. Further support is provided by means of various modes of bioinformatics training. CONCLUSIONS: For the past 4 years, the development of infrastructure support and human capacity through H3ABioNet, have significantly contributed to the establishment of African scientific networks, data analysis facilities, and training programs. Here, we describe the infrastructure and how it has affected genomics and bioinformatics research in Africa.


Assuntos
Pesquisa Biomédica/métodos , Biologia Computacional/tendências , Genômica/métodos , África , Humanos
10.
PLoS One ; 9(6): e95275, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24901648

RESUMO

Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.


Assuntos
Biologia Computacional , Cadeias de Markov , Domínios e Motivos de Interação entre Proteínas , Proteínas/química , Biologia Computacional/métodos , Anotação de Sequência Molecular , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
12.
Infect Genet Evol ; 11(4): 698-707, 2011 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-20920608

RESUMO

Eukaryotic pathogens (e.g. Plasmodium, Leishmania, Trypanosomes, etc.) are a major source of morbidity and mortality worldwide. In Africa, one of the most impacted continents, they cause millions of deaths and constitute an immense economic burden. While the genome sequence of several of these organisms is now available, the biological functions of more than half of their proteins are still unknown. This is a serious issue for bringing to the foreground the expected new therapeutic targets. In this context, the identification of protein domains is a key step to improve the functional annotation of the proteins. However, several domains are missed in eukaryotic pathogens because of the high phylogenetic distance of these organisms from the classical eukaryote models. We recently proposed a method, co-occurrence domain detection (CODD), that improves the sensitivity of Pfam domain detection by exploiting the tendency of domains to appear preferentially with a few other favorite domains in a protein. In this paper, we present EuPathDomains (http://www.atgc-montpellier.fr/EuPathDomains/), an extended database of protein domains belonging to ten major eukaryotic human pathogens. EuPathDomains gathers known and new domains detected by CODD, along with the associated confidence measurements and the GO annotations that can be deduced from the new domains. This database significantly extends the Pfam domain coverage of all selected genomes, by proposing new occurrences of domains as well as new domain families that have never been reported before. For example, with a false discovery rate lower than 20%, EuPathDomains increases the number of detected domains by 13% in Toxoplasma gondii genome and up to 28% in Cryptospordium parvum, and the total number of domain families by 10% in Plasmodium falciparum and up to 16% in C. parvum genome. The database can be queried by protein names, domain identifiers, Pfam or Interpro identifiers, or organisms, and should become a valuable resource to decipher the protein functions of eukaryotic pathogens.


Assuntos
Bases de Dados de Proteínas , Eucariotos/genética , Domínios e Motivos de Interação entre Proteínas/genética , Proteínas de Protozoários/genética , Biologia Computacional , Cryptosporidium parvum/genética , Eucariotos/metabolismo , Giardia lamblia/genética , Humanos , Leishmania/genética , Anotação de Sequência Molecular , Plasmodium/genética , Ligação Proteica , Proteínas de Protozoários/química , Proteínas de Protozoários/metabolismo , Toxoplasma/genética , Trypanosoma brucei brucei/genética
13.
Infect Genet Evol ; 9(3): 328-36, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-18992849

RESUMO

The production of increasingly reliable and accessible gene expression data has stimulated the development of computational tools to interpret such data and to organize them efficiently. The clustering techniques are largely recognized as useful exploratory tools for gene expression data analysis. Genes that show similar expression patterns over a wide range of experimental conditions can be clustered together. This relies on the hypothesis that genes that belong to the same cluster are coregulated and involved in related functions. Nevertheless, clustering algorithms still show limits, particularly for the estimation of the number of clusters and the interpretation of hierarchical dendrogram, which may significantly influence the outputs of the analysis process. We propose here a multi level SOM based clustering algorithm named Multi-SOM. Through the use of clustering validity indices, Multi-SOM overcomes the problem of the estimation of clusters number. To test the validity of the proposed clustering algorithm, we first tested it on supervised training data sets. Results were evaluated by computing the number of misclassified samples. We have then used Multi-SOM for the analysis of macrophage gene expression data generated in vitro from the same individual blood infected with 5 different pathogens. This analysis led to the identification of sets of tightly coregulated genes across different pathogens. Gene Ontology tools were then used to estimate the biological significance of the clustering, which showed that the obtained clusters are coherent and biologically significant.


Assuntos
Análise por Conglomerados , Perfilação da Expressão Gênica/métodos , Macrófagos/fisiologia , Redes Neurais de Computação , Algoritmos , Animais , Neoplasias da Mama/diagnóstico , Diabetes Mellitus/diagnóstico , Feminino , Regulação da Expressão Gênica , Humanos , Família Multigênica , Análise de Sequência com Séries de Oligonucleotídeos , Reconhecimento Automatizado de Padrão , Infecções por Protozoários/genética , Tuberculose Pulmonar/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA