RESUMO
The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals-comprising 50 ethnolinguistic groups, including previously unsampled populations-to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon-but in other genes, variants denoted as 'likely pathogenic' in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.
Assuntos
Variação Genética , Genoma Humano/genética , Genômica , Saúde , Migração Humana , África/etnologia , Reparo do DNA/genética , Conjuntos de Dados como Assunto , Feminino , Fluxo Gênico , Genética Médica , Genética Populacional , Saúde/história , História Antiga , Migração Humana/história , Humanos , Imunidade/genética , Idioma , Masculino , Metabolismo/genética , Seleção Genética , Sequenciamento Completo do GenomaRESUMO
BACKGROUND: Effective vector control is key to malaria prevention. However, this is now compromised by increased insecticide resistance due to continued reliance on insecticide-based control interventions. In Kenya, we have observed heterogenous resistance to pyrethroids and organophosphates in Anopheles arabiensis which is one of the most widespread malaria vectors in the country. We investigated the gene expression profiles of insecticide resistant An. arabiensis populations from Migori and Siaya counties in Western Kenya using RNA-Sequencing. Centers for Disease Control and Prevention (CDC) bottle assays were conducted using deltamethrin (DELTA), alphacypermethrin (ACYP) and pirimiphos-methyl (PMM) to determine the resistance status in both sites. RESULTS: Mosquitoes from Migori had average mortalities of 91%, 92% and 58% while those from Siaya had 85%, 86%, and 30% when exposed to DELTA, ACYP and PMM, respectively. RNA-Seq analysis was done on pools of mosquitoes which survived exposure ('resistant'), mosquitoes that were not exposed, and the insecticide-susceptible An. arabiensis Dongola strain. Gene expression profiles of resistant mosquitoes from both Migori and Siaya showed an overexpression mainly of salivary gland proteins belonging to both the short and long form D7 genes, and cuticular proteins (including CPR9, CPR10, CPR15, CPR16). Additionally, the overexpression of detoxification genes including cytochrome P450s (CYP9M1, CYP325H1, CYP4C27, CYP9L1 and CYP307A1), 2 carboxylesterases and a glutathione-S-transferase (GSTE4) were also shared between DELTA, ACYP, and PMM survivors, pointing to potential contribution to cross resistance to both pyrethroid and organophosphate insecticides. CONCLUSION: This study provides novel insights into the molecular basis of insecticide resistance in An. arabiensis in Western Kenya and suggests that salivary gland proteins and cuticular proteins are associated with resistance to multiple classes of insecticides.
Assuntos
Anopheles , Inseticidas , Malária , Compostos Organotiofosforados , Piretrinas , Animais , Inseticidas/farmacologia , Resistência a Inseticidas/genética , Anopheles/genética , Quênia , Mosquitos Vetores , Glutationa Transferase , Perfilação da Expressão Gênica , Proteínas e Peptídeos Salivares/genética , Glândulas SalivaresRESUMO
BACKGROUND: Insecticide resistance (IR) is one of the major threats to malaria vector control programs in endemic countries. However, the mechanisms underlying IR are poorly understood. Thus, investigating gene expression patterns related to IR can offer important insights into the molecular basis of IR in mosquitoes. In this study, RNA-Seq was used to characterize gene expression in Anopheles gambiae surviving exposure to pyrethroids (deltamethrin, alphacypermethrin) and an organophosphate (pirimiphos-methyl). RESULTS: Larvae of An. gambiae s.s. collected from Bassila and Djougou in Benin were reared to adulthood and phenotyped for IR using a modified CDC intensity bottle bioassay. The results showed that mosquitoes from Djougou were more resistant to pyrethroids (5X deltamethrin: 51.7% mortality; 2X alphacypermethrin: 47.4%) than Bassila (1X deltamethrin: 70.7%; 1X alphacypermethrin: 77.7%), while the latter were more resistant to pirimiphos-methyl (1.5X: 48.3% in Bassila and 1X: 21.5% in Djougou). RNA-seq was then conducted on resistant mosquitoes, non-exposed mosquitoes from the same locations and the laboratory-susceptible An. gambiae s.s. Kisumu strain. The results showed overexpression of detoxification genes, including cytochrome P450s (CYP12F2, CYP12F3, CYP4H15, CYP4H17, CYP6Z3, CYP9K1, CYP4G16, and CYP4D17), carboxylesterase genes (COEJHE5E, COE22933) and glutathione S-transferases (GSTE2 and GSTMS3) in all three resistant mosquito groups analyzed. Genes encoding cuticular proteins (CPR130, CPR10, CPR15, CPR16, CPR127, CPAP3-C, CPAP3-B, and CPR76) were also overexpressed in all the resistant groups, indicating their potential role in cross resistance in An. gambiae. Salivary gland protein genes related to 'salivary cysteine-rich peptide' and 'salivary secreted mucin 3' were also over-expressed and shared across all resistant groups. CONCLUSION: Our results suggest that in addition to metabolic enzymes, cuticular and salivary gland proteins could play an important role in cross-resistance to multiple classes of insecticides in Benin. These genes warrant further investigation to validate their functional role in An. gambiae resistance to insecticides.
Assuntos
Anopheles , Inseticidas , Malária , Nitrilas , Piretrinas , Animais , Inseticidas/farmacologia , Anopheles/genética , Benin , Organofosfatos/farmacologia , Mosquitos Vetores , Piretrinas/farmacologia , Resistência a Inseticidas/genética , Perfilação da Expressão GênicaRESUMO
Indoor residual spraying (IRS) and insecticide-treated nets (ITNs) are the main methods used to control mosquito populations for malaria prevention. The efficacy of these strategies is threatened by the spread of insecticide resistance (IR), limiting the success of malaria control. Studies of the genetic evolution leading to insecticide resistance could enable the identification of molecular markers that can be used for IR surveillance and an improved understanding of the molecular mechanisms associated with IR. This study used a weighted gene co-expression network analysis (WGCNA) algorithm, a systems biology approach, to identify genes with similar co-expression patterns (modules) and hub genes that are potential molecular markers for insecticide resistance surveillance in Kenya and Benin. A total of 20 and 26 gene co-expression modules were identified via average linkage hierarchical clustering from Anopheles arabiensis and An. gambiae, respectively, and hub genes (highly connected genes) were identified within each module. Three specific genes stood out: serine protease, E3 ubiquitin-protein ligase, and cuticular proteins, which were top hub genes in both species and could serve as potential markers and targets for monitoring IR in these malaria vectors. In addition to the identified markers, we explored molecular mechanisms using enrichment maps that revealed a complex process involving multiple steps, from odorant binding and neuronal signaling to cellular responses, immune modulation, cellular metabolism, and gene regulation. Incorporation of these dynamics into the development of new insecticides and the tracking of insecticide resistance could improve the sustainable and cost-effective deployment of interventions.
Assuntos
Anopheles , Resistência a Inseticidas , Piretrinas , Biologia de Sistemas , Anopheles/genética , Anopheles/efeitos dos fármacos , Animais , Resistência a Inseticidas/genética , Piretrinas/farmacologia , Inseticidas/farmacologia , Redes Reguladoras de Genes , Organofosfatos/farmacologia , Mosquitos Vetores/genética , Mosquitos Vetores/efeitos dos fármacos , Quênia , Perfilação da Expressão GênicaRESUMO
BACKGROUND: Microbiome research is providing important new insights into the metabolic interactions of complex microbial ecosystems involved in fields as diverse as the pathogenesis of human diseases, agriculture and climate change. Poor correlations typically observed between RNA and protein expression datasets make it hard to accurately infer microbial protein synthesis from metagenomic data. Additionally, mass spectrometry-based metaproteomic analyses typically rely on focused search sequence databases based on prior knowledge for protein identification that may not represent all the proteins present in a set of samples. Metagenomic 16S rRNA sequencing only targets the bacterial component, while whole genome sequencing is at best an indirect measure of expressed proteomes. Here we describe a novel approach, MetaNovo, that combines existing open-source software tools to perform scalable de novo sequence tag matching with a novel algorithm for probabilistic optimization of the entire UniProt knowledgebase to create tailored sequence databases for target-decoy searches directly at the proteome level, enabling metaproteomic analyses without prior expectation of sample composition or metagenomic data generation and compatible with standard downstream analysis pipelines. RESULTS: We compared MetaNovo to published results from the MetaPro-IQ pipeline on 8 human mucosal-luminal interface samples, with comparable numbers of peptide and protein identifications, many shared peptide sequences and a similar bacterial taxonomic distribution compared to that found using a matched metagenome sequence database-but simultaneously identified many more non-bacterial peptides than the previous approaches. MetaNovo was also benchmarked on samples of known microbial composition against matched metagenomic and whole genomic sequence database workflows, yielding many more MS/MS identifications for the expected taxa, with improved taxonomic representation, while also highlighting previously described genome sequencing quality concerns for one of the organisms, and identifying an experimental sample contaminant without prior expectation. CONCLUSIONS: By estimating taxonomic and peptide level information directly on microbiome samples from tandem mass spectrometry data, MetaNovo enables the simultaneous identification of peptides from all domains of life in metaproteome samples, bypassing the need for curated sequence databases to search. We show that the MetaNovo approach to mass spectrometry metaproteomics is more accurate than current gold standard approaches of tailored or matched genomic sequence database searches, can identify sample contaminants without prior expectation and yields insights into previously unidentified metaproteomic signals, building on the potential for complex mass spectrometry metaproteomic data to speak for itself.
Assuntos
Microbiota , Espectrometria de Massas em Tandem , Humanos , RNA Ribossômico 16S/genética , Bases de Dados de Proteínas , Peptídeos/genética , Peptídeos/análise , Microbiota/genética , Bactérias/genética , Proteoma/genéticaRESUMO
Over the past few years, meta-analysis has become popular among biomedical researchers for detecting biomarkers across multiple cohort studies with increased predictive power. Combining datasets from different sources increases sample size, thus overcoming the issue related to limited sample size from each individual study and boosting the predictive power. This leads to an increased likelihood of more accurately predicting differentially expressed genes/proteins or significant biomarkers underlying the biological condition of interest. Currently, several meta-analysis methods and tools exist, each having its own strengths and limitations. In this paper, we survey existing meta-analysis methods, and assess the performance of different methods based on results from different datasets as well as assessment from prior knowledge of each method. This provides a reference summary of meta-analysis models and tools, which helps to guide end-users on the choice of appropriate models or tools for given types of datasets and enables developers to consider current advances when planning the development of new meta-analysis models and more practical integrative tools.
Assuntos
Algoritmos , Análise de Dados , Metanálise como Assunto , Software , Árvores de Decisões , Humanos , Fluxo de TrabalhoRESUMO
Advances in high-throughput sequencing technologies have resulted in an exponential growth of publicly accessible biological datasets. In the 'big data' driven 'post-genomic' context, much work is being done to explore human protein-protein interactions (PPIs) for a systems level based analysis to uncover useful signals and gain more insights to advance current knowledge and answer specific biological and health questions. These PPIs are experimentally or computationally predicted, stored in different online databases and some of PPI resources are updated regularly. As with many biological datasets, such regular updates continuously render older PPI datasets potentially outdated. Moreover, while many of these interactions are shared between these online resources, each resource includes its own identified PPIs and none of these databases exhaustively contains all existing human PPI maps. In this context, it is essential to enable the integration of or combining interaction datasets from different resources, to generate a PPI map with increased coverage and confidence. To allow researchers to produce an integrated human PPI datasets in real-time, we introduce the integrated human protein-protein interaction network generator (IHP-PING) tool. IHP-PING is a flexible python package which generates a human PPI network from freely available online resources. This tool extracts and integrates heterogeneous PPI datasets to generate a unified PPI network, which is stored locally for further applications.
Assuntos
Bases de Dados de Proteínas , Linguagens de Programação , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , HumanosRESUMO
Current variant calling (VC) approaches have been designed to leverage populations of long-range haplotypes and were benchmarked using populations of European descent, whereas most genetic diversity is found in non-European such as Africa populations. Working with these genetically diverse populations, VC tools may produce false positive and false negative results, which may produce misleading conclusions in prioritization of mutations, clinical relevancy and actionability of genes. The most prominent question is which tool or pipeline has a high rate of sensitivity and precision when analysing African data with either low or high sequence coverage, given the high genetic diversity and heterogeneity of this data. Here, a total of 100 synthetic Whole Genome Sequencing (WGS) samples, mimicking the genetics profile of African and European subjects for different specific coverage levels (high/low), have been generated to assess the performance of nine different VC tools on these contrasting datasets. The performances of these tools were assessed in false positive and false negative call rates by comparing the simulated golden variants to the variants identified by each VC tool. Combining our results on sensitivity and positive predictive value (PPV), VarDict [PPV = 0.999 and Matthews correlation coefficient (MCC) = 0.832] and BCFtools (PPV = 0.999 and MCC = 0.813) perform best when using African population data on high and low coverage data. Overall, current VC tools produce high false positive and false negative rates when analysing African compared with European data. This highlights the need for development of VC approaches with high sensitivity and precision tailored for populations characterized by high genetic variations and low linkage disequilibrium.
Assuntos
População Negra/genética , Bases de Dados de Ácidos Nucleicos , Variação Genética , Genoma Humano , População Branca/genética , Sequenciamento Completo do Genoma , Humanos , Desequilíbrio de LigaçãoRESUMO
[This corrects the article DOI: 10.1371/journal.pcbi.1009218.].
RESUMO
With more microbiome studies being conducted by African-based research groups, there is an increasing demand for knowledge and skills in the design and analysis of microbiome studies and data. However, high-quality bioinformatics courses are often impeded by differences in computational environments, complicated software stacks, numerous dependencies, and versions of bioinformatics tools along with a lack of local computational infrastructure and expertise. To address this, H3ABioNet developed a 16S rRNA Microbiome Intermediate Bioinformatics Training course, extending its remote classroom model. The course was developed alongside experienced microbiome researchers, bioinformaticians, and systems administrators, who identified key topics to address. Development of containerised workflows has previously been undertaken by H3ABioNet, and Singularity containers were used here to enable the deployment of a standard replicable software stack across different hosting sites. The pilot ran successfully in 2019 across 23 sites registered in 11 African countries, with more than 200 participants formally enrolled and 106 volunteer staff for onsite support. The pulling, running, and testing of the containers, software, and analyses on various clusters were performed prior to the start of the course by hosting classrooms. The containers allowed the replication of analyses and results across all participating classrooms running a cluster and remained available posttraining ensuring analyses could be repeated on real data. Participants thus received the opportunity to analyse their own data, while local staff were trained and supported by experienced experts, increasing local capacity for ongoing research support. This provides a model for delivering topic-specific bioinformatics courses across Africa and other remote/low-resourced regions which overcomes barriers such as inadequate infrastructures, geographical distance, and access to expertise and educational materials.
Assuntos
Biologia Computacional/educação , Biologia Computacional/métodos , RNA Ribossômico 16S , Software , África , Algoritmos , Currículo , Genoma Humano , Geografia , Humanos , MicrobiotaRESUMO
Scientific research plays a key role in the advancement of human knowledge and pursuit of solutions to important societal challenges. Typically, research occurs within specific institutions where data are generated and subsequently analyzed. Although collaborative science bringing together multiple institutions is now common, in such collaborations the analytical processing of the data is often performed by individual researchers within the team, with only limited internal oversight and critical analysis of the workflow prior to publication. Here, we show how hackathons can be a means of enhancing collaborative science by enabling peer review before results of analyses are published by cross-validating the design of studies or underlying data sets and by driving reproducibility of scientific analyses. Traditionally, in data analysis processes, data generators and bioinformaticians are divided and do not collaborate on analyzing the data. Hackathons are a good strategy to build bridges over the traditional divide and are potentially a great agile extension to the more structured collaborations between multiple investigators and institutions.
Assuntos
Pesquisa Biomédica/métodos , Sistemas de Informação/estatística & dados numéricos , Comunicação Interdisciplinar , Transferência de Tecnologia , Pesquisa Biomédica/organização & administração , Comportamento Cooperativo , Humanos , Sistemas de Informação/organização & administração , Malária Falciparum/parasitologia , Malária Falciparum/prevenção & controle , Plasmodium falciparum/fisiologia , África do SulRESUMO
Over the past decade, studies of admixed populations have increasingly gained interest in both medical and population genetics. These studies have so far shed light on the patterns of genetic variation throughout modern human evolution and have improved our understanding of the demographics and adaptive processes of human populations. To date, there exist about 20 methods or tools to deconvolve local ancestry. These methods have merits and drawbacks in estimating local ancestry in multiway admixed populations. In this article, we survey existing ancestry deconvolution methods, with special emphasis on multiway admixture, and compare these methods based on simulation results reported by different studies, computational approaches used, including mathematical and statistical models, and biological challenges related to each method. This should orient users on the choice of an appropriate method or tool for given population admixture characteristics and update researchers on current advances, challenges and opportunities behind existing ancestry deconvolution methods.
Assuntos
Evolução Molecular , Genoma Humano , Modelos Genéticos , HumanosRESUMO
PURPOSE: Widespread, quality genomics education for health professionals is required to create a competent genomic workforce. A lack of standards for reporting genomics education and evaluation limits the evidence base for replication and comparison. We therefore undertook a consensus process to develop a recommended minimum set of information to support consistent reporting of design, development, delivery, and evaluation of genomics education interventions. METHODS: Draft standards were derived from literature (25 items from 21 publications). Thirty-six international experts were purposively recruited for three rounds of a modified Delphi process to reach consensus on relevance, clarity, comprehensiveness, utility, and design. RESULTS: The final standards include 18 items relating to development and delivery of genomics education interventions, 12 relating to evaluation, and 1 on stakeholder engagement. CONCLUSION: These Reporting Item Standards for Education and its Evaluation in Genomics (RISE2 Genomics) are intended to be widely applicable across settings and health professions. Their use by those involved in reporting genomics education interventions and evaluation, as well as adoption by journals and policy makers as the expected standard, will support greater transparency, consistency, and comprehensiveness of reporting. Consequently, the genomics education evidence base will be more robust, enabling high-quality education and evaluation across diverse settings.
Assuntos
Genômica , Relatório de Pesquisa , Consenso , Técnica Delphi , Humanos , Participação dos InteressadosRESUMO
BACKGROUND: Asthmatic children on corticosteroids can develop hypothalamic-pituitary-adrenal axis suppression (HPAS). Single nucleotide polymorphisms (SNPs) rs242941 and rs1876828 of the corticotrophin-releasing hormone receptor 1 (CRHR1) gene were associated with lower stimulated cortisol (F) levels, whereas rs41423247 of the glucocorticoid receptor (NR3C1) gene was associated with higher basal F levels. The objective of the current study was to confirm whether these three SNPs are associated with HPAS in asthmatic children. METHODS: DNA was extracted from saliva obtained from 95 asthmatic children, who had previously undergone basal F and metyrapone testing. Thirty-six children were classified as suppressed. Non-suppressed children were subclassified according to their post-metyrapone adrenocorticotropin (PMTP ACTH) level into a middle (106-319 pg/mL) and a high (>319 pg/mL) ACTH response group. TaqMan® polymerase chain reaction assays were utilized. RESULTS: Only rs41423247 was inversely associated with HPAS (OR = 0.27 [95% CI 0.06-0.90]). Its GC genotype was inversely associated with HPAS (log odds = -1.28, P = .021). âPMTP ACTH was associated with CC (effect size = 10.85, P = .005) and GC genotypes (effect size = 4.06, P = .023). The C allele is inherited as a dominant trait (effect size = -1.31 (95% CI -2.39--0.33; P = .012). In the high ACTH response group, both genotypes affected the PMTP ACTH (effect sizes 1.41 and 15.46; P-values .023 and <2 × 10-26 for GC and CC, respectively). CONCLUSIONS: The C allele of rs41423247 was found to be protective against HPAS. CC genotype is associated with the highest PMTP ACTH response.
Assuntos
Asma , Sistema Hipotálamo-Hipofisário , Hormônio Adrenocorticotrópico , Asma/genética , Criança , Humanos , Hidrocortisona , Sistema Hipófise-Suprarrenal , Receptores de Glucocorticoides/genéticaRESUMO
Drafting and writing a data management plan (DMP) is increasingly seen as a key part of the academic research process. A DMP is a document that describes how a researcher will collect, document, describe, share, and preserve the data that will be generated as part of a research project. The DMP illustrates the importance of utilizing best practices through all stages of working with data while ensuring accessibility, quality, and longevity of the data. The benefits of writing a DMP include compliance with funder and institutional mandates; making research more transparent (for reproduction and validation purposes); and FAIR (findable, accessible, interoperable, reusable); protecting data subjects and compliance with the General Data Protection Regulation (GDPR) and/or local data protection policies. In this review, we highlight the importance of a DMP in modern biomedical research, explaining both the rationale and current best practices associated with DMPs. In addition, we outline various funders' requirements concerning DMPs and discuss open-source tools that facilitate the development and implementation of a DMP. Finally, we discuss DMPs in the context of African research, and the considerations that need to be made in this regard.
Assuntos
Pesquisa Biomédica , Gerenciamento de Dados , África , Genômica , Humanos , Projetos de PesquisaRESUMO
Populations worldwide currently face several public health challenges, including growing prevalence of infections and the emergence of new pathogenic organisms. The cost and risk associated with drug development make the development of new drugs for several diseases, especially orphan or rare diseases, unappealing to the pharmaceutical industry. Proof of drug safety and efficacy is required before market approval, and rigorous testing makes the drug development process slow, expensive and frequently result in failure. This failure is often because of the use of irrelevant targets identified in the early steps of the drug discovery process, suggesting that target identification and validation are cornerstones for the success of drug discovery and development. Here, we present a large-scale data-driven integrative computational framework to extract essential targets and processes from an existing disease-associated data set and enhance target selection by leveraging drug-target-disease association at the systems level. We applied this framework to tuberculosis and Ebola virus diseases combining heterogeneous data from multiple sources, including protein-protein functional interaction, functional annotation and pharmaceutical data sets. Results obtained demonstrate the effectiveness of the pipeline, leading to the extraction of essential drug targets and to the rational use of existing approved drugs. This provides an opportunity to move toward optimal target-based strategies for screening available drugs and for drug discovery. There is potential for this model to bridge the gap in the production of orphan disease therapies, offering a systematic approach to predict new uses for existing drugs, thereby harnessing their full therapeutic potential.
Assuntos
Conjuntos de Dados como Assunto , Antituberculosos/química , Antituberculosos/farmacologia , Antivirais/química , Antivirais/farmacologia , Desenvolvimento de Medicamentos , Ebolavirus/efeitos dos fármacos , Doença pelo Vírus Ebola/genética , Interações Hospedeiro-Patógeno , Humanos , Anotação de Sequência Molecular , Mycobacterium tuberculosis/efeitos dos fármacos , Reprodutibilidade dos Testes , Tuberculose/genéticaRESUMO
BACKGROUND: As sequencing technology improves, the concept of a single reference genome is becoming increasingly restricting. In the case of Mycobacterium tuberculosis, one must often choose between using a genome that is closely related to the isolate, or one that is annotated in detail. One promising solution to this problem is through the graph based representation of collections of genomes as a single genome graph. Though there are currently a handful of tools that can create genome graphs and have demonstrated the advantages of this new paradigm, there still exists a need for flexible tools that can be used by researchers to overcome challenges in genomics studies. RESULTS: We present GenGraph, a Python toolkit and accompanying modules that use existing multiple sequence alignment tools to create genome graphs. Python is one of the most popular coding languages for the biological sciences, and by providing these tools, GenGraph makes it easier to experiment and develop new tools that utilise genome graphs. The conceptual model used is highly intuitive, and as much as possible the graph structure represents the biological relationship between the genomes. This design means that users will quickly be able to start creating genome graphs and using them in their own projects. We outline the methods used in the generation of the graphs, and give some examples of how the created graphs may be used. GenGraph utilises existing file formats and methods in the generation of these graphs, allowing graphs to be visualised and imported with widely used applications, including Cytoscape, R, and Java Script. CONCLUSIONS: GenGraph provides a set of tools for generating graph based representations of sets of sequences with a simple conceptual model, written in the widely used coding language Python, and publicly available on Github.
Assuntos
Genômica/métodos , Alinhamento de Sequência , Software , GenomaRESUMO
BACKGROUND: Currently, formal mechanisms for bioinformatics support are limited. The H3Africa Bioinformatics Network has implemented a public and freely available Helpdesk (HD), which provides generic bioinformatics support to researchers through an online ticketing platform. The following article reports on the H3ABioNet HD (H3A-HD)'s development, outlining its design, management, usage and evaluation framework, as well as the lessons learned through implementation. RESULTS: The H3A-HD evaluated using automatically generated usage logs, user feedback and qualitative ticket evaluation. Evaluation revealed that communication methods, ticketing strategies and the technical platforms used are some of the primary factors which may influence the effectivity of HD. CONCLUSION: To continuously improve the H3A-HD services, the resource should be regularly monitored and evaluated. The H3A-HD design, implementation and evaluation framework could be easily adapted for use by interested stakeholders within the Bioinformatics community and beyond.