Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 54
Filtrar
1.
F1000Res ; 62017.
Artigo em Inglês | MEDLINE | ID: mdl-28781748

RESUMO

ELIXIR-UK is the UK node of ELIXIR, the European infrastructure for life science data. Since its foundation in 2014, ELIXIR-UK has played a leading role in training both within the UK and in the ELIXIR Training Platform, which coordinates and delivers training across all ELIXIR members. ELIXIR-UK contributes to the Training Platform's coordination and supports the development of training to address key skill gaps amongst UK scientists. As part of this work it acts as a conduit for nationally-important bioinformatics training resources to promote their activities to the ELIXIR community. ELIXIR-UK also leads ELIXIR's flagship Training Portal, TeSS, which collects information about a diverse range of training and makes it easily accessible to the community. ELIXIR-UK also works with others to provide key digital skills training, partnering with the Software Sustainability Institute to provide Software Carpentry training to the ELIXIR community and to establish the Data Carpentry initiative, and taking a lead role amongst national stakeholders to deliver the StaTS project - a coordinated effort to drive engagement with training in statistics.

2.
Protein Eng Des Sel ; 26(10): 695-704, 2013 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-23840071

RESUMO

The inability to generate soluble, correctly folded recombinant protein is often a barrier to successful structural and functional studies. Access to affordable synthetic genes has, however, made it possible to design, make and test many more variants of a target protein to identify suitable constructs. We have used rational design and gene synthesis to create a controlled randomised library of the EphB4 receptor tyrosine kinase, with the aim of obtaining soluble, purifiable and active catalytic domain material at multi-milligram levels in Escherichia coli. Three main parameters were tested in designing the library--construct length, functional mutations and stability grafting. These variables were combined to generate a total of 9720 possible variants. The screening of 480 clones generated a 3% hit rate, with a purifiable solubility of up to 15 mg/L for some EphB4 constructs that was largely independent of construct length. Sequencing of the positive clones revealed a pair of hydrophobic core mutations that were key to obtaining soluble material. A minimal kinase domain construct containing these two mutations exhibited a +4.5°C increase in thermal stability over the wild-type protein. These approaches will be broadly applicable for solubility engineering of many different protein target classes. Atomic coordinates and structural factors have been deposited in PDB under the accession 2yn8 (EphB4 HP + staurosporine).


Assuntos
Domínio Catalítico , Biblioteca de Peptídeos , Engenharia de Proteínas/métodos , Receptor EphB4/química , Receptor EphB4/genética , Humanos , Interações Hidrofóbicas e Hidrofílicas , Modelos Moleculares , Mutação , Estabilidade Proteica , Receptor EphB4/metabolismo , Solubilidade , Temperatura
3.
Bioinformatics ; 26(18): i568-74, 2010 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-20823323

RESUMO

MOTIVATION: In recent years, the gulf between the mass of accumulating-research data and the massive literature describing and analyzing those data has widened. The need for intelligent tools to bridge this gap, to rescue the knowledge being systematically isolated in literature and data silos, is now widely acknowledged. RESULTS: To this end, we have developed Utopia Documents, a novel PDF reader that semantically integrates visualization and data-analysis tools with published research articles. In a successful pilot with editors of the Biochemical Journal (BJ), the system has been used to transform static document features into objects that can be linked, annotated, visualized and analyzed interactively (http://www.biochemj.org/bj/424/3/). Utopia Documents is now used routinely by BJ editors to mark up article content prior to publication. Recent additions include integration of various text-mining and biodatabase plugins, demonstrating the system's ability to seamlessly integrate on-line content with PDF articles. AVAILABILITY: http://getutopia.com.


Assuntos
Serviços de Informação , Literatura , Publicações , Pesquisa , Software , Internet , Publicações Periódicas como Assunto , Publicações/classificação , Editoração
4.
Nucleic Acids Res ; 38(Web Server issue): W719-23, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20501602

RESUMO

The WHAT IF molecular-modelling and drug design program is widely distributed in the world of protein structure bioinformatics. Although originally designed as an interactive application, its highly modular design and inbuilt control language have recently enabled its deployment as a collection of programmatically accessible web services. We report here a collection of WHAT IF-based protein structure bioinformatics web services: these relate to structure quality, the use of symmetry in crystal structures, structure correction and optimization, adding hydrogens and optimizing hydrogen bonds and a series of geometric calculations. The freely accessible web services are based on the industry standard WS-I profile and the EMBRACE technical guidelines, and are available via both REST and SOAP paradigms. The web services run on a dedicated computational cluster; their function and availability is monitored daily.


Assuntos
Conformação Proteica , Software , Biologia Computacional , Gráficos por Computador , Internet , Modelos Moleculares
5.
Bioinformatics ; 21(22): 4196-7, 2005 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-16159915

RESUMO

SUMMARY: METIS is a web-based integrated annotation tool. From single query sequences, the PRECIS component allows users to generate structured protein family reports from sets of related Swiss-Prot entries. These reports may then be augmented with pertinent sentences extracted from online biomedical literature via support vector machine and rule-based sentence classification systems. AVAILABILITY: http://umber.sbs.man.ac.uk/dbbrowser/metis/


Assuntos
Biologia Computacional/métodos , Algoritmos , Automação , DNA/química , Interpretação Estatística de Dados , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Bases de Dados de Proteínas , Processamento Eletrônico de Dados , Armazenamento e Recuperação da Informação , Internet , Idioma , Processamento de Linguagem Natural , Linguagens de Programação , Proteínas , Alinhamento de Sequência , Análise de Sequência de Proteína , Software , Interface Usuário-Computador
6.
Nucleic Acids Res ; 32(Database issue): D401-5, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681443

RESUMO

CADRE is a public resource for housing and analysing genomic data extracted from species of Aspergillus. It arose to enable maintenance of the complete annotated genomic sequence of Aspergillus fumigatus and to provide tools for searching, analysing and visualizing features of fungal genomes. By implementing CADRE using Ensembl, a framework is in place for storing and comparing several genomes: the resource will thus expand by including other Aspergillus genomes (such as Aspergillus nidulans) as they become available. CADRE is accessible at http://www.cadre. man.ac.uk.


Assuntos
Aspergillus/genética , Bases de Dados Genéticas , Genoma Fúngico , Aspergillus fumigatus/genética , Biologia Computacional , Genes Fúngicos , Genômica , Armazenamento e Recuperação da Informação , Internet , Software
7.
Comp Funct Genomics ; 5(1): 56-60, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-18629035

RESUMO

Bioinformaticians routinely analyse vast amounts of information held both in large remote databases and in flat data files hosted on local machines. The contemporary toolkit available for this purpose consists of an ad hoc collection of data manipulation tools, scripting languages and visualization systems; these must often be combined in complex and bespoke ways, the result frequently being an unwieldy artefact capable of one specific task, which cannot easily be exploited or extended by other practitioners. Owing to the sizes of current databases and the scale of the analyses necessary, routine bioinformatics tasks are often automated, but many still require the unique experience and intuition of human researchers: this requires tools that support real-time interaction with complex datasets. Many existing tools have poor user interfaces and limited real-time performance when applied to realistically large datasets; much of the user's cognitive capacity is therefore focused on controlling the tool rather than on performing the research. The UTOPIA project is addressing some of these issues by building reusable software components that can be combined to make useful applications in the field of bioinformatics. Expertise in the fields of human computer interaction, high-performance rendering, and distributed systems is being guided by bioinformaticians and end-user biologists to create a toolkit that is both architecturally sound from a computing point of view, and directly addresses end-user and application-developer requirements.

8.
Recept Channels ; 9(6): 363-77, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-14698964

RESUMO

Potassium channels govern the permeability of cells to potassium ions, thereby controlling the membrane potential. In metazoa, potassium channels are encoded by a large, diverse gene family. Previous analyses of this gene family have focused on its diversity in mammals. Here we have pursued a more comprehensive study in Caenorhabditis elegans, Drosophila melanogaster, and mammalian genomes. The investigation revealed 164 potassium channel encoding genes in C. elegans, D. melanogaster, and mammals, classified into seven conserved families, which we applied to phylogenetic analysis. The trees are discussed in relation to the assignment of orthologous relationships between genes and vertebrate genome duplication.


Assuntos
Canais de Potássio de Abertura Dependente da Tensão da Membrana , Canais de Potássio/genética , Animais , Caenorhabditis elegans , Cálcio/química , Biologia Computacional , Bases de Dados como Assunto , Drosophila melanogaster , Canal de Potássio ERG1 , Canais de Potássio Éter-A-Go-Go , Evolução Molecular , Humanos , Íons , Filogenia , Potássio/química , Potássio/metabolismo , Canais de Potássio Cálcio-Ativados/genética , Estrutura Terciária de Proteína , Ratos , Software
9.
Bioinformatics ; 19(13): 1664-71, 2003 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-12967963

RESUMO

MOTIVATION: There have been several endeavours to address the problem of annotating sequence data computationally, but the task is non-trivial and few tools have emerged that gather useful information on a given sequence, or set of sequences, in a simple and convenient manner. As more genome projects bear fruit, the mass of uncharacterized sequence data accumulating in public repositories grows ever larger. There is thus a pressing need for tools to support the process of automatic analysis and annotation of newly determined sequences. With this in mind, we have developed PRECIS, which automatically creates protein reports from sets of SWISS-PROT entries, collating results into structured reports, detailing known biological and medical information, literature and database cross-references, and relevant keywords.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Documentação , Armazenamento e Recuperação da Informação/métodos , Proteínas/química , Proteínas/metabolismo , Análise de Sequência de Proteína , Interface Usuário-Computador , Indexação e Redação de Resumos/métodos , Animais , Catalogação/métodos , Humanos , Proteínas/análise , Proteínas/classificação , Software , Design de Software , Vocabulário Controlado
10.
Nucleic Acids Res ; 31(1): 400-2, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12520033

RESUMO

The PRINTS database houses a collection of protein fingerprints. These may be used to assign uncharacterised sequences to known families and hence to infer tentative functions. The September 2002 release (version 36.0) includes 1800 fingerprints, encoding approximately 11 000 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. In addition to its continued steady growth, we report here the development of an automatic supplement, prePRINTS, designed to increase the coverage of the resource and reduce some of the manual burdens inherent in its maintenance. The databases are accessible for interrogation and searching at http://www.bioinf.man.ac.uk/dbbrowser/PRINTS/.


Assuntos
Motivos de Aminoácidos , Bases de Dados de Proteínas , Proteínas/química , Animais , Automação , Sequência Conservada , Software
11.
Biotechnol Annu Rev ; 8: 1-54, 2002.
Artigo em Inglês | MEDLINE | ID: mdl-12436914

RESUMO

In silico biology has gathered momentum as, worldwide, scientists have united in a common quest to sequence, store and analyse complete genomes. This year, a pivotal achievement of this cooperative endeavour was realised in the release of a public draft of the human genome, and with it the promises to improve our understanding of diverse aspects of biology and to yield a healthier future with safe personalized medicines. Key to these goals will be the need to elucidate and characterise the genes and gene products encoded not just in the human genome, but in many genomes. These tasks are underpinned by the concepts and processes of genome and gene/protein evolution, regulation of gene expression, mechanisms of protein folding, the manifestation of protein function, and so on, all of which must be understood in the context of complex, dynamic biological systems. Our use of computers to model such concepts and systems must be placed in the context of the current limits of our understanding of them:- it is important to recognise, for example, that we don't have a common understanding either of what constitutes a gene or a protein function; we can't invariably say that a particular sequence or fold has arisen via divergent or convergent evolution; and we don't fully understand the rules of protein folding. Accepting what we can't do in silico is essential in appreciating what we can do. Without this understanding, it is easy to be misled, as notions of what particular computational approaches can achieve are sometimes rather optimistic. There are valuable lessons to be learned here from the field of Artificial Intelligence, principal among which is the realisation that capturing and representing complex knowledge is time consuming, expensive and hard. Thus, we argue here that if bioinformatics is to tackle biological complexity in earnest, it would be wise to absorb the experience distilled from decades of artificial intelligence research, and to approach the road ahead with caution, rigour and pragmatism.


Assuntos
Inteligência Artificial , Biologia Computacional/métodos , Biologia Computacional/tendências , Bases de Dados Genéticas , Genômica/métodos , Análise de Sequência/métodos , DNA/química , DNA/genética , Genoma , Projeto Genoma Humano , Humanos , Proteínas/química , Proteínas/genética , Proteômica/métodos , Análise de Sequência/tendências , Integração de Sistemas
12.
Bioinformatics ; 18(10): 1402-3, 2002 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-12376388

RESUMO

UNLABELLED: Analyzing and visualizing multiple sequence alignments is a common task in many areas of molecular biology and bioinformatics. Many tools exist for this purpose, but are not easily customizable for specific in-house uses. Here we report the development of an editor, CINEMA-MX, that addresses these issues. CINEMA-MX is highly modular and configurable, and we present examples to illustrate its extensibility. AVAILABILITY: The program and full source code, which are available from http://www.bioinf.man.ac.uk/dbbrowser/cinema-mx, are being released under a combination of the LGPL and GPL, for Unix or Windows platforms.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Armazenamento e Recuperação da Informação/métodos , Alinhamento de Sequência/métodos , Software , Interface Usuário-Computador , Gráficos por Computador , Bases de Dados de Ácidos Nucleicos , Processamento de Texto
13.
Protein Eng ; 15(1): 7-12, 2002 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-11842232

RESUMO

G protein-coupled receptors (GPCRs) constitute the largest known family of cell-surface receptors. With hundreds of members populating the rhodopsin-like GPCR superfamily and many more awaiting discovery in the human genome, they are of interest to the pharmaceutical industry because of the opportunities they afford for yielding potentially lucrative drug targets. Typical sequence analysis strategies for identifying novel GPCRs tend to involve similarity searches using standard primary database search tools. This will reveal the most similar sequence, generally without offering any insight into its family or superfamily relationships. Conversely, searches of most 'pattern' or family databases are likely to identify the superfamily, but not the closest matching subtype. Here we describe a diagnostic resource that allows identification of GPCRs in a hierarchical fashion, based principally upon their ligand preference. This resource forms part of the PRINTS database, which now houses approximately 250 GPCR-specific fingerprints (http://www.bioinf.man.ac.uk/dbbrowser/gpcrPRINTS/). This collection of fingerprints is able to provide more sensitive diagnostic opportunities than have been realized by related approaches and is currently the only diagnostic tool for assigning GPCR subtypes. Mapping such fingerprints on to three-dimensional GPCR models offers powerful insights into the structural and functional determinants of subtype specificity.


Assuntos
Proteínas Heterotriméricas de Ligação ao GTP/classificação , Análise de Sequência de Proteína/métodos , Algoritmos , Motivos de Aminoácidos , Sequência de Aminoácidos , Animais , Bases de Dados de Proteínas , Proteínas Heterotriméricas de Ligação ao GTP/química , Proteínas Heterotriméricas de Ligação ao GTP/metabolismo , Humanos , Ligantes , Conformação Proteica , Estrutura Terciária de Proteína , Ratos , Receptor Tipo 4 de Melanocortina , Receptores de Superfície Celular/metabolismo , Receptores de Peptídeos/química , Receptores de Peptídeos/metabolismo , Rodopsina/química , Rodopsina/metabolismo , Sensibilidade e Especificidade , Alinhamento de Sequência , Ovinos
14.
Nucleic Acids Res ; 30(1): 239-41, 2002 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-11752304

RESUMO

The PRINTS database houses a collection of protein fingerprints. These may be used to make family and tentative functional assignments for uncharacterised sequences. The September 2001 release (version 32.0) includes 1600 fingerprints, encoding approximately 10 000 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. In addition to its continued steady growth, we report here its use as a source of annotation in the InterPro resource, and the use of its relational cousin, PRINTS-S, to model relationships between families, including those beyond the reach of conventional sequence analysis approaches. The database is accessible for BLAST, fingerprint and text searches at http://www.bioinf.man.ac.uk/dbbrowser/PRINTS/.


Assuntos
Bases de Dados de Proteínas , Evolução Molecular , Proteínas/genética , Motivos de Aminoácidos , Animais , Armazenamento e Recuperação da Informação , Internet , Proteínas/fisiologia , Alinhamento de Sequência
15.
Comput Chem ; 25(4): 329-39, 2001 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-11459349

RESUMO

'Silicon-based' biology has gathered momentum as the world-wide sequencing projects have made possible the investigation and comparative analysis of complete genomes. Central to the quest to elucidate and characterise the genes and gene products encoded within genomes are pivotal concepts concerning the processes of evolution, the mechanisms of protein folding, and, crucially, the manifestation of protein function. Our use of computers to model such concepts is limited by, and must be placed in the context of, the current limits of our understanding of these biological processes. It is important to recognise that we do not have a common understanding of what constitutes a gene; we cannot invariably say that a particular sequence or fold has arisen via divergence or convergence; we do not fully understand the rules of protein folding, so we cannot predict protein structure; and we cannot invariably diagnose protein function, given knowledge only of its sequence or structure in isolation. Accepting what we cannot do with computers plays an essential role in forming an appreciation of what we can do. Without this understanding, it is easy to be misled, as spurious arguments are often used to promote over-enthusiastic notions of what particular programs can achieve. There are valuable lessons to be learned here from the field of artificial intelligence, principal among which is the realisation that capturing and representing complex knowledge is time consuming, expensive and hard. If bioinformatics is to tackle biological complexity meaningfully, the road ahead must therefore be paved with caution, rigour and pragmatism.


Assuntos
Inteligência Artificial , Biologia Computacional , Sequência de Aminoácidos , Sequência de Bases , Biologia Computacional/métodos , Biologia Computacional/tendências , Bases de Dados Factuais , Genoma Humano , Humanos , Modelos Moleculares , Dados de Sequência Molecular , Dobramento de Proteína , Estrutura Secundária de Proteína , Proteínas/química , Análise de Sequência de Proteína , Homologia de Sequência do Ácido Nucleico
16.
Drug Discov Today ; 6(10): 509-513, 2001 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-11369288
17.
Trends Pharmacol Sci ; 22(4): 162-5, 2001 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-11282406

RESUMO

Analysis of G-protein-coupled receptor (GPCR) subtypes has attracted considerable interest because some drugs that act on GPCRs cause therapeutic problems as a result of their failure to differentiate between subtypes. In this article, an extensive compendium of diagnostic 'fingerprints' for GPCR subtypes and their families will be described. These fingerprints offer new opportunities to investigate correlations between specific sequence motifs and ligand binding or G-protein coupling, and are likely to prove valuable both in seeking novel receptors in genome data and in the characterization of orphan receptors.


Assuntos
Mapeamento de Peptídeos , Receptores de Superfície Celular/química , Biologia Computacional , Bases de Dados Factuais , Indústria Farmacêutica , Humanos , Receptores de Superfície Celular/efeitos dos fármacos , Receptores de Superfície Celular/genética , Alinhamento de Sequência
18.
Bioinformatics ; 17(1): 105-6, 2001 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-11222269

RESUMO

With the ever-increasing need to handle large volumes of sequence data efficiently and reliably, we have developed the EASY system for performing combined protein sequence and pattern database searches. EASY runs searches simultaneously and distils results into a concise 1-line diagnosis. By bringing together results of several different analyses, EASY provides a rapid means of evaluating biological significance, minimising the risk of inferring false relationships, for example from relying exclusively on top BLAST hits. The program has been tested using a variety of protein families and was instrumental in resolving family assignments in a major update of the PRINTS database.


Assuntos
Bases de Dados Factuais , Sistemas Inteligentes , Análise de Sequência de Proteína/métodos , Biologia Computacional , Proteínas/genética , Análise de Sequência de Proteína/estatística & dados numéricos , Software
20.
Nucleic Acids Res ; 29(1): 37-40, 2001 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-11125043

RESUMO

Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1,000,000 hits from 462,500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to interhelp@ebi.ac.uk.


Assuntos
Bases de Dados Factuais , Proteínas , Serviços de Informação , Internet , Estrutura Terciária de Proteína , Proteínas/química , Proteínas/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...