Pesquisa | Secretaria de Estado da Saúde

Petabyte-scale innovations at the European Nucleotide Archive.

Cochrane, Guy; Akhtar, Ruth; Bonfield, James; Bower, Lawrence; Demiralp, Fehmi; Faruque, Nadeem; Gibson, Richard; Hoad, Gemma; Hubbard, Tim; Hunter, Christopher; Jang, Mikyung; Juhos, Szilveszter; Leinonen, Rasko; Leonard, Steven; Lin, Quan; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Plaister, Sheila; Radhakrishnan, Rajesh; Robinson, Stephen; Sobhany, Siamak; Hoopen, Petra Ten; Vaughan, Robert; Zalunin, Vadim; Birney, Ewan.

Nucleic Acids Res ; 37(Database issue): D19-25, 2009 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18978013

RESUMO

Dramatic increases in the throughput of nucleotide sequencing machines, and the promise of ever greater performance, have thrust bioinformatics into the era of petabyte-scale data sets. Sequence repositories, which provide the feed for these data sets into the worldwide computational infrastructure, are challenged by the impact of these data volumes. The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/embl), comprising the EMBL Nucleotide Sequence Database and the Ensembl Trace Archive, has identified challenges in the storage, movement, analysis, interpretation and visualization of petabyte-scale data sets. We present here our new repository for next generation sequence data, a brief summary of contents of the ENA and provide details of major developments to submission pipelines, high-throughput rule-based validation infrastructure and data integration approaches.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência/tendências , Internet , Integração de Sistemas

Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database.

Cochrane, Guy; Akhtar, Ruth; Aldebert, Philippe; Althorpe, Nicola; Baldwin, Alastair; Bates, Kirsty; Bhattacharyya, Sumit; Bonfield, James; Bower, Lawrence; Browne, Paul; Castro, Matias; Cox, Tony; Demiralp, Fehmi; Eberhardt, Ruth; Faruque, Nadeem; Hoad, Gemma; Jang, Mikyung; Kulikova, Tamara; Labarga, Alberto; Leinonen, Rasko; Leonard, Steven; Lin, Quan; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Nardone, Francesco; Plaister, Sheila; Robinson, Stephen; Sobhany, Siamak; Vaughan, Robert; Wu, Dan; Zhu, Weimin; Apweiler, Rolf; Hubbard, Tim; Birney, Ewan.

Nucleic Acids Res ; 36(Database issue): D5-12, 2008 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18039715

RESUMO

The Ensembl Trace Archive (http://trace.ensembl.org/) and the EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/), known together as the European Nucleotide Archive, continue to see growth in data volume and diversity. Selected major developments of 2007 are presented briefly, along with data submission and retrieval information. In the face of increasing requirements for nucleotide trace, sequence and annotation data archiving, data capture priority decisions have been taken at the European Nucleotide Archive. Priorities are discussed in terms of how reliably information can be captured, the long-term benefits of its capture and the ease with which it can be captured.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Animais , Arquivos , Genômica , Internet

EMBL Nucleotide Sequence Database in 2006.

Kulikova, Tamara; Akhtar, Ruth; Aldebert, Philippe; Althorpe, Nicola; Andersson, Mikael; Baldwin, Alastair; Bates, Kirsty; Bhattacharyya, Sumit; Bower, Lawrence; Browne, Paul; Castro, Matias; Cochrane, Guy; Duggan, Karyn; Eberhardt, Ruth; Faruque, Nadeem; Hoad, Gemma; Kanz, Carola; Lee, Charles; Leinonen, Rasko; Lin, Quan; Lombard, Vincent; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Nardone, Francesco; Pastor, Maria Pilar Garcia; Plaister, Sheila; Sobhany, Siamak; Stoehr, Peter; Vaughan, Robert; Wu, Dan; Zhu, Weimin; Apweiler, Rolf.

Nucleic Acids Res ; 35(Database issue): D16-20, 2007 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-17148479

RESUMO

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl) at the EMBL European Bioinformatics Institute, UK, offers a large and freely accessible collection of nucleotide sequences and accompanying annotation. The database is maintained in collaboration with DDBJ and GenBank. Data are exchanged between the collaborating databases on a daily basis to achieve optimal synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation, alignments and bulk data. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. In 2006, the volume of data has continued to grow exponentially. Access to the data is provided via SRS, ftp and variety of other methods. Extensive external and internal cross-references enable users to search for related information across other databases and within the database. All available resources can be accessed via the EBI home page at http://www.ebi.ac.uk/. Changes over the past year include changes to the file format, further development of the EMBLCDS dataset and developments to the XML format.

Assuntos

Bases de Dados de Ácidos Nucleicos , Sequência de Bases , Bases de Dados de Ácidos Nucleicos/tendências , Internet , Interface Usuário-Computador

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa