Your browser doesn't support javascript.
loading
SureChEMBL: a large-scale, chemically annotated patent document database.
Papadatos, George; Davies, Mark; Dedman, Nathan; Chambers, Jon; Gaulton, Anna; Siddle, James; Koks, Richard; Irvine, Sean A; Pettersson, Joe; Goncharoff, Nicko; Hersey, Anne; Overington, John P.
Afiliação
  • Papadatos G; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK.
  • Davies M; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK.
  • Dedman N; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK.
  • Chambers J; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK.
  • Gaulton A; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK.
  • Siddle J; Digital Science, London N1 9XW, UK.
  • Koks R; Digital Science, London N1 9XW, UK.
  • Irvine SA; NetValue Ltd, Hamilton 3240, New Zealand.
  • Pettersson J; McKinsey & Company, London SW1Y 4UH, UK.
  • Goncharoff N; Digital Science, London N1 9XW, UK n.goncharoff@digital-science.com.
  • Hersey A; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK ahersey@ebi.ac.uk.
  • Overington JP; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK john.overington@stratifiedmedical.com.
Nucleic Acids Res ; 44(D1): D1220-8, 2016 Jan 04.
Article em En | MEDLINE | ID: mdl-26582922
ABSTRACT
SureChEMBL is a publicly available large-scale resource containing compounds extracted from the full text, images and attachments of patent documents. The data are extracted from the patent literature according to an automated text and image-mining pipeline on a daily basis. SureChEMBL provides access to a previously unavailable, open and timely set of annotated compound-patent associations, complemented with sophisticated combined structure and keyword-based search capabilities against the compound repository and patent document corpus; given the wealth of knowledge hidden in patent documents, analysis of SureChEMBL data has immediate applications in drug discovery, medicinal chemistry and other commercial areas of chemical science. Currently, the database contains 17 million compounds extracted from 14 million patent documents. Access is available through a dedicated web-based interface and data downloads at https//www.surechembl.org/.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Patentes como Assunto / Bases de Dados de Compostos Químicos Idioma: En Ano de publicação: 2016 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Patentes como Assunto / Bases de Dados de Compostos Químicos Idioma: En Ano de publicação: 2016 Tipo de documento: Article