Developing Collaborative QSAR Models Without Sharing Structures.

Gedeck, Peter; Skolnik, Suzanne; Rodde, Stephane

Gedeck, Peter; Skolnik, Suzanne; Rodde, Stephane.

Afiliação

Gedeck P; Peter Gedeck LLC , 2309 Grove Avenue, Falls Church, Virginia 22046, United States.
Skolnik S; Novartis Institute for Biomedical Research , 250 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States.
Rodde S; Novartis Institute for Biomedical Research , Postfach, CH-4002 Basel, Switzerland.

J Chem Inf Model ; 57(8): 1847-1858, 2017 08 28.

Article em En | MEDLINE | ID: mdl-28723087

RESUMO

It is widely understood that QSAR models greatly improve if more data are used. However, irrespective of model quality, once chemical structures diverge too far from the initial data set, the predictive performance of a model degrades quickly. To increase the applicability domain we need to increase the diversity of the training set. This can be achieved by combining data from diverse sources. Public data can be easily included; however, proprietary data may be more difficult to add due to intellectual property concerns. In this contribution, we will present a method for the collaborative development of linear regression models that addresses this problem. The method differs from other past approaches, because data are only shared in an aggregated form. This prohibits access to individual data points and therefore avoids the disclosure of confidential structural information. The final models are equivalent to models that were built with combined data sets.

Assuntos

Modelos Teóricos; Relação Quantitativa Estrutura-Atividade

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Relação Quantitativa Estrutura-Atividade / Modelos Teóricos Tipo de estudo: Prognostic_studies Idioma: En Revista: J Chem Inf Model Ano de publicação: 2017 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google