Your browser doesn't support javascript.
loading
Quality Control of Quantitative High Throughput Screening Data.
Shockley, Keith R; Gupta, Shuva; Harris, Shawn F; Lahiri, Soumendra N; Peddada, Shyamal D.
Affiliation
  • Shockley KR; Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, United States.
  • Gupta S; Statistics Department, University of Pennsylvania, Philadelphia, PA, United States.
  • Harris SF; Social and Scientific Systems, Durham, NC, United States.
  • Lahiri SN; Department of Statistics, North Carolina State University, Raleigh, NC, United States.
  • Peddada SD; Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States.
Front Genet ; 10: 387, 2019.
Article in En | MEDLINE | ID: mdl-31143201
Quantitative high throughput screening (qHTS) experiments can generate 1000s of concentration-response profiles to screen compounds for potentially adverse effects. However, potency estimates for a single compound can vary considerably in study designs incorporating multiple concentration-response profiles for each compound. We introduce an automated quality control procedure based on analysis of variance (ANOVA) to identify and filter out compounds with multiple cluster response patterns and improve potency estimation in qHTS assays. Our approach, called Cluster Analysis by Subgroups using ANOVA (CASANOVA), clusters compound-specific response patterns into statistically supported subgroups. Applying CASANOVA to 43 publicly available qHTS data sets, we found that only about 20% of compounds with response values outside of the noise band have single cluster responses. The error rates for incorrectly separating true clusters and incorrectly clumping disparate clusters were both less than 5% in extensive simulation studies. Simulation studies also showed that the bias and variance of concentration at half-maximal response (AC50 ) estimates were usually within 10-fold when using a weighted average approach for potency estimation. In short, CASANOVA effectively sorts out compounds with "inconsistent" response patterns and produces trustworthy AC50 values.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Diagnostic_studies / Screening_studies Language: En Journal: Front Genet Year: 2019 Document type: Article Affiliation country: United States Country of publication: Switzerland

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Diagnostic_studies / Screening_studies Language: En Journal: Front Genet Year: 2019 Document type: Article Affiliation country: United States Country of publication: Switzerland