Search | VHL Regional Portal

1.

Author Correction: Comparative cellular analysis of motor cortex in human, marmoset and mouse.

Bakken, Trygve E; Jorstad, Nikolas L; Hu, Qiwen; Lake, Blue B; Tian, Wei; Kalmbach, Brian E; Crow, Megan; Hodge, Rebecca D; Krienen, Fenna M; Sorensen, Staci A; Eggermont, Jeroen; Yao, Zizhen; Aevermann, Brian D; Aldridge, Andrew I; Bartlett, Anna; Bertagnolli, Darren; Casper, Tamara; Castanon, Rosa G; Crichton, Kirsten; Daigle, Tanya L; Dalley, Rachel; Dee, Nick; Dembrow, Nikolai; Diep, Dinh; Ding, Song-Lin; Dong, Weixiu; Fang, Rongxin; Fischer, Stephan; Goldman, Melissa; Goldy, Jeff; Graybuck, Lucas T; Herb, Brian R; Hou, Xiaomeng; Kancherla, Jayaram; Kroll, Matthew; Lathia, Kanan; van Lew, Baldur; Li, Yang Eric; Liu, Christine S; Liu, Hanqing; Lucero, Jacinta D; Mahurkar, Anup; McMillen, Delissa; Miller, Jeremy A; Moussa, Marmar; Nery, Joseph R; Nicovich, Philip R; Niu, Sheng-Yong; Orvis, Joshua; Osteen, Julia K.

Nature ; 604(7904): E8, 2022 Apr.

Article in English | MEDLINE | ID: mdl-35319013

2.

Comparative cellular analysis of motor cortex in human, marmoset and mouse.

Bakken, Trygve E; Jorstad, Nikolas L; Hu, Qiwen; Lake, Blue B; Tian, Wei; Kalmbach, Brian E; Crow, Megan; Hodge, Rebecca D; Krienen, Fenna M; Sorensen, Staci A; Eggermont, Jeroen; Yao, Zizhen; Aevermann, Brian D; Aldridge, Andrew I; Bartlett, Anna; Bertagnolli, Darren; Casper, Tamara; Castanon, Rosa G; Crichton, Kirsten; Daigle, Tanya L; Dalley, Rachel; Dee, Nick; Dembrow, Nikolai; Diep, Dinh; Ding, Song-Lin; Dong, Weixiu; Fang, Rongxin; Fischer, Stephan; Goldman, Melissa; Goldy, Jeff; Graybuck, Lucas T; Herb, Brian R; Hou, Xiaomeng; Kancherla, Jayaram; Kroll, Matthew; Lathia, Kanan; van Lew, Baldur; Li, Yang Eric; Liu, Christine S; Liu, Hanqing; Lucero, Jacinta D; Mahurkar, Anup; McMillen, Delissa; Miller, Jeremy A; Moussa, Marmar; Nery, Joseph R; Nicovich, Philip R; Niu, Sheng-Yong; Orvis, Joshua; Osteen, Julia K.

Nature ; 598(7879): 111-119, 2021 10.

Article in English | MEDLINE | ID: mdl-34616062

ABSTRACT

The primary motor cortex (M1) is essential for voluntary fine-motor control and is functionally conserved across mammals1. Here, using high-throughput transcriptomic and epigenomic profiling of more than 450,000 single nuclei in humans, marmoset monkeys and mice, we demonstrate a broadly conserved cellular makeup of this region, with similarities that mirror evolutionary distance and are consistent between the transcriptome and epigenome. The core conserved molecular identities of neuronal and non-neuronal cell types allow us to generate a cross-species consensus classification of cell types, and to infer conserved properties of cell types across species. Despite the overall conservation, however, many species-dependent specializations are apparent, including differences in cell-type proportions, gene expression, DNA methylation and chromatin state. Few cell-type marker genes are conserved across species, revealing a short list of candidate genes and regulatory mechanisms that are responsible for conserved features of homologous cell types, such as the GABAergic chandelier cells. This consensus transcriptomic classification allows us to use patch-seq (a combination of whole-cell patch-clamp recordings, RNA sequencing and morphological characterization) to identify corticospinal Betz cells from layer 5 in non-human primates and humans, and to characterize their highly specialized physiology and anatomy. These findings highlight the robust molecular underpinnings of cell-type diversity in M1 across mammals, and point to the genes and regulatory pathways responsible for the functional identity of cell types and their species-specific adaptations.

Subject(s)

Motor Cortex/cytology , Neurons/classification , Single-Cell Analysis , Animals , Atlases as Topic , Callithrix/genetics , Epigenesis, Genetic , Epigenomics , Female , GABAergic Neurons/cytology , GABAergic Neurons/metabolism , Gene Expression Profiling , Glutamates/metabolism , Humans , In Situ Hybridization, Fluorescence , Male , Mice , Middle Aged , Motor Cortex/anatomy & histology , Neurons/cytology , Neurons/metabolism , Organ Specificity , Phylogeny , Species Specificity , Transcriptome

3.

A transcriptomic and epigenomic cell atlas of the mouse primary motor cortex.

Yao, Zizhen; Liu, Hanqing; Xie, Fangming; Fischer, Stephan; Adkins, Ricky S; Aldridge, Andrew I; Ament, Seth A; Bartlett, Anna; Behrens, M Margarita; Van den Berge, Koen; Bertagnolli, Darren; de Bézieux, Hector Roux; Biancalani, Tommaso; Booeshaghi, A Sina; Bravo, Héctor Corrada; Casper, Tamara; Colantuoni, Carlo; Crabtree, Jonathan; Creasy, Heather; Crichton, Kirsten; Crow, Megan; Dee, Nick; Dougherty, Elizabeth L; Doyle, Wayne I; Dudoit, Sandrine; Fang, Rongxin; Felix, Victor; Fong, Olivia; Giglio, Michelle; Goldy, Jeff; Hawrylycz, Mike; Herb, Brian R; Hertzano, Ronna; Hou, Xiaomeng; Hu, Qiwen; Kancherla, Jayaram; Kroll, Matthew; Lathia, Kanan; Li, Yang Eric; Lucero, Jacinta D; Luo, Chongyuan; Mahurkar, Anup; McMillen, Delissa; Nadaf, Naeem M; Nery, Joseph R; Nguyen, Thuc Nghi; Niu, Sheng-Yong; Ntranos, Vasilis; Orvis, Joshua; Osteen, Julia K.

Nature ; 598(7879): 103-110, 2021 10.

Article in English | MEDLINE | ID: mdl-34616066

ABSTRACT

Single-cell transcriptomics can provide quantitative molecular signatures for large, unbiased samples of the diverse cell types in the brain1-3. With the proliferation of multi-omics datasets, a major challenge is to validate and integrate results into a biological understanding of cell-type organization. Here we generated transcriptomes and epigenomes from more than 500,000 individual cells in the mouse primary motor cortex, a structure that has an evolutionarily conserved role in locomotion. We developed computational and statistical methods to integrate multimodal data and quantitatively validate cell-type reproducibility. The resulting reference atlas-containing over 56 neuronal cell types that are highly replicable across analysis methods, sequencing technologies and modalities-is a comprehensive molecular and genomic account of the diverse neuronal and non-neuronal cell types in the mouse primary motor cortex. The atlas includes a population of excitatory neurons that resemble pyramidal cells in layer 4 in other cortical regions4. We further discovered thousands of concordant marker genes and gene regulatory elements for these cell types. Our results highlight the complex molecular regulation of cell types in the brain and will directly enable the design of reagents to target specific cell types in the mouse primary motor cortex for functional analysis.

Subject(s)

Epigenomics , Gene Expression Profiling , Motor Cortex/cytology , Neurons/classification , Single-Cell Analysis , Transcriptome , Animals , Atlases as Topic , Datasets as Topic , Epigenesis, Genetic , Female , Male , Mice , Motor Cortex/anatomy & histology , Neurons/cytology , Neurons/metabolism , Organ Specificity , Reproducibility of Results

4.

gEAR: Gene Expression Analysis Resource portal for community-driven, multi-omic data exploration.

Orvis, Joshua; Gottfried, Brian; Kancherla, Jayaram; Adkins, Ricky S; Song, Yang; Dror, Amiel A; Olley, Dustin; Rose, Kevin; Chrysostomou, Elena; Kelly, Michael C; Milon, Beatrice; Matern, Maggie S; Azaiez, Hela; Herb, Brian; Colantuoni, Carlo; Carter, Robert L; Ament, Seth A; Kelley, Matthew W; White, Owen; Bravo, Hector Corrada; Mahurkar, Anup; Hertzano, Ronna.

Nat Methods ; 18(8): 843-844, 2021 08.

Article in English | MEDLINE | ID: mdl-34172972

Subject(s)

Algorithms , Brain/metabolism , Computational Biology/methods , Gene Expression Regulation , Genomics/methods , Software , Transcriptome , Computer Graphics , Humans

5.

Epiviz File Server: Query, transform and interactively explore data from indexed genomic files.

Kancherla, Jayaram; Yang, Yifan; Chae, Hyeyun; Corrada Bravo, Hector.

Bioinformatics ; 36(18): 4682-4690, 2020 09 15.

Article in English | MEDLINE | ID: mdl-32618995

ABSTRACT

MOTIVATION: Genomic data repositories like The Cancer Genome Atlas, Encyclopedia of DNA Elements, Bioconductor's AnnotationHub and ExperimentHub etc., provide public access to large amounts of genomic data as flat files. Researchers often download a subset of data files from these repositories to perform exploratory data analysis. We developed Epiviz File Server, a Python library that implements an in situ data query system for local or remotely hosted indexed genomic files, not only for visualization but also data transformation. The File Server library decouples data retrieval and transformation from specific visualization and analysis tools and provides an abstract interface to define computations independent of the location, format or structure of the file. We demonstrate the File Server in two use cases: (i) integration with Galaxy workflows and (ii) using Epiviz to create a custom genome browser from the Epigenome Roadmap dataset. AVAILABILITY AND IMPLEMENTATION: Epiviz File Server is open source and is available on GitHub at http://github.com/epiviz/epivizFileServer. The documentation for the File Server library is available at http://epivizfileserver.rtfd.io.

Subject(s)

Genome , Genomics , Computers , Information Storage and Retrieval , Software

6.

Evidence-Based Network Approach to Recommending Targeted Cancer Therapies.

Kancherla, Jayaram; Rao, Shruti; Bhuvaneshwar, Krithika; Riggins, Rebecca B; Beckman, Robert A; Madhavan, Subha; Corrada Bravo, Héctor; Boca, Simina M.

JCO Clin Cancer Inform ; 4: 71-88, 2020 01.

Article in English | MEDLINE | ID: mdl-31990579

ABSTRACT

PURPOSE: In this work, we introduce CDGnet (Cancer-Drug-Gene Network), an evidence-based network approach for recommending targeted cancer therapies. CDGnet represents a user-friendly informatics tool that expands the range of targeted therapy options for patients with cancer who undergo molecular profiling by including the biologic context via pathway information. METHODS: CDGnet considers biologic pathway information specifically by looking at targets or biomarkers downstream of oncogenes and is personalized for individual patients via user-inputted molecular alterations and cancer type. It integrates a number of different sources of knowledge: patient-specific inputs (molecular alterations and cancer type), US Food and Drug Administration-approved therapies and biomarkers (curated from DailyMed), pathways for specific cancer types (from Kyoto Encyclopedia of Genes and Genomes [KEGG]), gene-drug connections (from DrugBank), and oncogene information (from KEGG). We consider 4 different evidence-based categories for therapy recommendations. Our tool is delivered via an R/Shiny Web application. For the 2 categories that use pathway information, we include an interactive Sankey visualization built on top of d3.js that also provides links to PubChem. RESULTS: We present a scenario for a patient who has estrogen receptor (ER)-positive breast cancer with FGFR1 amplification. Although many therapies exist for patients with ER-positive breast cancer, FGFR1 amplifications may confer resistance to such treatments. CDGnet provides therapy recommendations, including PIK3CA, MAPK, and RAF inhibitors, by considering targets or biomarkers downstream of FGFR1. CONCLUSION: CDGnet provides results in a number of easily accessible and usable forms, separating targeted cancer therapies into categories in an evidence-based manner that incorporates biologic pathway information.

Subject(s)

Antineoplastic Agents/therapeutic use , Biomarkers, Tumor/genetics , Evidence-Based Medicine , Gene Regulatory Networks , Molecular Targeted Therapy , Neoplasms/drug therapy , Precision Medicine , Biomarkers, Tumor/antagonists & inhibitors , Humans , Neoplasms/genetics , Neoplasms/pathology , Patient Selection

7.

Interactive exploratory data analysis of Integrative Human Microbiome Project data using Metaviz.

Wagner, Justin; Kancherla, Jayaram; Braccia, Domenick; Matsumara, James; Felix, Victor; Crabtree, Jonathan; Mahurkar, Anup; Corrada Bravo, Héctor.

F1000Res ; 9: 601, 2020.

Article in English | MEDLINE | ID: mdl-32742640

ABSTRACT

The rich data produced by the second phase of the Human Microbiome Project (iHMP) offers a unique opportunity to test hypotheses that interactions between microbial communities and a human host might impact an individual's health or disease status. In this work we describe infrastructure that integrates Metaviz, an interactive microbiome data analysis and visualization tool, with the iHMP Data Coordination Center web portal and the HMP2Data R/Bioconductor package. We describe integrative statistical and visual analyses of two datasets from iHMP using Metaviz along with the metagenomeSeq R/Bioconductor package for statistical analysis of differential abundance analysis. These use cases demonstrate the utility of a combined approach to access and analyze data from this resource.

Subject(s)

Data Analysis , Microbiota , Data Interpretation, Statistical , Humans , Research Design

8.

Proactive visual and statistical analysis of genomic data in Epiviz.

Cui, Zhe; Kancherla, Jayaram; Chang, Kyle W; Elmqvist, Niklas; Corrada Bravo, Héctor.

Bioinformatics ; 36(7): 2195-2201, 2020 04 01.

Article in English | MEDLINE | ID: mdl-31782758

ABSTRACT

MOTIVATION: Integrative analysis of genomic data that includes statistical methods in combination with visual exploration has gained widespread adoption. Many existing methods involve a combination of tools and resources: user interfaces that provide visualization of large genomic datasets, and computational environments that focus on data analyses over various subsets of a given dataset. Over the last few years, we have developed Epiviz as an integrative and interactive genomic data analysis tool that incorporates visualization tightly with state-of-the-art statistical analysis framework. RESULTS: In this article, we present Epiviz Feed, a proactive and automatic visual analytics system integrated with Epiviz that alleviates the burden of manually executing data analysis required to test biologically meaningful hypotheses. Results of interest that are proactively identified by server-side computations are listed as notifications in a feed. The feed turns genomic data analysis into a collaborative work between the analyst and the computational environment, which shortens the analysis time and allows the analyst to explore results efficiently.We discuss three ways where the proposed system advances the field of genomic data analysis: (i) takes the first step of proactive data analysis by utilizing available CPU power from the server to automate the analysis process; (ii) summarizes hypothesis test results in a way that analysts can easily understand and investigate; (iii) enables filtering and grouping of analysis results for quick search. This effort provides initial work on systems that substantially expand how computational and visualization frameworks can be tightly integrated to facilitate interactive genomic data analysis. AVAILABILITY AND IMPLEMENTATION: The source code for Epiviz Feed application is available at http://github.com/epiviz/epiviz_feed_polymer. The Epiviz Computational Server is available at http://github.com/epiviz/epiviz-feed-computation. Please refer to Epiviz documentation site for details: http://epiviz.github.io/.

Subject(s)

Genomics , Software , Genome , Research Design

9.

metagenomeFeatures: an R package for working with 16S rRNA reference databases and marker-gene survey feature data.

Olson, Nathan D; Shah, Nidhi; Kancherla, Jayaram; Wagner, Justin; Paulson, Joseph N; Corrada Bravo, Hector.

Bioinformatics ; 35(19): 3870-3872, 2019 10 01.

Article in English | MEDLINE | ID: mdl-30821316

ABSTRACT

SUMMARY: We developed the metagenomeFeatures R Bioconductor package along with annotation packages for three 16S rRNA databases (Greengenes, RDP and SILVA) to facilitate working with 16S rRNA databases and marker-gene survey feature data. The metagenomeFeatures package defines two classes, MgDb for working with 16S rRNA sequence databases, and mgFeatures for marker-gene survey feature data. The associated annotation packages provide a consistent interface to the different databases facilitating database comparison and exploration. The mgFeatures-class represents a crucial step in the development of a common data structure for working with 16S marker-gene survey data in R. AVAILABILITY AND IMPLEMENTATION: https://bioconductor.org/packages/release/bioc/html/metagenomeFeatures.html. SUPPLEMENTARY INFORMATION: Supplementary material is available at Bioinformatics online.

Subject(s)

Databases, Nucleic Acid , Software , RNA, Ribosomal, 16S , Surveys and Questionnaires

10.

Epiviz Web Components: reusable and extensible component library to visualize functional genomic datasets.

Kancherla, Jayaram; Zhang, Alexander; Gottfried, Brian; Bravo, Hector Corrada.

F1000Res ; 7: 1096, 2018.

Article in English | MEDLINE | ID: mdl-30135734

ABSTRACT

Interactive and integrative data visualization tools and libraries are integral to exploration and analysis of genomic data. Web based genome browsers allow integrative data exploration of a large number of data sets for a specific region in the genome. Currently available web-based genome browsers are developed for specific use cases and datasets, therefore integration and extensibility of the visualizations and the underlying libraries from these tools is a challenging task. Genomic data visualization and software libraries that enable bioinformatic researchers and developers to implement customized genomic data viewers and data analyses for their application are much needed. Using recent advances in core web platform APIs and technologies including Web Components, we developed the Epiviz Component Library, a reusable and extensible data visualization library and application framework for genomic data. Epiviz Components can be integrated with most JavaScript libraries and frameworks designed for HTML. To demonstrate the ease of integration with other frameworks, we developed an R/Bioconductor epivizrChart package, that provides interactive, shareable and reproducible visualizations of genomic data objects in R, Shiny and also create standalone HTML documents. The component library is modular by design, reusable and natively extensible and therefore simplifies the process of managing and developing bioinformatic applications.

Subject(s)

Computer Graphics , Databases, Nucleic Acid , Genomics , Software , Web Browser

11.

Metaviz: interactive statistical and visual analysis of metagenomic data.

Wagner, Justin; Chelaru, Florin; Kancherla, Jayaram; Paulson, Joseph N; Zhang, Alexander; Felix, Victor; Mahurkar, Anup; Elmqvist, Niklas; Corrada Bravo, Héctor.

Nucleic Acids Res ; 46(6): 2777-2787, 2018 04 06.

Article in English | MEDLINE | ID: mdl-29529268

ABSTRACT

Large studies profiling microbial communities and their association with healthy or disease phenotypes are now commonplace. Processed data from many of these studies are publicly available but significant effort is required for users to effectively organize, explore and integrate it, limiting the utility of these rich data resources. Effective integrative and interactive visual and statistical tools to analyze many metagenomic samples can greatly increase the value of these data for researchers. We present Metaviz, a tool for interactive exploratory data analysis of annotated microbiome taxonomic community profiles derived from marker gene or whole metagenome shotgun sequencing. Metaviz is uniquely designed to address the challenge of browsing the hierarchical structure of metagenomic data features while rendering visualizations of data values that are dynamically updated in response to user navigation. We use Metaviz to provide the UMD Metagenome Browser web service, allowing users to browse and explore data for more than 7000 microbiomes from published studies. Users can also deploy Metaviz as a web service, or use it to analyze data through the metavizr package to interoperate with state-of-the-art analysis tools available through Bioconductor. Metaviz is free and open source with the code, documentation and tutorials publicly accessible.

Subject(s)

Computational Biology/methods , Metagenome/genetics , Metagenomics/methods , Whole Genome Sequencing/methods , Bacteria/classification , Bacteria/genetics , Child , Computational Biology/statistics & numerical data , Diarrhea/diagnosis , Diarrhea/genetics , Humans , Internet , Metagenomics/statistics & numerical data , Reproducibility of Results , Web Browser , Whole Genome Sequencing/statistics & numerical data

12.

A framework and case studies for evaluation of enzyme ontogeny in children's health risk evaluation.

Ginsberg, Gary; Vulimiri, Suryanarayana V; Lin, Yu-Sheng; Kancherla, Jayaram; Foos, Brenda; Sonawane, Babasaheb.

J Toxicol Environ Health A ; 80(10-12): 569-593, 2017.

Article in English | MEDLINE | ID: mdl-28891786

ABSTRACT

Knowledge of the ontogeny of Phase I and Phase II metabolizing enzymes may be used to inform children's vulnerability based upon likely differences in internal dose from xenobiotic exposure. This might provide a qualitative assessment of toxicokinetic (TK) variability and uncertainty pertinent to early lifestages and help scope a more quantitative physiologically based toxicokinetic (PBTK) assessment. Although much is known regarding the ontogeny of metabolizing systems, this is not commonly utilized in scoping and problem formulation stage of human health risk evaluation. A framework is proposed for introducing this information into problem formulation which combines data on enzyme ontogeny and chemical-specific TK to explore potential child/adult differences in internal dose and whether such metabolic differences may be important factors in risk evaluation. The framework is illustrated with five case study chemicals, including some which are data rich and provide proof of concept, while others are data poor. Case studies for toluene and chlorpyrifos indicate potentially important child/adult TK differences while scoping for acetaminophen suggests enzyme ontogeny is unlikely to increase early-life risks. Scoping for trichloroethylene and aromatic amines indicates numerous ways that enzyme ontogeny may affect internal dose which necessitates further evaluation. PBTK modeling is a critical and feasible next step to further evaluate child-adult differences in internal dose for a number of these chemicals.

Subject(s)

Child Health , Enzymes/metabolism , Models, Theoretical , Acetaminophen/toxicity , Amines/toxicity , Child , Chlorpyrifos/toxicity , Environmental Pollutants/toxicity , Humans , Research Design , Risk Assessment , Toluene/toxicity , Toxicokinetics , Trichloroethylene/toxicity

13.

ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology.

Richard, Ann M; Judson, Richard S; Houck, Keith A; Grulke, Christopher M; Volarath, Patra; Thillainadarajah, Inthirany; Yang, Chihae; Rathman, James; Martin, Matthew T; Wambaugh, John F; Knudsen, Thomas B; Kancherla, Jayaram; Mansouri, Kamel; Patlewicz, Grace; Williams, Antony J; Little, Stephen B; Crofton, Kevin M; Thomas, Russell S.

Chem Res Toxicol ; 29(8): 1225-51, 2016 08 15.

Article in English | MEDLINE | ID: mdl-27367298

ABSTRACT

The U.S. Environmental Protection Agency's (EPA) ToxCast program is testing a large library of Agency-relevant chemicals using in vitro high-throughput screening (HTS) approaches to support the development of improved toxicity prediction models. Launched in 2007, Phase I of the program screened 310 chemicals, mostly pesticides, across hundreds of ToxCast assay end points. In Phase II, the ToxCast library was expanded to 1878 chemicals, culminating in the public release of screening data at the end of 2013. Subsequent expansion in Phase III has resulted in more than 3800 chemicals actively undergoing ToxCast screening, 96% of which are also being screened in the multi-Agency Tox21 project. The chemical library unpinning these efforts plays a central role in defining the scope and potential application of ToxCast HTS results. The history of the phased construction of EPA's ToxCast library is reviewed, followed by a survey of the library contents from several different vantage points. CAS Registry Numbers are used to assess ToxCast library coverage of important toxicity, regulatory, and exposure inventories. Structure-based representations of ToxCast chemicals are then used to compute physicochemical properties, substructural features, and structural alerts for toxicity and biotransformation. Cheminformatics approaches using these varied representations are applied to defining the boundaries of HTS testability, evaluating chemical diversity, and comparing the ToxCast library to potential target application inventories, such as used in EPA's Endocrine Disruption Screening Program (EDSP). Through several examples, the ToxCast chemical library is demonstrated to provide comprehensive coverage of the knowledge domains and target inventories of potential interest to EPA. Furthermore, the varied representations and approaches presented here define local chemistry domains potentially worthy of further investigation (e.g., not currently covered in the testing library or defined by toxicity "alerts") to strategically support data mining and predictive toxicology modeling moving forward.

Subject(s)

Toxicology

14.

CERAPP: Collaborative Estrogen Receptor Activity Prediction Project.

Mansouri, Kamel; Abdelaziz, Ahmed; Rybacka, Aleksandra; Roncaglioni, Alessandra; Tropsha, Alexander; Varnek, Alexandre; Zakharov, Alexey; Worth, Andrew; Richard, Ann M; Grulke, Christopher M; Trisciuzzi, Daniela; Fourches, Denis; Horvath, Dragos; Benfenati, Emilio; Muratov, Eugene; Wedebye, Eva Bay; Grisoni, Francesca; Mangiatordi, Giuseppe F; Incisivo, Giuseppina M; Hong, Huixiao; Ng, Hui W; Tetko, Igor V; Balabin, Ilya; Kancherla, Jayaram; Shen, Jie; Burton, Julien; Nicklaus, Marc; Cassotti, Matteo; Nikolov, Nikolai G; Nicolotti, Orazio; Andersson, Patrik L; Zang, Qingda; Politi, Regina; Beger, Richard D; Todeschini, Roberto; Huang, Ruili; Farag, Sherif; Rosenberg, Sine A; Slavov, Svetoslav; Hu, Xin; Judson, Richard S.

Environ Health Perspect ; 124(7): 1023-33, 2016 07.

Article in English | MEDLINE | ID: mdl-26908244

ABSTRACT

BACKGROUND: Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program. OBJECTIVES: We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing. METHODS: CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies. RESULTS: Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing. CONCLUSION: This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points. CITATION: Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. 2016. CERAPP: Collaborative Estrogen Receptor Activity Prediction Project. Environ Health Perspect 124:1023-1033; http://dx.doi.org/10.1289/ehp.1510267.

Subject(s)

Endocrine Disruptors/toxicity , Receptors, Estrogen/metabolism , Toxicity Tests , Computer Simulation , Endocrine Disruptors/classification , Environmental Policy , Quantitative Structure-Activity Relationship , United States

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL