RESUMO
New approach methodologies (NAMs) aim to accelerate the pace of chemical risk assessment while simultaneously reducing cost and dependency on animal studies. High Throughput Transcriptomics (HTTr) is an emerging NAM in the field of chemical hazard evaluation for establishing in vitro points-of-departure and providing mechanistic insight. In the current study, 1201 test chemicals were screened for bioactivity at eight concentrations using a 24-h exposure duration in the human- derived U-2 OS osteosarcoma cell line with HTTr. Assay reproducibility was assessed using three reference chemicals that were screened on every assay plate. The resulting transcriptomics data were analyzed by aggregating signal from genes into signature scores using gene set enrichment analysis, followed by concentration-response modeling of signatures scores. Signature scores were used to predict putative mechanisms of action, and to identify biological pathway altering concentrations (BPACs). BPACs were consistent across replicates for each reference chemical, with replicate BPAC standard deviations as low as 5.6 × 10-3 µM, demonstrating the internal reproducibility of HTTr-derived potency estimates. BPACs of test chemicals showed modest agreement (R2 = 0.55) with existing phenotype altering concentrations from high throughput phenotypic profiling using Cell Painting of the same chemicals in the same cell line. Altogether, this HTTr based chemical screen contributes to an accumulating pool of publicly available transcriptomic data relevant for chemical hazard evaluation and reinforces the utility of cell based molecular profiling methods in estimating chemical potency and predicting mechanism of action across a diverse set of chemicals.
Assuntos
Perfilação da Expressão Gênica , Ensaios de Triagem em Larga Escala , Transcriptoma , Humanos , Ensaios de Triagem em Larga Escala/métodos , Linhagem Celular Tumoral , Transcriptoma/efeitos dos fármacos , Perfilação da Expressão Gênica/métodos , Reprodutibilidade dos Testes , Relação Dose-Resposta a Droga , Medição de Risco , Osteossarcoma/genética , Osteossarcoma/patologiaRESUMO
The presence of numerous chemical contaminants from industrial, agricultural, and pharmaceutical sources in water supplies poses a potential risk to human and ecological health. Current chemical analyses suffer from limitations, including chemical coverage and high cost, and broad-coverage in vitro assays such as transcriptomics may further improve water quality monitoring by assessing a large range of possible effects. Here, we used high-throughput transcriptomics to assess the activity induced by field-derived water extracts in MCF7 breast carcinoma cells. Wastewater and surface water extracts induced the largest changes in expression among cell proliferation-related genes and neurological, estrogenic, and antibiotic pathways, whereas drinking and reclaimed water extracts that underwent advanced treatment showed substantially reduced bioactivity on both gene and pathway levels. Importantly, reclaimed water extracts induced fewer changes in gene expression than laboratory blanks, which reinforces previous conclusions based on targeted assays and improves confidence in bioassay-based monitoring of water quality.
Assuntos
Poluentes Químicos da Água , Purificação da Água , Humanos , Monitoramento Ambiental , Poluentes Químicos da Água/análise , Qualidade da Água , Perfilação da Expressão Gênica , BioensaioRESUMO
Tea plants are a perennial crop with significant economic value. Chlorophyll, a key factor in tea leaf color and photosynthetic efficiency, is affected by the photoperiod and usually exhibits diurnal and seasonal variations. In this study, high-throughput transcriptomic analysis was used to study the chlorophyll metabolism, under different photoperiods, of tea plants. We conducted a time-series sampling under a skeleton photoperiod (6L6D) and continuous light conditions (24 L), measuring the chlorophyll and carotenoid content at a photoperiod interval of 3 h (24 h). Transcriptome sequencing was performed at six time points across two light cycles, followed by bioinformatics analysis to identify and annotate the differentially expressed genes (DEGs) involved in chlorophyll metabolism. The results revealed distinct expression patterns of key genes in the chlorophyll biosynthetic pathway. The expression levels of CHLE (magnesium-protoporphyrin IX monomethyl ester cyclase gene), CHLP (geranylgeranyl reductase gene), CLH (chlorophyllase gene), and POR (cytochrome P450 oxidoreductase gene), encoding enzymes in chlorophyll synthesis, were increased under continuous light conditions (24 L). At 6L6D, the expression levels of CHLP1.1, POR1.1, and POR1.2 showed an oscillating trend. The expression levels of CHLP1.2 and CLH1.1 showed the same trend, they both decreased under light treatment and increased under dark treatment. Our findings provide potential insights into the molecular basis of how photoperiods regulate chlorophyll metabolism in tea plants.
Assuntos
Clorofila , Ritmo Circadiano , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Fotoperíodo , Transcriptoma , Clorofila/metabolismo , Ritmo Circadiano/genética , Camellia sinensis/genética , Camellia sinensis/metabolismo , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Sequenciamento de Nucleotídeos em Larga EscalaRESUMO
BACKGROUND: Prolonged exposure to toxic heavy metals leads to deleterious health outcomes including kidney injury. Metal exposure occurs through both environmental pathways including contamination of drinking water sources and from occupational hazards, including the military-unique risks from battlefield injuries resulting in retained metal fragments from bullets and blast debris. One of the key challenges to mitigate health effects in these scenarios is to detect early insult to target organs, such as the kidney, before irreversible damage occurs. METHODS: High-throughput transcriptomics (HTT) has been recently demonstrated to have high sensitivity and specificity as a rapid and cost-effective assay for detecting tissue toxicity. To better understand the molecular signature of early kidney damage, we performed RNA sequencing (RNA-seq) on renal tissue using a rat model of soft tissue-embedded metal exposure. We then performed small RNA-seq analysis on serum samples from the same animals to identify potential miRNA biomarkers of kidney damage. RESULTS: We found that metals, especially lead and depleted uranium, induce oxidative damage that mainly cause dysregulated mitochondrial gene expression. Utilizing publicly available single-cell RNA-seq datasets, we demonstrate that deep learning-based cell type decomposition effectively identified cells within the kidney that were affected by metal exposure. By combining random forest feature selection and statistical methods, we further identify miRNA-423 as a promising early systemic marker of kidney injury. CONCLUSION: Our data suggest that combining HTT and deep learning is a promising approach for identifying cell injury in kidney tissue. We propose miRNA-423 as a potential serum biomarker for early detection of kidney injury.
Assuntos
MicroRNAs , Transcriptoma , Ratos , Animais , Transcriptoma/genética , Rim , Perfilação da Expressão Gênica , MicroRNAs/genética , MicroRNAs/metabolismo , Biomarcadores/metabolismoRESUMO
To address the challenge of limited throughput with traditional toxicity testing, a newly developed high-throughput transcriptomics (HTT) platform, together with a 5-day in vivo rat model, offers an alternative approach to estimate chemical exposures and provide reasonable estimates of toxicological endpoints. This study contains an HTT analysis of 18 environmental chemicals with known liver toxicity. They were evaluated using male Sprague Dawley rats exposed to various concentrations daily for five consecutive days via oral gavage, with data collected on the sixth day. Here, we further explored the 5-day rat model to identify potential gene signatures that can differentiate between toxic and non-toxic liver responses and provide us with a potential histopathological endpoint of chemical exposure. We identified a distinct gene expression pattern that differentiated non-hepatotoxic compounds from hepatotoxic compounds in a dose-dependent manner, and an analysis of the significantly altered common genes indicated that toxic chemicals predominantly upregulated most of the genes and several pathways in amino acid and lipid metabolism. Finally, our liver injury module analysis revealed that several liver-toxic compounds showed similarities in the key injury phenotypes of cellular inflammation and proliferation, indicating potential molecular initiating processes that may lead to a specific end-stage liver disease.
Assuntos
Doença Hepática Terminal , Fígado , Ratos , Masculino , Animais , Ratos Sprague-Dawley , Fígado/patologia , Perfilação da Expressão Gênica , Testes de Toxicidade , Doença Hepática Terminal/patologiaRESUMO
BACKGROUND: The QuantiGene® Plex 2.0 platform (ThermoFisher Scientific) combines bDNA with the Luminex/xMAP magnetic bead capturing technology to assess differential gene expression in a compound exposure setting. This technology allows multiplexing in a single well of a 96 or 384 multi-well plate and can thus be used in high throughput drug discovery mode. Data interpretation follows a three-step normalization/transformation flow in which raw median fluorescent gene signals are transformed to fold change values with the use of proper housekeeping genes and negative controls. Clear instructions on how to assess the data quality and tools to perform this analysis in high throughput mode are, however, currently lacking. RESULTS: In this paper we introduce QGprofiler, an open source R based shiny application. QGprofiler allows for proper QuantiGene® Plex 2.0 assay optimization, choice of housekeeping genes and data pre-processing up to fold change, including appropriate QC metrics. In addition, QGprofiler allows for an Akaike information criterion based dose response fold change model selection and has a built-in tool to detect the cytotoxic potential of compounds evaluated in a high throughput screening campaign. CONCLUSION: QGprofiler is a user friendly, open source available R based shiny application, which is developed to support drug discovery campaigns. In this context, entire compound libraries/series can be tested in dose response against a gene signature of choice in search for new disease relevant chemical entities. QGprofiler is available at: https://qgprofiler.openanalytics.eu/app/QGprofiler.
Assuntos
Descoberta de Drogas/métodos , Perfilação da Expressão Gênica/métodos , SoftwareRESUMO
High-throughput transcriptomics (HTTr) uses gene expression profiling to characterize the biological activity of chemicals in in vitro cell-based test systems. As an extension of a previous study testing 44 chemicals, HTTr was used to screen an additional 1,751 unique chemicals from the EPA's ToxCast collection in MCF7 cells using 8 concentrations and an exposure duration of 6 h. We hypothesized that concentration-response modeling of signature scores could be used to identify putative molecular targets and cluster chemicals with similar bioactivity. Clustering and enrichment analyses were conducted based on signature catalog annotations and ToxPrint chemotypes to facilitate molecular target prediction and grouping of chemicals with similar bioactivity profiles. Enrichment analysis based on signature catalog annotation identified known mechanisms of action (MeOAs) associated with well-studied chemicals and generated putative MeOAs for other active chemicals. Chemicals with predicted MeOAs included those targeting estrogen receptor (ER), glucocorticoid receptor (GR), retinoic acid receptor (RAR), the NRF2/KEAP/ARE pathway, AP-1 activation, and others. Using reference chemicals for ER modulation, the study demonstrated that HTTr in MCF7 cells was able to stratify chemicals in terms of agonist potency, distinguish ER agonists from antagonists, and cluster chemicals with similar activities as predicted by the ToxCast ER Pathway model. Uniform manifold approximation and projection (UMAP) embedding of signature-level results identified novel ER modulators with no ToxCast ER Pathway model predictions. Finally, UMAP combined with ToxPrint chemotype enrichment was used to explore the biological activity of structurally related chemicals. The study demonstrates that HTTr can be used to inform chemical risk assessment by determining in vitro points of departure, predicting chemicals' MeOA and grouping chemicals with similar bioactivity profiles.
Assuntos
Perfilação da Expressão Gênica , Ensaios de Triagem em Larga Escala , Humanos , Células MCF-7 , Transcriptoma/efeitos dos fármacos , Análise por Conglomerados , Relação Dose-Resposta a DrogaRESUMO
Organophosphate esters (OPEs), used as flame retardants and plasticizers, are present ubiquitously in the environment. Previous studies suggest that exposure to OPEs is detrimental to female fertility in humans. However, no experimental information is available on the effects of OPE mixtures on ovarian granulosa cells, which play essential roles in female reproduction. We used high-content imaging to investigate the effects of environmentally relevant OPE mixtures on KGN human granulosa cell phenotypes. Perturbations to steroidogenesis were assessed using ELISA and qRT-PCR. A high-throughput transcriptomic approach, TempO-Seq, was used to identify transcriptional changes in a targeted panel of genes. Effects on lipid homeostasis were explored using a cholesterol assay and global lipidomic profiling. OPE mixtures altered multiple phenotypic features of KGN cells, with triaryl OPEs in the mixture showing higher potencies than other mixture components. The mixtures increased basal production of steroid hormones; this was mediated by significant changes in the expression of critical transcripts involved in steroidogenesis. Further, the total-OPE mixture disrupted cholesterol homeostasis and the composition of intracellular lipid droplets. Exposure to complex mixtures of OPEs, similar to those found in house dust, may adversely affect female reproductive health by altering a multitude of phenotypic and functional endpoints in granulosa cells. This study provides novel insights into the mechanisms of actions underlying the toxicity induced by OPEs and highlights the need to examine the effects of human relevant chemical mixtures.
Assuntos
Poeira , Ésteres , Retardadores de Chama , Células da Granulosa , Lipidômica , Organofosfatos , Fenótipo , Transcriptoma , Humanos , Feminino , Células da Granulosa/efeitos dos fármacos , Células da Granulosa/metabolismo , Transcriptoma/efeitos dos fármacos , Organofosfatos/toxicidade , Ésteres/toxicidade , Retardadores de Chama/toxicidade , Linhagem Celular , Metabolismo dos Lipídeos/efeitos dos fármacos , Plastificantes/toxicidade , Colesterol/metabolismoRESUMO
Per- and polyfluoroalkyl substances (PFAS) is a large compound class (n > 12,000) that is extensively present in food, drinking water, and aquatic environments. Reduced serum triglycerides and hepatosteatosis appear to be the common phenotypes for different PFAS chemicals. However, the hepatosteatosis potential of most PFAS chemicals remains largely unknown. This study aims to investigate PFAS-induced hepatosteatosis using in vitro high-throughput phenotype profiling (HTPP) and high-throughput transcriptomic (HTTr) data. We quantified the in vitro hepatosteatosis effects and mitochondrial damage using high-content imaging, curated the transcriptomic data from the Gene Expression Omnibus (GEO) database, and then calculated the point of departure (POD) values for HTPP phenotypes or HTTr transcripts, using the Bayesian benchmark dose modeling approach. Our results indicated that PFAS compounds with fully saturated C-F bonds, sulfur- and nitrogen-containing functional groups, and a fluorinated carbon chain length greater than 8 have the potential to produce biological effects consistent with hepatosteatosis. PFAS primarily induced hepatosteatosis via disturbance in lipid transport and storage. The potency rankings of PFAS compounds are highly concordant among in vitro HTPP, HTTr, and in vivo hepatosteatosis phenotypes (ρ = 0.60-0.73). In conclusion, integrating the information from in vitro HTPP and HTTr analyses can accurately project in vivo hepatosteatosis effects induced by PFAS compounds.
Assuntos
Fluorocarbonos , Perfilação da Expressão Gênica , Teorema de Bayes , Transcriptoma , Fenótipo , Fluorocarbonos/toxicidadeRESUMO
Animal toxicity testing is time and resource intensive, making it difficult to keep pace with the number of substances requiring assessment. Machine learning (ML) models that use chemical structure information and high-throughput experimental data can be helpful in predicting potential toxicity . However, much of the toxicity data used to train ML models is biased with an unequal balance of positives and negatives primarily since substances selected for in vivo testing are expected to elicit some toxicity effect. To investigate the impact this bias had on predictive performance, various sampling approaches were used to balance in vivo toxicity data as part of a supervised ML workflow to predict hepatotoxicity outcomes from chemical structure and/or targeted transcriptomic data. From the chronic, subchronic, developmental, multigenerational reproductive, and subacute repeat-dose testing toxicity outcomes with a minimum of 50 positive and 50 negative substances, 18 different study-toxicity outcome combinations were evaluated in up to 7 ML models. These included Artificial Neural Networks, Random Forests, Bernouilli Naïve Bayes, Gradient Boosting, and Support Vector classification algorithms which were compared with a local approach, Generalised Read-Across (GenRA), a similarity-weighted k-Nearest Neighbour (k-NN) method. The mean CV F1 performance for unbalanced data across all classifiers and descriptors for chronic liver effects was 0.735 (0.0395 SD). Mean CV F1 performance dropped to 0.639 (0.073 SD) with over-sampling approaches though the poorer performance of KNN approaches in some cases contributed to the observed decrease (mean CV F1 performance excluding KNN was 0.697 (0.072 SD)). With under-sampling approaches, the mean CV F1 was 0.523 (0.083 SD). For developmental liver effects, the mean CV F1 performance was much lower with 0.089 (0.111 SD) for unbalanced approaches and 0.149 (0.084 SD) for under-sampling. Over-sampling approaches led to an increase in mean CV F1 performance (0.234, (0.107 SD)) for developmental liver toxicity. Model performance was found to be dependent on dataset, model type, balancing approach and feature selection. Accordingly tailoring ML workflows for predicting toxicity should consider class imbalance and rely on simpler classifiers first.
RESUMO
Open access new approach methods (NAM) in the US EPA ToxCast program and NTP Integrated Chemical Environment (ICE) were used to investigate activities of four neurotoxic pesticides: endosulfan, fipronil, propyzamide and carbaryl. Concordance of in vivo regulatory points of departure (POD) adjusted for interspecies extrapolation (AdjPOD) to modelled human Administered Equivalent Dose (AEDHuman) was assessed using 3-compartment or Adult/Fetal PBTK in vitro to in vivo extrapolation. Model inputs were from Tier 1 (High throughput transcriptomics: HTTr, high throughput phenotypic profiling: HTPP) and Tier 2 (single target: ToxCast) assays. HTTr identified gene expression signatures associated with potential neurotoxicity for endosulfan, propyzamide and carbaryl in non-neuronal MCF-7 and HepaRG cells. The HTPP assay in U-2 OS cells detected potent effects on DNA endpoints for endosulfan and carbaryl, and mitochondria with fipronil (propyzamide was inactive). The most potent ToxCast assays were concordant with specific components of each chemical mode of action (MOA). Predictive adult IVIVE models produced fold differences (FD) < 10 between the AEDHuman and the measured in vivo AdjPOD. The 3-compartment model was concordant (i.e., smallest FD) for endosulfan, fipronil and carbaryl, and PBTK was concordant for propyzamide. The most potent AEDHuman predictions for each chemical showed HTTr, HTPP and ToxCast were mainly concordant with in vivo AdjPODs but assays were less concordant with MOAs. This was likely due to the cell types used for testing and/or lack of metabolic capabilities and pathways available in vivo. The Fetal PBTK model had larger FDs than adult models and was less predictive overall.
RESUMO
Despite the growing number of studies reporting potential risks associated with exposure to organophosphate esters (OPEs), their molecular mechanisms of action remain poorly defined. We used the high-throughput TempO-Seq™ platform to investigate the effects of frequently detected OPEs on the expression of â¼3000 environmentally responsive genes in KGN human ovarian granulosa cells. Cells were exposed for 48 h to one of five OPEs (0.1 to 50 µM): tris(methylphenyl) phosphate (TMPP), isopropylated triphenyl phosphate (IPPP), tert-butylphenyl diphenyl phosphate (BPDP), triphenyl phosphate (TPHP), or tris(2-butoxyethyl) phosphate (TBOEP). The sequencing data indicate that four OPEs induced transcriptional changes, whereas TBOEP had no effect within the concentration range tested. Multiple pathway databases were used to predict alterations in biological processes based on differentially expressed genes. At lower concentrations, inhibition of the cholesterol biosynthetic pathway was the predominant effect of OPEs; this was likely a consequence of intracellular cholesterol accumulation. At higher concentrations, BPDP and TPHP had distinct effects, primarily affecting pathways involved in cell cycle progression and other stress responses. Benchmark concentration (BMC) modelling revealed that BPDP had the lowest transcriptomic point of departure. However, in vitro to in vivo extrapolation modeling indicated that TMPP was bioactive at lower concentrations than the other OPEs. We conclude that these new approach methodologies provide information on the mechanism(s) underlying the effects of data-poor compounds and assist in the derivation of protective points of departure for use in chemical read-across and decision-making.
RESUMO
Concentrations at which global gene expression profiles in cells or animals exposed to a test substance start to differ significantly from those of controls have been proposed as an alternative point of departure for use in screening level hazard assessment. The present study describes pilot testing of a high throughput compatible transcriptomics assay with larval fathead minnows. One day post hatch fathead minnows were exposed to eleven different concentrations of three metals, three selective serotonin reuptake inhibitors, and four neonicotinoid-like compounds for 24 h and concentration response modeling was applied to whole body gene expression data. Transcriptomics-based points of departure (tPODs) were consistently lower than effect concentrations reported in apical endpoint studies in fish. However, larval fathead minnow-based tPODs were not always lower than concentrations reported to elicit apical toxicity in other aquatic organisms like crustaceans or insects. Random in silico subsampling of data from the pilot assays was used to evaluate various assay design and acceptance considerations such as transcriptome coverage, number of replicate individuals to sequence per treatment, and minimum number of differentially expressed genes to produce a reliable tPOD estimate. Results showed a strong association between the total number of genes for which a concentration response relationship could be derived and the overall variability in the resulting tPOD estimates. We conclude that, for our current assay design and analysis pipeline, tPODs based on fewer than 15 differentially expressed genes are likely to be unreliable for screening and that interindividual variability in gene expression profiles appears to be a more significant driver of tPOD variability than sample size alone. Results represent initial steps toward developing high throughput transcriptomics assays for use in ecological hazard screening.
RESUMO
High-throughput transcriptomics has advanced through the introduction of TempO-seq, a targeted alternative to traditional RNA-seq. TempO-seq platforms use 50 nucleotide probes, each specifically designed to target a known transcript, thus allowing for reduced sequencing depth per sample compared with RNA-seq without compromising the accuracy of results. Thus far, studies using the TempO-seq method have relied on existing tools for processing the resulting short read data. However, these tools were originally designed for other data types. While they have been used for processing of early TempO-seq data, they have not been systematically assessed for accuracy or compared to determine an optimal framework for processing and analyzing TempO-seq data. In this work, we re-analyze several publicly available TempO-seq data sets covering a range of experimental designs and use corresponding RNA-seq data sets as a gold standard to rigorously assess accuracy at multiple levels. We compare 6 aligners and 5 normalization methods across various accuracy and performance metrics. Our results demonstrate the overall robust accuracy of the TempO-seq platform, independent of data processing methods. Complex aligners and advanced normalization methods do not appear to have any general advantage over simpler methods when it comes to analyzing TempO-seq data. The reduced complexity of the sequencing space, and the fact that TempO-seq probes are all equal length, appears to reduce the need for elaborate bioinformatic or statistical methods used to address these factors in RNA-seq data.
RESUMO
High-throughput transcriptomics (HTTr) has the potential to support efforts to reduce or replace some animal tests. In past studies, we described a computational approach utilizing a gene expression biomarker consisting of 46 genes to predict estrogen receptor (ER) activity after chemical exposure in ER-positive human breast cancer cells including the MCF-7 cell line. We hypothesized that the biomarker model could identify ER activities of chemicals examined by Endocrine Disruptor Screening Program (EDSP) Tier 1 screening assays in which transcript profiles of the same chemicals were examined in MCF-7 cells. For the 62 chemicals examined including 5 chemicals examined in this study using RNA-Seq, the ER biomarker model accuracy was 1) 97% for in vitro reference chemicals, 2) 76-85% for guideline uterotrophic assays, and 3) 87-88% for guideline and nonguideline uterotrophic assays. For the same chemicals, these accuracies were similar or slightly better than those of the ToxCast ER model based on 18 in vitro assays. The performance of the ER biomarker model indicates that HTTr interpreted using the ER biomarker correctly identifies active and inactive ER reference chemicals. As part of the HTTr screening program the approach could rapidly identify chemicals with potential ER bioactivities for additional screening and testing.
Assuntos
Neoplasias da Mama , Disruptores Endócrinos , Animais , Biomarcadores , Neoplasias da Mama/genética , Feminino , Expressão Gênica , Ensaios de Triagem em Larga Escala , Humanos , Receptores de Estrogênio/genética , Receptores de Estrogênio/metabolismoRESUMO
BACKGROUND: MicroRNAs (miRNAs) are sought-after biomarkers of complex, polygenic diseases such as type 2 diabetes (T2D). Data-driven biocomputing provides robust and novel avenues for synthesizing evidence from individual miRNA seq studies. OBJECTIVE: To identify miRNA markers associated with T2D, via a data-driven, biocomputing approach on high throughput transcriptomics. MATERIALS AND METHODS: The pipeline consisted of five sequential steps using miRNA seq data retrieved from the National Center for Biotechnology Information Gene Expression Omnibus platform: systematic review; identification of differentially expressed miRNAs (DE-miRNAs); meta-analysis of DE-miRNAs; network analysis; and downstream analyses. Three normalization algorithms (trimmed mean of M-values; upper quartile; relative log expression) and two meta-analytic algorithms (robust rank aggregation; Fisher's method of p-value combining) were integrated into the pipeline. Network analysis was conducted on miRNet 2.0 while enrichment and over-representation analyses were conducted on miEAA 2.0. RESULTS: A total of 1256 DE-miRNAs (821 downregulated; 435 upregulated) were identified from 5 eligible miRNA seq datasets (3 circulatory; 1 adipose; 1 pancreatic). The meta-signature comprised 9 miRNAs (hsa-miR-15b-5p; hsa-miR-33b-5p; hsa-miR-106b-3p; hsa-miR-106b-5p; hsa-miR-146a-5p; hsa-miR-483-5p; hsa-miR-539-3p; hsa-miR-1260a; hsa-miR-4454), identified via the two meta-analysis approaches. Two hub nodes (hsa-miR-106b-5p; hsa-miR-15b-5p) with above-average degree and betweenness centralities in the miRNA-gene interactions network were identified. Downstream analyses revealed 5 highly conserved- (hsa-miR-33b-5p; hsa-miR-15b-5p; hsa-miR-106b-3p; hsa-miR-106b-5p; hsa-miR-146a-5p) and 7 highly confident- (hsa-miR-33b-5p; hsa-miR-15b-5p; hsa-miR-106b-3p; hsa-miR-106b-5p; hsa-miR-146a-5p; hsa-miR-483-5p; hsa-miR-539-3p) miRNAs. A total of 288 miRNA-disease associations were identified, in which 3 miRNAs (hsa-miR-15b-5p; hsa-miR-106b-3p; hsa-miR-146a-5p) were highly enriched. CONCLUSIONS: A meta-signature of DE-miRNAs associated with T2D was discovered via in-silico analyses and its pathobiological relevance was validated against corroboratory evidence from contemporary studies and downstream analyses. The miRNA meta-signature could be useful for guiding future studies on T2D. There may also be avenues for using the pipeline more broadly for evidence synthesis on other conditions using high throughput transcriptomics.
RESUMO
BACKGROUND: The advent of high-throughput transcriptomic screening technologies has resulted in a wealth of publicly available gene expression data associated with chemical treatments. From a regulatory perspective, data sets that cover a large chemical space and contain reference chemicals offer utility for the prediction of molecular initiating events associated with chemical exposure. Here, we integrate data from a large compendium of transcriptomic responses to chemical exposure with a comprehensive database of chemical-protein associations to train binary classifiers that predict mechanism(s) of action from transcriptomic responses. First, we linked reference chemicals present in the LINCS L1000 gene expression data collection to chemical identifiers in RefChemDB, a database of chemical-protein interactions. Next, we trained binary classifiers on MCF7 human breast cancer cell line derived gene expression profiles and chemical-protein labels using six classification algorithms to identify optimal analysis parameters. To validate classifier accuracy, we used holdout data sets, training-excluded reference chemicals, and empirical significance testing of null models derived from permuted chemical-protein associations. To identify classifiers that have variable predicting performance across training data derived from different cellular contexts, we trained a separate set of binary classifiers on the PC3 human prostate cancer cell line. RESULTS: We trained classifiers using expression data associated with chemical treatments linked to 51 molecular initiating events. This analysis identified and validated 9 high-performing classifiers with empirical p-values lower than 0.05 and internal accuracies ranging from 0.73 to 0.94 and holdout accuracies of 0.68 to 0.92. High-ranking predictions for training-excluded reference chemicals demonstrating that predictive accuracy extends beyond the set of chemicals used in classifier training. To explore differences in classifier performance as a function of training data cellular context, MCF7-trained classifier accuracies were compared to classifiers trained on the PC3 gene expression data for the same molecular initiating events. CONCLUSIONS: This methodology can offer insight in prioritizing candidate perturbagens of interest for targeted screens. This approach can also help guide the selection of relevant cellular contexts for screening classes of candidate perturbagens using cell line specific model performance.
RESUMO
Per- and polyfluoroalkyl substances (PFAS) are some of the most prominent organic contaminants in human blood. Although the toxicological implications of human exposure to perfluorooctane sulfonate (PFOS) and perfluorooctanoate (PFOA) are well established, data on lesser-understood PFAS are limited. New approach methodologies (NAMs) that apply bioinformatic tools to high-throughput data are being increasingly considered to inform risk assessment for data-poor chemicals. The aim of this study was to compare the potencies (ie, benchmark concentrations: BMCs) of PFAS in primary human liver microtissues (3D spheroids) using high-throughput transcriptional profiling. Gene expression changes were measured using TempO-seq, a templated, multiplexed RNA-sequencing platform. Spheroids were exposed for 1 or 10 days to increasing concentrations of 23 PFAS in 3 subgroups: carboxylates (PFCAs), sulfonates (PFSAs), and fluorotelomers and sulfonamides. PFCAs and PFSAs exhibited trends toward increased transcriptional potency with carbon chain-length. Specifically, longer-chain compounds (7-10 carbons) were more likely to induce changes in gene expression and have lower transcriptional BMCs. The combined high-throughput transcriptomic and bioinformatic analyses support the capability of NAMs to efficiently assess the effects of PFAS in liver microtissues. The data enable potency ranking of PFAS for human liver cell spheroid cytotoxicity and transcriptional changes, and assessment of in vitro transcriptomic points of departure. These data improve our understanding of the possible health effects of PFAS and will be used to inform read-across for human health risk assessment.
Assuntos
Ácidos Alcanossulfônicos , Fluorocarbonos , Ácidos Alcanossulfônicos/toxicidade , Ácidos Carboxílicos , Fluorocarbonos/toxicidade , Humanos , Fígado , TranscriptomaRESUMO
Read-across is a data gap filling technique utilized to predict the toxicity of a target chemical using data from similar analogues. Recent efforts such as Generalized Read-Across (GenRA) facilitate automated read-across predictions for untested chemicals. GenRA makes predictions of toxicity outcomes based on "neighboring" chemicals characterized by chemical and bioactivity fingerprints. Here we investigated the impact of biological similarities on neighborhood formation and read-across performance in predicting hazard (based on repeat-dose testing outcomes from US EPA ToxRefDB v2.0). We used targeted transcriptomic data on 93 genes for 1060 chemicals in HepaRG™ cells that measure nuclear receptor activation, xenobiotic metabolism, cellular stress, cell cycle progression, and apoptosis. Transcriptomic similarity between chemicals was calculated using binary hit-calls from concentration-response data for each gene. We evaluated GenRA performance in predicting ToxRefDB v2.0 hazard outcomes using the area under the Receiver Operating Characteristic (ROC) curve (AUC) for the baseline approach (chemical fingerprints) versus transcriptomic fingerprints and a combination of both (hybrid). For all endpoints, there were significant but only modest improvements in ROC AUC scores of 0.01 (2.1%) and 0.04 (7.3%) with transcriptomic and hybrid descriptors, respectively. However, for liver-specific toxicity endpoints, ROC AUC scores improved by 10% and 17% for transcriptomic and hybrid descriptors, respectively. Our findings suggest that using hybrid descriptors formed by combining chemical and targeted transcriptomic information can improve in vivo toxicity predictions in the right context.
RESUMO
Cell-based in vitro models coupled with high-throughput transcriptomics (HTTr) are increasingly utilized as alternative methods to animal-based toxicity testing. Here, using a panel of 14 chemicals with different risks of human drug-induced liver injury (DILI) and two dosing concentrations, we evaluated an HTTr platform comprised of collagen sandwich primary rat hepatocyte culture and the TempO-Seq surrogate S1500+ (ST) assay. First, the HTTr platform was found to exhibit high reproducibility between technical and biological replicates (r greater than 0.85). Connectivity mapping analysis further demonstrated a high level of inter-platform reproducibility between TempO-Seq data and Affymetrix GeneChip data from the Open TG-GATES project. Second, the TempO-Seq ST assay was shown to be a robust surrogate to the whole transcriptome (WT) assay in capturing chemical-induced changes in gene expression, as evident from correlation analysis, PCA and unsupervised hierarchical clustering. Gene set enrichment analysis (GSEA) using the Hallmark gene set collection also demonstrated consistency in enrichment scores between ST and WT assays. Lastly, unsupervised hierarchical clustering of hallmark enrichment scores broadly divided the samples into hepatotoxic, intermediate, and non-hepatotoxic groups. Xenobiotic metabolism, bile acid metabolism, apoptosis, p53 pathway, and coagulation were found to be the key hallmarks driving the clustering. Taken together, our results established the reproducibility and performance of collagen sandwich culture in combination with TempO-Seq S1500+ assay, and demonstrated the utility of GSEA using the hallmark gene set collection to identify potential hepatotoxicants for further validation.