Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Comput Struct Biotechnol J ; 21: 2160-2171, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37013005

RESUMEN

The cells of colorectal cancer (CRC) in their microenvironment experience constant stress, leading to dysregulated activity in the tumor niche. As a result, cancer cells acquire alternative pathways in response to the changing microenvironment, posing significant challenges for the design of effective cancer treatment strategies. While computational studies on high-throughput omics data have advanced our understanding of CRC subtypes, characterizing the heterogeneity of this disease remains remarkably complex. Here, we present a novel computational Pipeline for Characterizing Alternative Mechanisms (PCAM) based on biclustering to gain a more detailed understanding of cancer heterogeneity. Our application of PCAM to large-scale CRC transcriptomics datasets suggests that PCAM can generate a wealth of information leading to new biological understanding and predictive markers of alternative mechanisms. Our key findings include: 1) A comprehensive collection of alternative pathways in CRC, associated with biological and clinical factors. 2) Full annotation of detected alternative mechanisms, including their enrichment in known pathways and associations with various clinical outcomes. 3) A mechanistic relationship between known clinical subtypes and outcomes on a consensus map, visualized by the presence of alternative mechanisms. 4) Several potential novel alternative drug resistance mechanisms for Oxaliplatin, 5-Fluorouracil, and FOLFOX, some of which were validated on independent datasets. We believe that gaining a deeper understanding of alternative mechanisms is a critical step towards characterizing the heterogeneity of CRC. The hypotheses generated by PCAM, along with the comprehensive collection of biologically and clinically associated alternative pathways in CRC, could provide valuable insights into the underlying mechanisms driving cancer progression and drug resistance, which could aid in the development of more effective cancer therapies and guide experimental design towards more targeted and personalized treatment strategies. The computational pipeline of PCAM is available in GitHub (https://github.com/changwn/BC-CRC).

2.
Front Genet ; 13: 996941, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36276945

RESUMEN

Bi-clustering refers to the task of finding sub-matrices (indexed by a group of columns and a group of rows) within a matrix of data such that the elements of each sub-matrix (data and features) are related in a particular way, for instance, that they are similar with respect to some metric. In this paper, after analyzing the well-known Cheng and Church bi-clustering algorithm which has been proved to be an effective tool for mining co-expressed genes. However, Cheng and Church bi-clustering algorithm and summarizing its limitations (such as interference of random numbers in the greedy strategy; ignoring overlapping bi-clusters), we propose a novel enhancement of the adaptive bi-clustering algorithm, where a shielding complex sub-matrix is constructed to shield the bi-clusters that have been obtained and to discover the overlapping bi-clusters. In the shielding complex sub-matrix, the imaginary and the real parts are used to shield and extend the new bi-clusters, respectively, and to form a series of optimal bi-clusters. To assure that the obtained bi-clusters have no effect on the bi-clusters already produced, a unit impulse signal is introduced to adaptively detect and shield the constructed bi-clusters. Meanwhile, to effectively shield the null data (zero-size data), another unit impulse signal is set for adaptive detecting and shielding. In addition, we add a shielding factor to adjust the mean squared residue score of the rows (or columns), which contains the shielded data of the sub-matrix, to decide whether to retain them or not. We offer a thorough analysis of the developed scheme. The experimental results are in agreement with the theoretical analysis. The results obtained on a publicly available real microarray dataset show the enhancement of the bi-clusters performance thanks to the proposed method.

3.
Methods ; 207: 65-73, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-36122881

RESUMEN

Abnormal co-occurrence medical visit behavior is a form of medical insurance fraud. Specifically, an organized gang of fraudsters hold multiple medical insurance cards and purchase similar drugs frequently at the same time and the same location in order to siphon off medical insurance funds. Conventional identification methods to identify such behaviors rely mainly on manual auditing, making it difficult to satisfy the needs of identifying the small number of fraudulent behaviors in the large-scale medical data. On the other hand, the existing single-view bi-clustering algorithms only consider the features of the time-location dimension while neglecting the similarities in prescriptions and neglecting the fact that fraudsters may belong to multiple gangs. Therefore, in this paper, we present a multi-view bi-clustering method for identifying abnormal co-occurrence medical visit behavioral patterns, which performs cluster analysis simultaneously on the large-scale, complex and diverse visiting record dimension and prescription dimension to identify bi-clusters with similar time-location features. The proposed method constructs a matrix view of patients and visit records as well as a matrix view of patients and prescriptions, while decomposing multiple data matrices into sparse row and column vectors to obtain a consistent patient population across views. Subsequently the proposed method identifies the corresponding abnormal co-occurrence medical visit behavior and may greatly facilitate the safe operations and the sustainability of medical insurance funds. The experimental results show that our proposed method leads to more efficient and more accurate identifications of abnormal co-occurrence medical visit behavior, demonstrating its high efficiency and effectiveness.


Asunto(s)
Algoritmos , Humanos , Análisis por Conglomerados
4.
Front Plant Sci ; 13: 860791, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35463453

RESUMEN

Although growing evidence shows that microRNA (miRNA) regulates plant growth and development, miRNA regulatory networks in plants are not well understood. Current experimental studies cannot characterize miRNA regulatory networks on a large scale. This information gap provides an excellent opportunity to employ computational methods for global analysis and generate valuable models and hypotheses. To address this opportunity, we collected miRNA-target interactions (MTIs) and used MTIs from Arabidopsis thaliana and Medicago truncatula to predict homologous MTIs in soybeans, resulting in 80,235 soybean MTIs in total. A multi-level iterative bi-clustering method was developed to identify 483 soybean miRNA-target regulatory modules (MTRMs). Furthermore, we collected soybean miRNA expression data and corresponding gene expression data in response to abiotic stresses. By clustering these data, 37 MTRMs related to abiotic stresses were identified, including stress-specific MTRMs and shared MTRMs. These MTRMs have gene ontology (GO) enrichment in resistance response, iron transport, positive growth regulation, etc. Our study predicts soybean MTRMs and miRNA-GO networks under different stresses, and provides miRNA targeting hypotheses for experimental analyses. The method can be applied to other biological processes and other plants to elucidate miRNA co-regulation mechanisms.

5.
Biochim Biophys Acta Mol Basis Dis ; 1868(6): 166398, 2022 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-35318125

RESUMEN

Massive accumulation of lipids is a characteristic of alcoholic liver disease. Excess of hepatic fat activates Kupffer cells (KCs), which affect disease progression. Yet, KCs contribute to the resolution and advancement of liver injury. Aim of the present study was to evaluate the effect of KC depletion on markers of liver injury and the hepatic lipidome in liver steatosis (Lieber-DeCarli diet, LDC, female mice, mixed C57BL/6J and DBA/2J background). LDC increased the number of dead hepatocytes without changing the mRNA levels of inflammatory cytokines in the liver. Animals fed LDC accumulated elevated levels of almost all lipid classes. KC ablation normalized phosphatidylcholine and phosphatidylinositol levels in LDC livers, but had no effect in the controls. A modest decline of trigylceride and diglyceride levels upon KC loss was observed in both groups. Serum aminotransferases and hepatic ceramide were elevated in all animals upon KC depletion, and in particular, cytotoxic very long-chain ceramides increased in the LDC livers. Meta-biclustering revealed that eight lipid species occurred in more than 40% of the biclusters, and four of them were very long-chain ceramides. KC loss was further associated with excess free cholesterol levels in LDC livers. Expression of inflammatory cytokines did, however, not increase in parallel. In summary, the current study described a function of KCs in hepatic ceramide and cholesterol metabolism in an animal model of LDC liver steatosis. High abundance of cytotoxic ceramides and free cholesterol predispose the liver to disease progression suggesting a protective role of KCs in alcoholic liver diseases.


Asunto(s)
Hígado Graso , Macrófagos del Hígado , Animales , Hígado Graso/metabolismo , Femenino , Macrófagos del Hígado/metabolismo , Ratones , Ratones Endogámicos C57BL , Ratones Endogámicos DBA
6.
Ann Transl Med ; 9(5): 388, 2021 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-33842609

RESUMEN

BACKGROUND: With the popularization of knee replacement surgery in the treatment of the advanced lesions of knee joint, the amount of knee revision surgery is increasing unceasingly. Meanwhile, the continuous introduction of new clinical concepts and new technology poses a challenge to researchers and surgeons. Our study aims to inform the future scientific research and clinical treatment, by investigating the hot spots and trends of the knee revision research field with the method of bibliometric analysis. METHODS: Publications on knee revision included in the database of Web of Science Core Collection (WoSCC) between 2000 and 2018 were reviewed and MeSH terms of them were extracted from PubMed. Online bibliometric analysis website (http://bibliometric.com/), two pieces of software called "CiteSpace" and "Bibliographic Item Co-Occurrence Matrix Builder" (BICOMB) were used to analyze the publications reviewed at quantitative level. Another piece of software called "gCLUTO", was used to investigate the hot spots with visualization techniques at qualitative level. RESULTS: A total of 906 publications were retrieved between 2000 and 2018. There is an increasing number of publications, from 15 in 2000 to 86 in 2018. Journal of Arthroplasty is the leading journal which has the most publications on knee revision. The United States has been the biggest contributor. Mayo Clinic became the leader among the institutions which have conducted correlational researches. David G. Lewallen, Robert L. Barrack and Michael A. Mont should be regarded as the scholars who have made outstanding contribution. Hot spots were summed up in six clusters, respectively, the solutions for infection, prostheses, the adverse effects, the surgical techniques, epidemiological characters, and the pathophysiology of the revision knee. CONCLUSIONS: We found a growing trend in knee revision research and extracted the most contributive researchers, institutions, countries, journals, and most-cited articles worldwide. The solutions for complications, surgical applications and analysis for epidemiological characters have been the hot spots. Multi-disciplinary integration is becoming the time-trend of hot spots. Minimally invasive and navigation are directions of revision surgery. They together constitute a solid foundation and set up a fingerpost for the future scientific research and clinical treatment.

7.
Small ; 16(36): e2000272, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-32347014

RESUMEN

There is an urgent need for reliable toxicity assays to support the human health risk assessment of an ever increasing number of engineered nanomaterials (ENMs). Animal testing is not a suitable option for ENMs. Sensitive in vitro models and mechanism-based targeted in vitro assays that enable accurate prediction of in vivo responses are not yet available. In this proof-of-principle study, publicly available mouse lung transcriptomics data from studies investigating xenobiotic-induced lung diseases are used and a 17-gene biomarker panel (PFS17) applicable to the assessment of lung fibrosis is developed. The PFS17 is validated using a limited number of in vivo mouse lung transcriptomics datasets from studies investigating ENM-induced responses. In addition, an ex vivo precision-cut lung slice (PCLS) model is optimized for screening of potentially inflammogenic and pro-fibrotic ENMs. Using bleomycin and a multiwalled carbon nanotube, the practical application of the PCLS method as a sensitive alternative to whole animal tests to screen ENMs that may potentially induce inhalation toxicity is shown. Conditional to further optimization and validation, it is established that a combination of PFS17 and the ex vivo PCLS method will serve as a robust and sensitive approach to assess lung inflammation and fibrosis induced by ENMs.


Asunto(s)
Biomarcadores , Perfilación de la Expresión Génica , Nanoestructuras , Fibrosis Pulmonar , Toxicología , Animales , Biomarcadores/análisis , Perfilación de la Expresión Génica/métodos , Perfilación de la Expresión Génica/normas , Pulmón/efectos de los fármacos , Pulmón/fisiopatología , Ratones , Nanoestructuras/toxicidad , Fibrosis Pulmonar/inducido químicamente , Fibrosis Pulmonar/diagnóstico , Fibrosis Pulmonar/genética , Fibrosis Pulmonar/fisiopatología , Toxicología/métodos , Toxicología/tendencias , Transcriptoma
8.
Plant Biotechnol J ; 17(3): 580-593, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30133139

RESUMEN

Cell wall recalcitrance is the major challenge to improving saccharification efficiency in converting lignocellulose into biofuels. However, information regarding the transcriptional regulation of secondary cell wall biogenesis remains poor in switchgrass (Panicum virgatum), which has been selected as a biofuel crop in the United States. In this study, we present a combination of computational and experimental approaches to develop gene regulatory networks for lignin formation in switchgrass. To screen transcription factors (TFs) involved in lignin biosynthesis, we developed a modified method to perform co-expression network analysis using 14 lignin biosynthesis genes as bait (target) genes. The switchgrass lignin co-expression network was further extended by adding 14 TFs identified in this study, and seven TFs identified in previous studies, as bait genes. Six TFs (PvMYB58/63, PvMYB42/85, PvMYB4, PvWRKY12, PvSND2 and PvSWN2) were targeted to generate overexpressing and/or down-regulated transgenic switchgrass lines. The alteration of lignin content, cell wall composition and/or plant growth in the transgenic plants supported the role of the TFs in controlling secondary wall formation. RNA-seq analysis of four of the transgenic switchgrass lines revealed downstream target genes of the secondary wall-related TFs and crosstalk with other biological pathways. In vitro transactivation assays further confirmed the regulation of specific lignin pathway genes by four of the TFs. Our meta-analysis provides a hierarchical network of TFs and their potential target genes for future manipulation of secondary cell wall formation for lignin modification in switchgrass.


Asunto(s)
Redes Reguladoras de Genes/genética , Genes de Plantas/genética , Lignina/biosíntesis , Panicum/genética , Regulación de la Expresión Génica de las Plantas/genética , Genoma de Planta/genética , Panicum/metabolismo , Plantas Modificadas Genéticamente , Regiones Promotoras Genéticas/genética , Factores de Transcripción/genética
9.
Front Big Data ; 2: 27, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-33693350

RESUMEN

We consider multi-response and multi-task regression models, where the parameter matrix to be estimated is expected to have an unknown grouping structure. The groupings can be along tasks, or features, or both, the last one indicating a bi-cluster or "checkerboard" structure. Discovering this grouping structure along with parameter inference makes sense in several applications, such as multi-response Genome-Wide Association Studies (GWAS). By inferring this additional structure we can obtain valuable information on the underlying data mechanisms (e.g., relationships among genotypes and phenotypes in GWAS). In this paper, we propose two formulations to simultaneously learn the parameter matrix and its group structures, based on convex regularization penalties. We present optimization approaches to solve the resulting problems and provide numerical convergence guarantees. Extensive experiments demonstrate much better clustering quality compared to other methods, and our approaches are also validated on real datasets concerning phenotypes and genotypes of plant varieties.

10.
Methods ; 151: 12-20, 2018 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-29438828

RESUMEN

Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy for metabolic phenotyping data in order to discover subsets of interrelated metabolites that exhibit similar behaviour across subsets of samples. The proposed strategy incorporates bi-cross validation and statistical segmentation techniques to automatically determine the number and structure of bi-clusters. This alternative approach is in contrast to the widely used conventional clustering approaches that incorporate all molecular peaks for clustering in metabolic studies and require a priori specification of the number of clusters. We perform the comparative analysis of the proposed strategy with other bi-clustering approaches, which were developed in the context of genomics and transcriptomics research. We demonstrate the superior performance of the proposed bi-clustering strategy on both simulated (NMR) and real (MS) bacterial metabolic data.


Asunto(s)
Metaboloma , Metabolómica/métodos , Algoritmos , Análisis por Conglomerados , Perfilación de la Expresión Génica/métodos , Resonancia Magnética Nuclear Biomolecular
11.
Data Brief ; 15: 933-940, 2017 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-29159232

RESUMEN

This article contains data related to the research article 'Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials' (Williams and Halappanavar, 2015) [1]. The presence of diverse types of nanomaterials (NMs) in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015) [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs) to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we elaborate on the details of the statistical methods and algorithms used to derive the disease relevant gene signatures. These details will enable other investigators to use the gene signature in future Gene Set Enrichment Analysis studies involving NMs or as features for clustering and classifying NMs of diverse properties.

12.
Methods Mol Biol ; 1375: 117-21, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-25971913

RESUMEN

The importance of semantic-based methods and algorithms for the analysis and management of biological data is growing for two main reasons. From a biological side, knowledge contained in ontologies is more and more accurate and complete, from a computational side, recent algorithms are using in a valuable way such knowledge. Here we focus on semantic-based management and analysis of protein interaction networks referring to all the approaches of analysis of protein-protein interaction data that uses knowledge encoded into biological ontologies. Semantic approaches for studying high-throughput data have been largely used in the past to mine genomic and expression data. Recently, the emergence of network approaches for investigating molecular machineries has stimulated in a parallel way the introduction of semantic-based techniques for analysis and management of network data. The application of these computational approaches to the study of microarray data can broad the application scenario of them and simultaneously can help the understanding of disease development and progress.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Genómica/métodos , Semántica , Análisis por Conglomerados , Minería de Datos/métodos , Bases de Datos Genéticas , Mapeo de Interacción de Proteínas/métodos
13.
Methods Mol Biol ; 1375: 91-103, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-26350227

RESUMEN

Mining microarray data to unearth interesting expression profile patterns for discovery of in silico biological knowledge is an emerging area of research in computational biology. A group of functionally related genes may have similar expression patterns under a set of conditions or at some time points. Biclustering is an important data mining tool that has been successfully used to analyze gene expression data for biologically significant cluster discovery. The purpose of this chapter is to introduce interesting patterns that may be observed in expression data and discuss the role of biclustering techniques in detecting interesting functional gene groups with similar expression patterns.


Asunto(s)
Análisis por Conglomerados , Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Animales , Minería de Datos/métodos , Regulación de la Expresión Génica , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Reproducibilidad de los Resultados
14.
Bayesian Anal ; 8(4): 759-780, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26246865

RESUMEN

Histone modifications (HMs) play important roles in transcription through post-translational modifications. Combinations of HMs, known as chromatin signatures, encode specific messages for gene regulation. We therefore expect that inference on possible clustering of HMs and an annotation of genomic locations on the basis of such clustering can contribute new insights about the functions of regulatory elements and their relationships to combinations of HMs. We propose a nonparametric Bayesian local clustering Poisson model (NoB-LCP) to facilitate posterior inference on two-dimensional clustering of HMs and genomic locations. The NoB-LCP clusters HMs into HM sets and lets each HM set define its own clustering of genomic locations. Furthermore, it probabilistically excludes HMs and genomic locations that are irrelevant to clustering. By doing so, the proposed model effectively identifies important sets of HMs and groups regulatory elements with similar functionality based on HM patterns.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA