Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35914952

RESUMO

Low complexity regions are fragments of protein sequences composed of only a few types of amino acids. These regions frequently occur in proteins and can play an important role in their functions. However, scientists are mainly focused on regions characterized by high diversity of amino acid composition. Similarity between regions of protein sequences frequently reflect functional similarity between them. In this article, we discuss strengths and weaknesses of the similarity analysis of low complexity regions using BLAST, HHblits and CD-HIT. These methods are considered to be the gold standard in protein similarity analysis and were designed for comparison of high complexity regions. However, we lack specialized methods that could be used to compare the similarity of low complexity regions. Therefore, we investigated the existing methods in order to understand how they can be applied to compare such regions. Our results are supported by exploratory study, discussion of amino acid composition and biological roles of selected examples. We show that existing methods need improvements to efficiently search for similar low complexity regions. We suggest features that have to be re-designed specifically for comparing low complexity regions: scoring matrix, multiple sequence alignment, e-value, local alignment and clustering based on a set of representative sequences. Results of this analysis can either be used to improve existing methods or to create new methods for the similarity analysis of low complexity regions.


Assuntos
Aminoácidos , Proteínas , Algoritmos , Sequência de Aminoácidos , Aminoácidos/genética , Análise por Conglomerados , Proteínas/química , Proteínas/genética , Alinhamento de Sequência , Análise de Sequência de Proteína/métodos
2.
Bioinformatics ; 38(6): 1773-1775, 2022 03 04.
Artigo em Inglês | MEDLINE | ID: mdl-34954788

RESUMO

SUMMARY: Patient multi-omics datasets are often characterized by a high dimensionality; however, usually only a small fraction of the features is informative, that is change in their value is directly related to the disease outcome or patient survival. In medical sciences, in addition to a robust feature selection procedure, the ability to discover human-readable patterns in the analyzed data is also desirable. To address this need, we created MAINE-Multi-omics Analysis and Exploration. The unique functionality of MAINE is the ability to discover multidimensional dependencies between the selected multi-omics features and event outcome prediction as well as patient survival probability. Learned patterns are visualized in the form of interpretable decision/survival trees and rules. AVAILABILITY AND IMPLEMENTATION: MAINE is freely available at maine.ibemag.pl as an online web application. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Multiômica , Software , Humanos , Maine
3.
Brief Bioinform ; 21(2): 458-472, 2020 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-30698641

RESUMO

There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, and more generally the overlaps between different properties related to LCRs, using examples. We argue that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements. While the methodologies available to study LCRs are already very advanced, we foresee that a more comprehensive annotation of sequences in the databases will enable the improvement of predictions and a better understanding of the evolution and the connection between structure and function of LCRs. This will require the use of standards for the generation and exchange of data describing all aspects of LCRs. SHORT ABSTRACT: There are multiple definitions for low complexity regions (LCRs) in protein sequences. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, plus overlaps between different properties related to LCRs, using examples.


Assuntos
Proteínas/química , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Evolução Molecular , Conformação Proteica , Domínios Proteicos
4.
Nucleic Acids Res ; 48(W1): W77-W84, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32421769

RESUMO

Low complexity regions (LCRs) in protein sequences are characterized by a less diverse amino acid composition compared to typically observed sequence diversity. Recent studies have shown that LCRs may co-occur with intrinsically disordered regions, are highly conserved in many organisms, and often play important roles in protein functions and in diseases. In previous decades, several methods have been developed to identify regions with LCRs or amino acid bias, but most of them as stand-alone applications and currently there is no web-based tool which allows users to explore LCRs in protein sequences with additional functional annotations. We aim to fill this gap by providing PlaToLoCo - PLAtform of TOols for LOw COmplexity-a meta-server that integrates and collects the output of five different state-of-the-art tools for discovering LCRs and provides functional annotations such as domain detection, transmembrane segment prediction, and calculation of amino acid frequencies. In addition, the union or intersection of the results of the search on a query sequence can be obtained. By developing the PlaToLoCo meta-server, we provide the community with a fast and easily accessible tool for the analysis of LCRs with additional information included to aid the interpretation of the results. The PlaToLoCo platform is available at: http://platoloco.aei.polsl.pl/.


Assuntos
Proteínas/química , Software , Aminoácidos/análise , Gráficos por Computador , Humanos , Proteínas de Membrana/química , Anotação de Sequência Molecular , Domínios Proteicos , Análise de Sequência de Proteína
5.
BMC Bioinformatics ; 22(1): 182, 2021 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-33832440

RESUMO

BACKGROUND: The rapid spread of the COVID-19 demands immediate response from the scientific communities. Appropriate countermeasures mean thoughtful and educated choice of viral targets (epitopes). There are several articles that discuss such choices in the SARS-CoV-2 proteome, other focus on phylogenetic traits and history of the Coronaviridae genome/proteome. However none consider viral protein low complexity regions (LCRs). Recently we created the first methods that are able to compare such fragments. RESULTS: We show that five low complexity regions (LCRs) in three proteins (nsp3, S and N) encoded by the SARS-CoV-2 genome are highly similar to regions from human proteome. As many as 21 predicted T-cell epitopes and 27 predicted B-cell epitopes overlap with the five SARS-CoV-2 LCRs similar to human proteins. Interestingly, replication proteins encoded in the central part of viral RNA are devoid of LCRs. CONCLUSIONS: Similarity of SARS-CoV-2 LCRs to human proteins may have implications on the ability of the virus to counteract immune defenses. The vaccine targeted LCRs may potentially be ineffective or alternatively lead to autoimmune diseases development. These findings are crucial to the process of selection of new epitopes for drugs or vaccines which should omit such regions.


Assuntos
Proteoma , SARS-CoV-2/genética , Homologia de Sequência , Vacinas contra COVID-19 , Proteínas do Nucleocapsídeo de Coronavírus/imunologia , Epitopos de Linfócito B/imunologia , Epitopos de Linfócito T/imunologia , Humanos , Fosfoproteínas/imunologia , Filogenia , RNA Polimerase Dependente de RNA/imunologia , Fatores de Risco , Glicoproteína da Espícula de Coronavírus/imunologia , Proteínas não Estruturais Virais/imunologia
6.
Nucleic Acids Res ; 47(21): 10994-11006, 2019 12 02.
Artigo em Inglês | MEDLINE | ID: mdl-31584084

RESUMO

The widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with 'ready-to-use' deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotation-deposition workflow, and that may proliferate in public database repositories affecting all downstream analyses. As a case study, we provide examples of the Atlantic cod genome, whose sequencing and assembly were hindered by a particularly high prevalence of tandem repeats. We complement this case study with examples from other species, where mis-annotations and sequencing errors have propagated into protein databases. With this review, we aim to raise the awareness level within the community of database users, and alert scientists working in the underlying workflow of database creation that the data they omit or improperly assemble may well contain important biological information valuable to others.


Assuntos
DNA/genética , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Erro Científico Experimental , Sequências de Repetição em Tandem/genética , Animais , Gadus morhua/genética , Análise de Sequência de DNA
7.
J Am Chem Soc ; 141(42): 16817-16828, 2019 10 23.
Artigo em Inglês | MEDLINE | ID: mdl-31550880

RESUMO

Electrostatic interactions play important roles in the functional mechanisms exploited by intrinsically disordered proteins (IDPs). The atomic resolution description of long-range and local structural propensities that can both be crucial for the function of highly charged IDPs presents significant experimental challenges. Here, we investigate the conformational behavior of the δ subunit of RNA polymerase from Bacillus subtilis whose unfolded domain is highly charged, with 7 positively charged amino acids followed by 51 acidic amino acids. Using a specifically designed analytical strategy, we identify transient contacts between the two regions using a combination of NMR paramagnetic relaxation enhancements, residual dipolar couplings (RDCs), chemical shifts, and small-angle scattering. This strategy allows the resolution of long-range and local ensemble averaged structural contributions to the experimental RDCs, and reveals that the negatively charged segment folds back onto the positively charged strand, compacting the conformational sampling of the protein while remaining highly flexible in solution. Mutation of the positively charged region abrogates the long-range contact, leaving the disordered domain in an extended conformation, possibly due to local repulsion of like-charges along the chain. Remarkably, in vitro studies show that this mutation also has a significant effect on transcription activity, and results in diminished cell fitness of the mutated bacteria in vivo. This study highlights the importance of accurately describing electrostatic interactions for understanding the functional mechanisms of IDPs.


Assuntos
Bacillus subtilis/enzimologia , RNA Polimerases Dirigidas por DNA/química , RNA Polimerases Dirigidas por DNA/metabolismo , Subunidades Proteicas/química , Subunidades Proteicas/metabolismo , Eletricidade Estática , Sequência de Aminoácidos , Modelos Moleculares , Conformação Proteica
8.
Mol Reprod Dev ; 83(2): 144-8, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-26660717

RESUMO

Glyceraldehyde-3-phosphate dehydrogenase from human sperm (GAPDHS) provides energy to the sperm flagellum, and is therefore essential for sperm motility and male fertility. This isoform is distinct from somatic GAPDH, not only in being specific for the testis but also because it contains an additional amino-terminal region that encodes a proline-rich motif that is known to bind to the fibrous sheath of the sperm tail. By conducting a large-scale sequence comparison on low-complexity sequences available in databases, we identified a strong similarity between the proline-rich motif from GAPDHS and the proline-rich sequence from Ena/vasodilator-stimulated phosphoprotein-like (EVL), which is known to bind an SH3 domain of dynamin-binding protein (DNMBP). The putative binding partners of the proline-rich GAPDHS motif include SH3 domain-binding protein 4 (SH3BP4) and the IL2-inducible T-cell kinase/tyrosine-protein kinase ITK/TSK (ITK). This result implies that GAPDHS participates in specific signal-transduction pathways. Gene Ontology category-enrichment analysis showed several functional classes shared by both proteins, of which the most interesting ones are related to signal transduction and regulation of hydrolysis. Furthermore, a mutation of one EVL proline to leucine is known to cause colorectal cancer, suggesting that mutation of homologous amino acid residue in the GAPDHS motif may be functionally deleterious.


Assuntos
Gliceraldeído-3-Fosfato Desidrogenase (Fosforiladora) , Mutação de Sentido Incorreto , Cauda do Espermatozoide/enzimologia , Proteínas Adaptadoras de Transdução de Sinal/genética , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Substituição de Aminoácidos , Moléculas de Adesão Celular/genética , Moléculas de Adesão Celular/metabolismo , Gliceraldeído-3-Fosfato Desidrogenase (Fosforiladora)/genética , Gliceraldeído-3-Fosfato Desidrogenase (Fosforiladora)/metabolismo , Humanos , Leucina/genética , Leucina/metabolismo , Masculino , Prolina/genética , Prolina/metabolismo , Proteínas Tirosina Quinases/genética , Proteínas Tirosina Quinases/metabolismo , Transdução de Sinais , Domínios de Homologia de src/genética
9.
Molecules ; 19(11): 18558-73, 2014 Nov 13.
Artigo em Inglês | MEDLINE | ID: mdl-25401399

RESUMO

The need to find new EGFR inhibitors for use in combination with radiotherapy in the treatment of solid tumors has drawn our attention to compounds derived from genistein, a natural isoflavonoid. The antiproliferative potential of synthetic genistein derivatives used alone or in combination with ionizing radiation was evaluated in cancer cell lines using clonogenic assay. EGFR phosphorylation was assessed with western blotting. Genistein derivatives inhibited clonogenic growth of HCT 116 cancer cells additively or synergistically when used in combination with ionizing radiation, and decreased EGFR activation. Our preclinical evaluation of genistein-derived EGFR inhibitors suggests that these compounds are much more potent sensitizers of cells to radiation than the parent isoflavonoid, genistein and indicate that these compounds may be useful in the treatment of colon cancer with radiation therapy.


Assuntos
Neoplasias do Colo/terapia , Receptores ErbB/metabolismo , Genisteína/farmacologia , Glicosídeos/farmacologia , Proteínas de Neoplasias/metabolismo , Radiossensibilizantes/farmacologia , Anticarcinógenos/síntese química , Anticarcinógenos/química , Anticarcinógenos/farmacologia , Linhagem Celular Tumoral , Neoplasias do Colo/metabolismo , Neoplasias do Colo/patologia , Relação Dose-Resposta a Droga , Relação Dose-Resposta à Radiação , Genisteína/síntese química , Genisteína/química , Glicosídeos/síntese química , Glicosídeos/química , Humanos , Fosforilação/efeitos dos fármacos , Fosforilação/efeitos da radiação , Radiação Ionizante , Radiossensibilizantes/síntese química , Radiossensibilizantes/química
10.
Bioinform Adv ; 4(1): vbad187, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38239846

RESUMO

The Polish Bioinformatic Society (PTBI) Symposium convenes annually at leading Polish Universities, and in 2023, the Silesian University of Technology hosted participants from all over the world. The 15th PTBI Symposium, spanning a 3-day duration and divided into four scientific sessions, gathered around 100 participants and centered on research related to machine learning in biomedicine, RNA structure algorithms, next-generation sequencing methods, and microbiome analysis but was not limited to only those topics. The meeting also recognized outstanding research conducted by young scientists by awarding the best poster and best talk. Finally, the awards for the best PhD, MSc, and BSc thesis in bioinformatics defended in Poland were given. This report summarizes the key highlights and outcomes of the meeting.

11.
Nucleic Acids Res ; 39(Web Server issue): W293-301, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21715384

RESUMO

Genome-wide expression profiles obtained with the use of DNA microarray technology provide abundance of experimental data on biological and molecular processes. Such amount of data need to be further analyzed and interpreted in order to obtain biological conclusions on the basis of experimental results. The analysis requires a lot of experience and is usually time-consuming process. Thus, frequently various annotation databases are used to improve the whole process of analysis. Here, we present RuleGO--the web-based application that allows the user to describe gene groups on the basis of logical rules that include Gene Ontology (GO) terms in their premises. Presented application allows obtaining rules that reflect coappearance of GO-terms describing genes supported by the rules. The ontology level and number of coappearing GO-terms is adjusted in automatic manner. The user limits the space of possible solutions only. The RuleGO application is freely available at http://rulego.polsl.pl/.


Assuntos
Perfilação da Expressão Gênica , Software , Vocabulário Controlado , Algoritmos , Genes , Anotação de Sequência Molecular
12.
Front Microbiol ; 14: 1257002, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37808321

RESUMO

The rapid development of machine learning (ML) techniques has opened up the data-dense field of microbiome research for novel therapeutic, diagnostic, and prognostic applications targeting a wide range of disorders, which could substantially improve healthcare practices in the era of precision medicine. However, several challenges must be addressed to exploit the benefits of ML in this field fully. In particular, there is a need to establish "gold standard" protocols for conducting ML analysis experiments and improve interactions between microbiome researchers and ML experts. The Machine Learning Techniques in Human Microbiome Studies (ML4Microbiome) COST Action CA18131 is a European network established in 2019 to promote collaboration between discovery-oriented microbiome researchers and data-driven ML experts to optimize and standardize ML approaches for microbiome analysis. This perspective paper presents the key achievements of ML4Microbiome, which include identifying predictive and discriminatory 'omics' features, improving repeatability and comparability, developing automation procedures, and defining priority areas for the novel development of ML methods targeting the microbiome. The insights gained from ML4Microbiome will help to maximize the potential of ML in microbiome research and pave the way for new and improved healthcare practices.

13.
Front Mol Biosci ; 9: 828674, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35359602

RESUMO

Deficiency in a principal epidermal barrier protein, filaggrin (FLG), is associated with multiple allergic manifestations, including atopic dermatitis and contact allergy to nickel. Toxicity caused by dermal and respiratory exposures of the general population to nickel-containing objects and particles is a deleterious side effect of modern technologies. Its molecular mechanism may include the peptide bond hydrolysis in X1-S/T-c/p-H-c-X2 motifs by released Ni2+ ions. The goal of the study was to analyse the distribution of such cleavable motifs in the human proteome and examine FLG vulnerability of nickel hydrolysis. We performed a general bioinformatic study followed by biochemical and biological analysis of a single case, the FLG protein. FLG model peptides, the recombinant monomer domain human keratinocytes in vitro and human epidermis ex vivo were used. We also investigated if the products of filaggrin Ni2+-hydrolysis affect the activation profile of Langerhans cells. We found X1-S/T-c/p-H-c-X2 motifs in 40% of human proteins, with the highest abundance in those involved in the epidermal barrier function, including FLG. We confirmed the hydrolytic vulnerability and pH-dependent Ni2+-assisted cleavage of FLG-derived peptides and FLG monomer, using in vitro cell culture and ex-vivo epidermal sheets; the hydrolysis contributed to the pronounced reduction in FLG in all of the models studied. We also postulated that Ni-hydrolysis might dysregulate important immune responses. Ni2+-assisted cleavage of barrier proteins, including FLG, may contribute to clinical disease associated with nickel exposure.

15.
Bioorg Med Chem ; 19(1): 295-305, 2011 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-21129977

RESUMO

This paper describes the synthesis and antiproliferative activity of conjugates of genistein (1) and unsaturated pyranosides. Constructs linking genistein with a sugar moiety through an alkyl chain were obtained in a two-step synthesis: in a first step genistein was converted into an intermediate bearing an ω-hydroxyalkyl substituent, containing two, three or five carbon atoms, at position 7, while the second step involved Ferrier glycosylation reaction, employing glycals. Antiproliferative activity of several genistein derivatives was tested in cancer cell lines in vitro. The most potent derivative, Ram-3 inhibited the cell cycle, interacted with mitotic spindles and caused apoptotic cell death. Neither genistein nor the sugar alone were able to influence the mitotic spindle organization. Our results indicate, that conjugation of genistein with certain sugars may render the interaction of derivatives with new molecular targets.


Assuntos
Proliferação de Células/efeitos dos fármacos , Genisteína/farmacologia , Mitose/efeitos dos fármacos , Linhagem Celular Tumoral , Eletroforese em Gel de Ágar , Genisteína/síntese química , Humanos , Marcação In Situ das Extremidades Cortadas , Espectroscopia de Ressonância Magnética , Espectrometria de Massas por Ionização por Electrospray
16.
Sci Rep ; 11(1): 13580, 2021 06 30.
Artigo em Inglês | MEDLINE | ID: mdl-34193945

RESUMO

In the DECODE project, data were collected from 3,114 surveys filled by symptomatic patients RT-qPCR tested for SARS-CoV-2 in a single university centre in March-September 2020. The population demonstrated balanced sex and age with 759 SARS-CoV-2( +) patients. The most discriminative symptoms in SARS-CoV-2( +) patients at early infection stage were loss of taste/smell (OR = 3.33, p < 0.0001), body temperature above 38℃ (OR = 1.67, p < 0.0001), muscle aches (OR = 1.30, p = 0.0242), headache (OR = 1.27, p = 0.0405), cough (OR = 1.26, p = 0.0477). Dyspnea was more often reported among SARS-CoV-2(-) (OR = 0.55, p < 0.0001). Cough and dyspnea were 3.5 times more frequent among SARS-CoV-2(-) (OR = 0.28, p < 0.0001). Co-occurrence of cough, muscle aches, headache, loss of taste/smell (OR = 4.72, p = 0.0015) appeared significant, although co-occurrence of two symptoms only, cough and loss of smell or taste, means OR = 2.49 (p < 0.0001). Temperature > 38℃ with cough was most frequent in men (20%), while loss of taste/smell with cough in women (17%). For younger people, taste/smell impairment is sufficient to characterise infection, whereas in older patients co-occurrence of fever and cough is necessary. The presented study objectifies the single symptoms and interactions significance in COVID-19 diagnoses and demonstrates diverse symptomatology in patient groups.


Assuntos
COVID-19/diagnóstico , COVID-19/epidemiologia , Infecções Respiratórias/diagnóstico , Infecções Respiratórias/epidemiologia , SARS-CoV-2 , Avaliação de Sintomas/estatística & dados numéricos , Centros Médicos Acadêmicos/estatística & dados numéricos , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Ageusia/etiologia , COVID-19/complicações , Criança , Pré-Escolar , Tosse/etiologia , Diagnóstico Diferencial , Dispneia/etiologia , Feminino , Febre/etiologia , Cefaleia/etiologia , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Razão de Chances , Transtornos do Olfato/etiologia , Projetos Piloto , Polônia/epidemiologia , Infecções Respiratórias/complicações , Infecções Respiratórias/microbiologia , Inquéritos e Questionários , Avaliação de Sintomas/classificação , Adulto Jovem
17.
Mol Cancer Ther ; 20(5): 775-786, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33649105

RESUMO

The development of antimetastatic drugs is an urgent healthcare priority for patients with cancer, because metastasis is thought to account for around 90% of cancer deaths. Current antimetastatic treatment options are limited and often associated with poor long-term survival and systemic toxicities. Bcl3, a facilitator protein of the NF-κB family, is associated with poor prognosis in a range of tumor types. Bcl3 has been directly implicated in the metastasis of tumor cells, yet is well tolerated when constitutively deleted in murine models, making it a promising therapeutic target. Here, we describe the identification and characterization of the first small-molecule Bcl3 inhibitor, by using a virtual drug design and screening approach against a computational model of the Bcl3-NF-kB1(p50) protein-protein interaction. From selected virtual screening hits, one compound (JS6) showed potent intracellular Bcl3-inhibitory activity. JS6 treatment led to reductions in Bcl3-NF-kB1 binding, tumor colony formation, and cancer cell migration in vitro; and tumor stasis and antimetastatic activity in vivo, while being devoid of overt systemic toxicity. These results represent a successful application of in silico screening in the identification of protein-protein inhibitors for novel intracellular targets, and confirm Bcl3 as a potential antimetastatic target.


Assuntos
Proteína 3 do Linfoma de Células B/antagonistas & inibidores , Neoplasias/tratamento farmacológico , Animais , Linhagem Celular Tumoral , Humanos , Camundongos , Camundongos Nus , Modelos Moleculares
18.
Front Microbiol ; 12: 634511, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33737920

RESUMO

The number of microbiome-related studies has notably increased the availability of data on human microbiome composition and function. These studies provide the essential material to deeply explore host-microbiome associations and their relation to the development and progression of various complex diseases. Improved data-analytical tools are needed to exploit all information from these biological datasets, taking into account the peculiarities of microbiome data, i.e., compositional, heterogeneous and sparse nature of these datasets. The possibility of predicting host-phenotypes based on taxonomy-informed feature selection to establish an association between microbiome and predict disease states is beneficial for personalized medicine. In this regard, machine learning (ML) provides new insights into the development of models that can be used to predict outputs, such as classification and prediction in microbiology, infer host phenotypes to predict diseases and use microbial communities to stratify patients by their characterization of state-specific microbial signatures. Here we review the state-of-the-art ML methods and respective software applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on the application of ML in microbiome studies related to association and clinical use for diagnostics, prognostics, and therapeutics. Although the data presented here is more related to the bacterial community, many algorithms could be applied in general, regardless of the feature type. This literature and software review covering this broad topic is aligned with the scoping review methodology. The manual identification of data sources has been complemented with: (1) automated publication search through digital libraries of the three major publishers using natural language processing (NLP) Toolkit, and (2) an automated identification of relevant software repositories on GitHub and ranking of the related research papers relying on learning to rank approach.

19.
F1000Res ; 9: 1398, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33604028

RESUMO

Today, academic researchers benefit from the changes driven by digital technologies and the enormous growth of knowledge and data, on globalisation, enlargement of the scientific community, and the linkage between different scientific communities and the society. To fully benefit from this development, however, information needs to be shared openly and transparently. Digitalisation plays a major role here because it permeates all areas of business, science and society and is one of the key drivers for innovation and international cooperation. To address the resulting opportunities, the EU promotes the development and use of collaborative ways to produce and share knowledge and data as early as possible in the research process, but also to appropriately secure results with the European strategy for Open Science (OS). It is now widely recognised that making research results more accessible to all societal actors contributes to more effective and efficient science; it also serves as a boost for innovation in the public and private sectors. However  for research data to be findable, accessible, interoperable and reusable the use of standards is essential. At the metadata level, considerable efforts in standardisation have already been made (e.g. Data Management Plan and FAIR Principle etc.), whereas in context with the raw data these fundamental efforts are still fragmented and in some cases completely missing. The CHARME consortium, funded by the European Cooperation in Science and Technology (COST) Agency, has identified needs and gaps in the field of standardisation in the life sciences and also discussed potential hurdles for implementation of standards in current practice. Here, the authors suggest four measures in response to current challenges to ensure a high quality of life science research data and their re-usability for research and innovation.


Assuntos
Disciplinas das Ciências Biológicas , Confiança , Cooperação Internacional , Metadados , Qualidade de Vida
20.
Bioorg Med Chem Lett ; 19(17): 4939-43, 2009 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-19660945

RESUMO

Genistein, due to its recognized chemopreventive and antitumor potential, is a molecule of interest as a lead compound in drug design. While multiple molecular targets for genistein have been identified, so far neither for this isoflavonoid nor for its natural or synthetic derivatives disruption of microtubules and mitotic spindles has been reported. Here we describe such properties of the synthetic glycosidic derivative of genistein significantly more cytotoxic than genistein, 7-O-(2,3,4,6-tetra-O-acetyl-beta-D-galactopyranosyl)-(1-->4)-(6-O-acetyl-hex-2-ene-alpha-D-erythro-pyranosyl)genistein, shortly named G21. We found that G21 causes significant mitotic delay, frequent appearance of multipolar spindles, and alteration of the interphase microtubule array.


Assuntos
Antineoplásicos/química , Dissacarídeos/química , Genisteína/análogos & derivados , Glicosídeos/química , Microtúbulos/efeitos dos fármacos , Moduladores de Tubulina/química , Antineoplásicos/síntese química , Antineoplásicos/toxicidade , Linhagem Celular Tumoral , Dissacarídeos/síntese química , Dissacarídeos/toxicidade , Genisteína/síntese química , Genisteína/química , Genisteína/toxicidade , Glicosídeos/síntese química , Glicosídeos/toxicidade , Humanos , Microtúbulos/metabolismo , Moduladores de Tubulina/síntese química , Moduladores de Tubulina/toxicidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA