Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 40
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Hum Mol Genet ; 33(13): 1152-1163, 2024 Jun 21.
Article in English | MEDLINE | ID: mdl-38558123

ABSTRACT

Neanderthal and Denisovan hybridisation with modern humans has generated a non-random genomic distribution of introgressed regions, the result of drift and selection dynamics. Cross-species genomic incompatibility and more efficient removal of slightly deleterious archaic variants have been proposed as selection-based processes involved in the post-hybridisation purge of archaic introgressed regions. Both scenarios require the presence of functionally different alleles across Homo species onto which selection operated differently according to which populations hosted them, but only a few of these variants have been pinpointed so far. In order to identify functionally divergent archaic variants removed in humans, we focused on mitonuclear genes, which are underrepresented in the genomic landscape of archaic humans. We searched for non-synonymous, fixed, archaic-derived variants present in mitonuclear genes, rare or absent in human populations. We then compared the functional impact of archaic and human variants in the model organism Saccharomyces cerevisiae. Notably, a variant within the mitochondrial tyrosyl-tRNA synthetase 2 (YARS2) gene exhibited a significant decrease in respiratory activity and a substantial reduction of Cox2 levels, a proxy for mitochondrial protein biosynthesis, coupled with the accumulation of the YARS2 protein precursor and a lower amount of mature enzyme. Our work suggests that this variant is associated with mitochondrial functionality impairment, thus contributing to the purging of archaic introgression in YARS2. While different molecular mechanisms may have impacted other mitonuclear genes, our approach can be extended to the functional screening of mitonuclear genetic variants present across species and populations.


Subject(s)
Neanderthals , Saccharomyces cerevisiae , Humans , Saccharomyces cerevisiae/genetics , Neanderthals/genetics , Animals , Genetic Variation , Mitochondria/genetics , Mitochondria/metabolism , Alleles , Genetic Introgression , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolism
2.
Bioinformatics ; 40(5)2024 May 02.
Article in English | MEDLINE | ID: mdl-38754097

ABSTRACT

MOTIVATION: Mutational signatures are a critical component in deciphering the genetic alterations that underlie cancer development and have become a valuable resource to understand the genomic changes during tumorigenesis. Therefore, it is essential to employ precise and accurate methods for their extraction to ensure that the underlying patterns are reliably identified and can be effectively utilized in new strategies for diagnosis, prognosis, and treatment of cancer patients. RESULTS: We present MUSE-XAE, a novel method for mutational signature extraction from cancer genomes using an explainable autoencoder. Our approach employs a hybrid architecture consisting of a nonlinear encoder that can capture nonlinear interactions among features, and a linear decoder which ensures the interpretability of the active signatures. We evaluated and compared MUSE-XAE with other available tools on both synthetic and real cancer datasets and demonstrated that it achieves superior performance in terms of precision and sensitivity in recovering mutational signature profiles. MUSE-XAE extracts highly discriminative mutational signature profiles by enhancing the classification of primary tumour types and subtypes in real world settings. This approach could facilitate further research in this area, with neural networks playing a critical role in advancing our understanding of cancer genomics. AVAILABILITY AND IMPLEMENTATION: MUSE-XAE software is freely available at https://github.com/compbiomed-unito/MUSE-XAE.


Subject(s)
Mutation , Neoplasms , Humans , Neoplasms/genetics , Algorithms , Software , Genomics/methods , Computational Biology/methods , Neural Networks, Computer
3.
Brief Bioinform ; 23(2)2022 03 10.
Article in English | MEDLINE | ID: mdl-35021190

ABSTRACT

Predicting the difference in thermodynamic stability between protein variants is crucial for protein design and understanding the genotype-phenotype relationships. So far, several computational tools have been created to address this task. Nevertheless, most of them have been trained or optimized on the same and 'all' available data, making a fair comparison unfeasible. Here, we introduce a novel dataset, collected and manually cleaned from the latest version of the ThermoMutDB database, consisting of 669 variants not included in the most widely used training datasets. The prediction performance and the ability to satisfy the antisymmetry property by considering both direct and reverse variants were evaluated across 21 different tools. The Pearson correlations of the tested tools were in the ranges of 0.21-0.5 and 0-0.45 for the direct and reverse variants, respectively. When both direct and reverse variants are considered, the antisymmetric methods perform better achieving a Pearson correlation in the range of 0.51-0.62. The tested methods seem relatively insensitive to the physiological conditions, performing well also on the variants measured with more extreme pH and temperature values. A common issue with all the tested methods is the compression of the $\Delta \Delta G$ predictions toward zero. Furthermore, the thermodynamic stability of the most significantly stabilizing variants was found to be more challenging to predict. This study is the most extensive comparisons of prediction methods using an entirely novel set of variants never tested before.


Subject(s)
Point Mutation , Proteins , Mutation , Protein Stability , Proteins/chemistry , Thermodynamics
4.
PLoS Comput Biol ; 19(9): e1011474, 2023 Sep.
Article in English | MEDLINE | ID: mdl-37721960

ABSTRACT

Genetic markers (especially short tandem repeats or STRs) located on the X chromosome are a valuable resource to solve complex kinship cases in forensic genetics in addition or alternatively to autosomal STRs. Groups of tightly linked markers are combined into haplotypes, thus increasing the discriminating power of tests. However, this approach requires precise knowledge of the recombination rates between adjacent markers. The International Society of Forensic Genetics recommends that recombination rate estimation on the X chromosome is performed from pedigree genetic data while taking into account the confounding effect of mutations. However, implementations that satisfy these requirements have several drawbacks: they were never publicly released, they are very slow and/or need cluster-level hardware and strong computational expertise to use. In order to address these key concerns we developed Recombulator-X, a new open-source Python tool. The most challenging issue, namely the running time, was addressed with dynamic programming techniques to greatly reduce the computational complexity of the algorithm. Compared to the previous methods, Recombulator-X reduces the estimation times from weeks or months to less than one hour for typical datasets. Moreover, the estimation process, including preprocessing, has been streamlined and packaged into a simple command-line tool that can be run on a normal PC. Where previous approaches were limited to small panels of STR markers (up to 15), our tool can handle greater numbers (up to 100) of mixed STR and non-STR markers. In conclusion, Recombulator-X makes the estimation process much simpler, faster and accessible to researchers without a computational background, hopefully spurring increased adoption of best practices.

5.
Nucleic Acids Res ; 50(W1): W222-W227, 2022 07 05.
Article in English | MEDLINE | ID: mdl-35524565

ABSTRACT

Estimating the functional effect of single amino acid variants in proteins is fundamental for predicting the change in the thermodynamic stability, measured as the difference in the Gibbs free energy of unfolding, between the wild-type and the variant protein (ΔΔG). Here, we present the web-server of the DDGun method, which was previously developed for the ΔΔG prediction upon amino acid variants. DDGun is an untrained method based on basic features derived from evolutionary information. It is antisymmetric, as it predicts opposite ΔΔG values for direct (A → B) and reverse (B → A) single and multiple site variants. DDGun is available in two versions, one based on only sequence information and the other one based on sequence and structure information. Despite being untrained, DDGun reaches prediction performances comparable to those of trained methods. Here we make DDGun available as a web server. For the web server version, we updated the protein sequence database used for the computation of the evolutionary features, and we compiled two new data sets of protein variants to do a blind test of its performances. On these blind data sets of single and multiple site variants, DDGun confirms its prediction performance, reaching an average correlation coefficient between experimental and predicted ΔΔG of 0.45 and 0.49 for the sequence-based and structure-based versions, respectively. Besides being used for the prediction of ΔΔG, we suggest that DDGun should be adopted as a benchmark method to assess the predictive capabilities of newly developed methods. Releasing DDGun as a web-server, stand-alone program and docker image will facilitate the necessary process of method comparison to improve ΔΔG prediction.


Subject(s)
Amino Acids , Protein Stability , Proteins , Amino Acids/genetics , Computers , Databases, Protein , Proteins/genetics , Proteins/chemistry
6.
Clin Gastroenterol Hepatol ; 21(13): 3314-3321.e3, 2023 12.
Article in English | MEDLINE | ID: mdl-37149016

ABSTRACT

BACKGROUND AND AIMS: Nonalcoholic fatty liver disease (NAFLD) is a complex disease, resulting from the interplay between environmental determinants and genetic variations. Single nucleotide polymorphism rs738409 C>G in the PNPLA3 gene is associated with hepatic fibrosis and with higher risk of developing hepatocellular carcinoma. Here, we analyzed a longitudinal cohort of biopsy-proven NAFLD subjects with the aim to identify individuals in whom genetics may have a stronger impact on disease progression. METHODS: We retrospectively analyzed 756 consecutive, prospectively enrolled biopsy-proven NAFLD subjects from Italy, United Kingdom, and Spain who were followed for a median of 84 months (interquartile range, 65-109 months). We stratified the study cohort according to sex, body mass index (BMI)

Subject(s)
Carcinoma, Hepatocellular , Esophageal and Gastric Varices , Liver Neoplasms , Non-alcoholic Fatty Liver Disease , Humans , Female , Male , Middle Aged , Non-alcoholic Fatty Liver Disease/complications , Non-alcoholic Fatty Liver Disease/genetics , Non-alcoholic Fatty Liver Disease/epidemiology , Carcinoma, Hepatocellular/epidemiology , Carcinoma, Hepatocellular/genetics , Carcinoma, Hepatocellular/complications , Retrospective Studies , Esophageal and Gastric Varices/complications , Gastrointestinal Hemorrhage/complications , Genotype , Polymorphism, Single Nucleotide , Liver Neoplasms/epidemiology , Liver Neoplasms/genetics , Liver Neoplasms/complications , Genetic Predisposition to Disease
7.
Bioinformatics ; 38(9): 2617-2618, 2022 04 28.
Article in English | MEDLINE | ID: mdl-35199151

ABSTRACT

MOTIVATION: Many genomics applications require the computation of nucleotide coverage of a reference genome or the ability to determine how many reads map to a reference region. RESULTS: BamToCov is a toolkit for rapid and flexible coverage computation that relies on the most memory efficient algorithm and is designed for integration in pipelines, given its ability to read alignment files from streams. The tools in the suite can process sorted BAM or CRAM files, allowing the user to extract coverage information via different filtering approaches and to save the output in different formats (BED, Wig or counts). The BamToCov algorithm can also handle strand-specific and/or physical coverage analyses. AVAILABILITY AND IMPLEMENTATION: This program, accessory utilities and their documentation are freely available at https://github.com/telatin/BamToCov. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genomics , Software , Sequence Analysis, DNA , Algorithms , Documentation , High-Throughput Nucleotide Sequencing
8.
Gut ; 71(2): 382-390, 2022 02.
Article in English | MEDLINE | ID: mdl-33541866

ABSTRACT

OBJECTIVE: The full phenotypic expression of non-alcoholic fatty liver disease (NAFLD) in lean subjects is incompletely characterised. We aimed to investigate prevalence, characteristics and long-term prognosis of Caucasian lean subjects with NAFLD. DESIGN: The study cohort comprises 1339 biopsy-proven NAFLD subjects from four countries (Italy, UK, Spain and Australia), stratified into lean and non-lean (body mass index (BMI) 10 483 person-years), 4.7% of lean vs 7.7% of non-lean patients reported liver-related events (p=0.37). No difference in survival was observed compared with non-lean NAFLD (p=0.069). CONCLUSIONS: Caucasian lean subjects with NAFLD may progress to advanced liver disease, develop metabolic comorbidities and experience cardiovascular disease (CVD) as well as liver-related mortality, independent of longitudinal progression to obesity and PNPLA3 genotype. These patients represent one end of a wide spectrum of phenotypic expression of NAFLD where the disease manifests at lower overall BMI thresholds. LAY SUMMARY: NAFLD may affect and progress in both obese and lean individuals. Lean subjects are predominantly males, have a younger age at diagnosis and are more prevalent in some geographic areas. During the follow-up, lean subjects can develop hepatic and extrahepatic disease, including metabolic comorbidities, in the absence of weight gain. These patients represent one end of a wide spectrum of phenotypic expression of NAFLD.


Subject(s)
Non-alcoholic Fatty Liver Disease/complications , Thinness/complications , White People , Adult , Body Mass Index , Cohort Studies , Female , Humans , Male , Middle Aged , Non-alcoholic Fatty Liver Disease/mortality , Non-alcoholic Fatty Liver Disease/pathology , Prognosis , Survival Rate , Thinness/mortality , Thinness/pathology
9.
Mol Med ; 28(1): 39, 2022 04 01.
Article in English | MEDLINE | ID: mdl-35365098

ABSTRACT

BACKGROUND: Bladder cancer (BC) has the highest per-patient cost of all cancer types. Hence, we aim to develop a non-invasive, point-of-care tool for the diagnostic and molecular stratification of patients with BC based on combined microRNAs (miRNAs) and surface-enhanced Raman spectroscopy (SERS) profiling of urine. METHODS: Next-generation sequencing of the whole miRNome and SERS profiling were performed on urine samples collected from 15 patients with BC and 16 control subjects (CTRLs). A retrospective cohort (BC = 66 and CTRL = 50) and RT-qPCR were used to confirm the selected differently expressed miRNAs. Diagnostic accuracy was assessed using machine learning algorithms (logistic regression, naïve Bayes, and random forest), which were trained to discriminate between BC and CTRL, using as input either miRNAs, SERS, or both. The molecular stratification of BC based on miRNA and SERS profiling was performed to discriminate between high-grade and low-grade tumors and between luminal and basal types. RESULTS: Combining SERS data with three differentially expressed miRNAs (miR-34a-5p, miR-205-3p, miR-210-3p) yielded an Area Under the Curve (AUC) of 0.92 ± 0.06 in discriminating between BC and CTRL, an accuracy which was superior either to miRNAs (AUC = 0.84 ± 0.03) or SERS data (AUC = 0.84 ± 0.05) individually. When evaluating the classification accuracy for luminal and basal BC, the combination of miRNAs and SERS profiling averaged an AUC of 0.95 ± 0.03 across the three machine learning algorithms, again better than miRNA (AUC = 0.89 ± 0.04) or SERS (AUC = 0.92 ± 0.05) individually, although SERS alone performed better in terms of classification accuracy. CONCLUSION: miRNA profiling synergizes with SERS profiling for point-of-care diagnostic and molecular stratification of BC. By combining the two liquid biopsy methods, a clinically relevant tool that can aid BC patients is envisaged.


Subject(s)
MicroRNAs , Urinary Bladder Neoplasms , Bayes Theorem , Biomarkers, Tumor/genetics , Humans , Liquid Biopsy , MicroRNAs/genetics , Point-of-Care Systems , Retrospective Studies , Urinary Bladder Neoplasms/diagnosis , Urinary Bladder Neoplasms/genetics
10.
Hum Mutat ; 42(3): 272-289, 2021 03.
Article in English | MEDLINE | ID: mdl-33326653

ABSTRACT

To reconstruct the phenotypical and clinical implications of the Italian genetic structure, we thoroughly analyzed a whole-exome sequencing data set comprised of 1686 healthy Italian individuals. We found six previously unreported variants with remarkable frequency differences between Northern and Southern Italy in the HERC2, OR52R1, ADH1B, and THBS4 genes. We reported 36 clinically relevant variants (submitted as pathogenic, risk factors, or drug response in ClinVar) with significant frequency differences between Italy and Europe. We then explored putatively pathogenic variants in the Italian exome. On average, our Italian individuals carried 16.6 protein-truncating variants (PTVs), with 2.5% of the population having a PTV in one of the 59 American College of Medical Genetics (ACMG) actionable genes. Lastly, we looked for PTVs that are likely to cause Mendelian diseases. We found four heterozygous PTVs in haploinsufficient genes (KAT6A, PTCH1, and STXBP1) and three homozygous PTVs in genes causing recessive diseases (DPYD, FLG, and PYGM). Comparing frequencies from our data set to other public databases, like gnomAD, we showed the importance of population-specific databases for a more accurate assessment of variant pathogenicity. For this reason, we made aggregated frequencies from our data set publicly available as a tool for both clinicians and researchers (http://nigdb.cineca.it; NIG-ExIT).


Subject(s)
Exome , Genetic Variation , Europe , Exome/genetics , Humans , Italy , Exome Sequencing
11.
Int J Mol Sci ; 22(10)2021 May 18.
Article in English | MEDLINE | ID: mdl-34069990

ABSTRACT

The taxonomic composition of microbial communities can be assessed using universal marker amplicon sequencing. The most common taxonomic markers are the 16S rDNA for bacterial communities and the internal transcribed spacer (ITS) region for fungal communities, but various other markers are used for barcoding eukaryotes. A crucial step in the bioinformatic analysis of amplicon sequences is the identification of representative sequences. This can be achieved using a clustering approach or by denoising raw sequencing reads. DADA2 is a widely adopted algorithm, released as an R library, that denoises marker-specific amplicons from next-generation sequencing and produces a set of representative sequences referred to as 'Amplicon Sequence Variants' (ASV). Here, we present Dadaist2, a modular pipeline, providing a complete suite for the analysis that ranges from raw sequencing reads to the statistics of numerical ecology. Dadaist2 implements a new approach that is specifically optimised for amplicons with variable lengths, such as the fungal ITS. The pipeline focuses on streamlining the data flow from the command line to R, with multiple options for statistical analysis and plotting, both interactive and automatic.


Subject(s)
DNA Barcoding, Taxonomic/statistics & numerical data , Metagenomics/statistics & numerical data , Microbiota/genetics , Software , Algorithms , Cluster Analysis , Computational Biology/methods , Data Interpretation, Statistical , High-Throughput Nucleotide Sequencing , Metadata , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA
12.
Int J Mol Sci ; 22(18)2021 Sep 09.
Article in English | MEDLINE | ID: mdl-34575904

ABSTRACT

The KMT2A/AFF1 rearrangement is associated with an unfavorable prognosis in infant acute lymphocytic leukemia (ALL). Discordant ALL in monozygotic twins is uncommon and represents an attractive resource to evaluate intrauterine environment-genetic interplay in ALL. Mutational and epigenetic profiles were characterized for a discordant KMT2A/AFF1-rearranged infant monozygotic twin pair and their parents, and they were compared to three independent KMT2A/AFF1-positive ALL infants, in which the DNA methylation and gene expression profiles were investigated. A de novo Q61H NRAS mutation was detected in the affected twin at diagnosis and backtracked in both twins at birth. The KMT2A/AFF1 rearrangement was absent at birth in both twins. Genetic analyses conducted at birth gave more insights into the timing of the mutation hit. We identified correlations between DNA methylation and gene expression changes for 32 genes in the three independent affected versus remitted patients. The strongest correlations were observed for the RAB32, PDK4, CXCL3, RANBP17, and MACROD2 genes. This epigenetic signature could be a putative target for the development of novel epigenetic-based therapies and could help in explaining the molecular mechanisms characterizing ALL infants with KMT2A/AFF1 fusions.


Subject(s)
DNA-Binding Proteins/genetics , Epigenesis, Genetic , Gene Expression Regulation , Gene Rearrangement , Histone-Lysine N-Methyltransferase/genetics , Myeloid-Lymphoid Leukemia Protein/genetics , Transcriptional Elongation Factors/genetics , Twins, Monozygotic/genetics , Alleles , Computational Biology/methods , CpG Islands , DNA Methylation , Epigenomics/methods , Female , Genotype , Humans , Infant, Newborn , Male , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics , Exome Sequencing
13.
BMC Bioinformatics ; 18(1): 225, 2017 Apr 28.
Article in English | MEDLINE | ID: mdl-28454514

ABSTRACT

BACKGROUND: Whole genome and exome sequencing are contributing to the extraordinary progress in the study of human genetic variants. In this fast developing field, appropriate and easily accessible tools are required to facilitate data analysis. RESULTS: Here we describe QueryOR, a web platform suitable for searching among known candidate genes as well as for finding novel gene-disease associations. QueryOR combines several innovative features that make it comprehensive, flexible and easy to use. Instead of being designed on specific datasets, it works on a general XML schema specifying formats and criteria of each data source. Thanks to this flexibility, new criteria can be easily added for future expansion. Currently, up to 70 user-selectable criteria are available, including a wide range of gene and variant features. Moreover, rather than progressively discarding variants taking one criterion at a time, the prioritization is achieved by a global positive selection process that considers all transcript isoforms, thus producing reliable results. QueryOR is easy to use and its intuitive interface allows to handle different kinds of inheritance as well as features related to sharing variants in different patients. QueryOR is suitable for investigating single patients, families or cohorts. CONCLUSIONS: QueryOR is a comprehensive and flexible web platform eligible for an easy user-driven variant prioritization. It is freely available for academic institutions at http://queryor.cribi.unipd.it/ .


Subject(s)
Databases, Genetic , Genetic Variation , Software , Disease/genetics , Exome , Genome, Human , Humans , Internet
14.
Comput Biol Med ; 172: 108288, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38503094

ABSTRACT

Data sharing among different institutions represents one of the major challenges in developing distributed machine learning approaches, especially when data is sensitive, such as in medical applications. Federated learning is a possible solution, but requires fast communications and flawless security. Here, we propose SYNDSURV (SYNthetic Distributed SURVival), an alternative approach that simplifies the current state-of-the-art paradigm by allowing different centres to generate local simulated instances from real data and then gather them into a centralised hub, where an Artificial Intelligence (AI) model can learn in a standard way. The main advantage of this procedure is that it is model-agnostic, therefore prediction models can be directly applied in distributed applications without requiring particular adaptations as the current federated approaches do. To show the validity of our approach for medical applications, we tested it on a survival analysis task, offering a viable alternative to train AI models on distributed data. While federated learning has been mainly optimised for gradient-based approaches so far, our framework works with any predictive method, proving to be a comparable way of performing distributed learning without being too demanding towards each participating institute in terms of infrastructural requirements.


Subject(s)
Artificial Intelligence , Machine Learning , Survival Analysis
15.
Genes (Basel) ; 14(12)2023 12 17.
Article in English | MEDLINE | ID: mdl-38137050

ABSTRACT

Missense variation in genomes can affect protein structure stability and, in turn, the cell physiology behavior. Predicting the impact of those variations is relevant, and the best-performing computational tools exploit the protein structure information. However, most of the current protein sequence variants are unresolved, and comparative or ab initio tools can provide a structure. Here, we evaluate the impact of model structures, compared to experimental structures, on the predictors of protein stability changes upon single-point mutations, where no significant changes are expected between the original and the mutated structures. We show that there are substantial differences among the computational tools. Methods that rely on coarse-grained representation are less sensitive to the underlying protein structures. In contrast, tools that exploit more detailed molecular representations are sensible to structures generated from comparative modeling, even on single-residue substitutions.


Subject(s)
Computational Biology , Point Mutation , Computational Biology/methods , Proteins/metabolism , Protein Stability , Amino Acid Sequence
16.
Cancer Med ; 12(5): 5859-5873, 2023 03.
Article in English | MEDLINE | ID: mdl-36366788

ABSTRACT

BACKGROUND: The 8q24 locus is enriched in cancer-associated polymorphisms and, despite containing relatively few protein-coding genes, it hosts the MYC oncogene and other genetic elements connected to tumorigenesis, including microRNAs (miRNAs). Research on miRNAs may provide insights into the transcriptomic regulation of this multiple cancer-associated region. MATERIAL AND METHODS: We profiled all miRNAs located in the 8q24 region in 120 colorectal cancer (CRC) patients and 80 controls. miRNA profiling was performed on cancer/non-malignant adjacent mucosa, stool, and plasma extracellular vesicles (EVs), and the results validated with The Cancer Genome Atlas (TCGA) data. To verify if the 8q24-annotated miRNAs altered in CRC were dysregulated in other cancers and biofluids, we evaluated their levels in bladder cancer (BC) cases from the TCGA dataset and in urine and plasma EVs from a set of BC cases and healthy controls. RESULTS: Among the detected mature miRNAs in the region, 12 were altered between CRC and adjacent mucosa (adj. p < 0.05). Five and four miRNAs were confirmed as dysregulated in the CRC and BC TCGA dataset, respectively. A co-expression analysis of tumor/adjacent tissue data from the CRC group revealed a correlation between the dysregulated miRNAs and CRC-related genes (PVT1 and MYC) annotated in 8q24 region. miR-30d-5p and miR-151a-3p, altered in CRC tissue, were also dysregulated in stool of CRC patients and urine of BC cases, respectively. Functional enrichment of dysregulated miRNA target genes highlighted terms related to TP53-mediated cell cycle control. CONCLUSIONS: Altered expression of 8q24-annotated miRNAs may be relevant for the initiation and/or progression of cancer.


Subject(s)
Colorectal Neoplasms , MicroRNAs , Urinary Bladder Neoplasms , Humans , MicroRNAs/metabolism , Gene Expression Profiling/methods , Transcriptome , Urinary Bladder Neoplasms/genetics , Colorectal Neoplasms/pathology , Gene Expression Regulation, Neoplastic , Biomarkers, Tumor/metabolism
17.
Eur Radiol Exp ; 7(1): 79, 2023 12 12.
Article in English | MEDLINE | ID: mdl-38087079

ABSTRACT

BACKGROUND: Spleen stiffness measurement (SSM) performed by transient elastography at 100 Hz is a novel technology for the evaluation of portal hypertension in advanced chronic liver disease, but technical aspects are lacking. We aimed to evaluate the intraexamination variability of SSM and to determine the best transient elastography protocol for obtaining robust measurements to be used in clinical practice. METHODS: We analyzed 253 SSM exams with up to 20 scans for each examination, performed between April 2021 and June 2022. All SSM results were evaluated according to different protocols by dividing data into groups of n measurements (from 2 to 19). Considering as reference the median SSM values across all the 20 measurements, we calculated the distribution of the absolute deviations of each protocol from the reference median. This analysis was repeated 1,000 times by resampling the data. Distributions were also stratified by etiology (chronic liver disease versus clinically significant portal hypertension) and different SSM ranges: < 25 kPa, 25-75, and > 75 kPa. RESULTS: Overall, we observed that the spleen stiffness exam had less variability if it exceeded 12 measurements, i.e., absolute deviations ≤ 5 kPa at 95% confidence. For exams with higher SSM values (> 75 kPa), as seen in clinically significant portal hypertension, at least 15 measurements are highly recommendable. CONCLUSIONS: Fifteen scans per examination should be considered for each SSM exam performed at 100 Hz to achieve a low intraexamination variability within a reasonable time in clinical practice. RELEVANCE STATEMENT: Performing at least 15 scans per examination is recommended for 100 Hz SSM in order to achieve a low intraexamination variability, in particular for values > 75 kPa compatible with clinically significant portal hypertension. KEY POINTS: • Spleen stiffness measurement by transient elastography is used for stratification in patients with portal hypertension. • At 100 Hz, this method may have intraexamination variability. • A minimum of 15 scans per examination achieves a low intraexamination variability.


Subject(s)
Elasticity Imaging Techniques , Hypertension, Portal , Humans , Spleen/diagnostic imaging , Elasticity Imaging Techniques/methods , Hypertension, Portal/diagnostic imaging
18.
J Cachexia Sarcopenia Muscle ; 14(6): 2692-2702, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37743227

ABSTRACT

BACKGROUND: Small non-coding (snc)RNAs, including microRNAs and P-element induced wimpy testis (PIWI)-interacting-RNAs (piRNAs), crucially regulate gene expression in both physiological and pathological conditions. In particular, some muscle-specific microRNAs (myomiRs) have been involved in the pathogenesis of cancer-induced muscle wasting. The aims of the present study were (i) to profile sncRNAs in both skeletal muscle and plasma of gastrointestinal cancer patients and (ii) to investigate the association among differentially expressed sncRNAs and the level of muscularity at body composition analysis. METHODS: Surgical patients with gastrointestinal cancer or benign disease were recruited. Blood samples and muscle biopsies (rectus abdominis) were collected during surgery. Low muscularity patients were those at the lowest tertile of skeletal muscle index (SMI; CT-scan), whereas moderate/high muscularity patients were in the middle and highest SMI tertiles. SncRNAs in the muscle were assessed by RNAseq, circulating microRNAs were evaluated by qPCR. RESULTS: Cancer patients (n = 25; 13 females, 52%) showed a mean age of 71.6 ± 11.2 years, a median body weight loss of 4.2% and a mean BMI of 27.0 ± 3.2 kg/m2 . Control group (n = 15; 9 females, 60%) showed a mean age 58.1 ± 13.9 years and a mean BMI of 28.0 ± 4.3 kg/m2 . In cancer patients, the median L3-SMI (cm2 /m2 ) was 42.52 (34.42; 49.07). Males showed a median L3-SMI of 46.08 (41.17-51.79) and females a median L3-SMI of 40.77 (33.73-42.87). Moderate-high and low muscularity groups included 17 and 8 patients, respectively. As for circulating microRNAs, miR-21-5p and miR-133a-3p were up-regulated in patients compared with controls, whereas miR-15b-5p resulted down-regulated in the same comparison (about 30% of control values). Sample clustering by muscularity and sex revealed increased miR-133a-3p and miR-206 only in moderate-high muscularity males. SncRNA profiling in the muscle identified 373 microRNAs and 190 piRNAs (72.5% and 18.7% of raw reads, respectively). As for microRNAs, 10 were up-regulated, and 56 were down-regulated in cancer patients versus controls. Among the 24 dysregulated piRNAs, the majority were down-regulated, including the top two most expressed piRNAs in the muscle (piR-12790 and piR-2106). Network analysis on validated mRNA targets of down-regulated microRNAs revealed miR-15b-5p, miR-106a-5p and miR-106b-5p as main interactors of genes related to ubiquitin ligase/transferase activities. CONCLUSIONS: These results show dysregulation of both muscle microRNAs and piRNAs in cancer patients compared with controls, the former following a sex-specific pattern. Changes in circulating microRNAs are associated with the degree of muscularity rather than body weight loss.


Subject(s)
Circulating MicroRNA , Gastrointestinal Neoplasms , MicroRNAs , RNA, Small Untranslated , Male , Female , Humans , Middle Aged , Aged , Aged, 80 and over , Adult , RNA, Small Untranslated/genetics , Piwi-Interacting RNA , Gene Expression Profiling , MicroRNAs/metabolism , Weight Loss
19.
Artif Intell Med ; 142: 102588, 2023 08.
Article in English | MEDLINE | ID: mdl-37316101

ABSTRACT

BACKGROUND: Amyotrophic Lateral Sclerosis (ALS) is a fatal neurodegenerative disorder characterised by the progressive loss of motor neurons in the brain and spinal cord. The fact that ALS's disease course is highly heterogeneous, and its determinants not fully known, combined with ALS's relatively low prevalence, renders the successful application of artificial intelligence (AI) techniques particularly arduous. OBJECTIVE: This systematic review aims at identifying areas of agreement and unanswered questions regarding two notable applications of AI in ALS, namely the automatic, data-driven stratification of patients according to their phenotype, and the prediction of ALS progression. Differently from previous works, this review is focused on the methodological landscape of AI in ALS. METHODS: We conducted a systematic search of the Scopus and PubMed databases, looking for studies on data-driven stratification methods based on unsupervised techniques resulting in (A) automatic group discovery or (B) a transformation of the feature space allowing patient subgroups to be identified; and for studies on internally or externally validated methods for the prediction of ALS progression. We described the selected studies according to the following characteristics, when applicable: variables used, methodology, splitting criteria and number of groups, prediction outcomes, validation schemes, and metrics. RESULTS: Of the starting 1604 unique reports (2837 combined hits between Scopus and PubMed), 239 were selected for thorough screening, leading to the inclusion of 15 studies on patient stratification, 28 on prediction of ALS progression, and 6 on both stratification and prediction. In terms of variables used, most stratification and prediction studies included demographics and features derived from the ALSFRS or ALSFRS-R scores, which were also the main prediction targets. The most represented stratification methods were K-means, and hierarchical and expectation-maximisation clustering; while random forests, logistic regression, the Cox proportional hazard model, and various flavours of deep learning were the most widely used prediction methods. Predictive model validation was, albeit unexpectedly, quite rarely performed in absolute terms (leading to the exclusion of 78 eligible studies), with the overwhelming majority of included studies resorting to internal validation only. CONCLUSION: This systematic review highlighted a general agreement in terms of input variable selection for both stratification and prediction of ALS progression, and in terms of prediction targets. A striking lack of validated models emerged, as well as a general difficulty in reproducing many published studies, mainly due to the absence of the corresponding parameter lists. While deep learning seems promising for prediction applications, its superiority with respect to traditional methods has not been established; there is, instead, ample room for its application in the subfield of patient stratification. Finally, an open question remains on the role of new environmental and behavioural variables collected via novel, real-time sensors.


Subject(s)
Amyotrophic Lateral Sclerosis , Humans , Amyotrophic Lateral Sclerosis/diagnosis , Artificial Intelligence , Brain , Cluster Analysis , Databases, Factual
20.
Front Mol Biosci ; 9: 1075570, 2022.
Article in English | MEDLINE | ID: mdl-36685278

ABSTRACT

An open challenge of computational and experimental biology is understanding the impact of non-synonymous DNA variations on protein function and, subsequently, human health. The effects of these variants on protein stability can be measured as the difference in the free energy of unfolding (ΔΔG) between the mutated structure of the protein and its wild-type form. Throughout the years, bioinformaticians have developed a wide variety of tools and approaches to predict the ΔΔG. Although the performance of these tools is highly variable, overall they are less accurate in predicting ΔΔG stabilizing variations rather than the destabilizing ones. Here, we analyze the possible reasons for this difference by focusing on the relationship between experimentally-measured ΔΔG and seven protein properties on three widely-used datasets (S2648, VariBench, Ssym) and a recently introduced one (S669). These properties include protein structural information, different physical properties and statistical potentials. We found that two highly used input features, i.e., hydrophobicity and the Blosum62 substitution matrix, show a performance close to random choice when trying to separate stabilizing variants from either neutral or destabilizing ones. We then speculate that, since destabilizing variations are the most abundant class in the available datasets, the overall performance of the methods is higher when including features that improve the prediction for the destabilizing variants at the expense of the stabilizing ones. These findings highlight the need of designing predictive methods able to exploit also input features highly correlated with the stabilizing variants. New tools should also be tested on a not-artificially balanced dataset, reporting the performance on all the three classes (i.e., stabilizing, neutral and destabilizing variants) and not only the overall results.

SELECTION OF CITATIONS
SEARCH DETAIL