RESUMO
Identifying early and non-invasive biomarkers to detect individuals in the earliest stages of the Alzheimer's disease continuum is crucial. As a result, electrophysiology and plasma biomarkers are emerging as great candidates in this pursuit due to their low invasiveness. This is the first magnetoencephalography study to assess the relationship between minimum spanning tree parameters, an alternative to overcome the comparability and thresholding problem issues characteristic of conventional brain network analyses, and plasma phosphorylated tau231 levels in unimpaired individuals, with different risk levels of Alzheimer's disease. Seventy-six individuals with available magnetoencephalography recordings and phosphorylated tau231 plasma determination were included. The minimum spanning tree for the theta, alpha and beta bands for each subject was obtained, and the leaf fraction, tree hierarchy and diameter were calculated. To study the relationship between these topological parameters and phosphorylated tau231, we performed correlation analyses, for the whole sample and considering the two risk sub-groups separately. Increasing concentrations of phosphorylated tau231 were associated with greater leaf fraction and tree hierarchy values, along with lower diameter values, for the alpha and theta frequency bands. These results emerged for the whole sample and the higher risk group, but not for the lower risk group. Our results indicate that the network topology of cognitively unimpaired individuals with elevated plasma phosphorylated tau231 levels, a marker of Alzheimer's disease pathology and amyloid-ß accumulation, is already altered, shifting towards a more integrated network increasing its vulnerability and hub-dependency, mostly in the alpha band. This is indicated by increases in leaf fraction and tree hierarchy, along with reductions in diameter. These results match the initial trajectory proposed by theoretical models of disease progression and network disruption and suggest that changes in brain function and organization begin early on.
RESUMO
Background: Gene set analysis methods have played a major role in generating biological interpretations from omics data such as gene expression datasets. However, most methods focus on detecting homogenous pattern changes in mean expression and methods detecting pattern changes in variance remain poorly explored. While a few studies attempted to use gene-level variance analysis, such approach remains under-utilized. When comparing two phenotypes, gene sets with distinct changes in subgroups under one phenotype are overlooked by available methods although they reflect meaningful biological differences between two phenotypes. Multivariate sample-level variance analysis methods are needed to detect such pattern changes. Results: We use ranking schemes based on minimum spanning tree to generalize the Cramer-Von Mises and Anderson-Darling univariate statistics into multivariate gene set analysis methods to detect differential sample variance or mean. We characterize these methods in addition to two methods developed earlier using simulation results with different parameters. We apply the developed methods to microarray gene expression dataset of prednisolone-resistant and prednisolone-sensitive children diagnosed with B-lineage acute lymphoblastic leukemia and bulk RNA-sequencing gene expression dataset of benign hyperplastic polyps and potentially malignant sessile serrated adenoma/polyps. One or both of the two compared phenotypes in each of these datasets have distinct molecular subtypes that contribute to heterogeneous differences. Our results show that methods designed to detect differential sample variance are able to detect specific hallmark signaling pathways associated with the two compared phenotypes as documented in available literature. Conclusions: The results in this study demonstrate the usefulness of methods designed to detect differential sample variance in providing biological interpretations when biologically relevant but heterogeneous changes between two phenotypes are prevalent in specific signaling pathways. Software implementation of the developed methods is available with detailed documentation from Bioconductor package GSAR. The available methods are applicable to gene expression datasets in a normalized matrix form and could be used with other omics datasets in a normalized matrix form with available collection of feature sets.
RESUMO
It was recently shown that a large class of phylogenetic networks, the 'labellable' networks, is in bijection with the set of 'expanding' covers of finite sets. In this paper, we show how several prominent classes of phylogenetic networks can be characterised purely in terms of properties of their associated covers. These classes include the tree-based, tree-child, orchard, tree-sibling, and normal networks. In the opposite direction, we give an example of how a restriction on the set of expanding covers can define a new class of networks, which we call 'spinal' phylogenetic networks.
Assuntos
Algoritmos , Modelos Genéticos , Humanos , FilogeniaRESUMO
Cluster routing is a critical routing approach in wireless sensor networks (WSNs). However, the uneven distribution of selected cluster head nodes and impractical data transmission paths can result in uneven depletion of network energy. For this purpose, we introduce a new routing strategy for clustered wireless sensor networks that utilizes an improved beluga whale optimization algorithm, called tCBWO-DPR. In the selection process of cluster heads, we introduce a new excitation function to evaluate and select more suitable candidate cluster heads by establishing the correlation between the energy of node and the positional relationship of nodes. In addition, the beluga whale optimization (BWO) algorithm has been improved by incorporating the cosine factor and t-distribution to enhance its local and global search capabilities, as well as to improve its convergence speed and ability. For the data transmission path, we use Prim's algorithm to construct a spanning tree and introduce DPR for determining the optimal route between cluster heads based on the correlation distances of cluster heads. This effectively shortens the data transmission path and enhances network stability. Simulation results show that the improved beluga whale optimization based algorithm can effectively improve the survival cycle and reduce the average energy consumption of the network.
RESUMO
Background: Increasing research has examined the factors related to smartphone use disorder. However, limited research has explored its neural basis. Aims: We aimed to examine the relationship between the topology of the resting-state electroencephalography (rs-EEG) brain network and smartphone use disorder using minimum spanning tree analysis. Furthermore, we examined how negative emotions mediate this relationship. Methods: This study included 113 young, healthy adults (mean age = 20.87 years, 46.9% males). Results: The results showed that the alpha- and delta-band kappas and delta-band leaf fraction were positively correlated with smartphone use disorder. In contrast, the alpha-band diameter was negatively correlated with smartphone use disorder. Negative emotions fully mediated the relationship between alpha-band kappa and alpha-band diameter and smartphone use disorder. Furthermore, negative emotions partially mediated the relationship between delta-band kappa and smartphone use disorder. The findings suggest that excessive scale-free alpha- and delta-band brain networks contribute to the emergence of smartphone use disorder. In addition, the findings also demonstrate that negative emotions and smartphone use disorder share the same neural basis. Negative emotions play a mediating role in the association between topological deviations and smartphone use disorder. Discussion: To the best of our knowledge, this is the first study to examine the neural basis of smartphone use disorder from the perspective of the topology of the rs-EEG brain network. Therefore, neuromodulation may be a potential intervention for smartphone use disorder.
Assuntos
Encéfalo , Smartphone , Masculino , Adulto , Humanos , Adulto Jovem , Feminino , Eletroencefalografia , Mapeamento Encefálico , EmoçõesRESUMO
The Laplacian spectrum significantly contributes the study of the structural features of non-regular networks. Actually, it emphasizes the interaction among the network eigenvalues and their structural properties. Let Pn(Pn') represent the pentagonal-derivation cylinder (Möbius) network. In this article, based on the decomposition techniques of the Laplacian characteristic polynomial, we initially determine that the Laplacian spectra of Pn contain the eigenvalues of matrices LR and LS. Furthermore, using the relationship among the coefficients and roots of these two matrices, explicit calculations of the Kirchhoff index and spanning trees of Pn are determined. The relationship between the Wiener and Kirchhoff indices of Pn is also established.
RESUMO
We prove that for recurrent, reversible graphs, the following conditions are equivalent: (a) existence and uniqueness of the potential kernel, (b) existence and uniqueness of harmonic measure from infinity, (c) a new anchored Harnack inequality, and (d) one-endedness of the wired uniform spanning tree. In particular this gives a proof of the anchored (and in fact also elliptic) Harnack inequality on the UIPT. This also complements and strengthens some results of Benjamini et al. (Ann Probab 29(1):1-65, 2001). Furthermore, we make progress towards a conjecture of Aldous and Lyons by proving that these conditions are fulfilled for strictly subdiffusive recurrent unimodular graphs. Finally, we discuss the behaviour of the random walk conditioned to never return to the origin, which is well defined as a consequence of our results.
RESUMO
The diagnosis of bipolar disorders (BD) mainly depends on the clinical history and behavior observation, while only using clinical tools often limits the diagnosis accuracy. The study aimed to create a novel BD diagnosis framework using multilayer modularity in the dynamic minimum spanning tree (MST). We collected 45 un-medicated BD patients and 47 healthy controls (HC). The sliding window approach was utilized to construct dynamic MST via resting-state functional magnetic resonance imaging (fMRI) data. Firstly, we used three null models to explore the effectiveness of multilayer modularity in dynamic MST. Furthermore, the module allegiance exacted from dynamic MST was applied to train a classifier to discriminate BD patients. Finally, we explored the influence of the FC estimator and MST scale on the performance of the model. The findings indicated that multilayer modularity in the dynamic MST was not a random process in the human brain. And the model achieved an accuracy of 83.70% for identifying BD patients. In addition, we found the default mode network, subcortical network (SubC), and attention network played a key role in the classification. These findings suggested that the multilayer modularity in dynamic MST could highlight the difference between HC and BD patients, which opened up a new diagnostic tool for BD patients. Supplementary Information: The online version contains supplementary material available at 10.1007/s11571-022-09907-x.
RESUMO
Minimum spanning tree (MST)-based clustering algorithms are widely used to detect clusters with diverse densities and irregular shapes. However, most algorithms require the entire dataset to construct an MST, which leads to significant computational overhead. To alleviate this issue, our proposed algorithm R-MST utilizes representative points instead of all sample points for constructing MST. Additionally, based on the density and nearest neighbor distance, we improved the representative point selection strategy to enhance the uniform distribution of representative points in sparse areas, enabling the algorithm to perform well on datasets with varying densities. Furthermore, traditional methods for eliminating inconsistent edges generally require prior knowledge about the number of clusters, which is not always readily available in practical applications. Therefore, we propose an adaptive method that employs mutual neighbors to identify inconsistent edges and determine the optimal number of clusters automatically. The experimental results indicate that the R-MST algorithm not only improves the efficiency of clustering but also enhances its accuracy.
RESUMO
We investigate the topology of sectoral returns in the US stock market using minimum spanning tree (MST) analysis. We examine four distinct time periods: the full period, the Global Financial Crisis (GFC), the COVID-19 pandemic, and the Russia-Ukraine war period. By comparing the static results across these periods, we identify differences in the network structure. Additionally, a rolling window analysis is conducted to explore the time-varying nature of the MST. We employ a TVP-VAR based connectedness framework to ensure a robust analysis of the sectoral return linkages. Our main findings are summarized as follows: First, the structure of the MST varies in different periods, with distinct crisis period structures. During the GFC, the industrial sector dominated clustering, whereas COVID-19 affected the financial, IT, and industrial sectors. The Russia-Ukraine war period showed clustering centered on materials, except in the industrial sector. These varying structures may explain the different characteristics of each crisis. Second, both static and rolling window analyses highlight the significance of the industrial sector in the US stock market. Third, the utilities sector exhibits the lowest centrality measures, indicating its minimal importance and lack of relationships with other industries. These findings provide valuable insights into the interrelationships among industries in the US stock market. Market participants can leverage these findings to enhance their understanding and improve their portfolio management. By utilizing this information, investors can develop optimal diversification strategies to maximize returns and minimize risk.
RESUMO
As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods.
RESUMO
This article proposes a benchmark instance generator for the Hop-Constrained Minimum Spanning Tree problem, the Delay-Constrained Minimum Spanning Tree problem, and their bi-objective variants. The generator is developed in C++ and does not uses external libraries, being understandable, easy-to-read, and easy-to-use. Furthermore, the generator employs five parameters that makes possible to generate personalized benchmark instances for these problems. We also describe 640 benchmark instances that were previously used in computational experiments in the literature. Lastly, we include raw results obtained from computational experiments with the described benchmark instances. We hope that the data introduced in this article can foster the development and the evaluation of new algorithms for solving constrained minimum spanning tree problems.
RESUMO
Background and Objectives: This study aimed to investigate the causes of continuous deep fluctuations in the absolute lymphocyte count (ALC) in an untreated patient with Chronic Lymphocytic Leukemia (CLL), who has had a favorable prognosis since the time of diagnosis. Up until now, the patient has voluntarily chosen to adopt a predominantly vegetarian and fruitarian diet, along with prolonged periods of total fasting (ranging from 4 to 39 days) each year. Materials and Methods: For this purpose, we decided to analyze the whole transcriptome profiling of peripheral blood (PB) CD19+ cells from the patient (#1) at different time-points vs. the same cells of five other untreated CLL patients who followed a varied diet. Consequently, the CLL patients were categorized as follows: the 1st group comprised patient #1 at 20 different time-points (16 time-points during nutrition and 4 time-points during fasting), whereas the 2nd group included only one time point for each of the patients (#2, #3, #4, #5, and #6) as they followed a varied diet. We performed microarray experiments using a powerful tool, the Affymetrix Human Clariom™ D Pico Assay, to generate high-fidelity biomarker signatures. Statistical analysis was employed to identify differentially expressed genes and to perform sample clustering. Results: The lymphocytosis trend in patient #1 showed recurring fluctuations since the time of diagnosis. Interestingly, we observed that approximately 4-6 weeks after the conclusion of fasting periods, the absolute lymphocyte count was reduced by about half. The gene expression profiling analysis revealed that nine genes were statistically differently expressed between the 1st group and the 2nd group. Specifically, IGLC3, RPS26, CHPT1, and PCDH9 were under expressed in the 1st group compared to the 2nd group of CLL patients. Conversely, IGHV3-43, IGKV3D-20, PLEKHA1, CYBB, and GABRB2 were over-expressed in the 1st group when compared to the 2nd group of CLL patients. Furthermore, clustering analysis validated that all the samples from patient #1 clustered together, showing clear separation from the samples of the other CLL patients. Conclusions: This study unveiled a small gene expression signature consisting of nine genes that distinguished an untreated CLL patient who followed prolonged periods of total fasting, maintaining a gradual growth trend of lymphocytosis, compared to five untreated CLL patients with a varied diet. Future investigations focusing on patient #1 could potentially shed light on the role of prolonged periodic fasting and the implication of this specific gene signature in sustaining the lymphocytosis trend and the favorable course of the disease.
Assuntos
Jejum , Leucemia Linfocítica Crônica de Células B , Transcriptoma , Humanos , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Idoso de 80 Anos ou mais , Análise por Conglomerados , Dieta Vegetariana , Leucemia Linfocítica Crônica de Células B/genética , LinfocitoseRESUMO
Here, we report a new multi-optical maps scaffolder (MOMS) aiming at utilizing complementary information among optical maps labelled by distinct enzymes. This pipeline was designed for data structure organization, scaffolding by path traversal, gap-filling and molecule reuse of optical maps. Our testing showed that this pipeline has uncapped enzyme tolerance in scaffolding. This means that there are no inbuilt limits as to the number of maps generated by different enzymes that can be utilized by MOMS. For the genome assembly of the human GM12878 cell line, MOMS significantly improved the contiguity and completeness with an up to 144-fold increase of scaffold N50 compared with initial assemblies. Benchmarking on the genomes of human and O. sativa showed that MOMS is more effective and robust compared with other optical-map-based scaffolders. We believe this pipeline will contribute to high-fidelity chromosome assembly and chromosome-level evolutionary analysis.
Assuntos
Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de DNARESUMO
Executive functioning (EF) is a higher order cognitive process that is thought to depend on a network organization facilitating integration across subnetworks, in the context of which the central role of the fronto-parietal network (FPN) has been described across imaging and neurophysiological modalities. However, the potentially complementary unimodal information on the relevance of the FPN for EF has not yet been integrated. We employ a multilayer framework to allow for integration of different modalities into one 'network of networks.' We used diffusion MRI, resting-state functional MRI, MEG, and neuropsychological data obtained from 33 healthy adults to construct modality-specific single-layer networks as well as a single multilayer network per participant. We computed single-layer and multilayer eigenvector centrality of the FPN as a measure of integration in this network and examined their associations with EF. We found that higher multilayer FPN centrality, but not single-layer FPN centrality, was related to better EF. We did not find a statistically significant change in explained variance in EF when using the multilayer approach as compared to the single-layer measures. Overall, our results show the importance of FPN integration for EF and underline the promise of the multilayer framework toward better understanding cognitive functioning.
RESUMO
Electroconvulsive therapy (ECT) is an interventional technique capable of highly effective neuromodulation in major depressive disorder (MDD), but its antidepressant mechanism remains unclear. By recording the resting-state electroencephalogram (RS-EEG) of 19 MDD patients before and after ECT, we analyzed the modulation effect of ECT on the resting-state brain functional network of MDD patients from multiple perspectives: estimating spontaneous EEG activity power spectral density (PSD) using Welch algorithm; constructing brain functional network based on imaginary part coherence (iCoh) and calculate functional connectivity; using minimum spanning tree theory to explore the topological characteristics of brain functional network. The results show that PSD, functional connectivity, and topology in multiple frequency bands were significantly changed after ECT in MDD patients. The results of this study reveal that ECT changes the brain activity of MDD patients, which provides an important reference in the clinical treatment and mechanism analysis of MDD.
Assuntos
Transtorno Depressivo Maior , Eletroconvulsoterapia , Humanos , Transtorno Depressivo Maior/terapia , Encéfalo , Algoritmos , EletroencefalografiaRESUMO
We contribute to the efficient approximation of the Pareto-set for the classical NP-hard multi-objective minimum spanning tree problem (moMST) adopting evolutionary computation. More precisely, by building upon preliminary work, we analyse the neighborhood structure of Pareto-optimal spanning trees and design several highly biased sub-graph-based mutation operators founded on the gained insights. In a nutshell, these operators replace (un)connected sub-trees of candidate solutions with locally optimal sub-trees. The latter (biased) step is realized by applying Kruskal's single-objective MST algorithm to a weighted sum scalarization of a sub-graph. We prove runtime complexity results for the introduced operators and investigate the desirable Pareto-beneficial property. This property states that mutants cannot be dominated by their parent. Moreover, we perform an extensive experimental benchmark study to showcase the operator's practical suitability. Our results confirm that the subgraph based operators beat baseline algorithms from the literature even with severely restricted computational budget in terms of function evaluations on four different classes of complete graphs with different shapes of the Pareto-front.
RESUMO
We present RabbitTClust, a fast and memory-efficient genome clustering tool based on sketch-based distance estimation. Our approach enables efficient processing of large-scale datasets by combining dimensionality reduction techniques with streaming and parallelization on modern multi-core platforms. 113,674 complete bacterial genome sequences from RefSeq, 455 GB in FASTA format, can be clustered within less than 6 min and 1,009,738 GenBank assembled bacterial genomes, 4.0 TB in FASTA format, within only 34 min on a 128-core workstation. Our results further identify 1269 redundant genomes, with identical nucleotide content, in the RefSeq bacterial genomes database.
Assuntos
Genoma , Software , Bases de Dados de Ácidos Nucleicos , Análise por Conglomerados , Bactérias , Algoritmos , Genoma BacterianoRESUMO
An instance of the non-preemptive tree packing problem consists of an undirected graph G = ( V , E ) together with a weight w(e) for every edge e ∈ E . The goal is to activate every edge e for some time interval of length w(e), such that the activated edges keep G connected for the longest possible overall time. We derive a variety of results on this problem. The problem is strongly NP-hard even on graphs of treewidth 2, and it does not allow a polynomial time approximation scheme (unless P=NP). Furthermore, we discuss the performance of a simple greedy algorithm, and we construct and analyze a number of parameterized and exact algorithms.
RESUMO
A resource optimization methodology is proposed for application in long range wide area networks (LoRaWANs). Using variable neighborhood search (VNS) and a minimum-cost spanning tree algorithm, it reduces the implementation and the maintenance costs of such low power networks. Performance evaluations were conducted in LoRaWANs with LoRa repeaters to increase coverage, in scenario where the number and the location of the repeaters are determined by the VNS metaheuristic. Parameters such as spread factor (SF), bandwidth and transmission power were adjusted to minimize the network's total energy per useful bit (Ebit) and the total data collection time. The importance of the SF in the trade-off between (Ebit) and time on-air is evaluated, considering a device scaling factor. Simulation results, obtained after model adjustments with experimental data, show that, in networks with few associated devices, there is a preference for small values of SF aiming at reduction of Ebit. The usage of large SF's becomes relevant when reach extensions are required. The results also demonstrate that, for networks with high number of nodes, the scaling of devices over time become relevant in the fitness function, forcing an equal distribution of time slots per SF to avoid discrepancies in the time data collection.