RESUMEN
INTRODUCTION: Epilepsy is a disease characterized by an excessive discharge in neurons generally provoked without any external stimulus, known as convulsions. About 2 million people are diagnosed each year in the world. This process is carried out by a neurological doctor using an electroencephalogram (EEG), which is lengthy. METHOD: To optimize these processes and make them more efficient, we have resorted to innovative artificial intelligence methods essential in classifying EEG signals. For this, comparing traditional models, such as machine learning or deep learning, with cutting-edge models, in this case, using Capsule-Net architectures and Transformer Encoder, has a crucial role in finding the most accurate model and helping the doctor to have a faster diagnosis. RESULT: In this paper, a comparison was made between different models for binary and multiclass classification of the epileptic seizure detection database, achieving a binary accuracy of 99.92% with the Capsule-Net model and a multiclass accuracy with the Transformer Encoder model of 87.30%. CONCLUSION: Artificial intelligence is essential in diagnosing pathology. The comparison between models is helpful as it helps to discard those that are not efficient. State-of-the-art models overshadow conventional models, but data processing also plays an essential role in evaluating the higher accuracy of the models.
Asunto(s)
Inteligencia Artificial , Epilepsia , Humanos , Epilepsia/diagnóstico , Convulsiones/diagnóstico , Algoritmos , Aprendizaje Automático , ElectroencefalografíaRESUMEN
BACKGROUND: Rhizosphere microorganisms play a crucial role in plant health and development. Plant root exudates (PRE) are a complex mixture of organic molecules and provide nutritional and signaling information to rhizosphere microorganisms. Burkholderiaceae species are non-abundant in the rhizosphere but exhibit a wide range of plant-growth-promoting and plant-health-protection effects. Most of these plant-associated microorganisms have been studied in isolation under laboratory conditions, whereas in nature, they interact in competition or cooperation with each other. To improve our understanding of the factors driving growth dynamics of low-abundant bacterial species in the rhizosphere, we hypothesized that the growth and survival of four Burkholderiaceae strains (Paraburkholderia phytofirmans PsJN, Cupriavidus metallidurans CH34, C. pinatubonensis JMP134 and C. taiwanensis LMG19424) in Arabidopsis thaliana PRE is affected by the presence of each other. RESULTS: Differential growth abilities of each strain were found depending on plant age and whether PRE was obtained after growth on N limitation conditions. The best-adapted strain to grow in PRE was P. phytofirmans PsJN, with C. pinatubonensis JMP134 growing better than the other two Cupriavidus strains. Individual strain behavior changed when they succeeded in combinations. Clustering analysis showed that the 4-member co-culture grouped with one of the best-adapted strains, either P. phytofirmans PsJN or C. pinatubonensis JMP134, depending on the PRE used. Sequential transference experiments showed that the behavior of the 4-member co-culture relies on the type of PRE provided for growth. CONCLUSIONS: The results suggest that individual strain behavior changed when they grew in combinations of two, three, or four members, and those changes are determined first by the inherent characteristics of each strain and secondly by the environment.
Asunto(s)
Arabidopsis , Burkholderia , Burkholderiaceae , Arabidopsis/microbiología , Mezclas Complejas , Exudados y Transudados , Estado Nutricional , PlantasRESUMEN
BACKGROUND: When designing a treatment in orthodontics, especially for children and teenagers, it is crucial to be aware of the changes that occur throughout facial growth because the rate and direction of growth can greatly affect the necessity of using different treatment mechanics. This paper presents a Bayesian network approach for facial biotype classification to classify patients' biotypes into Dolichofacial (long and narrow face), Brachyfacial (short and wide face), and an intermediate kind called Mesofacial, we develop a novel learning technique for tree augmented Naive Bayes (TAN) for this purpose. RESULTS: The proposed method, on average, outperformed all the other models based on accuracy, precision, recall, [Formula: see text], and kappa, for the particular dataset analyzed. Moreover, the proposed method presented the lowest dispersion, making this model more stable and robust against different runs. CONCLUSIONS: The proposed method obtained high accuracy values compared to other competitive classifiers. When analyzing a resulting Bayesian network, many of the interactions shown in the network had an orthodontic interpretation. For orthodontists, the Bayesian network classifier can be a helpful decision-making tool.
Asunto(s)
Algoritmos , Concienciación , Niño , Adolescente , Humanos , Teorema de BayesRESUMEN
The prognostics and health management disciplines provide an efficient solution to improve a system's durability, taking advantage of its lifespan in functionality before a failure appears. Prognostics are performed to estimate the system or subsystem's remaining useful life (RUL). This estimation can be used as a supply in decision-making within maintenance plans and procedures. This work focuses on prognostics by developing a recurrent neural network and a forecasting method called Prophet to measure the performance quality in RUL estimation. We apply this approach to degradation signals, which do not need to be monotonical. Finally, we test our system using data from new generation telescopes in real-world applications.
Asunto(s)
Análisis de Falla de Equipo , Redes Neurales de la Computación , Análisis de Falla de Equipo/métodosRESUMEN
BACKGROUND: An important process for plant survival is the immune system. The induced systemic resistance (ISR) triggered by beneficial microbes is an important cost-effective defense mechanism by which plants are primed to an eventual pathogen attack. Defense mechanisms such as ISR depend on an accurate and context-specific regulation of gene expression. Interactions between genes and their products give rise to complex circuits known as gene regulatory networks (GRNs). Here, we explore the regulatory mechanism of the ISR defense response triggered by the beneficial bacterium Paraburkholderia phytofirmans PsJN in Arabidopsis thaliana plants infected with Pseudomonas syringae DC3000. To achieve this, a GRN underlying the ISR response was inferred using gene expression time-series data of certain defense-related genes, differential evolution, and threshold Boolean networks. RESULTS: One thousand threshold Boolean networks were inferred that met the restriction of the desired dynamics. From these networks, a consensus network was obtained that helped to find plausible interactions between the genes. A representative network was selected from the consensus network and biological restrictions were applied to it. The dynamics of the selected network showed that the largest attractor, a limit cycle of length 3, represents the final stage of the defense response (12, 18, and 24 h). Also, the structural robustness of the GRN was studied through the networks' attractors. CONCLUSIONS: A computational intelligence approach was designed to reconstruct a GRN underlying the ISR defense response in plants using gene expression time-series data of A. thaliana colonized by P. phytofirmans PsJN and subsequently infected with P. syringae DC3000. Using differential evolution, 1000 GRNs from time-series data were successfully inferred. Through the study of the network dynamics of the selected GRN, it can be concluded that it is structurally robust since three mutations were necessary to completely disarm the Boolean trajectory that represents the biological data. The proposed method to reconstruct GRNs is general and can be used to infer other biologically relevant networks to formulate new biological hypotheses.
Asunto(s)
Arabidopsis/genética , Arabidopsis/microbiología , Resistencia a la Enfermedad/genética , Redes Reguladoras de Genes , Burkholderiaceae/fisiología , Pseudomonas syringaeRESUMEN
In intertidal marine crustaceans, phenotypic variation in physiological and life-history traits is pervasive along latitudinal clines. However, organisms have complex phenotypes, and their traits do not vary independently but rather interact differentially between them, effect that is caused by genetic and/or environmental forces. We evaluated the geographic variation in phenotypic integration of three marine crab species that inhabit different vertical thermal microhabitats of the intertidal zone. We studied seven populations of each species along a latitudinal gradient that spans more than 3000â¯km of the Chilean coast. Specifically we measured nine physiological traits that are highly related to thermal physiology. Of the nine traits, we selected four that contributed significantly to the observed geographical variation among populations; this variation was then evaluated using mixed linear models and an integrative approach employing machine learning. The results indicate that patterns of physiological variation depend on species vertical microhabitat, which may be subject to chronic or acute environmental variation. The species that inhabit the high- intertidal sites (i.e., exposed to chronic variation) better tolerated thermal stress compared with populations that inhabit the lower intertidal. While those in the low-intertidal only face conditions of acute thermal variation, using to a greater extent the plasticity to face these events. Our main results reflect that (1) species that inhabit the high-intertidal maintain a greater integration between their physiological traits and present lower plasticity than those that inhabit the low-intertidal. (2) Inverse relationship that exists between phenotypic plasticity and phenotypic integration of the physiological traits identified, which could help optimize energy resources. In general, the study of multiple physiological traits provides a more accurate picture of how the thermal traits of organisms vary along temperature gradients especially when exposed to conditions close to tolerance limits.
Asunto(s)
Aclimatación , Temperatura Corporal , Crustáceos/fisiología , Ecotipo , Animales , Crustáceos/genética , Aprendizaje AutomáticoRESUMEN
Quorum-sensing systems play important roles in host colonization and host establishment of Burkholderiales species. Beneficial Paraburkholderia species share a conserved quorum-sensing (QS) system, designated BraI/R, that controls different phenotypes. In this context, the plant growth-promoting bacterium Paraburkholderia phytofirmans PsJN possesses two different homoserine lactone QS systems BpI.1/R.1 and BpI.2/R.2 (BraI/R-like QS system). The BpI.1/R.1 QS system was previously reported to be important to colonize and produce beneficial effects in Arabidopsis thaliana plants. Here, we analyzed the temporal variations of the QS gene transcript levels in the wild-type strain colonizing plant roots. The gene expression patterns showed relevant differences in both QS systems compared with the wild-type strain in the unplanted control treatment. The gene expression data were used to reconstruct a regulatory network model of QS systems in P. phytofirmans PsJN, using a Boolean network model. Also, we examined the phenotypic traits and transcript levels of genes involved in QS systems, using P. phytofirmans mutants in homoserine lactone synthases genes. We observed that the BpI.1/R.1 QS system regulates biofilm formation production in strain PsJN and this phenotype was associated with the lower expression of a specific extracytoplasmic function sigma factor ecf26.1 gene (implicated in biofilm formation) in the bpI.1 mutant strain.
Asunto(s)
Arabidopsis/crecimiento & desarrollo , Biopelículas , Burkholderia/genética , Raíces de Plantas/crecimiento & desarrollo , Percepción de Quorum/genética , 4-Butirolactona/análogos & derivados , 4-Butirolactona/metabolismo , Arabidopsis/microbiología , Burkholderia/metabolismo , Burkholderia/fisiología , Perfilación de la Expresión Génica/métodos , Regulación Bacteriana de la Expresión Génica , Redes Reguladoras de Genes , Interacciones Huésped-Patógeno , Modelos Genéticos , Mutación , Raíces de Plantas/microbiologíaRESUMEN
BACKGROUND: Interactions between genes and their products give rise to complex circuits known as gene regulatory networks (GRN) that enable cells to process information and respond to external stimuli. Several important processes for life, depend of an accurate and context-specific regulation of gene expression, such as the cell cycle, which can be analyzed through its GRN, where deregulation can lead to cancer in animals or a directed regulation could be applied for biotechnological processes using yeast. An approach to study the robustness of GRN is through the neutral space. In this paper, we explore the neutral space of a Schizosaccharomyces pombe (fission yeast) cell cycle network through an evolution strategy to generate a neutral graph, composed of Boolean regulatory networks that share the same state sequences of the fission yeast cell cycle. RESULTS: Through simulations it was found that in the generated neutral graph, the functional networks that are not in the wildtype connected component have in general a Hamming distance more than 3 with the wildtype, and more than 10 between the other disconnected functional networks. Significant differences were found between the functional networks in the connected component of the wildtype network and the rest of the network, not only at a topological level, but also at the state space level, where significant differences in the distribution of the basin of attraction for the G1 fixed point was found for deterministic updating schemes. CONCLUSIONS: In general, functional networks in the wildtype network connected component, can mutate up to no more than 3 times, then they reach a point of no return where the networks leave the connected component of the wildtype. The proposed method to construct a neutral graph is general and can be used to explore the neutral space of other biologically interesting networks, and also formulate new biological hypotheses studying the functional networks in the wildtype network connected component.
Asunto(s)
Ciclo Celular/fisiología , Quinasas Ciclina-Dependientes/metabolismo , Redes Reguladoras de Genes/fisiología , Modelos Biológicos , Schizosaccharomyces/fisiología , Proteínas de Ciclo Celular/metabolismo , Biología Computacional , Gráficos por Computador , Simulación por Computador , Fase G1/fisiología , Redes Neurales de la Computación , Schizosaccharomyces/genéticaRESUMEN
Dementia arises from various brain-affecting diseases and injuries, with Alzheimer's disease being the most prevalent, impacting around 55 million people globally. Current clinical diagnosis often relies on biomarkers indicative of Alzheimer's distinctive features. Electroencephalography (EEG) serves as a cost-effective, user-friendly, and safe biomarker for early Alzheimer's detection. This study utilizes EEG signals processed with Short-Time Fourier Transform (STFT) to generate spectrograms, facilitating visualization of EEG signal properties. Leveraging the Brainlat database, we propose SpectroCVT-Net, a novel convolutional vision transformer architecture incorporating channel attention mechanisms. SpectroCVT-Net integrates convolutional and attention mechanisms to capture local and global dependencies within spectrograms. Comprising feature extraction and classification stages, the model enhances Alzheimer's disease classification accuracy compared to transfer learning methods, achieving 92.59 ± 2.3% accuracy across Alzheimer's, healthy controls, and behavioral variant frontotemporal dementia (bvFTD). This article introduces a new architecture and evaluates its efficacy with unconventional data for Alzheimer's diagnosis, contributing: SpectroCVT-Net, tailored for EEG spectrogram classification without reliance on transfer learning; a convolutional vision transformer (CVT) module in the classification stage, integrating local feature extraction with attention heads for global context analysis; Grad-CAM analysis for network decision insight, identifying critical layers, frequencies, and electrodes influencing classification; and enhanced interpretability through spectrograms, illuminating key brain wave contributions to Alzheimer's, frontotemporal dementia, and healthy control classifications, potentially aiding clinical diagnosis and management.
Asunto(s)
Enfermedad de Alzheimer , Electroencefalografía , Procesamiento de Señales Asistido por Computador , Enfermedad de Alzheimer/clasificación , Enfermedad de Alzheimer/fisiopatología , Enfermedad de Alzheimer/diagnóstico , Humanos , Electroencefalografía/métodos , Masculino , Femenino , Bases de Datos Factuales , AncianoRESUMEN
Analyzing all the deterministic dynamics of a Boolean regulatory network is a difficult problem since it grows exponentially with the number of nodes. In this paper, we present mathematical and computational tools for analyzing the complete deterministic dynamics of Boolean regulatory networks. For this, the notion of alliance is introduced, which is a subconfiguration of states that remains fixed regardless of the values of the other nodes. Also, equivalent classes are considered, which are sets of updating schedules which have the same dynamics. Using these techniques, we analyze two yeast cell cycle models. Results show the effectiveness of the proposed tools for analyzing update robustness as well as the discovery of new information related to the attractors of the yeast cell cycle models considering all the possible deterministic dynamics, which previously have only been studied considering the parallel updating scheme.
Asunto(s)
Puntos de Control del Ciclo Celular , Saccharomycetales/citología , Schizosaccharomyces/citología , Algoritmos , Conceptos Matemáticos , Modelos Biológicos , Biología de SistemasRESUMEN
We explore the data-parallel acceleration of physics-informed machine learning (PIML) schemes, with a focus on physics-informed neural networks (PINNs) for multiple graphics processing units (GPUs) architectures. In order to develop scale-robust and high-throughput PIML models for sophisticated applications which may require a large number of training points (e.g., involving complex and high-dimensional domains, non-linear operators or multi-physics), we detail a novel protocol based on h-analysis and data-parallel acceleration through the Horovod training framework. The protocol is backed by new convergence bounds for the generalization error and the train-test gap. We show that the acceleration is straightforward to implement, does not compromise training, and proves to be highly efficient and controllable, paving the way towards generic scale-robust PIML. Extensive numerical experiments with increasing complexity illustrate its robustness and consistency, offering a wide range of possibilities for real-world simulations.
RESUMEN
An evolutionary computation framework to learn binary threshold networks is presented. Inspired by the recent trend of binary neural networks, where weights and activation thresholds are represented using 1 and -1 such that they can be stored in 1-bit instead of full precision, we explore this approach for gene regulatory network modeling. We test our method by inferring binary threshold networks of two regulatory network models: Quorum sensing systems in bacterium Paraburkholderia phytofirmans PsJN and the fission yeast cell-cycle. We considered differential evolution and particle swarm optimization for the simulations. Results for weights having only 1 and -1 values, and different activation thresholds are presented. Full binary threshold networks were found with minimum error (2 bits), whereas when the binary restriction is relaxed for the activation thresholds, networks with 0 bit error were found.
Asunto(s)
Redes Reguladoras de Genes , Redes Neurales de la Computación , Redes Reguladoras de Genes/genética , Percepción de Quorum/genética , AlgoritmosRESUMEN
Several artificial intelligence algorithms have been developed for COVID-19-related topics. One that has been common is the COVID-19 diagnosis using chest X-rays, where the eagerness to obtain early results has triggered the construction of a series of datasets where bias management has not been thorough from the point of view of patient information, capture conditions, class imbalance, and careless mixtures of multiple datasets. This paper analyses 19 datasets of COVID-19 chest X-ray images, identifying potential biases. Moreover, computational experiments were conducted using one of the most popular datasets in this domain, which obtains a 96.19% of classification accuracy on the complete dataset. Nevertheless, when evaluated with the ethical tool Aequitas, it fails on all the metrics. Ethical tools enhanced with some distribution and image quality considerations are the keys to developing or choosing a dataset with fewer bias issues. We aim to provide broad research on dataset problems, tools, and suggestions for future dataset developments and COVID-19 applications using chest X-ray images.
Asunto(s)
Inteligencia Artificial , COVID-19 , Humanos , Prueba de COVID-19 , Rayos X , SesgoRESUMEN
Alzheimer's disease (AD) is a progressive type of dementia characterized by loss of memory and other cognitive abilities, including speech. Since AD is a progressive disease, detection in the early stages is essential for the appropriate care of the patient throughout its development, going from asymptomatic to a stage known as mild cognitive impairment (MCI), and then progressing to dementia and severe dementia; is worth mentioning that everyone suffers from cognitive impairment to some degree as we age, but the relevant task here is to identify which people are most likely to develop AD. Along with cognitive tests, evaluation of the brain morphology is the primary tool for AD diagnosis, where atrophy and loss of volume of the frontotemporal lobe are common features in patients who suffer from the disease. Regarding medical imaging techniques, magnetic resonance imaging (MRI) scans are one of the methods used by specialists to assess brain morphology. Recently, with the rise of deep learning (DL) and its successful implementation in medical imaging applications, it is of growing interest in the research community to develop computer-aided diagnosis systems that can help physicians to detect this disease, especially in the early stages where macroscopic changes are not so easily identified. This article presents a DL-based approach to classifying MRI scans in the different stages of AD, using a curated set of images from Alzheimer's Disease Neuroimaging Initiative and Open Access Series of Imaging Studies databases. Our methodology involves image pre-processing using FreeSurfer, spatial data-augmentation operations, such as rotation, flip, and random zoom during training, and state-of-the-art 3D convolutional neural networks such as EfficientNet, DenseNet, and a custom siamese network, as well as the relatively new approach of vision transformer architecture. With this approach, the best detection percentage among all four architectures was around 89% for AD vs. Control, 80% for Late MCI vs. Control, 66% for MCI vs. Control, and 67% for Early MCI vs. Control.
RESUMEN
Introduction: Older adults are at a higher risk of severe illness and death from COVID-19. This vulnerability increases in those who live in long-term care facilities due to overcrowding, greater physical dependence, and contact with health workers. Evidence on the impact of the pandemic on these establishments in lowand middle-income countries has been scant. This study aims to determine the seroprevalence of SARS-CoV-2 in older people residing in long-term care facilities and estimate the impact of infection after the first wave of the pandemic. Methods: A cross-sectional design with 2099 residents in three regions of Chile was carried out between September and November 2020. Measurement of antibodies was performed with a rapid test. The impact of SARS-CoV-2 infection was estimated with seropositive residents, those who had a history of positive polymerase chain reaction tests, and those who died from COVID-19. Bivariate analysis with the region, sex, age, history of COVID-19, physical dependence, and serological results were performed. In addition, we performed a correlation analysis between the seroprevalence of the centers by the municipality and the rate of confirmed cases. Results: The seroprevalence of SARS-CoV-2 antibodies in the three regions was 14.7% (95% confidence interval: 13.2 to 16.3%), the infection impact was 46.4%, and the fatality rate was 19.6%. A significant correlation was found between the seroprevalence of older adults residing in long-term care facilities and the cumulative incidence by municipalities. Conclusions: The seroprevalence of older adults residing in long-term care facilities was higher than the general population. The high impact of infection among this population at the end of the first wave of the COVID-19 pandemic is similar to other countries. The centers' environment is directly related to COVID-19 infection. Morbidity and mortality monitoring systems should be implemented promptly to establish prevention and control measures.
Introducción: Las personas mayores tienen más riesgo de enfermar gravemente y fallecer por COVID-19. Esta vulnerabilidad aumenta en quienes viven en establecimientos de larga estadía, debido a hacinamiento, mayor dependencia física y contacto con los trabajadores. La evidencia sobre el impacto de la pandemia de estos establecimientos en países de medianos y bajos ingresos ha sido escasa. El objetivo es determinar la seroprevalencia de la infección por SARS-CoV-2 en personas mayores que residen en establecimientos de larga estadía. Así como estimar el impacto global de la infección después de la primera ola de la pandemia. Métodos: Diseño transversal con 2099 residentes en tres regiones de Chile, realizado entre septiembre y noviembre 2020. Anticuerpos fueron medidos con test rápido contra SARS-CoV-2. Se estimó el impacto de la infección con los residentes seropositivos, los residentes con antecedentes de reacción en cadena de la polimerasa de transcripción inversa positiva, y residentes que murieron por COVID-19. Análisis bivariado entre el resultado serológico y región, sexo, edad, antecedentes de COVID-19 y dependencia física fueron realizados. Además, realizamos un análisis de correlación entre la seroprevalencia de los centros por municipio y la tasa acumulada de casos confirmados. Resultados: La seroprevalencia de anticuerpos en las tres regiones fue 14,7% (intervalo de confianza del 95%: 13,2 a 16,3%). El impacto real de la infección se estimó en 46,4% y la tasa de letalidad en 19,6%. La seroprevalencia de los residentes de los centros por comuna se correlacionó positiva y significativamente con la frecuencia de la enfermedad a nivel comunal. Conclusiones: Seroprevalencia superior a la de la población general, observándose un alto impacto de la infección por COVID-19 al final de la primera ola de la pandemia. El lugar en el que se encuentran los establecimientos está directamente relacionado con la tasa de seroprevalencia en ellos. Sistemas de vigilancia epidemiológica deben aplicarse con prontitud para establecer medidas de prevención y control.
Asunto(s)
COVID-19 , Anciano , COVID-19/epidemiología , Chile/epidemiología , Estudios Transversales , Humanos , Cuidados a Largo Plazo , Pandemias , SARS-CoV-2 , Estudios SeroepidemiológicosRESUMEN
Plant defense responses to biotic stresses are complex biological processes, all governed by sophisticated molecular regulations. Induced systemic resistance (ISR) is one of these defense mechanisms where beneficial bacteria or fungi prime plants to resist pathogens or pest attacks. In ISR, the defense arsenal in plants remains dormant and it is only triggered by an infection, allowing a better allocation of plant resources. Our group recently described that the well-known beneficial bacterium Paraburkholderia phytofirmans PsJN is able to induce Arabidopsis thaliana resistance to Pseudomonas syringae pv. tomato (Pst) DC3000 through ISR, and that ethylene, jasmonate and salicylic acid are involved in this protection. Nevertheless, the molecular networks governing this beneficial interaction remain unknown. To tackle this issue, we analyzed the temporal changes in the transcriptome of PsJN-inoculated plants before and after being infected with Pst DC3000. These data were used to perform a gene network analysis to identify highly connected transcription factors. Before the pathogen challenge, the strain PsJN regulated 405 genes (corresponding to 1.8% of the analyzed genome). PsJN-inoculated plants presented a faster and stronger transcriptional response at 1-hour post infection (hpi) compared with the non-inoculated plants, which presented the highest transcriptional changes at 24 hpi. A principal component analysis showed that PsJN-induced plant responses to the pathogen could be differentiated from those induced by the pathogen itself. Forty-eight transcription factors were regulated by PsJN at 1 hpi, and a system biology analysis revealed a network with four clusters. Within these clusters LHY, WRKY28, MYB31 and RRTF1 are highly connected transcription factors, which could act as hub regulators in this interaction. Concordantly with our previous results, these clusters are related to jasmonate, ethylene, salicylic, acid and ROS pathways. These results indicate that a rapid and specific response of PsJN-inoculated plants to the virulent DC3000 strain could be the pivotal element in the protection mechanism.
Asunto(s)
Arabidopsis/genética , Burkholderiaceae/fisiología , Regulación de la Expresión Génica de las Plantas/inmunología , Enfermedades de las Plantas/genética , Pseudomonas syringae/patogenicidad , Factores de Transcripción/genética , Arabidopsis/inmunología , Arabidopsis/microbiología , Ciclopentanos/inmunología , Ciclopentanos/metabolismo , Resistencia a la Enfermedad/genética , Etilenos/inmunología , Etilenos/metabolismo , Perfilación de la Expresión Génica , Redes Reguladoras de Genes/inmunología , Oxilipinas/inmunología , Oxilipinas/metabolismo , Enfermedades de las Plantas/inmunología , Enfermedades de las Plantas/microbiología , Reguladores del Crecimiento de las Plantas/inmunología , Reguladores del Crecimiento de las Plantas/metabolismo , Inmunidad de la Planta/genética , Análisis de Componente Principal , Pseudomonas syringae/crecimiento & desarrollo , Ácido Salicílico/inmunología , Ácido Salicílico/metabolismo , Factores de Transcripción/inmunología , Transcriptoma/inmunologíaRESUMEN
In the first decades of the 20th century, political actors diagnosed the incubation of a crisis in the Chilean schooling process. Low rates of enrollment, literacy, and attendance, inefficiency in the use of resources, poverty, and a reduced number of schools were the main factors explaining the crisis. As a response, the Law on Compulsory Primary Education, considering mandatory for children between 6 and 14 years old to attend any school for at least four years, was passed in 1920. Using data from Censuses of the Republic of Chile from 1920 and 1930, reports of the Ministry of Justice, the Ministry of Education, and the Statistical Yearbooks between 1895 and 1930, we apply machine learning techniques (clustering and decision trees) to assess the impact of this law on the Chilean schooling process between 1920 and 1930. We conclude that the law had a positive impact on the schooling indicators in this period. Even though it did not overcome the differences between urban and rural zones, it brought about a general improvement of the schooling process and a more efficient use of resources and infrastructure in both big urban centers and small-urban and rural zones, thereby managing the so-called crisis of the Republic.
Asunto(s)
Educación/historia , Aprendizaje Automático , Algoritmos , Chile , Análisis por Conglomerados , Bases de Datos como Asunto , Árboles de Decisión , Geografía , Historia del Siglo XX , Análisis de Componente Principal , Reproducibilidad de los ResultadosRESUMEN
Background: Long-read sequencing technologies are the ultimate solution for genome repeats, allowing near reference-level reconstructions of large genomes. However, long-read de novo assembly pipelines are computationally intense and require a considerable amount of coverage, thereby hindering their broad application to the assembly of large genomes. Alternatively, hybrid assembly methods that combine short- and long-read sequencing technologies can reduce the time and cost required to produce de novo assemblies of large genomes. Results: Here, we propose a new method, called Fast-SG, that uses a new ultrafast alignment-free algorithm specifically designed for constructing a scaffolding graph using light-weight data structures. Fast-SG can construct the graph from either short or long reads. This allows the reuse of efficient algorithms designed for short-read data and permits the definition of novel modular hybrid assembly pipelines. Using comprehensive standard datasets and benchmarks, we show how Fast-SG outperforms the state-of-the-art short-read aligners when building the scaffoldinggraph and can be used to extract linking information from either raw or error-corrected long reads. We also show how a hybrid assembly approach using Fast-SG with shallow long-read coverage (5X) and moderate computational resources can produce long-range and accurate reconstructions of the genomes of Arabidopsis thaliana (Ler-0) and human (NA12878). Conclusions: Fast-SG opens a door to achieve accurate hybrid long-range reconstructions of large genomes with low effort, high portability, and low cost.