Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
1.
Proc Natl Acad Sci U S A ; 117(22): 12411-12418, 2020 06 02.
Article in English | MEDLINE | ID: mdl-32430323

ABSTRACT

Genetic factors and socioeconomic status (SES) inequalities play a large role in educational attainment, and both have been associated with variations in brain structure and cognition. However, genetics and SES are correlated, and no prior study has assessed their neural associations independently. Here we used a polygenic score for educational attainment (EduYears-PGS), as well as SES, in a longitudinal study of 551 adolescents to tease apart genetic and environmental associations with brain development and cognition. Subjects received a structural MRI scan at ages 14 and 19. At both time points, they performed three working memory (WM) tasks. SES and EduYears-PGS were correlated (r = 0.27) and had both common and independent associations with brain structure and cognition. Specifically, lower SES was related to less total cortical surface area and lower WM. EduYears-PGS was also related to total cortical surface area, but in addition had a regional association with surface area in the right parietal lobe, a region related to nonverbal cognitive functions, including mathematics, spatial cognition, and WM. SES, but not EduYears-PGS, was related to a change in total cortical surface area from age 14 to 19. This study demonstrates a regional association of EduYears-PGS and the independent prediction of SES with cognitive function and brain development. It suggests that the SES inequalities, in particular parental education, are related to global aspects of cortical development, and exert a persistent influence on brain development during adolescence.


Subject(s)
Brain/growth & development , Cognition , Educational Status , Academic Success , Adolescent , Adult , Brain/diagnostic imaging , Brain/physiology , Female , Humans , Longitudinal Studies , Magnetic Resonance Imaging , Male , Memory, Short-Term , Multifactorial Inheritance , Social Class , Young Adult
2.
PLoS Comput Biol ; 12(5): e1004871, 2016 05.
Article in English | MEDLINE | ID: mdl-27177143

ABSTRACT

By integrating Haar wavelets with Hidden Markov Models, we achieve drastically reduced running times for Bayesian inference using Forward-Backward Gibbs sampling. We show that this improves detection of genomic copy number variants (CNV) in array CGH experiments compared to the state-of-the-art, including standard Gibbs sampling. The method concentrates computational effort on chromosomal segments which are difficult to call, by dynamically and adaptively recomputing consecutive blocks of observations likely to share a copy number. This makes routine diagnostic use and re-analysis of legacy data collections feasible; to this end, we also propose an effective automatic prior. An open source software implementation of our method is available at http://schlieplab.org/Software/HaMMLET/ (DOI: 10.5281/zenodo.46262). This paper was selected for oral presentation at RECOMB 2016, and an abstract is published in the conference proceedings.


Subject(s)
Comparative Genomic Hybridization/statistics & numerical data , DNA Copy Number Variations , Models, Genetic , Bayes Theorem , Breast Neoplasms/genetics , Cell Line , Computational Biology , Computer Simulation , Data Compression , Female , Genome, Human , Humans , Markov Chains , Software
3.
Bioinformatics ; 28(18): i325-i332, 2012 Sep 15.
Article in English | MEDLINE | ID: mdl-22962448

ABSTRACT

MOTIVATION: Mapping billions of reads from next generation sequencing experiments to reference genomes is a crucial task, which can require hundreds of hours of running time on a single CPU even for the fastest known implementations. Traditional approaches have difficulties dealing with matches of large edit distance, particularly in the presence of frequent or large insertions and deletions (indels). This is a serious obstacle both in determining the spectrum and abundance of genetic variations and in personal genomics. RESULTS: For the first time, we adopt the approximate string matching paradigm of geometric embedding to read mapping, thus rephrasing it to nearest neighbor queries in a q-gram frequency vector space. Using the L(1) distance between frequency vectors has the benefit of providing lower bounds for an edit distance with affine gap costs. Using a cache-oblivious kd-tree, we realize running times, which match the state-of-the-art. Additionally, running time and memory requirements are about constant for read lengths between 100 and 1000 bp. We provide a first proof-of-concept that geometric embedding is a promising paradigm for read mapping and that L(1) distance might serve to detect structural variations. TreQ, our initial implementation of that concept, performs more accurate than many popular read mappers over a wide range of structural variants. AVAILABILITY AND IMPLEMENTATION: TreQ will be released under the GNU Public License (GPL), and precomputed genome indices will be provided for download at http://treq.sf.net. CONTACT: pavelm@cs.rutgers.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , INDEL Mutation , Sequence Analysis, DNA/methods , Chromosome Mapping , Genetic Variation , Genome, Human , Genomics/methods , Humans , Nucleotides/chemistry
4.
NPJ Sci Learn ; 6(1): 16, 2021 Jun 02.
Article in English | MEDLINE | ID: mdl-34078902

ABSTRACT

The interplay of genetic and environmental factors behind cognitive development has preoccupied multiple fields of science and sparked heated debates over the decades. Here we tested the hypothesis that developmental genes rely heavily on cognitive challenges-as opposed to natural maturation. Starting with a polygenic score (cogPGS) that previously explained variation in cognitive performance in adults, we estimated its effect in 344 children and adolescents (mean age of 12 years old, ranging from 6 to 25) who showed changes in working memory (WM) in two distinct samples: (1) a developmental sample showing significant WM gains after 2 years of typical, age-related development, and (2) a training sample showing significant, experimentally-induced WM gains after 25 days of an intense WM training. We found that the same genetic factor, cogPGS, significantly explained the amount of WM gain in both samples. And there was no interaction of cogPGS with sample, suggesting that those genetic factors are neutral to whether the WM gains came from development or training. These results represent evidence that cognitive challenges are a central piece in the gene-environment interplay during cognitive development. We believe our study sheds new light on previous findings of interindividual differences in education (rich-get-richer and compensation effects), brain plasticity in children, and the heritability increase of intelligence across the lifespan.

5.
PeerJ ; 8: e8225, 2020.
Article in English | MEDLINE | ID: mdl-32025365

ABSTRACT

Natural history museums are unique spaces for interdisciplinary research and educational innovation. Through extensive exhibits and public programming and by hosting rich communities of amateurs, students, and researchers at all stages of their careers, they can provide a place-based window to focus on integration of science and discovery, as well as a locus for community engagement. At the same time, like a synthesis radio telescope, when joined together through emerging digital resources, the global community of museums (the 'Global Museum') is more than the sum of its parts, allowing insights and answers to diverse biological, environmental, and societal questions at the global scale, across eons of time, and spanning vast diversity across the Tree of Life. We argue that, whereas natural history collections and museums began with a focus on describing the diversity and peculiarities of species on Earth, they are now increasingly leveraged in new ways that significantly expand their impact and relevance. These new directions include the possibility to ask new, often interdisciplinary questions in basic and applied science, such as in biomimetic design, and by contributing to solutions to climate change, global health and food security challenges. As institutions, they have long been incubators for cutting-edge research in biology while simultaneously providing core infrastructure for research on present and future societal needs. Here we explore how the intersection between pressing issues in environmental and human health and rapid technological innovation have reinforced the relevance of museum collections. We do this by providing examples as food for thought for both the broader academic community and museum scientists on the evolving role of museums. We also identify challenges to the realization of the full potential of natural history collections and the Global Museum to science and society and discuss the critical need to grow these collections. We then focus on mapping and modelling of museum data (including place-based approaches and discovery), and explore the main projects, platforms and databases enabling this growth. Finally, we aim to improve relevant protocols for the long-term storage of specimens and tissues, ensuring proper connection with tomorrow's technologies and hence further increasing the relevance of natural history museums.

6.
Algorithms Mol Biol ; 14: 20, 2019.
Article in English | MEDLINE | ID: mdl-31572486

ABSTRACT

BACKGROUND: Full Bayesian inference for detecting copy number variants (CNV) from whole-genome sequencing (WGS) data is still largely infeasible due to computational demands. A recently introduced approach to perform Forward-Backward Gibbs sampling using dynamic Haar wavelet compression has alleviated issues of convergence and, to some extent, speed. Yet, the problem remains challenging in practice. RESULTS: In this paper, we propose an improved algorithmic framework for this approach. We provide new space-efficient data structures to query sufficient statistics in logarithmic time, based on a linear-time, in-place transform of the data, which also improves on the compression ratio. We also propose a new approach to efficiently store and update marginal state counts obtained from the Gibbs sampler. CONCLUSIONS: Using this approach, we discover several CNV candidates in two rat populations divergently selected for tame and aggressive behavior, consistent with earlier results concerning the domestication syndrome as well as experimental observations. Computationally, we observe a 29.5-fold decrease in memory, an average 5.8-fold speedup, as well as a 191-fold decrease in minor page faults. We also observe that metrics varied greatly in the old implementation, but not the new one. We conjecture that this is due to the better compression scheme. The fully Bayesian segmentation of the entire WGS data set required 3.5 min and 1.24 GB of memory, and can hence be performed on a commodity laptop.

7.
PeerJ ; 7: e6399, 2019.
Article in English | MEDLINE | ID: mdl-30783571

ABSTRACT

Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress.

8.
Methods Mol Biol ; 1833: 83-93, 2018.
Article in English | MEDLINE | ID: mdl-30039365

ABSTRACT

CNV detection requires a high-quality segmentation of genomic data. In many WGS experiments, sample and control are sequenced together in a multiplexed fashion using DNA barcoding for economic reasons. Using the differential read depth of these two conditions cancels out systematic additive errors. Due to this detrending, the resulting data is appropriate for inference using a hidden Markov model (HMM), arguably one of the principal models for labeled segmentation. However, while the usual frequentist approaches such as Baum-Welch are problematic for several reasons, they are often preferred to Bayesian HMM inference, which normally requires prohibitively long running times and exceeds a typical user's computational resources on a genome scale data. HaMMLET solves this problem using a dynamic wavelet compression scheme, which makes Bayesian segmentation of WGS data feasible on standard consumer hardware.


Subject(s)
DNA Barcoding, Taxonomic/methods , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA/methods , Markov Chains
9.
Ann N Y Acad Sci ; 1257: 142-51, 2012 Jun.
Article in English | MEDLINE | ID: mdl-22671600

ABSTRACT

In epithelia and endothelia, overall resistance (TER) is determined by all ion-conductive structures, such as membrane channels, tight junctions, and the intercellular space, whereas the epithelial capacitance is due to the hydrophobic phase of the plasma membrane. Impedance means alternating current resistance and, in contrast to ohmic resistance, takes into account that, e.g., capacitors become increasingly conductive with increasing frequency. Impedance spectroscopy uses the association of the capacitance with the transcellular pathway to distinguish between this capacitive pathway and purely conductive components (tight junctions, subepithelium). In detail, one-path impedance spectroscopy distinguishes the resistance of the epithelium from the resistance of subepithelial tissues. Beyond that, two-path impedance spectroscopy allows for the separation of paracellular resistance (governed by tight junctional properties) from transcellular resistance (determined by conductive structures residing in the cell membranes). The present paper reviews the basic principles of these techniques, some historic milestones, as well as recent developments in epithelial physiology.


Subject(s)
Cell Membrane/physiology , Dielectric Spectroscopy/methods , Epithelium/physiology , Tight Junctions/physiology , Dielectric Spectroscopy/history , Electric Impedance , History, 20th Century , Humans
10.
Article in English | MEDLINE | ID: mdl-21282868

ABSTRACT

Interactions of protein domains control essential cellular processes. Thus, inferring the evolutionary histories of multidomain proteins in the context of their families can provide rewarding insights into protein function. However, methods to infer these histories are challenged by the complexity of macroevolutionary events. Here, we address this challenge by describing an algorithm that computes a novel network-like structure, called plexus, which represents the evolution of domains and their combinations. Finally, we demonstrate the performance of this algorithm with empirical data sets.


Subject(s)
Computational Biology/methods , Evolution, Molecular , Models, Biological , Proteins/chemistry , Proteins/genetics , Algorithms , Phylogeny , Protein Structure, Tertiary
11.
PLoS Curr ; 3: RRN1240, 2011 Jun 09.
Article in English | MEDLINE | ID: mdl-21686311

ABSTRACT

Phylogenies of multi-domain proteins have to incorporate macro-evolutionary events, which dramatically increases the complexity of their construction.We present an application to infer ancestral multi-domain proteins given a species tree and domain phylogenies. As the individual domain phylogenies are often incongruent, we provide diagnostics for the identification and reconciliation of implausible topologies. We implement and extend a suggested algorithmic approach by Behzadi and Vingron (2006).

SELECTION OF CITATIONS
SEARCH DETAIL