Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 77
Filter
1.
J R Stat Soc Series B Stat Methodol ; 86(3): 694-713, 2024 Jul.
Article in English | MEDLINE | ID: mdl-39005888

ABSTRACT

Quantifying the association between components of multivariate random curves is of general interest and is a ubiquitous and basic problem that can be addressed with functional data analysis. An important application is the problem of assessing functional connectivity based on functional magnetic resonance imaging (fMRI), where one aims to determine the similarity of fMRI time courses that are recorded on anatomically separated brain regions. In the functional brain connectivity literature, the static temporal Pearson correlation has been the prevailing measure for functional connectivity. However, recent research has revealed temporally changing patterns of functional connectivity, leading to the study of dynamic functional connectivity. This motivates new similarity measures for pairs of random curves that reflect the dynamic features of functional similarity. Specifically, we introduce gradient synchronization measures in a general setting. These similarity measures are based on the concordance and discordance of the gradients between paired smooth random functions. Asymptotic normality of the proposed estimates is obtained under regularity conditions. We illustrate the proposed synchronization measures via simulations and an application to resting-state fMRI signals from the Alzheimer's Disease Neuroimaging Initiative and they are found to improve discrimination between subjects with different disease status.

2.
Aging Cell ; 23(4): e14080, 2024 04.
Article in English | MEDLINE | ID: mdl-38268242

ABSTRACT

The relationship between the early-age activity of Mediterranean fruit flies (medflies) or other fruit flies and their lifespan has not been much studied, in contrast to the connections between lifespan and diet, sexual signaling, and reproduction. The objective of this study is to assess intra-day and day-to-day activity profiles of female Mediterranean fruit flies and their role as biomarker of longevity as well as to explore the relationships between these activity profiles, diet, and age-at-death throughout the lifespan. We use advanced statistical methods from functional data analysis (FDA). Three distinct patterns of activity variations in early-age activity profiles can be distinguished. A low-caloric diet is associated with a delayed activity peak, while a high-caloric diet is linked with an earlier activity peak. We find that age-at-death of individual medflies is connected to their activity profiles in early life. An increased risk of mortality is associated with increased activity in early age, as well as with a higher contrast between daytime and nighttime activity. Conversely, medflies are more likely to have a longer lifespan when they are fed a medium-caloric diet and when their daily activity is more evenly distributed across the early-age span and between daytime and nighttime. The before-death activity profile of medflies displays two characteristic before-death patterns, where one pattern is characterized by slowly declining daily activity and the other by a sudden decline in activity that is followed by death.


Subject(s)
Ceratitis capitata , Longevity , Animals , Female , Aging , Reproduction , Drosophila , Biomarkers
3.
Obesity (Silver Spring) ; 32(1): 156-165, 2024 Jan.
Article in English | MEDLINE | ID: mdl-37817330

ABSTRACT

OBJECTIVE: Common obesity-associated genetic variants at the fat mass and obesity-associated (FTO) locus have been associated with appetitive behaviors and altered structure and function of frontostriatal brain regions. The authors aimed to investigate the influence of FTO variation on frontostriatal appetite circuits in early life. METHODS: Data were drawn from RESONANCE, a longitudinal study of early brain development. Growth trajectories of nucleus accumbens and frontal lobe volumes, as well as total gray matter and white matter volume, by risk allele (AA) carrier status on FTO single-nucleotide polymorphism rs9939609 were examined in 228 children (102 female, 126 male) using magnetic resonance imaging assessments obtained from infancy through middle childhood. The authors fit functional concurrent regression models with brain volume outcomes over age as functional responses, and FTO genotype, sex, BMI z score, and maternal education were included as predictors. RESULTS: Bootstrap pointwise 95% CI for regression coefficient functions in the functional concurrent regression models showed that the AA group versus the group with no risk allele (TT) had greater nucleus accumbens volume (adjusted for total brain volume) in the interval of 750 to 2250 days (2-6 years). CONCLUSIONS: These findings suggest that common genetic risk for obesity is associated with differences in early development of brain reward circuitry and argue for investigating dynamic relationships among genotype, brain, behavior, and weight throughout development.


Subject(s)
Obesity , Polymorphism, Single Nucleotide , Humans , Male , Child , Female , Longitudinal Studies , Obesity/genetics , Obesity/complications , Risk Factors , Genotype , Brain/diagnostic imaging , Alpha-Ketoglutarate-Dependent Dioxygenase FTO/genetics , Body Mass Index , Genetic Predisposition to Disease
4.
J Appl Stat ; 50(11-12): 2294-2309, 2023.
Article in English | MEDLINE | ID: mdl-37529574

ABSTRACT

The study of events distributed over time which can be quantified as point processes has attracted much interest over the years due to its wide range of applications. It has recently gained new relevance due to the COVID-19 case and death processes associated with SARS-CoV-2 that characterize the COVID-19 pandemic and are observed across different countries. It is of interest to study the behavior of these point processes and how they may be related to covariates such as mobility restrictions, gross domestic product per capita, and fraction of population of older age. As infections and deaths in a region are intrinsically events that arrive at random times, a point process approach is natural for this setting. We adopt techniques for conditional functional point processes that target point processes as responses with vector covariates as predictors, to study the interaction and optimal transport between case and death processes and doubling times conditional on covariates.

5.
J R Stat Soc Series B Stat Methodol ; 85(3): 1012-1033, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37521164

ABSTRACT

Series of univariate distributions indexed by equally spaced time points are ubiquitous in applications and their analysis constitutes one of the challenges of the emerging field of distributional data analysis. To quantify such distributional time series, we propose a class of intrinsic autoregressive models that operate in the space of optimal transport maps. The autoregressive transport models that we introduce here are based on regressing optimal transport maps on each other, where predictors can be transport maps from an overall barycenter to a current distribution or transport maps between past consecutive distributions of the distributional time series. Autoregressive transport models and their associated distributional regression models specify the link between predictor and response transport maps by moving along geodesics in Wasserstein space. These models emerge as natural extensions of the classical autoregressive models in Euclidean space. Unique stationary solutions of autoregressive transport models are shown to exist under a geometric moment contraction condition of Wu & Shao [(2004) Limit theorems for iterated random functions. Journal of Applied Probability 41, 425-436)], using properties of iterated random functions. We also discuss an extension to a varying coefficient model for first-order autoregressive transport models. In addition to simulations, the proposed models are illustrated with distributional time series of house prices across U.S. counties and annual summer temperature distributions.

6.
bioRxiv ; 2023 Jun 09.
Article in English | MEDLINE | ID: mdl-37333100

ABSTRACT

The relationship between the early age activity of Mediterranean fruit flies or other fruit flies and their lifespan has not been much studied, in contrast to the connections between lifespan and diet, sexual signaling and reproduction. The objective of this study is to assess intraday and day-to-day activity profiles of female Mediterranean fruit flies and their role as biomarker of longevity as well as to explore the relationships between these activity profiles, diet and age-at-death throughout the lifespan. Three distinct patterns of activity variations in early age activity profiles can be distinguished. A low-caloric diet is associated with a delayed activity peak, while a high-caloric diet is linked with an earlier activity peak. We find that age-at-death of individual medflies is connected to their activity profiles in early life. An increased risk of mortality is associated with increased activity in early age, as well as with a higher contrast between daytime and nighttime activity. Conversely, medflies are more likely to have a longer lifespan when they are fed a medium caloric diet and when their daily activity is more evenly distributed across the early age span and between daytime and nighttime. The before-death activity profile of medflies displays two characteristic before-death patterns, where one pattern is characterized by slowly declining daily activity and the other by a sudden decline in activity that is followed by death.

7.
Biometrics ; 79(4): 3345-3358, 2023 12.
Article in English | MEDLINE | ID: mdl-36877941

ABSTRACT

Multivariate functional data present theoretical and practical complications that are not found in univariate functional data. One of these is a situation where the component functions of multivariate functional data are positive and are subject to mutual time warping. That is, the component processes exhibit a common shape but are subject to systematic phase variation across their domains in addition to subject-specific time warping, where each subject has its own internal clock. This motivates a novel model for multivariate functional data that connect such mutual time warping to a latent-deformation-based framework by exploiting a novel time-warping separability assumption. This separability assumption allows for meaningful interpretation and dimension reduction. The resulting latent deformation model is shown to be well suited to represent commonly encountered functional vector data. The proposed approach combines a random amplitude factor for each component with population-based registration across the components of a multivariate functional data vector and includes a latent population function, which corresponds to a common underlying trajectory. We propose estimators for all components of the model, enabling implementation of the proposed data-based representation for multivariate functional data and downstream analyses such as Fréchet regression. Rates of convergence are established when curves are fully observed or observed with measurement error. The usefulness of the model, interpretations, and practical aspects are illustrated in simulations and with application to multivariate human growth curves and multivariate environmental pollution data.


Subject(s)
Time , Humans
8.
Hum Brain Mapp ; 44(8): 3168-3179, 2023 06 01.
Article in English | MEDLINE | ID: mdl-36896867

ABSTRACT

Brain growth in early childhood is reflected in the evolution of proportional cerebrospinal fluid volumes (pCSF), grey matter (pGM), and white matter (pWM). We study brain development as reflected in the relative fractions of these three tissues for a cohort of 388 children that were longitudinally followed between the ages of 18 and 96 months. We introduce statistical methodology (Riemannian Principal Analysis through Conditional Expectation, RPACE) that addresses major challenges that are of general interest for the analysis of longitudinal neuroimaging data, including the sparsity of the longitudinal observations over time and the compositional structure of the relative brain volumes. Applying the RPACE methodology, we find that longitudinal growth as reflected by tissue composition differs significantly for children of mothers with higher and lower maternal education levels.


Subject(s)
Brain , White Matter , Female , Humans , Child , Child, Preschool , Infant , Brain/diagnostic imaging , Gray Matter/diagnostic imaging , White Matter/diagnostic imaging , Educational Status , Neuroimaging , Magnetic Resonance Imaging , Longitudinal Studies
9.
Sci Rep ; 13(1): 2984, 2023 02 20.
Article in English | MEDLINE | ID: mdl-36804963

ABSTRACT

The maturation of regional brain volumes from birth to preadolescence is a critical developmental process that underlies emerging brain structural connectivity and function. Regulated by genes and environment, the coordinated growth of different brain regions plays an important role in cognitive development. Current knowledge about structural network evolution is limited, partly due to the sparse and irregular nature of most longitudinal neuroimaging data. In particular, it is unknown how factors such as mother's education or sex of the child impact the structural network evolution. To address this issue, we propose a method to construct evolving structural networks and study how the evolving connections among brain regions as reflected at the network level are related to maternal education and biological sex of the child and also how they are associated with cognitive development. Our methodology is based on applying local Fréchet regression to longitudinal neuroimaging data acquired from the RESONANCE cohort, a cohort of healthy children (245 females and 309 males) ranging in age from 9 weeks to 10 years. Our findings reveal that sustained highly coordinated volume growth across brain regions is associated with lower maternal education and lower cognitive development. This suggests that higher neurocognitive performance levels in children are associated with increased variability of regional growth patterns as children age.


Subject(s)
Magnetic Resonance Imaging , Mothers , Male , Female , Humans , Child , Child, Preschool , Brain/diagnostic imaging , Cognition , Educational Status
10.
PLoS One ; 17(7): e0269598, 2022.
Article in English | MEDLINE | ID: mdl-35802688

ABSTRACT

We study the U.S. Supreme Court dynamics by analyzing the temporal evolution of the underlying policy positions of the Supreme Court Justices as reflected by their actual voting data, using functional data analysis methods. The proposed fully flexible nonparametric method makes it possible to dissect the time-dynamics of policy positions at the level of individual Justices, as well as providing a comprehensive view of the ideology evolution over the history of Supreme Court since its establishment. In addition to quantifying individual Justice's policy positions, we uncover average changes over time and also the major patterns of change over time. Additionally, our approach allows for representing highly complex dynamic trajectories by a few principal components which complements other models of analyzing and predicting court behavior.


Subject(s)
Data Analysis , Social Justice , Politics , Supreme Court Decisions , United States
11.
J Math Anal Appl ; 514(2): 125677, 2022 Oct 15.
Article in English | MEDLINE | ID: mdl-34642503

ABSTRACT

Delay differential equations form the underpinning of many complex dynamical systems. The forward problem of solving random differential equations with delay has received increasing attention in recent years. Motivated by the challenge to predict the COVID-19 caseload trajectories for individual states in the U.S., we target here the inverse problem. Given a sample of observed random trajectories obeying an unknown random differential equation model with delay, we use a functional data analysis framework to learn the model parameters that govern the underlying dynamics from the data. We show the existence and uniqueness of the analytical solutions of the population delay random differential equation model when one has discrete time delays in the functional concurrent regression model and also for a second scenario where one has a delay continuum or distributed delay. The latter involves a functional linear regression model with history index. The derivative of the process of interest is modeled using the process itself as predictor and also other functional predictors with predictor-specific delayed impacts. This dynamics learning approach is shown to be well suited to model the growth rate of COVID-19 for the states that are part of the U.S., by pooling information from the individual states, using the case process and concurrently observed economic and mobility data as predictors.

12.
Neuroimage ; 237: 118079, 2021 08 15.
Article in English | MEDLINE | ID: mdl-34000395

ABSTRACT

Early childhood is a period marked by rapid brain growth accompanied by cognitive and motor development. However, it remains unclear how early developmental skills relate to neuroanatomical growth across time with no growth quantile trajectories of typical brain development currently available to place and compare individual neuroanatomical development. Even though longitudinal neuroimaging data have become more common, they are often sparse, making dynamic analyses at subject level a challenging task. Using the Principal Analysis through Conditional Expectation (PACE) approach geared towards sparse longitudinal data, we investigate the evolution of gray matter, white matter and cerebrospinal fluid volumes in a cohort of 446 children between the ages of 1 and 120 months. For each child, we calculate their dynamic age-varying association between the growing brain and scores that assess cognitive functioning, applying the functional varying coefficient model. Using local Fréchet regression, we construct age-varying growth percentiles to reveal the evolution of brain development across the population. To further demonstrate its utility, we apply PACE to predict individual trajectories of brain development.


Subject(s)
Brain , Child Development/physiology , Image Processing, Computer-Assisted/methods , Models, Theoretical , Neuroimaging/methods , Brain/anatomy & histology , Brain/diagnostic imaging , Brain/physiology , Child , Child, Preschool , Connectome , Female , Humans , Infant , Longitudinal Studies , Magnetic Resonance Imaging , Magnetic Resonance Spectroscopy , Male
13.
Biometrics ; 77(4): 1328-1341, 2021 12.
Article in English | MEDLINE | ID: mdl-33034049

ABSTRACT

Modern data collection often entails longitudinal repeated measurements that assume values on a Riemannian manifold. Analyzing such longitudinal Riemannian data is challenging, because of both the sparsity of the observations and the nonlinear manifold constraint. Addressing this challenge, we propose an intrinsic functional principal component analysis for longitudinal Riemannian data. Information is pooled across subjects by estimating the mean curve with local Fréchet regression and smoothing the covariance structure of the linearized data on tangent spaces around the mean. Dimension reduction and imputation of the manifold-valued trajectories are achieved by utilizing the leading principal components and applying best linear unbiased prediction. We show that the proposed mean and covariance function estimates achieve state-of-the-art convergence rates. For illustration, we study the development of brain connectivity in a longitudinal cohort of Alzheimer's disease and normal participants by modeling the connectivity on the manifold of symmetric positive definite matrices with the affine-invariant metric. In a second illustration for irregularly recorded longitudinal emotion compositional data for unemployed workers, we show that the proposed method leads to nicely interpretable eigenfunctions and principal component scores. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative database.


Subject(s)
Algorithms , Alzheimer Disease , Alzheimer Disease/diagnostic imaging , Databases, Factual , Humans , Neuroimaging
14.
Sci Rep ; 10(1): 21040, 2020 12 03.
Article in English | MEDLINE | ID: mdl-33273598

ABSTRACT

We apply tools from functional data analysis to model cumulative trajectories of COVID-19 cases across countries, establishing a framework for quantifying and comparing cases and deaths across countries longitudinally. It emerges that a country's trajectory during an initial first month "priming period" largely determines how the situation unfolds subsequently. We also propose a method for forecasting case counts, which takes advantage of the common, latent information in the entire sample of curves, instead of just the history of a single country. Our framework facilitates to quantify the effects of demographic covariates and social mobility on doubling rates and case fatality rates through a time-varying regression model. Decreased workplace mobility is associated with lower doubling rates with a roughly 2 week delay, and case fatality rates exhibit a positive feedback pattern.


Subject(s)
COVID-19/epidemiology , Pandemics/statistics & numerical data , Forecasting/methods , Humans , Models, Statistical , Risk Factors
15.
Hum Brain Mapp ; 40(14): 4130-4145, 2019 10 01.
Article in English | MEDLINE | ID: mdl-31187920

ABSTRACT

From birth to 5 years of age, brain structure matures and evolves alongside emerging cognitive and behavioral abilities. In relating concurrent cognitive functioning and measures of brain structure, a major challenge that has impeded prior investigation of their time-dynamic relationships is the sparse and irregular nature of most longitudinal neuroimaging data. We demonstrate how this problem can be addressed by applying functional concurrent regression models (FCRMs) to longitudinal cognitive and neuroimaging data. The application of FCRM in neuroimaging is illustrated with longitudinal neuroimaging and cognitive data acquired from a large cohort (n = 210) of healthy children, 2-48 months of age. Quantifying white matter myelination by using myelin water fraction (MWF) as imaging metric derived from MRI scans, application of this methodology reveals an early period (200-500 days) during which whole brain and regional white matter structure, as quantified by MWF, is positively associated with cognitive ability, while we found no such association for whole brain white matter volume. Adjusting for baseline covariates including socioeconomic status as measured by maternal education (SES-ME), infant feeding practice, gender, and birth weight further reveals an increasing association between SES-ME and cognitive development with child age. These results shed new light on the emerging patterns of brain and cognitive development, indicating that FCRM provides a useful tool for investigating these evolving relationships.


Subject(s)
Brain/growth & development , Child Development/physiology , Cognition/physiology , White Matter/growth & development , Brain/physiology , Child, Preschool , Female , Humans , Infant , Longitudinal Studies , Magnetic Resonance Imaging , Male , White Matter/physiology
16.
G3 (Bethesda) ; 9(3): 841-853, 2019 03 07.
Article in English | MEDLINE | ID: mdl-30670607

ABSTRACT

Numerous quantitative trait loci (QTL) have been mapped in tetraploid and hexaploid wheat and wheat relatives, mostly with simple sequence repeat (SSR) or single nucleotide polymorphism (SNP) markers. To conduct meta-analysis of QTL requires projecting them onto a common genomic framework, either a consensus genetic map or genomic sequence. The latter strategy is pursued here. Of 774 QTL mapped in wheat and wheat relatives found in the literature, 585 (75.6%) were successfully projected onto the Aegilops tauschii pseudomolecules. QTL mapped with SNP markers were more successfully projected (92.2%) than those mapped with SSR markers (66.2%). The QTL were not distributed homogeneously along chromosome arms. Their frequencies increased in the proximal-to-distal direction but declined in the most distal regions and were weakly correlated with recombination rates along the chromosome arms. Databases for projected SSR markers and QTL were constructed and incorporated into the Ae. tauschii JBrowse. To facilitate meta-QTL analysis, eight clusters of QTL were used to estimate standard deviations ([Formula: see text]) of independently mapped QTL projected onto the Ae. tauschii genome sequence. The standard deviations [Formula: see text] were modeled as an exponential decay function of recombination rates along the Ae. tauschii chromosomes. We implemented four hypothesis tests for determining the membership of query QTL. The hypothesis tests and estimation procedure for [Formula: see text] were implemented in a web portal for meta-analysis of projected QTL. Twenty-one QTL for Fusarium head blight resistance mapped on wheat chromosomes 3A, 3B, and 3D were analyzed to illustrate the use of the portal for meta-QTL analyses.


Subject(s)
Aegilops/genetics , Genome, Plant , Quantitative Trait Loci , Sequence Analysis, DNA , Triticum/genetics , Data Analysis , Disease Resistance/genetics , Fusariosis , Genomics , Meta-Analysis as Topic , Microsatellite Repeats , Plant Diseases , Polymorphism, Single Nucleotide , Polyploidy
17.
Brain Connect ; 9(1): 37-47, 2019 02.
Article in English | MEDLINE | ID: mdl-30265561

ABSTRACT

The use of correlation densities is introduced to quantify and provide visual interpretation for intraregional functional connectivity in the brain. For each brain region, pairwise correlations are computed between a seed voxel and other gray matter voxels within the region, and the distribution of the ensemble of these correlation values is represented as a probability density, the correlation density. The correlation density can be estimated by kernel smoothing. It provides an intuitive and comprehensive representation of subject-specific functional connectivity strength at the local level for each region. To address the challenge of interpreting and utilizing this rich connectivity information when multiple regions are considered, methods from functional data analysis are implemented, including a recently developed method of dimensionality reduction specifically tailored to the analysis of probability distributions. To illustrate the utility of these methods in neuroimaging, experiments were carried out to identify the associations between local functional connectivity and a battery of neurocognitive scores. These experiments demonstrate that correlation densities facilitate the discovery and interpretation of specific region-score associations.


Subject(s)
Brain/physiology , Connectome/methods , Magnetic Resonance Imaging , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/physiopathology , Brain/diagnostic imaging , Cognition/physiology , Data Interpretation, Statistical , Humans , Image Processing, Computer-Assisted/methods , Neural Pathways/diagnostic imaging , Neural Pathways/physiology
18.
Brain Struct Funct ; 224(2): 535-551, 2019 Mar.
Article in English | MEDLINE | ID: mdl-30392094

ABSTRACT

The maturation of the myelinated white matter throughout childhood is a critical developmental process that underlies emerging connectivity and brain function. In response to genetic influences and neuronal activities, myelination helps establish the mature neural networks that support cognitive and behavioral skills. The emergence and refinement of brain networks, traditionally investigated using functional imaging data, can also be interrogated using longitudinal structural imaging data. However, few studies of structural network development throughout infancy and early childhood have been presented, likely owing to the sparse and irregular nature of most longitudinal neuroimaging data, which complicates dynamic analysis. Here, we overcome this limitation and investigate through concurrent correlation the co-development of white matter myelination and volume, and structural network development of white matter myelination between brain regions as a function of age, using statistically well-supported methods. We show that the concurrent correlation of white matter myelination and volume is overall positive and reaches a peak at 580 days. Brain regions are found to differ in overall magnitudes and patterns of time-varying association throughout early childhood. We introduce time-dynamic developmental networks based on temporal similarity of association patterns in the levels of myelination across brain regions. These networks reflect groups of brain regions that share similar patterns of evolving intra-regional connectivity, as evidenced by levels of myelination, are biologically interpretable and provide novel visualizations of brain development. Comparing the constructed networks between different maternal education groups, we found that children with higher and lower maternal education differ significantly in the overall magnitude of the time-dynamic correlations.


Subject(s)
Brain/diagnostic imaging , Child Development/physiology , Nerve Fibers, Myelinated , Neuroimaging , White Matter/diagnostic imaging , Brain/growth & development , Child, Preschool , Female , Humans , Infant , Magnetic Resonance Imaging , Male , White Matter/growth & development
19.
PLoS One ; 13(11): e0207073, 2018.
Article in English | MEDLINE | ID: mdl-30419052

ABSTRACT

For longitudinal studies with multivariate observations, we propose statistical methods to identify clusters of archetypal subjects by using techniques from functional data analysis and to relate longitudinal patterns to outcomes. We demonstrate how this approach can be applied to examine associations between multiple time-varying exposures and subsequent health outcomes, where the former are recorded sparsely and irregularly in time, with emphasis on the utility of multiple longitudinal observations in the framework of dimension reduction techniques. In applications to children's growth data, we investigate archetypes of infant growth patterns and identify subgroups that are related to cognitive development in childhood. Specifically, "Stunting" and "Faltering" time-dynamic patterns of head circumference, body length and weight in the first 12 months are associated with lower levels of long-term cognitive development in comparison to "Generally Large" and "Catch-up" growth. Our findings provide evidence for the statistical association between multivariate growth patterns in infancy and long-term cognitive development.


Subject(s)
Cognition/physiology , Growth and Development/physiology , Principal Component Analysis , Intelligence , Longitudinal Studies , Models, Statistical , Multivariate Analysis , Risk , Time Factors
20.
Genetics ; 210(3): 1039-1051, 2018 11.
Article in English | MEDLINE | ID: mdl-30158124

ABSTRACT

Long terminal repeat-retrotransposons (LTR-RTs) are a major component of all flowering plant genomes. To analyze the time dynamics of LTR-RTs, we modeled the insertion rates of the 35 most abundant LTR-RT families in the genome of Aegilops tauschii, one of the progenitors of wheat. Our model of insertion rate (birth) takes into account random variation in LTR divergence and the deletion rate (death) of LTR-RTs. Modeling the death rate is crucial because ignoring it would underestimate insertion rates in the distant past. We rejected the hypothesis of constancy of insertion rates for all 35 families and showed by simulations that our hypothesis test controlled the false-positive rate. LTR-RT insertions peaked from 0.064 to 2.39 MYA across the 35 families. Among other effects, the average age of elements within a family was negatively associated with recombination rate along a chromosome, with proximity to the closest gene, and weakly associated with the proximity to its 5' end. Elements within a family that were near genes colinear with genes in the genome of tetraploid emmer wheat tended to be younger than those near noncolinear genes. We discuss these associations in the context of genome evolution and stability of genome sizes in the tribe Triticeae. We demonstrate the general utility of our models by analyzing the two most abundant LTR-RT families in Arabidopsis lyrata, and show that these families differed in their insertion dynamics. Our estimation methods are available in the R package TE on CRAN.


Subject(s)
Aegilops/genetics , Retroelements/genetics , Terminal Repeat Sequences/genetics , Genome, Plant/genetics , Genomics
SELECTION OF CITATIONS
SEARCH DETAIL
...