Search | Virtual Health Library

1.

Human lower leg muscles grow asynchronously.

Chow, Brian V Y; Morgan, Catherine; Rae, Caroline; Warton, David I; Novak, Iona; Davies, Suzanne; Lancaster, Ann; Popovic, Gordana C; Rizzo, Rodrigo R N; Rizzo, Claudia Y; Kyriagis, Maria; Herbert, Robert D; Bolsterlee, Bart.

J Anat ; 244(3): 476-485, 2024 03.

Article in English | MEDLINE | ID: mdl-37917014

ABSTRACT

Muscle volume must increase substantially during childhood growth to generate the power required to propel the growing body. One unresolved but fundamental question about childhood muscle growth is whether muscles grow at equal rates; that is, if muscles grow in synchrony with each other. In this study, we used magnetic resonance imaging (MRI) and advances in artificial intelligence methods (deep learning) for medical image segmentation to investigate whether human lower leg muscles grow in synchrony. Muscle volumes were measured in 10 lower leg muscles in 208 typically developing children (eight infants aged less than 3 months and 200 children aged 5 to 15 years). We tested the hypothesis that human lower leg muscles grow synchronously by investigating whether the volume of individual lower leg muscles, expressed as a proportion of total lower leg muscle volume, remains constant with age. There were substantial age-related changes in the relative volume of most muscles in both boys and girls (p < 0.001). This was most evident between birth and five years of age but was still evident after five years. The medial gastrocnemius and soleus muscles, the largest muscles in infancy, grew faster than other muscles in the first five years. The findings demonstrate that muscles in the human lower leg grow asynchronously. This finding may assist early detection of atypical growth and allow targeted muscle-specific interventions to improve the quality of life, particularly for children with neuromotor conditions such as cerebral palsy.

Subject(s)

Artificial Intelligence , Leg , Male , Child , Female , Humans , Child, Preschool , Quality of Life , Muscle, Skeletal/pathology , Lower Extremity , Magnetic Resonance Imaging/methods

2.

Selecting the model for multiple imputation of missing data: Just use an IC!

Noghrehchi, Firouzeh; Stoklosa, Jakub; Penev, Spiridon; Warton, David I.

Stat Med ; 40(10): 2467-2497, 2021 05 10.

Article in English | MEDLINE | ID: mdl-33629367

ABSTRACT

Multiple imputation and maximum likelihood estimation (via the expectation-maximization algorithm) are two well-known methods readily used for analyzing data with missing values. While these two methods are often considered as being distinct from one another, multiple imputation (when using improper imputation) is actually equivalent to a stochastic expectation-maximization approximation to the likelihood. In this article, we exploit this key result to show that familiar likelihood-based approaches to model selection, such as Akaike's information criterion (AIC) and the Bayesian information criterion (BIC), can be used to choose the imputation model that best fits the observed data. Poor choice of imputation model is known to bias inference, and while sensitivity analysis has often been used to explore the implications of different imputation models, we show that the data can be used to choose an appropriate imputation model via conventional model selection tools. We show that BIC can be consistent for selecting the correct imputation model in the presence of missing data. We verify these results empirically through simulation studies, and demonstrate their practicality on two classical missing data examples. An interesting result we saw in simulations was that not only can parameter estimates be biased by misspecifying the imputation model, but also by overfitting the imputation model. This emphasizes the importance of using model selection not just to choose the appropriate type of imputation model, but also to decide on the appropriate level of imputation model complexity.

Subject(s)

Algorithms , Bayes Theorem , Bias , Computer Simulation , Humans , Likelihood Functions

3.

Modeling recreational fishing intensity in a complex urbanised estuary.

Griffin, Kingsley J; Hedge, Luke H; Warton, David I; Astles, Karen L; Johnston, Emma L.

J Environ Manage ; 279: 111529, 2021 Feb 01.

Article in English | MEDLINE | ID: mdl-33246754

ABSTRACT

Urbanised estuaries, ports and harbours are often utilised for recreational purposes, notably recreational angling. Yet there has been little quantitative assessment of the footprint and intensity of these activities at scales suitable for spatial management. Urban and industrialised estuaries have previously been considered as having low conservation value, perhaps due to issues with contamination and disturbance. Studies in recent decades have demonstrated that many of these systems are still highly biodiverse and of high value to local residents. As a response, urbanised estuaries are now being considered by coastal spatial management initiatives, where assessments of recreational use in these areas can help avoid 'user-environmental' and 'user-user' conflict. The models of these activities need to be developed at a scale relevant to governments and regulatory authorities, but the few human-use models that do exist integrate fishing intensity to a regional or even continental scale; too large to capture the fine scale variation inherent in complex urban fisheries. Species Distribution Modeling (SDM) is a tool commonly used to assess drivers of species range, but can be applied to models of recreational fishing in complex environments, at a scale relevant to regulatory bodies. Using point-data from 573 visual surveys with recently developed Poisson point process models, we examine the recreational fishery in Australia's busiest estuarine port, Sydney Harbour. We demonstrate the utility of these models for understanding the distribution of boat and shore-based fishers, and the effects of a range of temporally static (geographical) and dynamic (weather) predictors on these distributions.

Subject(s)

Conservation of Natural Resources , Estuaries , Biodiversity , Fisheries , Humans , Recreation

4.

Why you cannot transform your way out of trouble for small counts.

Warton, David I.

Biometrics ; 74(1): 362-368, 2018 03.

Article in English | MEDLINE | ID: mdl-28504821

ABSTRACT

While data transformation is a common strategy to satisfy linear modeling assumptions, a theoretical result is used to show that transformation cannot reasonably be expected to stabilize variances for small counts. Under broad assumptions, as counts get smaller, it is shown that the variance becomes proportional to the mean under monotonic transformations g(·) that satisfy g(0)=0, excepting a few pathological cases. A suggested rule-of-thumb is that if many predicted counts are less than one then data transformation cannot reasonably be expected to stabilize variances, even for a well-chosen transformation. This result has clear implications for the analysis of counts as often implemented in the applied sciences, but particularly for multivariate analysis in ecology. Multivariate discrete data are often collected in ecology, typically with a large proportion of zeros, and it is currently widespread to use methods of analysis that do not account for differences in variance across observations nor across responses. Simulations demonstrate that failure to account for the mean-variance relationship can have particularly severe consequences in this context, and also in the univariate context if the sampling design is unbalanced.

Subject(s)

Data Interpretation, Statistical , Models, Statistical , Ecology , Linear Models , Multivariate Analysis

5.

Order selection and sparsity in latent variable models via the ordered factor LASSO.

Hui, Francis K C; Tanaka, Emi; Warton, David I.

Biometrics ; 74(4): 1311-1319, 2018 12.

Article in English | MEDLINE | ID: mdl-29750847

ABSTRACT

Generalized linear latent variable models (GLLVMs) offer a general framework for flexibly analyzing data involving multiple responses. When fitting such models, two of the major challenges are selecting the order, that is, the number of factors, and an appropriate structure for the loading matrix, typically a sparse structure. Motivated by the application of GLLVMs to study marine species assemblages in the Southern Ocean, we propose the Ordered Factor LASSO or OFAL penalty for order selection and achieving sparsity in GLLVMs. The OFAL penalty is the first penalty developed specifically for order selection in latent variable models, and achieves this by using a hierarchically structured group LASSO type penalty to shrink entire columns of the loading matrix to zero, while ensuring that non-zero loadings are concentrated on the lower-order factors. Simultaneously, individual element sparsity is achieved through the use of an adaptive LASSO. In conjunction with using an information criterion which promotes aggressive shrinkage, simulation shows that the OFAL penalty performs strongly compared with standard methods and penalties for order selection, achieving sparsity, and prediction in GLLVMs. Applying the OFAL penalty to the Southern Ocean marine species dataset suggests the available environmental predictors explain roughly half of the total covariation between species, thus leading to a smaller number of latent variables and increased sparsity in the loading matrix compared to a model without any covariates.

Subject(s)

Biometry/methods , Factor Analysis, Statistical , Animals , Aquatic Organisms , Computer Simulation/statistics & numerical data , Likelihood Functions , Oceans and Seas

6.

Fast forward selection for generalized estimating equations with a large number of predictor variables.

Stoklosa, Jakub; Gibb, Heloise; Warton, David I.

Biometrics ; 70(1): 110-20, 2014 Mar.

Article in English | MEDLINE | ID: mdl-24350717

ABSTRACT

We propose a new variable selection criterion designed for use with forward selection algorithms; the score information criterion (SIC). The proposed criterion is based on score statistics which incorporate correlated response data. The main advantage of the SIC is that it is much faster to compute than existing model selection criteria when the number of predictor variables added to a model is large, this is because SIC can be computed for all candidate models without actually fitting them. A second advantage is that it incorporates the correlation between variables into its quasi-likelihood, leading to more desirable properties than competing selection criteria. Consistency and prediction properties are shown for the SIC. We conduct simulation studies to evaluate the selection and prediction performances, and compare these, as well as computational times, with some well-known variable selection criteria. We apply the SIC on a real data set collected on arthropods by considering variable selection on a large number of interactions terms consisting of species traits and environmental covariates.

Subject(s)

Algorithms , Data Interpretation, Statistical , Likelihood Functions , Longitudinal Studies/methods , Models, Statistical , Animals , Arthropods/growth & development , Australia , Computer Simulation , Ecosystem

7.

Novel community data in ecology-properties and prospects.

Hartig, Florian; Abrego, Nerea; Bush, Alex; Chase, Jonathan M; Guillera-Arroita, Gurutzeta; Leibold, Mathew A; Ovaskainen, Otso; Pellissier, Loïc; Pichler, Maximilian; Poggiato, Giovanni; Pollock, Laura; Si-Moussi, Sara; Thuiller, Wilfried; Viana, Duarte S; Warton, David I; Zurell, Damaris; Yu, Douglas W.

Trends Ecol Evol ; 39(3): 280-293, 2024 03.

Article in English | MEDLINE | ID: mdl-37949795

ABSTRACT

New technologies for monitoring biodiversity such as environmental (e)DNA, passive acoustic monitoring, and optical sensors promise to generate automated spatiotemporal community observations at unprecedented scales and resolutions. Here, we introduce 'novel community data' as an umbrella term for these data. We review the emerging field around novel community data, focusing on new ecological questions that could be addressed; the analytical tools available or needed to make best use of these data; and the potential implications of these developments for policy and conservation. We conclude that novel community data offer many opportunities to advance our understanding of fundamental ecological processes, including community assembly, biotic interactions, micro- and macroevolution, and overall ecosystem functioning.

Subject(s)

Biodiversity , Ecosystem , DNA , Policy

8.

To mix or not to mix: comparing the predictive performance of mixture models vs. separate species distribution models.

Hui, Francis K C; Warton, David I; Foster, Scott D; Dunstan, Piers K.

Ecology ; 94(9): 1913-9, 2013 Sep.

Article in English | MEDLINE | ID: mdl-24279262

ABSTRACT

Species distribution models (SDMs) are an important tool for studying the patterns of species across environmental and geographic space. For community data, a common approach involves fitting an SDM to each species separately, although the large number of models makes interpretation difficult and fails to exploit any similarities between individual species responses. A recently proposed alternative that can potentially overcome these difficulties is species archetype models (SAMs), a model-based approach that clusters species based on their environmental response. In this paper, we compare the predictive performance of SAMs against separate SDMs using a number of multi-species data sets. Results show that SAMs improve model accuracy and discriminatory capacity compared to separate SDMs. This is achieved by borrowing strength from common species having higher information content. Moreover, the improvement increases as the species become rarer.

Subject(s)

Models, Biological , Animals , Computer Simulation , Demography , Species Specificity , Temperature

9.

Equivalence of MAXENT and Poisson point process models for species distribution modeling in ecology.

Renner, Ian W; Warton, David I.

Biometrics ; 69(1): 274-81, 2013 Mar.

Article in English | MEDLINE | ID: mdl-23379623

ABSTRACT

Modeling the spatial distribution of a species is a fundamental problem in ecology. A number of modeling methods have been developed, an extremely popular one being MAXENT, a maximum entropy modeling approach. In this article, we show that MAXENT is equivalent to a Poisson regression model and hence is related to a Poisson point process model, differing only in the intercept term, which is scale-dependent in MAXENT. We illustrate a number of improvements to MAXENT that follow from these relations. In particular, a point process model approach facilitates methods for choosing the appropriate spatial resolution, assessing model adequacy, and choosing the LASSO penalty parameter, all currently unavailable to MAXENT. The equivalence result represents a significant step in the unification of the species distribution modeling literature.

Subject(s)

Ecology/methods , Ecosystem , Models, Biological , Models, Statistical , Animals , Eucalyptus/growth & development , New South Wales , Software

10.

A general algorithm for error-in-variables regression modelling using Monte Carlo expectation maximization.

Stoklosa, Jakub; Hwang, Wen-Han; Warton, David I.

PLoS One ; 18(4): e0283798, 2023.

Article in English | MEDLINE | ID: mdl-37011065

ABSTRACT

In regression modelling, measurement error models are often needed to correct for uncertainty arising from measurements of covariates/predictor variables. The literature on measurement error (or errors-in-variables) modelling is plentiful, however, general algorithms and software for maximum likelihood estimation of models with measurement error are not as readily available, in a form that they can be used by applied researchers without relatively advanced statistical expertise. In this study, we develop a novel algorithm for measurement error modelling, which could in principle take any regression model fitted by maximum likelihood, or penalised likelihood, and extend it to account for uncertainty in covariates. This is achieved by exploiting an interesting property of the Monte Carlo Expectation-Maximization (MCEM) algorithm, namely that it can be expressed as an iteratively reweighted maximisation of complete data likelihoods (formed by imputing the missing values). Thus we can take any regression model for which we have an algorithm for (penalised) likelihood estimation when covariates are error-free, nest it within our proposed iteratively reweighted MCEM algorithm, and thus account for uncertainty in covariates. The approach is demonstrated on examples involving generalized linear models, point process models, generalized additive models and capture-recapture models. Because the proposed method uses maximum (penalised) likelihood, it inherits advantageous optimality and inferential properties, as illustrated by simulation. We also study the model robustness of some violations in predictor distributional assumptions. Software is provided as the refitME package on R, whose key function behaves like a refit() function, taking a fitted regression model object and re-fitting with a pre-specified amount of measurement error.

Subject(s)

Algorithms , Motivation , Likelihood Functions , Linear Models , Computer Simulation , Monte Carlo Method , Models, Statistical

11.

Leaf economics fundamentals explained by optimality principles.

Wang, Han; Prentice, I Colin; Wright, Ian J; Warton, David I; Qiao, Shengchao; Xu, Xiangtao; Zhou, Jian; Kikuzawa, Kihachiro; Stenseth, Nils Chr.

Sci Adv ; 9(3): eadd5667, 2023 Jan 18.

Article in English | MEDLINE | ID: mdl-36652527

ABSTRACT

The life span of leaves increases with their mass per unit area (LMA). It is unclear why. Here, we show that this empirical generalization (the foundation of the worldwide leaf economics spectrum) is a consequence of natural selection, maximizing average net carbon gain over the leaf life cycle. Analyzing two large leaf trait datasets, we show that evergreen and deciduous species with diverse construction costs (assumed proportional to LMA) are selected by light, temperature, and growing-season length in different, but predictable, ways. We quantitatively explain the observed divergent latitudinal trends in evergreen and deciduous LMA and show how local distributions of LMA arise by selection under different environmental conditions acting on the species pool. These results illustrate how optimality principles can underpin a new theory for plant geography and terrestrial carbon dynamics.

12.

Generalized Matrix Factorization: efficient algorithms for fitting generalized linear latent variable models to large data arrays.

Kidzinski, Lukasz; Hui, Francis K C; Warton, David I; Hastie, Trevor.

J Mach Learn Res ; 232022 Nov.

Article in English | MEDLINE | ID: mdl-37102181

ABSTRACT

Unmeasured or latent variables are often the cause of correlations between multivariate measurements, which are studied in a variety of fields such as psychology, ecology, and medicine. For Gaussian measurements, there are classical tools such as factor analysis or principal component analysis with a well-established theory and fast algorithms. Generalized Linear Latent Variable models (GLLVMs) generalize such factor models to non-Gaussian responses. However, current algorithms for estimating model parameters in GLLVMs require intensive computation and do not scale to large datasets with thousands of observational units or responses. In this article, we propose a new approach for fitting GLLVMs to high-dimensional datasets, based on approximating the model using penalized quasi-likelihood and then using a Newton method and Fisher scoring to learn the model parameters. Computationally, our method is noticeably faster and more stable, enabling GLLVM fits to much larger matrices than previously possible. We apply our method on a dataset of 48,000 observational units with over 2,000 observed species in each unit and find that most of the variability can be explained with a handful of factors. We publish an easy-to-use implementation of our proposed fitting algorithm.

13.

Global patterns of leaf mechanical properties.

Onoda, Yusuke; Westoby, Mark; Adler, Peter B; Choong, Amy M F; Clissold, Fiona J; Cornelissen, Johannes H C; Díaz, Sandra; Dominy, Nathaniel J; Elgart, Alison; Enrico, Lucas; Fine, Paul V A; Howard, Jerome J; Jalili, Adel; Kitajima, Kaoru; Kurokawa, Hiroko; McArthur, Clare; Lucas, Peter W; Markesteijn, Lars; Pérez-Harguindeguy, Natalia; Poorter, Lourens; Richards, Lora; Santiago, Louis S; Sosinski, Enio E; Van Bael, Sunshine A; Warton, David I; Wright, Ian J; Wright, S Joseph; Yamashita, Nayuta.

Ecol Lett ; 14(3): 301-12, 2011 Mar.

Article in English | MEDLINE | ID: mdl-21265976

ABSTRACT

Leaf mechanical properties strongly influence leaf lifespan, plant-herbivore interactions, litter decomposition and nutrient cycling, but global patterns in their interspecific variation and underlying mechanisms remain poorly understood. We synthesize data across the three major measurement methods, permitting the first global analyses of leaf mechanics and associated traits, for 2819 species from 90 sites worldwide. Key measures of leaf mechanical resistance varied c. 500-800-fold among species. Contrary to a long-standing hypothesis, tropical leaves were not mechanically more resistant than temperate leaves. Leaf mechanical resistance was modestly related to rainfall and local light environment. By partitioning leaf mechanical resistance into three different components we discovered that toughness per density contributed a surprisingly large fraction to variation in mechanical resistance, larger than the fractions contributed by lamina thickness and tissue density. Higher toughness per density was associated with long leaf lifespan especially in forest understory. Seldom appreciated in the past, toughness per density is a key factor in leaf mechanical resistance, which itself influences plant-animal interactions and ecosystem functions across the globe.

Subject(s)

Biomechanical Phenomena , Plant Leaves/anatomy & histology , Stress, Mechanical , Light , Plant Leaves/physiology , Plant Physiological Phenomena , Plants/anatomy & histology , Rain , Tropical Climate

14.

Putting plant resistance traits on the map: a test of the idea that plants are better defended at lower latitudes.

Moles, Angela T; Wallis, Ian R; Foley, William J; Warton, David I; Stegen, James C; Bisigato, Alejandro J; Cella-Pizarro, Lucrecia; Clark, Connie J; Cohen, Philippe S; Cornwell, William K; Edwards, Will; Ejrnaes, Rasmus; Gonzales-Ojeda, Therany; Graae, Bente J; Hay, Gregory; Lumbwe, Fainess C; Magaña-Rodríguez, Benjamín; Moore, Ben D; Peri, Pablo L; Poulsen, John R; Veldtman, Ruan; von Zeipel, Hugo; Andrew, Nigel R; Boulter, Sarah L; Borer, Elizabeth T; Campón, Florencia Fernández; Coll, Moshe; Farji-Brener, Alejandro G; De Gabriel, Jane; Jurado, Enrique; Kyhn, Line A; Low, Bill; Mulder, Christa P H; Reardon-Smith, Kathryn; Rodríguez-Velázquez, Jorge; Seabloom, Eric W; Vesk, Peter A; van Cauter, An; Waldram, Matthew S; Zheng, Zheng; Blendinger, Pedro G; Enquist, Brian J; Facelli, Jose M; Knight, Tiffany; Majer, Jonathan D; Martínez-Ramos, Miguel; McQuillan, Peter; Prior, Lynda D.

New Phytol ; 191(3): 777-788, 2011 Aug.

Article in English | MEDLINE | ID: mdl-21539574

ABSTRACT

â¢ It has long been believed that plant species from the tropics have higher levels of traits associated with resistance to herbivores than do species from higher latitudes. A meta-analysis recently showed that the published literature does not support this theory. However, the idea has never been tested using data gathered with consistent methods from a wide range of latitudes. â¢ We quantified the relationship between latitude and a broad range of chemical and physical traits across 301 species from 75 sites world-wide. â¢ Six putative resistance traits, including tannins, the concentration of lipids (an indicator of oils, waxes and resins), and leaf toughness were greater in high-latitude species. Six traits, including cyanide production and the presence of spines, were unrelated to latitude. Only ash content (an indicator of inorganic substances such as calcium oxalates and phytoliths) and the properties of species with delayed greening were higher in the tropics. â¢ Our results do not support the hypothesis that tropical plants have higher levels of resistance traits than do plants from higher latitudes. If anything, plants have higher resistance toward the poles. The greater resistance traits of high-latitude species might be explained by the greater cost of losing a given amount of leaf tissue in low-productivity environments.

Subject(s)

Plant Diseases/immunology , Plant Leaves/immunology , Plants/immunology , Animals , Cyanides/analysis , Environment , Geography , Lipids/analysis , Phenotype , Plant Immunity , Plant Leaves/anatomy & histology , Plant Leaves/chemistry , Plants/anatomy & histology , Plants/chemistry , Species Specificity , Tannins/analysis

15.

The arcsine is asinine: the analysis of proportions in ecology.

Warton, David I; Hui, Francis K C.

Ecology ; 92(1): 3-10, 2011 Jan.

Article in English | MEDLINE | ID: mdl-21560670

ABSTRACT

The arcsine square root transformation has long been standard procedure when analyzing proportional data in ecology, with applications in data sets containing binomial and non-binomial response variables. Here, we argue that the arcsine transform should not be used in either circumstance. For binomial data, logistic regression has greater interpretability and higher power than analyses of transformed data. However, it is important to check the data for additional unexplained variation, i.e., overdispersion, and to account for it via the inclusion of random effects in the model if found. For non-binomial data, the arcsine transform is undesirable on the grounds of interpretability, and because it can produce nonsensical predictions. The logit transformation is proposed as an alternative approach to address these issues. Examples are presented in both cases to illustrate these advantages, comparing various methods of analyzing proportions including untransformed, arcsine- and logit-transformed linear models and logistic regression (with or without random effects). Simulations demonstrate that logistic regression usually provides a gain in power over other methods.

Subject(s)

Ecology/methods , Ecosystem , Models, Biological , Statistics as Topic , Computer Simulation

16.

Regularized sandwich estimators for analysis of high-dimensional data using generalized estimating equations.

Warton, David I.

Biometrics ; 67(1): 116-23, 2011 Mar.

Article in English | MEDLINE | ID: mdl-20528857

ABSTRACT

A modification of generalized estimating equations (GEEs) methodology is proposed for hypothesis testing of high-dimensional data, with particular interest in multivariate abundance data in ecology, an important application of interest in thousands of environmental science studies. Such data are typically counts characterized by high dimensionality (in the sense that cluster size exceeds number of clusters, n>K) and over-dispersion relative to the Poisson distribution. Usual GEE methods cannot be applied in this setting primarily because sandwich estimators become numerically unstable as n increases. We propose instead using a regularized sandwich estimator that assumes a common correlation matrix R, and shrinks the sample estimate of R toward the working correlation matrix to improve its numerical stability. It is shown via theory and simulation that this substantially improves the power of Wald statistics when cluster size is not small. We apply the proposed approach to study the effects of nutrient addition on nematode communities, and in doing so discuss important issues in implementation, such as using statistics that have good properties when parameter estimates approach the boundary (), and using resampling to enable valid inference that is robust to high dimensionality and to possible model misspecification.

Subject(s)

Algorithms , Biometry/methods , Data Interpretation, Statistical , Ecosystem , Models, Statistical , Nematoda/growth & development , Population Growth , Animals , Computer Simulation

17.

Robust estimation and inference for bivariate line-fitting in allometry.

Taskinen, Sara; Warton, David I.

Biom J ; 53(4): 652-72, 2011 Jul.

Article in English | MEDLINE | ID: mdl-21681982

ABSTRACT

In allometry, bivariate techniques related to principal component analysis are often used in place of linear regression, and primary interest is in making inferences about the slope. We demonstrate that the current inferential methods are not robust to bivariate contamination, and consider four robust alternatives to the current methods -- a novel sandwich estimator approach, using robust covariance matrices derived via an influence function approach, Huber's M-estimator and the fast-and-robust bootstrap. Simulations demonstrate that Huber's M-estimators are highly efficient and robust against bivariate contamination, and when combined with the fast-and-robust bootstrap, we can make accurate inferences even from small samples.

Subject(s)

Biostatistics/methods , Analysis of Variance , Body Size , Probability

18.

Efficient estimation of generalized linear latent variable models.

Niku, Jenni; Brooks, Wesley; Herliansyah, Riki; Hui, Francis K C; Taskinen, Sara; Warton, David I.

PLoS One ; 14(5): e0216129, 2019.

Article in English | MEDLINE | ID: mdl-31042745

ABSTRACT

Generalized linear latent variable models (GLLVM) are popular tools for modeling multivariate, correlated responses. Such data are often encountered, for instance, in ecological studies, where presence-absences, counts, or biomass of interacting species are collected from a set of sites. Until very recently, the main challenge in fitting GLLVMs has been the lack of computationally efficient estimation methods. For likelihood based estimation, several closed form approximations for the marginal likelihood of GLLVMs have been proposed, but their efficient implementations have been lacking in the literature. To fill this gap, we show in this paper how to obtain computationally convenient estimation algorithms based on a combination of either the Laplace approximation method or variational approximation method, and automatic optimization techniques implemented in R software. An extensive set of simulation studies is used to assess the performances of different methods, from which it is shown that the variational approximation method used in conjunction with automatic optimization offers a powerful tool for estimation.

Subject(s)

Linear Models , Multivariate Analysis , Algorithms , Computer Simulation , Data Interpretation, Statistical , Likelihood Functions , Software

19.

The PIT-trap-A "model-free" bootstrap procedure for inference about regression models with discrete, multivariate responses.

Warton, David I; Thibaut, Loïc; Wang, Yi Alice.

PLoS One ; 12(7): e0181790, 2017.

Article in English | MEDLINE | ID: mdl-28738071

ABSTRACT

Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstrapping)-common examples including logistic or Poisson regression and generalizations to handle clustered or multivariate data, such as generalised estimating equations. We propose a bootstrap method based on probability integral transform (PIT-) residuals, which we call the PIT-trap, which assumes data come from some marginal distribution F of known parametric form. This method can be understood as a type of "model-free bootstrap", adapted to the problem of discrete and highly multivariate data. PIT-residuals have the key property that they are (asymptotically) pivotal. The PIT-trap thus inherits the key property, not afforded by any other residual resampling approach, that the marginal distribution of data can be preserved under PIT-trapping. This in turn enables the derivation of some standard bootstrap properties, including second-order correctness of pivotal PIT-trap test statistics. In multivariate data, bootstrapping rows of PIT-residuals affords the property that it preserves correlation in data without the need for it to be modelled, a key point of difference as compared to a parametric bootstrap. The proposed method is illustrated on an example involving multivariate abundance data in ecology, and demonstrated via simulation to have improved properties as compared to competing resampling methods.

Subject(s)

Models, Statistical , Multivariate Analysis , Statistical Distributions , Computer Simulation , Ecology/methods , Probability , Research Design

20.

Evidence at hand: Diversity, functional implications, and locomotor prediction in intrinsic hand proportions of diprotodontian marsupials.

Weisbecker, Vera; Warton, David I.

J Morphol ; 267(12): 1469-85, 2006 Dec.

Article in English | MEDLINE | ID: mdl-17103390

ABSTRACT

Knowledge about the diversity, locomotor adaptations, and evolution of the marsupial forelimb is limited, resulting in an underrepresentation of marsupials in comparative anatomical literature on mammalian forelimb anatomy. This study investigated hand proportions in the diverse marsupial order Diprotodontia. Fifty-two measurements of 95 specimens representing 47 species, as well as 6 non-diprotodontian specimens, were explored using principal components analysis (PCA). Bootstrapping was used to assess the reliability of the loadings. Phylogenetically independent contrasts and phylogenetic ANOVA were used to test for correlation with size and functional adaptation of forelimbs for locomotor habit, scored as arboreal vs. terrestrial. Analysis of first principal component (PC1) scores revealed significant differences between arboreal and terrestrial species, and was related to relative slenderness of their phalangeal elements. Both locomotor groups displayed allometry along PC1 scores, but with different intercepts such that PC1 discriminated between the two locomotor habits almost completely. PC2 separated some higher-level clades and burrowing species. Analysis of locomotor predictors commonly applied by palaeontologists indicates that ratios between proximal and intermediate phalanges were unsuitable as predictors of arboreality/terrestriality, but the phalangeal index was more effective. From PCA results, a phalangeal slenderness ratio was developed which proved to be a useful discriminator, suggesting that a single unallocated phalanx can be used for an impression of locomotor mode in fossils. Most Diprotodontia are laterally paraxonic or ectaxonic, with the exception of digging species whose hands are medially paraxonic. Our results complement those of studies on placental mammals, suggesting that the demands of arboreality, terrestriality, or frequent digging on intrinsic hand proportions are met with similar anatomical adaptations in marsupials.

Subject(s)

Adaptation, Physiological , Forelimb/anatomy & histology , Marsupialia/anatomy & histology , Animals , Biomechanical Phenomena , Forelimb/physiology , Locomotion , Marsupialia/physiology

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL