RESUMO
Diagnosis and prognosis of cancer are informed by the architecture inherent in cancer patient tissue sections. This architecture is typically identified by pathologists, yet advances in computational image analysis facilitate quantitative assessment of this structure. In this article, we develop a spatial point process approach to describe patterns in cell distribution within tissue samples taken from colorectal cancer (CRC) patients. In particular, our approach is centered on the Palm intensity function. This leads to taking an approximate-likelihood technique in fitting point processes models. We consider two Neyman-Scott point processes and a void process, fitting these point process models to the CRC patient data. We find that the parameter estimates of these models may be used to quantify the spatial arrangement of cells. Importantly, we observe characteristic differences in the spatial arrangement of cells between patients who died from CRC and those alive at follow up.
Assuntos
Neoplasias Colorretais/diagnóstico , Neoplasias Colorretais/patologia , Algoritmos , Interpretação Estatística de Dados , Humanos , PrognósticoRESUMO
Accurate estimation of tree biomass is necessary to provide realistic values of the carbon stored in the terrestrial biosphere. A recognized source of errors in tree aboveground biomass (AGB) estimation is introduced when individual tree height values (H) are not directly measured but estimated from diameter at breast height (DBH) using allometric equations. In this paper, we evaluate the performance of 12 alternative DBH : H equations and compare their effects on AGB estimation for three tropical forests that occur in contrasting climatic and altitudinal zones. We found that fitting a three-parameter Weibull function using data collected locally generated the lowest errors and bias in H estimation, and that equations fitted to these data were more accurate than equations with parameters derived from the literature. For computing AGB, the introduced error values differed notably among DBH : H allometric equations, and in most cases showed a clear bias that resulted in either over- or under-estimation of AGB. Fitting the three-parameter Weibull function minimized errors in AGB estimates in our study and we recommend its widespread adoption for carbon stock estimation. We conclude that many previous studies are likely to present biased estimates of AGB due to the method of H estimation.
Assuntos
Árvores , Clima Tropical , Biomassa , Carbono , FlorestasRESUMO
Spatial capture-recapture (SCR) is now routinely used for estimating abundance and density of wildlife populations. A standard SCR model includes sub-models for the distribution of individual activity centers (ACs) and for individual detections conditional on the locations of these ACs. Both sub-models can be expressed as point processes taking place in continuous space, but there is a lack of accessible and efficient tools to fit such models in a Bayesian paradigm. Here, we describe a set of custom functions and distributions to achieve this. Our work allows for more efficient model fitting with spatial covariates on population density, offers the option to fit SCR models using the semi-complete data likelihood (SCDL) approach instead of data augmentation, and better reflects the spatially continuous detection process in SCR studies that use area searches. In addition, the SCDL approach is more efficient than data augmentation for simple SCR models while losing its advantages for more complicated models that account for spatial variation in either population density or detection. We present the model formulation, test it with simulations, quantify computational efficiency gains, and conclude with a real-life example using non-invasive genetic sampling data for an elusive large carnivore, the wolverine (Gulo gulo) in Norway.
Assuntos
Animais Selvagens , Animais , Teorema de Bayes , Probabilidade , Densidade Demográfica , NoruegaRESUMO
As the COVID-19 pandemic continues to threaten various regions around the world, obtaining accurate and reliable COVID-19 data is crucial for governments and local communities aiming at rigorously assessing the extent and magnitude of the virus spread and deploying efficient interventions. Using data reported between January and February 2020 in China, we compared counts of COVID-19 from near-real-time spatially disaggregated data (city level) with fine-spatial scale predictions from a Bayesian downscaling regression model applied to a reference province-level data set. The results highlight discrepancies in the counts of coronavirus-infected cases at the district level and identify districts that may require further investigation.
RESUMO
The 2010 biodiversity target agreed by signatories to the Convention on Biological Diversity directed the attention of conservation professionals toward the development of indicators with which to measure changes in biological diversity at the global scale. We considered why global biodiversity indicators are needed, what characteristics successful global indicators have, and how existing indicators perform. Because monitoring could absorb a large proportion of funds available for conservation, we believe indicators should be linked explicitly to monitoring objectives and decisions about which monitoring schemes deserve funding should be informed by predictions of the value of such schemes to decision making. We suggest that raising awareness among the public and policy makers, auditing management actions, and informing policy choices are the most important global monitoring objectives. Using four well-developed indicators of biological diversity (extent of forests, coverage of protected areas, Living Planet Index, Red List Index) as examples, we analyzed the characteristics needed for indicators to meet these objectives. We recommend that conservation professionals improve on existing indicators by eliminating spatial biases in data availability, fill gaps in information about ecosystems other than forests, and improve understanding of the way indicators respond to policy changes. Monitoring is not an end in itself, and we believe it is vital that the ultimate objectives of global monitoring of biological diversity inform development of new indicators.
Assuntos
Biodiversidade , Conservação dos Recursos Naturais/tendências , Animais , Espécies em Perigo de ExtinçãoRESUMO
OBJECTIVES: To model the risk of COVID-19 mortality in British care homes conditional on the community level risk. METHODS: A two stage modeling process ("doubly latent") which includes a Besag York Mollie model (BYM) and a Log Gaussian Cox Process. The BYM is adopted so as to estimate the community level risks. These are incorporated in the Log Gaussian Cox Process to estimate the impact of these risks on that in care homes. RESULTS: For an increase in the risk at the community level, the number of COVID-19 related deaths in the associated care home would be increased by exp (0.833), 2. This is based on a simulated dataset. In the context of COVID-19 related deaths, this study has illustrated the estimation of the risk to care homes in the presence of background community risk. This approach will be useful in facilitating the identification of the most vulnerable care homes and in predicting risk to new care homes. CONCLUSIONS: The modeling of two latent processes have been shown to be successfully facilitated by the use of the BYM and Log Gaussian Cox Process Models. Community COVID-19 risks impact on that of the care homes embedded in these communities.
Assuntos
COVID-19/epidemiologia , Casas de Saúde/estatística & dados numéricos , Características de Residência , Geografia , Humanos , Modelos Estatísticos , Fatores de RiscoRESUMO
Species distribution models (SDMs) are now being widely used in ecology for management and conservation purposes across terrestrial, freshwater, and marine realms. The increasing interest in SDMs has drawn the attention of ecologists to spatial models and, in particular, to geostatistical models, which are used to associate observations of species occurrence or abundance with environmental covariates in a finite number of locations in order to predict where (and how much of) a species is likely to be present in unsampled locations. Standard geostatistical methodology assumes that the choice of sampling locations is independent of the values of the variable of interest. However, in natural environments, due to practical limitations related to time and financial constraints, this theoretical assumption is often violated. In fact, data commonly derive from opportunistic sampling (e.g., whale or bird watching), in which observers tend to look for a specific species in areas where they expect to find it. These are examples of what is referred to as preferential sampling, which can lead to biased predictions of the distribution of the species. The aim of this study is to discuss a SDM that addresses this problem and that it is more computationally efficient than existing MCMC methods. From a statistical point of view, we interpret the data as a marked point pattern, where the sampling locations form a point pattern and the measurements taken in those locations (i.e., species abundance or occurrence) are the associated marks. Inference and prediction of species distribution is performed using a Bayesian approach, and integrated nested Laplace approximation (INLA) methodology and software are used for model fitting to minimize the computational burden. We show that abundance is highly overestimated at low abundance locations when preferential sampling effects not accounted for, in both a simulated example and a practical application using fishery data. This highlights that ecologists should be aware of the potential bias resulting from preferential sampling and account for it in a model when a survey is based on non-randomized and/or non-systematic sampling.
RESUMO
A common approach to molecular characterisation of microbial communities in natural environments is the amplification of small subunit (SSU) rRNA genes or genes encoding enzymes essential for a particular ecosystem function. A range of 'fingerprinting' techniques are available for the analysis of amplification products of both types of gene enabling quantitative or semi-quantitative analysis of relative abundances of different community members, and facilitating analysis of communities from large numbers of samples, including replicates. Statistical models that have been applied in this context suffer from a number of unavoidable limitations, including lack of distinction between closely adjacent bands or peaks, particularly when these differ significantly in intensity or size. Current approaches to the analysis of banding structures derived from gels are typically based on standard multivariate analysis methods such as principal component analysis (PCA) which do not consider structure of DGGE gels but treat the intensity of each band as independent from the other bands, ignoring local neighbourhood structures. This paper assesses whether a new statistical analytical technique, based on functional data analysis (FDA) methods, improves the discriminatory ability of molecular fingerprinting techniques. The approach regards band intensities as a mathematical function of the location on the gel and explicitly includes neighbourhood structure in the analysis. A simulation study clearly reveals the weaknesses of the standard PCA approach as opposed to the FDA approach, which is then used to analyse experimental DGGE data.