Pesquisa | Portal Regional da BVS

1.

The Relationship Between Rates of Cannabis Use and Covid-19 Infection Rates During the Pandemic: An Analysis of Canada's National Cannabis Survey.

Cullen, Greggory; Cristiano, Nick; Walters, David; Hathaway, Andrew; Wrathall, Meghan; Wadsworth, Elle.

Subst Use Misuse ; : 1-9, 2024 Sep 16.

Artigo em Inglês | MEDLINE | ID: mdl-39282898

RESUMO

Background: The well-documented relationship between mental health and substance use is corroborated by recent research on the impacts of the Covid-19 pandemic on cannabis use behavior. Social isolation, anxiety, depression, stress, and boredom are all linked to the greater prevalence of cannabis and other substance use. Objectives: To better understand the relationship between infection rates in Canada and cannabis use behavior, this research examines the prevalence and frequency of cannabis use across health regions in all 10 provinces at the height of the pandemic. Methods: Our analyses linked data from the National Cannabis Survey with Covid-19 case rates and cannabis availability through legal retail outlets at the end of 2020, 2 years after cannabis legalization came into effect. Hierarchical generalized linear models were employed, controlling for age, gender, SES, mental health, the number of cannabis stores per square kilometer, and prevalence of cannabis use in each health region prior to the pandemic. Results: Even after controlling for other predictors, our models show that those residing where infection rates are higher are more likely to use cannabis and use it more often. Conclusions: The findings of this study support investing in better-targeted harm reduction measures in areas hit hardest by the pandemic to address contributing societal conditions. The implications are noteworthy for drug policy observers in North America and other global jurisdictions pursuing evidence-based public health approaches to regulating cannabis and other substance use.

2.

Joint modeling of an outcome variable and integrated omics datasets using GLM-PO2PLS.

Gu, Zhujie; Uh, Hae-Won; Houwing-Duistermaat, Jeanine; El Bouhaddani, Said.

J Appl Stat ; 51(13): 2627-2651, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39290359

RESUMO

In many studies of human diseases, multiple omics datasets are measured. Typically, these omics datasets are studied one by one with the disease, thus the relationship between omics is overlooked. Modeling the joint part of multiple omics and its association to the outcome disease will provide insights into the complex molecular base of the disease. Several dimension reduction methods which jointly model multiple omics and two-stage approaches that model the omics and outcome in separate steps are available. Holistic one-stage models for both omics and outcome are lacking. In this article, we propose a novel one-stage method that jointly models an outcome variable with omics. We establish the model identifiability and develop EM algorithms to obtain maximum likelihood estimators of the parameters for normally and Bernoulli distributed outcomes. Test statistics are proposed to infer the association between the outcome and omics, and their asymptotic distributions are derived. Extensive simulation studies are conducted to evaluate the proposed model. The method is illustrated by modeling Down syndrome as outcome and methylation and glycomics as omics datasets. Here we show that our model provides more insight by jointly considering methylation and glycomics.

3.

Methodology for the generation of normative data for the U.S. adult Spanish-speaking population: A Bayesian approach.

Rivera, Diego; Forte, Anabel; Olabarrieta-Landa, Laiene; Perrin, Paul B; Arango-Lasprilla, Juan Carlos.

NeuroRehabilitation ; 55(2): 155-167, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39302390

RESUMO

BACKGROUND: Hispanics are the largest growing ethnic minority group in the U.S. Despite significant progress in providing norms for this population, updated normative data are essential. OBJECTIVE: To present the methodology for a study generating normative neuropsychological test data for Spanish-speaking adults living in the U.S. using Bayesian inference as a novel approach. METHODS: The sample consisted of 253 healthy adults from eight U.S. regions, with individuals originating from a diverse array of Latin American countries. To participate, individuals must have met the following criteria: were between 18 and 80 years of age, had lived in the U.S. for at least 1 year, self-identified Spanish as their dominant language, had at least one year of formal education, were able to read and write in Spanish at the time of evaluation, scored≥23 on the Mini-Mental State Examination, <10 on the Patient Health Questionnaire- 9, and <10 on the Generalized Anxiety Disorder scale. Participants completed 12 neuropsychological tests. Reliability statistics and norms were calculated for all tests. CONCLUSION: This is the first normative study for Spanish-speaking adults in the U.S. that uses Bayesian linear or generalized linear regression models for generating norms in neuropsychology, implementing sociocultural measures as possible covariates.

Assuntos

Teorema de Bayes , Hispânico ou Latino , Testes Neuropsicológicos , Humanos , Adulto , Masculino , Feminino , Pessoa de Meia-Idade , Estados Unidos , Idoso , Testes Neuropsicológicos/estatística & dados numéricos , Testes Neuropsicológicos/normas , Adulto Jovem , Valores de Referência , Adolescente , Idoso de 80 Anos ou mais , Idioma , Reprodutibilidade dos Testes

4.

Integrated analysis of remote sensing with meteorological and health data for allergic rhinitis forecasting in Tianjin.

Guo, Yu-Di; Wang, Yuan; Fan, Wen-Yan; Li, Gen.

Int J Biometeorol ; 2024 Aug 06.

Artigo em Inglês | MEDLINE | ID: mdl-39105775

RESUMO

Long time series of vegetation monitoring can be carried out by remote sensing data, the level of urban greening is objectively described, and the spatial characteristics of plant pollen are indirectly understood. Pollen is the main allergen in patients with seasonal allergic rhinitis. Meteorological factors affect the release and diffusion of pollen. Therefore, studying of the complex relationship between meteorological factors and allergic rhinitis is essential for effective prevention and treatment of the disease. In this study, we leverage remote sensing data for a comprehensive decade-long analysis of urban greening in Tianjin, which exhibits an annual increase in vegetative cover of 0.51 per annum, focusing on its impact on allergic rhinitis through changes in pollen distribution. Utilizing high-resolution imagery, we quantify changes in urban Fractional Vegetation Coverage (FVC) and its correlation with pollen types and allergic rhinitis cases. Our analysis reveals a significant correlation between FVC trends and pollen concentrations, with a surprising value of 0.71, highlighting the influence of urban greenery on allergenic pollen levels. We establish a robust connection between the seasonal patterns of pollen outbreaks and allergic rhinitis consultations, with a noticeable increase in consultations during high pollen seasons. our findings indicate a higher allergenic potential of herbaceous compared to woody vegetation. This nuanced understanding underscores the importance of pollen sensitivity, alongside concentration, in driving allergic rhinitis incidents. Utilizing a Generalized Linear Model, significant features influencing the number of visits for allergic rhinitis (P < 0.05) were identified. Both GLM and LSTM models were employed to forecast the visitation volumes for rhinitis during the spring and summer-autumn of 2022. Upon validation, it was found that the R² values between the simulated and actual values for both GLM and LSTM models surpassed the 95% confidence threshold. Moreover, the R² values for the summer-autumn seasons (GLM: 0.56, LSTM: 0.72) were higher than those for spring (GLM: 0.22, LSTM: 0.47). Comparing the errors between the simulated and actual values of GLM and LSTM models, LSTM exhibited higher simulation precision in both spring and summer-autumn seasons, demonstrating superior simulation performance. Overall, our study pioneers the integration of remote sensing with meteorological and health data for allergic rhinitis forecasting. This integrative approach provides valuable insights for public health planning, particularly in urban settings, and lays the groundwork for advanced, location-specific allergenic pollen forecasting and mitigation strategies.

5.

The spike-and-slab lasso and scalable algorithm to accommodate multinomial outcomes in variable selection problems.

Leach, Justin M; Yi, Nengjun; Aban, Inmaculada.

J Appl Stat ; 51(11): 2039-2061, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39157266

RESUMO

Spike-and-slab prior distributions are used to impose variable selection in Bayesian regression-style problems with many possible predictors. These priors are a mixture of two zero-centered distributions with differing variances, resulting in different shrinkage levels on parameter estimates based on whether they are relevant to the outcome. The spike-and-slab lasso assigns mixtures of double exponential distributions as priors for the parameters. This framework was initially developed for linear models, later developed for generalized linear models, and shown to perform well in scenarios requiring sparse solutions. Standard formulations of generalized linear models cannot immediately accommodate categorical outcomes with > 2 categories, i.e. multinomial outcomes, and require modifications to model specification and parameter estimation. Such modifications are relatively straightforward in a Classical setting but require additional theoretical and computational considerations in Bayesian settings, which can depend on the choice of prior distributions for the parameters of interest. While previous developments of the spike-and-slab lasso focused on continuous, count, and/or binary outcomes, we generalize the spike-and-slab lasso to accommodate multinomial outcomes, developing both the theoretical basis for the model and an expectation-maximization algorithm to fit the model. To our knowledge, this is the first generalization of the spike-and-slab lasso to allow for multinomial outcomes.

6.

Spatial distribution and risk assessment of microplastics in surface waters of the St. Lawrence Estuary.

Kelly, Noreen E.

Sci Total Environ ; 946: 174324, 2024 Oct 10.

Artigo em Inglês | MEDLINE | ID: mdl-38960195

RESUMO

Development of effective prevention and mitigation strategies for marine plastic pollution requires a better understanding of the pathways and transport mechanisms of plastic waste. Yet the role of estuaries as a key interface between riverine inputs of plastic pollution and delivery to receiving marine environments remains poorly understood. This study quantified the concentration and distribution of microplastics (MPs) (50-3200 µm) in surface waters of the St. Lawrence Estuary (SLE) in eastern Canada. Microplastics were identified and enumerated based on particle morphology, colour, and size class. Fourier Transform Infrared (FTIR) spectroscopy was used on a subset of particles to identify polymers. Generalized linear models (Gamma distribution with log-link) examined the relationship between MP concentrations and oceanographic variables and anthropogenic sources. Finally, a risk assessment model, using MP concentrations and chemical hazards based on polymer types, estimated the MP pollution risk to ecosystem health. Mean surface MP concentration in the SLE was 120 ± 42 SD particles m-3; MP concentrations were highest in the fluvial section and lowest in the Northwest Gulf of St. Lawrence. However, MP concentrations exhibited high heterogeneity along the length and width of the SLE. Microplastics were elevated at stations located closer to wastewater treatment plant outflows and downstream sites with more agricultural land. Black, blue, and transparent fibers and fragments ≤250 µm were most commonly encountered. Predominant polymer types included polyethylene terephthalate, regenerated cellulose, polyethylene, and alkyds. While the overall risk to ecosystem health in the entire estuary was considered low, several stations, particularly near urban centres were at high or very high risk. This study provides new insights into the quantification and distribution of MPs and first estimates of the risk of MP pollution to ecosystem health in one of the world's largest estuaries.

7.

Simultaneous Inference of Multiple Binary Endpoints in Biomedical Research: Small Sample Properties of Multiple Marginal Models and a Resampling Approach.

Budig, Sören; Jung, Klaus; Hasler, Mario; Schaarschmidt, Frank.

Biom J ; 66(5): e202300197, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38953619

RESUMO

In biomedical research, the simultaneous inference of multiple binary endpoints may be of interest. In such cases, an appropriate multiplicity adjustment is required that controls the family-wise error rate, which represents the probability of making incorrect test decisions. In this paper, we investigate two approaches that perform single-step p $p$ -value adjustments that also take into account the possible correlation between endpoints. A rather novel and flexible approach known as multiple marginal models is considered, which is based on stacking of the parameter estimates of the marginal models and deriving their joint asymptotic distribution. We also investigate a nonparametric vector-based resampling approach, and we compare both approaches with the Bonferroni method by examining the family-wise error rate and power for different parameter settings, including low proportions and small sample sizes. The results show that the resampling-based approach consistently outperforms the other methods in terms of power, while still controlling the family-wise error rate. The multiple marginal models approach, on the other hand, shows a more conservative behavior. However, it offers more versatility in application, allowing for more complex models or straightforward computation of simultaneous confidence intervals. The practical application of the methods is demonstrated using a toxicological dataset from the National Toxicology Program.

Assuntos

Pesquisa Biomédica , Biometria , Modelos Estatísticos , Biometria/métodos , Pesquisa Biomédica/métodos , Tamanho da Amostra , Determinação de Ponto Final , Humanos

8.

Generalized nonlinearity in animal ecology: Research, review, and recommendations.

Heit, David R; Ortiz-Calo, Waldemar; Poisson, Mairi K P; Butler, Andrew R; Moll, Remington J.

Ecol Evol ; 14(7): e11387, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38994210

RESUMO

Generalized linear models (GLMs) are an integral tool in ecology. Like general linear models, GLMs assume linearity, which entails a linear relationship between independent and dependent variables. However, because this assumption acts on the link rather than the natural scale in GLMs, it is more easily overlooked. We reviewed recent ecological literature to quantify the use of linearity. We then used two case studies to confront the linearity assumption via two GLMs fit to empirical data. In the first case study we compared GLMs to generalized additive models (GAMs) fit to mammal relative abundance data. In the second case study we tested for linearity in occupancy models using passerine point-count data. We reviewed 162 studies published in the last 5 years in five leading ecology journals and found less than 15% reported testing for linearity. These studies used transformations and GAMs more often than they reported a linearity test. In the first case study, GAMs strongly out-performed GLMs as measured by AIC in modeling relative abundance, and GAMs helped uncover nonlinear responses of carnivore species to landscape development. In the second case study, 14% of species-specific models failed a formal statistical test for linearity. We also found that differences between linear and nonlinear (i.e., those with a transformed independent variable) model predictions were similar for some species but not for others, with implications for inference and conservation decision-making. Our review suggests that reporting tests for linearity are rare in recent studies employing GLMs. Our case studies show how formally comparing models that allow for nonlinear relationships between the dependent and independent variables has the potential to impact inference, generate new hypotheses, and alter conservation implications. We conclude by suggesting that ecological studies report tests for linearity and use formal methods to address linearity assumption violations in GLMs.

9.

Identifying Key Genes Involved in Axillary Lymph Node Metastasis in Breast Cancer Using Advanced RNA-Seq Analysis: A Methodological Approach with GLMQL and MAS.

Rezapour, Mostafa; Wesolowski, Robert; Gurcan, Metin Nafi.

Int J Mol Sci ; 25(13)2024 Jul 03.

Artigo em Inglês | MEDLINE | ID: mdl-39000413

RESUMO

Our study aims to address the methodological challenges frequently encountered in RNA-Seq data analysis within cancer studies. Specifically, it enhances the identification of key genes involved in axillary lymph node metastasis (ALNM) in breast cancer. We employ Generalized Linear Models with Quasi-Likelihood (GLMQLs) to manage the inherently discrete and overdispersed nature of RNA-Seq data, marking a significant improvement over conventional methods such as the t-test, which assumes a normal distribution and equal variances across samples. We utilize the Trimmed Mean of M-values (TMMs) method for normalization to address library-specific compositional differences effectively. Our study focuses on a distinct cohort of 104 untreated patients from the TCGA Breast Invasive Carcinoma (BRCA) dataset to maintain an untainted genetic profile, thereby providing more accurate insights into the genetic underpinnings of lymph node metastasis. This strategic selection paves the way for developing early intervention strategies and targeted therapies. Our analysis is exclusively dedicated to protein-coding genes, enriched by the Magnitude Altitude Scoring (MAS) system, which rigorously identifies key genes that could serve as predictors in developing an ALNM predictive model. Our novel approach has pinpointed several genes significantly linked to ALNM in breast cancer, offering vital insights into the molecular dynamics of cancer development and metastasis. These genes, including ERBB2, CCNA1, FOXC2, LEFTY2, VTN, ACKR3, and PTGS2, are involved in key processes like apoptosis, epithelial-mesenchymal transition, angiogenesis, response to hypoxia, and KRAS signaling pathways, which are crucial for tumor virulence and the spread of metastases. Moreover, the approach has also emphasized the importance of the small proline-rich protein family (SPRR), including SPRR2B, SPRR2E, and SPRR2D, recognized for their significant involvement in cancer-related pathways and their potential as therapeutic targets. Important transcripts such as H3C10, H1-2, PADI4, and others have been highlighted as critical in modulating the chromatin structure and gene expression, fundamental for the progression and spread of cancer.

Assuntos

Neoplasias da Mama , Regulação Neoplásica da Expressão Gênica , Metástase Linfática , Humanos , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Metástase Linfática/genética , Feminino , RNA-Seq/métodos , Perfilação da Expressão Gênica/métodos , Linfonodos/patologia , Axila , Biomarcadores Tumorais/genética , Análise de Sequência de RNA/métodos

10.

Radiotherapy toxicity prediction using knowledge-constrained generalized linear model.

Hu, Jiuyun; Fatyga, Mirek; Liu, Wei; Schild, Steven E; Wong, William W; Vora, Sujay A; Li, Jing.

IISE Trans Healthc Syst Eng ; 14(2): 130-140, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39055377

RESUMO

Radiation therapy (RT) is a frontline approach to treating cancer. While the target of radiation dose delivery is the tumor, there is an inevitable spill of dose to nearby normal organs causing complications. This phenomenon is known as radiotherapy toxicity. To predict the outcome of the toxicity, statistical models can be built based on dosimetric variables received by the normal organ at risk (OAR), known as Normal Tissue Complication Probability (NTCP) models. To tackle the challenge of the high dimensionality of dosimetric variables and limited clinical sample sizes, statistical models with variable selection techniques are viable choices. However, existing variable selection techniques are data-driven and do not integrate medical domain knowledge into the model formulation. We propose a knowledge-constrained generalized linear model (KC-GLM). KC-GLM includes a new mathematical formulation to translate three pieces of domain knowledge into non-negativity, monotonicity, and adjacent similarity constraints on the model coefficients. We further propose an equivalent transformation of the KC-GLM formulation, which makes it possible to solve the model coefficients using existing optimization solvers. Furthermore, we compare KC-GLM and several well-known variable selection techniques via a simulation study and on two real datasets of prostate cancer and lung cancer, respectively. These experiments show that KC-GLM selects variables with better interpretability, avoids producing counter-intuitive and misleading results, and has better prediction accuracy.

11.

Unsupervised Single-Cell Clustering with Asymmetric Within-Sample Transformation and Per-Cluster Supervised Features Selection.

Pagnotta, Stefano Maria.

Methods Mol Biol ; 2812: 155-168, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39068361

RESUMO

This chapter shows applying the Asymmetric Within-Sample Transformation to single-cell RNA-Seq data matched with a previous dropout imputation. The asymmetric transformation is a special winsorization that flattens low-expressed intensities and preserves highly expressed gene levels. Before a standard hierarchical clustering algorithm, an intermediate step removes noninformative genes according to a threshold applied to a per-gene entropy estimate. Following the clustering, a time-intensive algorithm is shown to uncover the molecular features associated with each cluster. This step implements a resampling algorithm to generate a random baseline to measure up/downregulated significant genes. To this aim, we adopt a GLM model as implemented in DESeq2 package. We render the results in graphical mode. While the tools are standard heat maps, we introduce some data scaling to clarify the results' reliability.

Assuntos

Algoritmos , Análise de Célula Única , Análise de Célula Única/métodos , Análise por Conglomerados , Humanos , Perfilação da Expressão Gênica/métodos , Software , Biologia Computacional/métodos , RNA-Seq/métodos

12.

Analysis of gene expression dynamics and differential expression in viral infections using generalized linear models and quasi-likelihood methods.

Rezapour, Mostafa; Walker, Stephen J; Ornelles, David A; McNutt, Patrick M; Atala, Anthony; Gurcan, Metin Nafi.

Front Microbiol ; 15: 1342328, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38655085

RESUMO

Introduction: Our study undertakes a detailed exploration of gene expression dynamics within human lung organ tissue equivalents (OTEs) in response to Influenza A virus (IAV), Human metapneumovirus (MPV), and Parainfluenza virus type 3 (PIV3) infections. Through the analysis of RNA-Seq data from 19,671 genes, we aim to identify differentially expressed genes under various infection conditions, elucidating the complexities of virus-host interactions. Methods: We employ Generalized Linear Models (GLMs) with Quasi-Likelihood (QL) F-tests (GLMQL) and introduce the novel Magnitude-Altitude Score (MAS) and Relaxed Magnitude-Altitude Score (RMAS) algorithms to navigate the intricate landscape of RNA-Seq data. This approach facilitates the precise identification of potential biomarkers, highlighting the host's reliance on innate immune mechanisms. Our comprehensive methodological framework includes RNA extraction, library preparation, sequencing, and Gene Ontology (GO) enrichment analysis to interpret the biological significance of our findings. Results: The differential expression analysis unveils significant changes in gene expression triggered by IAV, MPV, and PIV3 infections. The MAS and RMAS algorithms enable focused identification of biomarkers, revealing a consistent activation of interferon-stimulated genes (e.g., IFIT1, IFIT2, IFIT3, OAS1) across all viruses. Our GO analysis provides deep insights into the host's defense mechanisms and viral strategies exploiting host cellular functions. Notably, changes in cellular structures, such as cilium assembly and mitochondrial ribosome assembly, indicate a strategic shift in cellular priorities. The precision of our methodology is validated by a 92% mean accuracy in classifying respiratory virus infections using multinomial logistic regression, demonstrating the superior efficacy of our approach over traditional methods. Discussion: This study highlights the intricate interplay between viral infections and host gene expression, underscoring the need for targeted therapeutic interventions. The stability and reliability of the MAS/RMAS ranking method, even under stringent statistical corrections, and the critical importance of adequate sample size for biomarker reliability are significant findings. Our comprehensive analysis not only advances our understanding of the host's response to viral infections but also sets a new benchmark for the identification of biomarkers, paving the way for the development of effective diagnostic and therapeutic strategies.

13.

Overcoming denominator problems in refugee settings with fragmented electronic records for health and immigration data: a prediction-based approach.

Erdmann, Stella; Jahn, Rosa; Rohleder, Sven; Bozorgmehr, Kayvan.

BMC Med Res Methodol ; 24(1): 81, 2024 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-38561661

RESUMO

BACKGROUND: Epidemiological studies in refugee settings are often challenged by the denominator problem, i.e. lack of population at risk data. We develop an empirical approach to address this problem by assessing relationships between occupancy data in refugee centres, number of refugee patients in walk-in clinics, and diseases of the digestive system. METHODS: Individual-level patient data from a primary care surveillance system (PriCarenet) was matched with occupancy data retrieved from immigration authorities. The three relationships were analysed using regression models, considering age, sex, and type of centre. Then predictions for the respective data category not available in each of the relationships were made. Twenty-one German on-site health care facilities in state-level registration and reception centres participated in the study, covering the time period from November 2017 to July 2021. RESULTS: 445 observations ("centre-months") for patient data from electronic health records (EHR, 230 mean walk-in clinics visiting refugee patients per month and centre; standard deviation sd: 202) of a total of 47.617 refugee patients were available, 215 for occupancy data (OCC, mean occupancy of 348 residents, sd: 287), 147 for both (matched), leaving 270 observations without occupancy (EHR-unmatched) and 40 without patient data (OCC-unmatched). The incidence of diseases of the digestive system, using patients as denominators in the different sub-data sets were 9.2% (sd: 5.9) in EHR, 8.8% (sd: 5.1) when matched, 9.6% (sd: 6.4) in EHR- and 12% (sd 2.9) in OCC-unmatched. Using the available or predicted occupancy as denominator yielded average incidence estimates (per centre and month) of 4.7% (sd: 3.2) in matched data, 4.8% (sd: 3.3) in EHR- and 7.4% (sd: 2.7) in OCC-unmatched. CONCLUSIONS: By modelling the ratio between patient and occupancy numbers in refugee centres depending on sex and age, as well as on the total number of patients or occupancy, the denominator problem in health monitoring systems could be mitigated. The approach helped to estimate the missing component of the denominator, and to compare disease frequency across time and refugee centres more accurately using an empirically grounded prediction of disease frequency based on demographic and centre typology. This avoided over-estimation of disease frequency as opposed to the use of patients as denominators.

Assuntos

Refugiados , Humanos , Registros Eletrônicos de Saúde , Emigração e Imigração , Fatores de Risco , Eletrônica

14.

Flexible multi-step hypothesis testing of human ECoG data using cluster-based permutation tests with GLMEs.

König, Seth D; Safo, Sandra; Miller, Kai; Herman, Alexander B; Darrow, David P.

Neuroimage ; 290: 120557, 2024 Apr 15.

Artigo em Inglês | MEDLINE | ID: mdl-38423264

RESUMO

BACKGROUND: Time series analysis is critical for understanding brain signals and their relationship to behavior and cognition. Cluster-based permutation tests (CBPT) are commonly used to analyze a variety of electrophysiological signals including EEG, MEG, ECoG, and sEEG data without a priori assumptions about specific temporal effects. However, two major limitations of CBPT include the inability to directly analyze experiments with multiple fixed effects and the inability to account for random effects (e.g. variability across subjects). Here, we propose a flexible multi-step hypothesis testing strategy using CBPT with Linear Mixed Effects Models (LMEs) and Generalized Linear Mixed Effects Models (GLMEs) that can be applied to a wide range of experimental designs and data types. METHODS: We first evaluate the statistical robustness of LMEs and GLMEs using simulated data distributions. Second, we apply a multi-step hypothesis testing strategy to analyze ERPs and broadband power signals extracted from human ECoG recordings collected during a simple image viewing experiment with image category and novelty as fixed effects. Third, we assess the statistical power differences between analyzing signals with CBPT using LMEs compared to CBPT using separate t-tests run on each fixed effect through simulations that emulate broadband power signals. Finally, we apply CBPT using GLMEs to high-gamma burst data to demonstrate the extension of the proposed method to the analysis of nonlinear data. RESULTS: First, we found that LMEs and GLMEs are robust statistical models. In simple simulations LMEs produced highly congruent results with other appropriately applied linear statistical models, but LMEs outperformed many linear statistical models in the analysis of "suboptimal" data and maintained power better than analyzing individual fixed effects with separate t-tests. GLMEs also performed similarly to other nonlinear statistical models. Second, in real world human ECoG data, LMEs performed at least as well as separate t-tests when applied to predefined time windows or when used in conjunction with CBPT. Additionally, fixed effects time courses extracted with CBPT using LMEs from group-level models of pseudo-populations replicated latency effects found in individual category-selective channels. Third, analysis of simulated broadband power signals demonstrated that CBPT using LMEs was superior to CBPT using separate t-tests in identifying time windows with significant fixed effects especially for small effect sizes. Lastly, the analysis of high-gamma burst data using CBPT with GLMEs produced results consistent with CBPT using LMEs applied to broadband power data. CONCLUSIONS: We propose a general approach for statistical analysis of electrophysiological data using CBPT in conjunction with LMEs and GLMEs. We demonstrate that this method is robust for experiments with multiple fixed effects and applicable to the analysis of linear and nonlinear data. Our methodology maximizes the statistical power available in a dataset across multiple experimental variables while accounting for hierarchical random effects and controlling FWER across fixed effects. This approach substantially improves power leading to better reproducibility. Additionally, CBPT using LMEs and GLMEs can be used to analyze individual channels or pseudo-population data for the comparison of functional or anatomical groups of data.

Assuntos

Encéfalo , Projetos de Pesquisa , Humanos , Reprodutibilidade dos Testes , Encéfalo/fisiologia , Modelos Estatísticos , Modelos Lineares

15.

Prediction of body weight of Curraleiro Pé-Duro cattle based on morphometric measurements.

Rocha-Silva, Mérik; Britto, Fábio Barros; da Silva, Dinnara Layza Souza; Oliveira do O, Alan; da Silva, Leiliane Alves Soares; de Oliveira, Max Brandão; de Araújo, Cláudio Vieira; Carvalho, Geraldo Magela Cortes; Sarmento, José Lindenberg Rocha.

Trop Anim Health Prod ; 56(1): 42, 2024 Jan 12.

Artigo em Inglês | MEDLINE | ID: mdl-38214742

RESUMO

Cattle weight development is highly correlated with some body measurements. Based on the relationship between morphometric measurements and body mass, our aim was to develop regression equations to estimate the body weight of Curraleiro Pé-Duro (CPD) cattle to be used in farms that lack access to weighting scales. Data from 1023 animals from four farms on withers height (WH), body length (BL), body score (BS), heart girth (HG), permanent teeth (PT), scrotal perimeter (SP), and live weight were used. The animals were classified into five categories depending on age and/or sex: newborns (NB), calves, weaned animals, cows, and bulls. The best models are GLM with Gamma, Gamma, inverse Gaussian, Gaussian, and Gamma distributions for NB, calves, weaned animals, cows, and bulls, respectively. Predictive modeling for bulls was the best performing overall, with a correlation of 0.97 between the estimated by the model and the obtained with a weighting scale. For NB, calves, weaned animals, and cows, the correlation (r) was 0.85, 0.90, 0.95, and 0.87, respectively. The evaluated models are adequate to be used as a technical solution to estimate weight in a cattle production system.

Assuntos

Peso ao Nascer , Feminino , Animais , Bovinos , Masculino , Fazendas , Desmame , Peso Corporal

16.

deMULTIplex2: robust sample demultiplexing for scRNA-seq.

Zhu, Qin; Conrad, Daniel N; Gartner, Zev J.

Genome Biol ; 25(1): 37, 2024 01 30.

Artigo em Inglês | MEDLINE | ID: mdl-38291503

RESUMO

Sample multiplexing enables pooled analysis during single-cell RNA sequencing workflows, thereby increasing throughput and reducing batch effects. A challenge for all multiplexing techniques is to link sample-specific barcodes with cell-specific barcodes, then demultiplex sample identity post-sequencing. However, existing demultiplexing tools fail under many real-world conditions where barcode cross-contamination is an issue. We therefore developed deMULTIplex2, an algorithm inspired by a mechanistic model of barcode cross-contamination. deMULTIplex2 employs generalized linear models and expectation-maximization to probabilistically determine the sample identity of each cell. Benchmarking reveals superior performance across various experimental conditions, particularly on large or noisy datasets with unbalanced sample compositions.

Assuntos

Análise de Célula Única , Análise da Expressão Gênica de Célula Única , Análise de Célula Única/métodos , Algoritmos , Análise de Sequência de RNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos

17.

MRI log file analysis for workflow improvement.

Petroianu, Larissa P G; Li, Lun; Mieloszyk, Rebecca J; Mastrangelo, Christina M; Stapleton, Shawn; Hall, Christopher.

Curr Probl Diagn Radiol ; 53(2): 192-200, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-37951726

RESUMO

Magnetic Resonance Imaging (MRI) is an important diagnostic scanning tool for the detection and monitoring of specific diseases and conditions. However, the equipment cost, maintenance and specialty training of the technologists make the examination expensive. Consequently, unnecessary scanner time caused by poor scheduling, repeated sequences, aborted sequences, scanner idleness, or capture of non-diagnostic or low-value sequences is an opportunity to reduce costs and increase efficiency. This paper analyzes data collected from log files on 29 scanners over several years. 'Wasted' time is defined and key performance indicators (KPIs) are identified. A decrease in exam duration results when actively modifying and monitoring the number of sequences that comprise the exam card for a protocol.

Assuntos

Eficiência , Imageamento por Ressonância Magnética , Humanos , Fluxo de Trabalho , Imageamento por Ressonância Magnética/métodos

18.

Inverse probability of treatment weighting with generalized linear outcome models for doubly robust estimation.

Gabriel, Erin E; Sachs, Michael C; Martinussen, Torben; Waernbaum, Ingeborg; Goetghebeur, Els; Vansteelandt, Stijn; Sjölander, Arvid.

Stat Med ; 43(3): 534-547, 2024 02 10.

Artigo em Inglês | MEDLINE | ID: mdl-38096856

RESUMO

There are now many options for doubly robust estimation; however, there is a concerning trend in the applied literature to believe that the combination of a propensity score and an adjusted outcome model automatically results in a doubly robust estimator and/or to misuse more complex established doubly robust estimators. A simple alternative, canonical link generalized linear models (GLM) fit via inverse probability of treatment (propensity score) weighted maximum likelihood estimation followed by standardization (the g $$ g $$ -formula) for the average causal effect, is a doubly robust estimation method. Our aim is for the reader not just to be able to use this method, which we refer to as IPTW GLM, for doubly robust estimation, but to fully understand why it has the doubly robust property. For this reason, we define clearly, and in multiple ways, all concepts needed to understand the method and why it is doubly robust. In addition, we want to make very clear that the mere combination of propensity score weighting and an adjusted outcome model does not generally result in a doubly robust estimator. Finally, we hope to dispel the misconception that one can adjust for residual confounding remaining after propensity score weighting by adjusting in the outcome model for what remains 'unbalanced' even when using doubly robust estimators. We provide R code for our simulations and real open-source data examples that can be followed step-by-step to use and hopefully understand the IPTW GLM method. We also compare to a much better-known but still simple doubly robust estimator.

Assuntos

Modelos Estatísticos , Humanos , Simulação por Computador , Interpretação Estatística de Dados , Probabilidade , Pontuação de Propensão , Modelos Lineares

19.

Species and seasonality can affect recent trends in beak and feather disease virus prevalence in captive psittacine birds.

Saechin, A; Suksai, P; Sariya, L; Mongkolphan, C; Tangsudjai, S.

Acta Trop ; 249: 107071, 2024 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-37956820

RESUMO

Beak and feather disease virus (BFDV) is globally distributed in psittacine birds. BFDV is considered a key threat to biodiversity because it has the ability to transmit and shift between host species. Data from captive psittacine birds can help to identify potential risk factors for viral transmission management. Generalized Linear Models (GLM) were used to examine the association of sample type, species, and season on the prevalence of BFDV in captive exotic birds in Thailand. In this study, the overall prevalence of BFDV was 8.2 %, with 346 of 4243 birds being positive. The prevalence in feather samples (12.1 %) and pooled (dried blood and feather) samples (15.4 %) was higher than that in the dried blood samples (4.8 %). A GLM test revealed that the sample type, species, and season were significant factors influencing the prevalence of BFDV. Based on the model, two species (blue-eyed cockatoo; Cacatua ophthalmica, and ring-necked parakeet; Psittacula krameri) were associated with higher BFDV prevalence. By studying the seasonal BFDV prevalence, we can gather important insights into the environmental factors that contribute to its spread. The higher prevalence observed during the wet season suggest a possible affect between BFDV prevalence and environmental factors such as heavy rainfall and humidity. In conclusion, our analysis of the trends in BFDV prevalence offers valuable insights into the prevalence or distribution of BFDV in the studied population. By monitoring BFDV prevalence, identifying high-risk species, and understanding seasonal patterns, we can develop targeted management approaches to control the spread of the virus. This information is crucial for mitigating the impact of BFDV on aviculture.

Assuntos

Doenças das Aves , Infecções por Circoviridae , Circovirus , Papagaios , Animais , Circovirus/genética , Prevalência , Infecções por Circoviridae/epidemiologia , Infecções por Circoviridae/veterinária , Doenças das Aves/epidemiologia , DNA Viral , Reação em Cadeia da Polimerase/veterinária , Filogenia

20.

Examination of the Effect of Task Complexity and Coping Capacity on Driving Risk: A Cross-Country and Transportation Mode Comparative Study.

Roussou, Stella; Garefalakis, Thodoris; Michelaraki, Eva; Katrakazas, Christos; Adnan, Muhammad; Khattak, Muhammad Wisal; Brijs, Tom; Yannis, George.

Sensors (Basel) ; 23(24)2023 Dec 07.

Artigo em Inglês | MEDLINE | ID: mdl-38139509

RESUMO

The i-DREAMS project established a 'Safety Tolerance Zone (STZ)' to maintain operators within safe boundaries through real-time and post-trip interventions, based on the crucial role of the human element in driving behavior. This paper aims to model the inter-relationship among driving task complexity, operator and vehicle coping capacity, and crash risk. Towards that aim, data from 80 drivers, who participated in a naturalistic driving experiment carried out in three countries (i.e., Belgium, Germany, and Portugal), resulting in a dataset of approximately 19,000 trips were collected and analyzed. The exploratory analysis included the development of Generalized Linear Models (GLMs) and the choice of the most appropriate variables associated with the latent variables "task complexity" and "coping capacity" that are to be estimated from the various indicators. In addition, Structural Equation Models (SEMs) were used to explore how the model variables were interrelated, allowing for both direct and indirect relationships to be modeled. Comparisons on the performance of such models, as well as a discussion on behaviors and driving patterns across different countries and transport modes, were also provided. The findings revealed a positive relationship between task complexity and coping capacity, indicating that as the difficulty of the driving task increased, the driver's coping capacity increased accordingly, (i.e., higher ability to manage and adapt to the challenges posed by more complex tasks). The integrated treatment of task complexity, coping capacity, and risk can improve the behavior and safety of all travelers, through the unobtrusive and seamless monitoring of behavior. Thus, authorities should utilize a data system oriented towards collecting key driving insights on population level to plan mobility and safety interventions, develop incentives for road users, optimize enforcement, and enhance community building for safe traveling.

Assuntos

Condução de Veículo , Humanos , Acidentes de Trânsito/prevenção & controle , Capacidades de Enfrentamento , Viagem , Modelos Lineares

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA