ABSTRACT
Proprioception tells the brain the state of the body based on distributed sensory neurons. Yet, the principles that govern proprioceptive processing are poorly understood. Here, we employ a task-driven modeling approach to investigate the neural code of proprioceptive neurons in cuneate nucleus (CN) and somatosensory cortex area 2 (S1). We simulated muscle spindle signals through musculoskeletal modeling and generated a large-scale movement repertoire to train neural networks based on 16 hypotheses, each representing different computational goals. We found that the emerging, task-optimized internal representations generalize from synthetic data to predict neural dynamics in CN and S1 of primates. Computational tasks that aim to predict the limb position and velocity were the best at predicting the neural activity in both areas. Since task optimization develops representations that better predict neural activity during active than passive movements, we postulate that neural activity in the CN and S1 is top-down modulated during goal-directed movements.
Subject(s)
Neurons , Proprioception , Animals , Proprioception/physiology , Neurons/physiology , Brain/physiology , Movement/physiology , Primates , Neural Networks, ComputerABSTRACT
Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), which directly analyzes raw sequencing data, using a statistical test to detect a signature of regulation: sample-specific sequence variation. SPLASH detects many types of variation and can be efficiently run at scale. We show that SPLASH identifies complex mutation patterns in SARS-CoV-2, discovers regulated RNA isoforms at the single-cell level, detects the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes: geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a unifying approach to genomic analysis that enables expansive discovery without metadata or references.
Subject(s)
Algorithms , Genomics , Genome , Sequence Analysis, RNA , Humans , HLA Antigens/genetics , Single-Cell AnalysisABSTRACT
Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths in the United States and compiles the most recent data on population-based cancer occurrence and outcomes using incidence data collected by central cancer registries (through 2020) and mortality data collected by the National Center for Health Statistics (through 2021). In 2024, 2,001,140 new cancer cases and 611,720 cancer deaths are projected to occur in the United States. Cancer mortality continued to decline through 2021, averting over 4 million deaths since 1991 because of reductions in smoking, earlier detection for some cancers, and improved treatment options in both the adjuvant and metastatic settings. However, these gains are threatened by increasing incidence for 6 of the top 10 cancers. Incidence rates increased during 2015-2019 by 0.6%-1% annually for breast, pancreas, and uterine corpus cancers and by 2%-3% annually for prostate, liver (female), kidney, and human papillomavirus-associated oral cancers and for melanoma. Incidence rates also increased by 1%-2% annually for cervical (ages 30-44 years) and colorectal cancers (ages <55 years) in young adults. Colorectal cancer was the fourth-leading cause of cancer death in both men and women younger than 50 years in the late-1990s but is now first in men and second in women. Progress is also hampered by wide persistent cancer disparities; compared to White people, mortality rates are two-fold higher for prostate, stomach and uterine corpus cancers in Black people and for liver, stomach, and kidney cancers in Native American people. Continued national progress will require increased investment in cancer prevention and access to equitable treatment, especially among American Indian and Alaska Native and Black individuals.
Subject(s)
Melanoma , Neoplasms , Male , Young Adult , Humans , Female , United States/epidemiology , Neoplasms/epidemiology , Neoplasms/therapy , Registries , Incidence , Smoking , WhiteABSTRACT
Adaptation is an essential feature of auditory neurons, which reduces their responses to unchanging and recurring sounds and allows their response properties to be matched to the constantly changing statistics of sounds that reach the ears. As a consequence, processing in the auditory system highlights novel or unpredictable sounds and produces an efficient representation of the vast range of sounds that animals can perceive by continually adjusting the sensitivity and, to a lesser extent, the tuning properties of neurons to the most commonly encountered stimulus values. Together with attentional modulation, adaptation to sound statistics also helps to generate neural representations of sound that are tolerant to background noise and therefore plays a vital role in auditory scene analysis. In this review, we consider the diverse forms of adaptation that are found in the auditory system in terms of the processing levels at which they arise, the underlying neural mechanisms, and their impact on neural coding and perception. We also ask what the dynamics of adaptation, which can occur over multiple timescales, reveal about the statistical properties of the environment. Finally, we examine how adaptation to sound statistics is influenced by learning and experience and changes as a result of aging and hearing loss.
Subject(s)
Auditory Cortex , Animals , Acoustic Stimulation , Auditory Cortex/physiology , Auditory Perception/physiology , Noise , Adaptation, Physiological/physiologyABSTRACT
Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths in the United States and compiles the most recent data on population-based cancer occurrence and outcomes using incidence data collected by central cancer registries and mortality data collected by the National Center for Health Statistics. In 2023, 1,958,310 new cancer cases and 609,820 cancer deaths are projected to occur in the United States. Cancer incidence increased for prostate cancer by 3% annually from 2014 through 2019 after two decades of decline, translating to an additional 99,000 new cases; otherwise, however, incidence trends were more favorable in men compared to women. For example, lung cancer in women decreased at one half the pace of men (1.1% vs. 2.6% annually) from 2015 through 2019, and breast and uterine corpus cancers continued to increase, as did liver cancer and melanoma, both of which stabilized in men aged 50 years and older and declined in younger men. However, a 65% drop in cervical cancer incidence during 2012 through 2019 among women in their early 20s, the first cohort to receive the human papillomavirus vaccine, foreshadows steep reductions in the burden of human papillomavirus-associated cancers, the majority of which occur in women. Despite the pandemic, and in contrast with other leading causes of death, the cancer death rate continued to decline from 2019 to 2020 (by 1.5%), contributing to a 33% overall reduction since 1991 and an estimated 3.8 million deaths averted. This progress increasingly reflects advances in treatment, which are particularly evident in the rapid declines in mortality (approximately 2% annually during 2016 through 2020) for leukemia, melanoma, and kidney cancer, despite stable/increasing incidence, and accelerated declines for lung cancer. In summary, although cancer mortality rates continue to decline, future progress may be attenuated by rising incidence for breast, prostate, and uterine corpus cancers, which also happen to have the largest racial disparities in mortality.
Subject(s)
Lung Neoplasms , Melanoma , Multiple Endocrine Neoplasia Type 1 , Neoplasms , Male , Humans , Female , United States/epidemiology , Middle Aged , Aged , Neoplasms/epidemiology , Registries , Incidence , Racial Groups , Lung Neoplasms/epidemiologyABSTRACT
The number of cancer survivors continues to increase in the United States due to the growth and aging of the population as well as advances in early detection and treatment. To assist the public health community in better serving these individuals, the American Cancer Society and the National Cancer Institute collaborate triennially to estimate cancer prevalence in the United States using incidence and survival data from the Surveillance, Epidemiology, and End Results cancer registries, vital statistics from the Centers for Disease Control and Prevention's National Center for Health Statistics, and population projections from the US Census Bureau. Current treatment patterns based on information in the National Cancer Database are presented for the most prevalent cancer types by race, and cancer-related and treatment-related side-effects are also briefly described. More than 18 million Americans (8.3 million males and 9.7 million females) with a history of cancer were alive on January 1, 2022. The 3 most prevalent cancers are prostate (3,523,230), melanoma of the skin (760,640), and colon and rectum (726,450) among males and breast (4,055,770), uterine corpus (891,560), and thyroid (823,800) among females. More than one-half (53%) of survivors were diagnosed within the past 10 years, and two-thirds (67%) were aged 65 years or older. One of the largest racial disparities in treatment is for rectal cancer, for which 41% of Black patients with stage I disease receive proctectomy or proctocolectomy compared to 66% of White patients. Surgical receipt is also substantially lower among Black patients with non-small cell lung cancer, 49% for stages I-II and 16% for stage III versus 55% and 22% for White patients, respectively. These treatment disparities are exacerbated by the fact that Black patients continue to be less likely to be diagnosed with stage I disease than White patients for most cancers, with some of the largest disparities for female breast (53% vs 68%) and endometrial (59% vs 73%). Although there are a growing number of tools that can assist patients, caregivers, and clinicians in navigating the various phases of cancer survivorship, further evidence-based strategies and equitable access to available resources are needed to mitigate disparities for communities of color and optimize care for people with a history of cancer. CA Cancer J Clin. 2022;72:409-436.
Subject(s)
Carcinoma, Non-Small-Cell Lung , Lung Neoplasms , American Cancer Society , Female , Humans , Male , National Cancer Institute (U.S.) , Survivorship , United States/epidemiologyABSTRACT
Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths in the United States and compiles the most recent data on population-based cancer occurrence and outcomes. Incidence data (through 2018) were collected by the Surveillance, Epidemiology, and End Results program; the National Program of Cancer Registries; and the North American Association of Central Cancer Registries. Mortality data (through 2019) were collected by the National Center for Health Statistics. In 2022, 1,918,030 new cancer cases and 609,360 cancer deaths are projected to occur in the United States, including approximately 350 deaths per day from lung cancer, the leading cause of cancer death. Incidence during 2014 through 2018 continued a slow increase for female breast cancer (by 0.5% annually) and remained stable for prostate cancer, despite a 4% to 6% annual increase for advanced disease since 2011. Consequently, the proportion of prostate cancer diagnosed at a distant stage increased from 3.9% to 8.2% over the past decade. In contrast, lung cancer incidence continued to decline steeply for advanced disease while rates for localized-stage increased suddenly by 4.5% annually, contributing to gains both in the proportion of localized-stage diagnoses (from 17% in 2004 to 28% in 2018) and 3-year relative survival (from 21% to 31%). Mortality patterns reflect incidence trends, with declines accelerating for lung cancer, slowing for breast cancer, and stabilizing for prostate cancer. In summary, progress has stagnated for breast and prostate cancers but strengthened for lung cancer, coinciding with changes in medical practice related to cancer screening and/or treatment. More targeted cancer control interventions and investment in improved early detection and treatment would facilitate reductions in cancer mortality.
Subject(s)
Breast Neoplasms/epidemiology , Early Detection of Cancer/statistics & numerical data , Lung Neoplasms/epidemiology , Prostatic Neoplasms/epidemiology , American Cancer Society , Breast Neoplasms/diagnosis , Early Detection of Cancer/trends , Female , Humans , Incidence , Lung Neoplasms/diagnosis , Male , Neoplasm Staging , Prostatic Neoplasms/diagnosis , SEER Program/statistics & numerical data , Survival Rate , United States/epidemiologyABSTRACT
African American/Black individuals have a disproportionate cancer burden, including the highest mortality and the lowest survival of any racial/ethnic group for most cancers. Every 3 years, the American Cancer Society estimates the number of new cancer cases and deaths for Black people in the United States and compiles the most recent data on cancer incidence (herein through 2018), mortality (through 2019), survival, screening, and risk factors using population-based data from the National Cancer Institute and the Centers for Disease Control and Prevention. In 2022, there will be approximately 224,080 new cancer cases and 73,680 cancer deaths among Black people in the United States. During the most recent 5-year period, Black men had a 6% higher incidence rate but 19% higher mortality than White men overall, including an approximately 2-fold higher risk of death from myeloma, stomach cancer, and prostate cancer. The overall cancer mortality disparity is narrowing between Black and White men because of a steeper drop in Black men for lung and prostate cancers. However, the decline in prostate cancer mortality in Black men slowed from 5% annually during 2010 through 2014 to 1.3% during 2015 through 2019, likely reflecting the 5% annual increase in advanced-stage diagnoses since 2012. Black women have an 8% lower incidence rate than White women but a 12% higher mortality; further, mortality rates are 2-fold higher for endometrial cancer and 41% higher for breast cancer despite similar or lower incidence rates. The wide breast cancer disparity reflects both later stage diagnosis (57% localized stage vs 67% in White women) and lower 5-year survival overall (82% vs 92%, respectively) and for every stage of disease (eg, 20% vs 30%, respectively, for distant stage). Breast cancer surpassed lung cancer as the leading cause of cancer death among Black women in 2019. Targeted interventions are needed to reduce stark cancer inequalities in the Black community.
Subject(s)
Breast Neoplasms , Prostatic Neoplasms , Black or African American , American Cancer Society , Female , Humans , Male , National Cancer Institute (U.S.) , United States/epidemiologyABSTRACT
As acquiring bigger data becomes easier in experimental brain science, computational and statistical brain science must achieve similar advances to fully capitalize on these data. Tackling these problems will benefit from a more explicit and concerted effort to work together. Specifically, brain science can be further democratized by harnessing the power of community-driven tools, which both are built by and benefit from many different people with different backgrounds and expertise. This perspective can be applied across modalities and scales and enables collaborations across previously siloed communities.
Subject(s)
Big Data , Brain/physiology , Computational Biology , Nerve Net/physiology , Animals , Computational Biology/methods , Databases, Genetic , Gene Expression/physiology , HumansABSTRACT
The Hispanic/Latino population is the second largest racial/ethnic group in the continental United States and Hawaii, accounting for 18% (60.6 million) of the total population. An additional 3 million Hispanic Americans live in Puerto Rico. Every 3 years, the American Cancer Society reports on cancer occurrence, risk factors, and screening for Hispanic individuals in the United States using the most recent population-based data. An estimated 176,600 new cancer cases and 46,500 cancer deaths will occur among Hispanic individuals in the continental United States and Hawaii in 2021. Compared to non-Hispanic Whites (NHWs), Hispanic men and women had 25%-30% lower incidence (2014-2018) and mortality (2015-2019) rates for all cancers combined and lower rates for the most common cancers, although this gap is diminishing. For example, the colorectal cancer (CRC) incidence rate ratio for Hispanic compared with NHW individuals narrowed from 0.75 (95% CI, 0.73-0.78) in 1995 to 0.91 (95% CI, 0.89-0.93) in 2018, reflecting delayed declines in CRC rates among Hispanic individuals in part because of slower uptake of screening. In contrast, Hispanic individuals have higher rates of infection-related cancers, including approximately two-fold higher incidence of liver and stomach cancer. Cervical cancer incidence is 32% higher among Hispanic women in the continental US and Hawaii and 78% higher among women in Puerto Rico compared to NHW women, yet is largely preventable through screening. Less access to care may be similarly reflected in the low prevalence of localized-stage breast cancer among Hispanic women, 59% versus 67% among NHW women. Evidence-based strategies for decreasing the cancer burden among the Hispanic population include the use of culturally appropriate lay health advisors and patient navigators and targeted, community-based intervention programs to facilitate access to screening and promote healthy behaviors. In addition, the impact of the COVID-19 pandemic on cancer trends and disparities in the Hispanic population should be closely monitored.
Subject(s)
Early Detection of Cancer/statistics & numerical data , Health Services Accessibility/statistics & numerical data , Hispanic or Latino/statistics & numerical data , Neoplasms/ethnology , Adolescent , Adult , Aged , Female , Humans , Incidence , Male , Middle Aged , Neoplasms/mortality , Neoplasms/prevention & control , Puerto Rico/epidemiology , Risk Factors , Survival Rate , United States/epidemiology , White People/statistics & numerical data , Young AdultABSTRACT
Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths in the United States and compiles the most recent data on population-based cancer occurrence. Incidence data (through 2017) were collected by the Surveillance, Epidemiology, and End Results Program; the National Program of Cancer Registries; and the North American Association of Central Cancer Registries. Mortality data (through 2018) were collected by the National Center for Health Statistics. In 2021, 1,898,160 new cancer cases and 608,570 cancer deaths are projected to occur in the United States. After increasing for most of the 20th century, the cancer death rate has fallen continuously from its peak in 1991 through 2018, for a total decline of 31%, because of reductions in smoking and improvements in early detection and treatment. This translates to 3.2 million fewer cancer deaths than would have occurred if peak rates had persisted. Long-term declines in mortality for the 4 leading cancers have halted for prostate cancer and slowed for breast and colorectal cancers, but accelerated for lung cancer, which accounted for almost one-half of the total mortality decline from 2014 to 2018. The pace of the annual decline in lung cancer mortality doubled from 3.1% during 2009 through 2013 to 5.5% during 2014 through 2018 in men, from 1.8% to 4.4% in women, and from 2.4% to 5% overall. This trend coincides with steady declines in incidence (2.2%-2.3%) but rapid gains in survival specifically for nonsmall cell lung cancer (NSCLC). For example, NSCLC 2-year relative survival increased from 34% for persons diagnosed during 2009 through 2010 to 42% during 2015 through 2016, including absolute increases of 5% to 6% for every stage of diagnosis; survival for small cell lung cancer remained at 14% to 15%. Improved treatment accelerated progress against lung cancer and drove a record drop in overall cancer mortality, despite slowing momentum for other common cancers.
Subject(s)
Mortality/trends , Neoplasms/epidemiology , SEER Program/statistics & numerical data , American Cancer Society , Humans , Incidence , Neoplasms/therapy , United States/epidemiologyABSTRACT
Mendelian randomization (MR), which utilizes genetic variants as instrumental variables (IVs), has gained popularity as a method for causal inference between phenotypes using genetic data. While efforts have been made to relax IV assumptions and develop new methods for causal inference in the presence of invalid IVs due to confounding, the reliability of MR methods in real-world applications remains uncertain. Instead of using simulated datasets, we conducted a benchmark study evaluating 16 two-sample summary-level MR methods using real-world genetic datasets to provide guidelines for the best practices. Our study focused on the following crucial aspects: type I error control in the presence of various confounding scenarios (e.g., population stratification, pleiotropy, and family-level confounders like assortative mating), the accuracy of causal effect estimates, replicability, and power. By comprehensively evaluating the performance of compared methods over one thousand exposure-outcome trait pairs, our study not only provides valuable insights into the performance and limitations of the compared methods but also offers practical guidance for researchers to choose appropriate MR methods for causal inference.
Subject(s)
Benchmarking , Genome-Wide Association Study , Mendelian Randomization Analysis , Mendelian Randomization Analysis/methods , Humans , Genome-Wide Association Study/methods , Phenotype , Genetic Variation , Causality , Polymorphism, Single Nucleotide , Models, GeneticABSTRACT
Both trio and population designs are popular study designs for identifying risk genetic variants in genome-wide association studies (GWASs). The trio design, as a family-based design, is robust to confounding due to population structure, whereas the population design is often more powerful due to larger sample sizes. Here, we propose KnockoffHybrid, a knockoff-based statistical method for hybrid analysis of both the trio and population designs. KnockoffHybrid provides a unified framework that brings together the advantages of both designs and produces powerful hybrid analysis while controlling the false discovery rate (FDR) in the presence of linkage disequilibrium and population structure. Furthermore, KnockoffHybrid has the flexibility to leverage different types of summary statistics for hybrid analyses, including expression quantitative trait loci (eQTL) and GWAS summary statistics. We demonstrate in simulations that KnockoffHybrid offers power gains over non-hybrid methods for the trio and population designs with the same number of cases while controlling the FDR with complex correlation among variants and population structure among subjects. In hybrid analyses of three trio cohorts for autism spectrum disorders (ASDs) from the Autism Speaks MSSNG, Autism Sequencing Consortium, and Autism Genome Project with GWAS summary statistics from the iPSYCH project and eQTL summary statistics from the MetaBrain project, KnockoffHybrid outperforms conventional methods by replicating several known risk genes for ASDs and identifying additional associations with variants in other genes, including the PRAME family genes involved in axon guidance and which may act as common targets for human speech/language evolution and related disorders.
Subject(s)
Autism Spectrum Disorder , Genome-Wide Association Study , Linkage Disequilibrium , Quantitative Trait Loci , Genome-Wide Association Study/methods , Humans , Autism Spectrum Disorder/genetics , Genetic Predisposition to Disease , Polymorphism, Single Nucleotide , Computer Simulation , Models, GeneticABSTRACT
Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths that will occur in the United States and compiles the most recent data on population-based cancer occurrence. Incidence data (through 2016) were collected by the Surveillance, Epidemiology, and End Results Program; the National Program of Cancer Registries; and the North American Association of Central Cancer Registries. Mortality data (through 2017) were collected by the National Center for Health Statistics. In 2020, 1,806,590 new cancer cases and 606,520 cancer deaths are projected to occur in the United States. The cancer death rate rose until 1991, then fell continuously through 2017, resulting in an overall decline of 29% that translates into an estimated 2.9 million fewer cancer deaths than would have occurred if peak rates had persisted. This progress is driven by long-term declines in death rates for the 4 leading cancers (lung, colorectal, breast, prostate); however, over the past decade (2008-2017), reductions slowed for female breast and colorectal cancers, and halted for prostate cancer. In contrast, declines accelerated for lung cancer, from 3% annually during 2008 through 2013 to 5% during 2013 through 2017 in men and from 2% to almost 4% in women, spurring the largest ever single-year drop in overall cancer mortality of 2.2% from 2016 to 2017. Yet lung cancer still caused more deaths in 2017 than breast, prostate, colorectal, and brain cancers combined. Recent mortality declines were also dramatic for melanoma of the skin in the wake of US Food and Drug Administration approval of new therapies for metastatic disease, escalating to 7% annually during 2013 through 2017 from 1% during 2006 through 2010 in men and women aged 50 to 64 years and from 2% to 3% in those aged 20 to 49 years; annual declines of 5% to 6% in individuals aged 65 years and older are particularly striking because rates in this age group were increasing prior to 2013. It is also notable that long-term rapid increases in liver cancer mortality have attenuated in women and stabilized in men. In summary, slowing momentum for some cancers amenable to early detection is juxtaposed with notable gains for other common cancers.
Subject(s)
American Cancer Society , Neoplasms/epidemiology , Registries , SEER Program/statistics & numerical data , Adult , Aged , Female , Humans , Incidence , Male , Middle Aged , Survival Rate/trends , United States/epidemiology , Young AdultABSTRACT
Epidemiologic associations estimated from observational data are often confounded by genetics due to pervasive pleiotropy among complex traits. Many studies either neglect genetic confounding altogether or rely on adjusting for polygenic scores (PGS) in regression analysis. In this study, we unveil that the commonly employed PGS approach is inadequate for removing genetic confounding due to measurement error and model misspecification. To tackle this challenge, we introduce PENGUIN, a principled framework for polygenic genetic confounding control based on variance component estimation. In addition, we present extensions of this approach that can estimate genetically unconfounded associations using GWAS summary statistics alone as input and between multiple generations of study samples. Through simulations, we demonstrate superior statistical properties of PENGUIN compared to the existing approaches. Applying our method to multiple population cohorts, we reveal and remove substantial genetic confounding in the associations of educational attainment with various complex traits and between parental and offspring education. Our results show that PENGUIN is an effective solution for genetic confounding control in observational data analysis with broad applications in future epidemiologic association studies.
Subject(s)
Genome-Wide Association Study , Multifactorial Inheritance , Multifactorial Inheritance/genetics , Humans , Genome-Wide Association Study/methods , Models, Genetic , Confounding Factors, EpidemiologicABSTRACT
Localized deformation and randomly shaped imperfections are salient features of buckling-type instabilities in thin-walled load-bearing structures. However, it is generally agreed that their complex interactions in response to mechanical loading are not yet sufficiently understood, as evidenced by buckling-induced catastrophic failures which continue to today. This study investigates how the intimate coupling between localization mechanisms and geometric imperfections combine to determine the statistics of the pressure required to buckle (the illustrative example of) a hemispherical shell. The geometric imperfections, in the form of a surface, are defined by a random field generated over the nominally hemispherical shell geometry, and the probability distribution of the buckling pressure is computed via stochastic finite element analysis. Monte-Carlo simulations are performed for a wide range of the shell's radius to thickness ratio, as well as the correlation length of the spatial distribution of the imperfection. The results show that over this range, the buckling pressure is captured by the Weibull distribution. In addition, the analyses of the deformation patterns observed during the simulations provide insights into the effects of certain characteristic lengths on the local buckling that triggers global instability. In light of the simulation results, a probabilistic model is developed for the statistics of the buckling load that reveals how the dimensionless radius plays a dual role which remained hidden in previous deterministic analyses. The implications of the present model for reliability-based design of shell structures are discussed.
ABSTRACT
RNA decay is a crucial mechanism for regulating gene expression in response to environmental stresses. In bacteria, RNA-binding proteins (RBPs) are known to be involved in posttranscriptional regulation, but their global impact on RNA half-lives has not been extensively studied. To shed light on the role of the major RBPs ProQ and CspC/E in maintaining RNA stability, we performed RNA sequencing of Salmonella enterica over a time course following treatment with the transcription initiation inhibitor rifampicin (RIF-seq) in the presence and absence of these RBPs. We developed a hierarchical Bayesian model that corrects for confounding factors in rifampicin RNA stability assays and enables us to identify differentially decaying transcripts transcriptome-wide. Our analysis revealed that the median RNA half-life in Salmonella in early stationary phase is less than 1 min, a third of previous estimates. We found that over half of the 500 most long-lived transcripts are bound by at least one major RBP, suggesting a general role for RBPs in shaping the transcriptome. Integrating differential stability estimates with cross-linking and immunoprecipitation followed by RNA sequencing (CLIP-seq) revealed that approximately 30% of transcripts with ProQ binding sites and more than 40% with CspC/E binding sites in coding or 3' untranslated regions decay differentially in the absence of the respective RBP. Analysis of differentially destabilized transcripts identified a role for ProQ in the oxidative stress response. Our findings provide insights into posttranscriptional regulation by ProQ and CspC/E, and the importance of RBPs in regulating gene expression.
Subject(s)
Gene Expression Profiling , Rifampin , Bayes Theorem , Half-Life , Transcriptome , RNA-Binding Proteins/metabolism , RNA/metabolism , Salmonella/metabolism , RNA Stability/geneticsABSTRACT
The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of â¼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hËγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hËγ2 = 0.012 ± 9.2 × 10-4), which translates to hË2 ranging from 0.062 to 0.85 (mean hË2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.
Subject(s)
Black or African American , Genetics, Population , Humans , Chromosome Mapping , Phenotype , Polymorphism, Single Nucleotide/geneticsABSTRACT
Multivariate analysis is becoming central in studies investigating high-throughput molecular data, yet, some important features of these data are seldom explored. Here, we present MANOCCA (Multivariate Analysis of Conditional CovAriance), a powerful method to test for the effect of a predictor on the covariance matrix of a multivariate outcome. The proposed test is by construction orthogonal to tests based on the mean and variance and is able to capture effects that are missed by both approaches. We first compare the performances of MANOCCA with existing correlation-based methods and show that MANOCCA is the only test correctly calibrated in simulation mimicking omics data. We then investigate the impact of reducing the dimensionality of the data using principal component analysis when the sample size is smaller than the number of pairwise covariance terms analysed. We show that, in many realistic scenarios, the maximum power can be achieved with a limited number of components. Finally, we apply MANOCCA to 1000 healthy individuals from the Milieu Interieur cohort, to assess the effect of health, lifestyle and genetic factors on the covariance of two sets of phenotypes, blood biomarkers and flow cytometry-based immune phenotypes. Our analyses identify significant associations between multiple factors and the covariance of both omics data.
Subject(s)
Principal Component Analysis , Humans , Multivariate Analysis , Computational Biology/methods , Phenotype , Algorithms , Genomics/methods , Biomarkers/blood , Computer SimulationABSTRACT
Batch effects introduce significant variability into high-dimensional data, complicating accurate analysis and leading to potentially misleading conclusions if not adequately addressed. Despite technological and algorithmic advancements in biomedical research, effectively managing batch effects remains a complex challenge requiring comprehensive considerations. This paper underscores the necessity of a flexible and holistic approach for selecting batch effect correction algorithms (BECAs), advocating for proper BECA evaluations and consideration of artificial intelligence-based strategies. We also discuss key challenges in batch effect correction, including the importance of uncovering hidden batch factors and understanding the impact of design imbalance, missing values, and aggressive correction. Our aim is to provide researchers with a robust framework for effective batch effects management and enhancing the reliability of high-dimensional data analyses.