RESUMEN
Understanding population health disparities is an essential component of equitable precision health efforts. Epidemiology research often relies on definitions of race and ethnicity, but these population labels may not adequately capture disease burdens and environmental factors impacting specific sub-populations. Here, we propose a framework for repurposing data from electronic health records (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward fine-scale monitoring of population health.
Asunto(s)
Etnicidad/genética , Salud Poblacional , Bases de Datos Genéticas , Registros Electrónicos de Salud , Genómica , Humanos , AutoinformeRESUMEN
Current Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) using short-read sequencing strategies resolve expressed Ab transcripts with limited resolution of the C region. In this article, we present the near-full-length AIRR-seq (FLAIRR-seq) method that uses targeted amplification by 5' RACE, combined with single-molecule, real-time sequencing to generate highly accurate (99.99%) human Ab H chain transcripts. FLAIRR-seq was benchmarked by comparing H chain V (IGHV), D (IGHD), and J (IGHJ) gene usage, complementarity-determining region 3 length, and somatic hypermutation to matched datasets generated with standard 5' RACE AIRR-seq using short-read sequencing and full-length isoform sequencing. Together, these data demonstrate robust FLAIRR-seq performance using RNA samples derived from PBMCs, purified B cells, and whole blood, which recapitulated results generated by commonly used methods, while additionally resolving H chain gene features not documented in IMGT at the time of submission. FLAIRR-seq data provide, for the first time, to our knowledge, simultaneous single-molecule characterization of IGHV, IGHD, IGHJ, and IGHC region genes and alleles, allele-resolved subisotype definition, and high-resolution identification of class switch recombination within a clonal lineage. In conjunction with genomic sequencing and genotyping of IGHC genes, FLAIRR-seq of the IgM and IgG repertoires from 10 individuals resulted in the identification of 32 unique IGHC alleles, 28 (87%) of which were previously uncharacterized. Together, these data demonstrate the capabilities of FLAIRR-seq to characterize IGHV, IGHD, IGHJ, and IGHC gene diversity for the most comprehensive view of bulk-expressed Ab repertoires to date.
Asunto(s)
Regiones Determinantes de Complementariedad , Humanos , Regiones Determinantes de Complementariedad/genética , Secuencia de BasesRESUMEN
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p = 5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights.
Asunto(s)
COVID-19 , Exoma , Humanos , Exoma/genética , Estudio de Asociación del Genoma Completo , COVID-19/genética , Predisposición Genética a la Enfermedad , Receptor Toll-Like 7/genética , SARS-CoV-2/genéticaRESUMEN
Insulin resistance (IR) precedes the development of type 2 diabetes (T2D) and increases cardiovascular disease risk. Although genome wide association studies (GWAS) have uncovered new loci associated with T2D, their contribution to explain the mechanisms leading to decreased insulin sensitivity has been very limited. Thus, new approaches are necessary to explore the genetic architecture of insulin resistance. To that end, we generated an iPSC library across the spectrum of insulin sensitivity in humans. RNA-seq based analysis of 310 induced pluripotent stem cell (iPSC) clones derived from 100 individuals allowed us to identify differentially expressed genes between insulin resistant and sensitive iPSC lines. Analysis of the co-expression architecture uncovered several insulin sensitivity-relevant gene sub-networks, and predictive network modeling identified a set of key driver genes that regulate these co-expression modules. Functional validation in human adipocytes and skeletal muscle cells (SKMCs) confirmed the relevance of the key driver candidate genes for insulin responsiveness.
Asunto(s)
Redes Reguladoras de Genes , Células Madre Pluripotentes Inducidas/metabolismo , Resistencia a la Insulina/genética , Insulina/metabolismo , HumanosRESUMEN
BACKGROUND: COVID-19 has infected millions of people worldwide and is responsible for several hundred thousand fatalities. The COVID-19 pandemic has necessitated thoughtful resource allocation and early identification of high-risk patients. However, effective methods to meet these needs are lacking. OBJECTIVE: The aims of this study were to analyze the electronic health records (EHRs) of patients who tested positive for COVID-19 and were admitted to hospitals in the Mount Sinai Health System in New York City; to develop machine learning models for making predictions about the hospital course of the patients over clinically meaningful time horizons based on patient characteristics at admission; and to assess the performance of these models at multiple hospitals and time points. METHODS: We used Extreme Gradient Boosting (XGBoost) and baseline comparator models to predict in-hospital mortality and critical events at time windows of 3, 5, 7, and 10 days from admission. Our study population included harmonized EHR data from five hospitals in New York City for 4098 COVID-19-positive patients admitted from March 15 to May 22, 2020. The models were first trained on patients from a single hospital (n=1514) before or on May 1, externally validated on patients from four other hospitals (n=2201) before or on May 1, and prospectively validated on all patients after May 1 (n=383). Finally, we established model interpretability to identify and rank variables that drive model predictions. RESULTS: Upon cross-validation, the XGBoost classifier outperformed baseline models, with an area under the receiver operating characteristic curve (AUC-ROC) for mortality of 0.89 at 3 days, 0.85 at 5 and 7 days, and 0.84 at 10 days. XGBoost also performed well for critical event prediction, with an AUC-ROC of 0.80 at 3 days, 0.79 at 5 days, 0.80 at 7 days, and 0.81 at 10 days. In external validation, XGBoost achieved an AUC-ROC of 0.88 at 3 days, 0.86 at 5 days, 0.86 at 7 days, and 0.84 at 10 days for mortality prediction. Similarly, the unimputed XGBoost model achieved an AUC-ROC of 0.78 at 3 days, 0.79 at 5 days, 0.80 at 7 days, and 0.81 at 10 days. Trends in performance on prospective validation sets were similar. At 7 days, acute kidney injury on admission, elevated LDH, tachypnea, and hyperglycemia were the strongest drivers of critical event prediction, while higher age, anion gap, and C-reactive protein were the strongest drivers of mortality prediction. CONCLUSIONS: We externally and prospectively trained and validated machine learning models for mortality and critical events for patients with COVID-19 at different time horizons. These models identified at-risk patients and uncovered underlying relationships that predicted outcomes.
Asunto(s)
Infecciones por Coronavirus/diagnóstico , Infecciones por Coronavirus/mortalidad , Aprendizaje Automático/normas , Neumonía Viral/diagnóstico , Neumonía Viral/mortalidad , Lesión Renal Aguda/epidemiología , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Betacoronavirus , COVID-19 , Estudios de Cohortes , Registros Electrónicos de Salud , Femenino , Mortalidad Hospitalaria , Hospitalización/estadística & datos numéricos , Hospitales , Humanos , Masculino , Persona de Mediana Edad , Ciudad de Nueva York/epidemiología , Pandemias , Pronóstico , Curva ROC , Medición de Riesgo/métodos , Medición de Riesgo/normas , SARS-CoV-2 , Adulto JovenRESUMEN
Our hypothesis is that changes in gene and protein expression are crucial to the development of late-onset Alzheimer’s disease. Previously we examined how DNA alleles control downstream expression of RNA transcripts and how those relationships are changed in late-onset Alzheimer’s disease. We have now examined how proteins are incorporated into networks in two separate series and evaluated our outputs in two different cell lines. Our pipeline included the following steps: (i) predicting expression quantitative trait loci; (ii) determining differential expression; (iii) analysing networks of transcript and peptide relationships; and (iv) validating effects in two separate cell lines. We performed all our analysis in two separate brain series to validate effects. Our two series included 345 samples in the first set (177 controls, 168 cases; age range 65–105; 58% female; KRONOSII cohort) and 409 samples in the replicate set (153 controls, 141 cases, 115 mild cognitive impairment; age range 66–107; 63% female; RUSH cohort). Our top target is heat shock protein family A member 2 (HSPA2), which was identified as a key driver in our two datasets. HSPA2 was validated in two cell lines, with overexpression driving further elevation of amyloid-β40 and amyloid-β42 levels in APP mutant cells, as well as significant elevation of microtubule associated protein tau and phosphorylated-tau in a modified neuroglioma line. This work further demonstrates that studying changes in gene and protein expression is crucial to understanding late onset disease and further nominates HSPA2 as a specific key regulator of late-onset Alzheimer’s disease processes.10.1093/brain/awy215_video1awy215media15824729224001.
Asunto(s)
Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/fisiopatología , Proteínas HSP70 de Choque Térmico/fisiología , Anciano , Anciano de 80 o más Años , Enfermedad de Alzheimer/metabolismo , Encéfalo/metabolismo , Mapeo Encefálico/métodos , Línea Celular , Femenino , Perfilación de la Expresión Génica/métodos , Células HEK293 , Proteínas HSP70 de Choque Térmico/genética , Proteínas HSP70 de Choque Térmico/metabolismo , Humanos , Masculino , Red Nerviosa/fisiopatología , Procesamiento Proteico-Postraduccional , ARN/análisis , ARN/metabolismo , Transcriptoma/genéticaRESUMEN
Assigning cancer patients to the most effective treatments requires an understanding of the molecular basis of their disease. While DNA-based molecular profiling approaches have flourished over the past several years to transform our understanding of driver pathways across a broad range of tumors, a systematic characterization of key driver pathways based on RNA data has not been undertaken. Here we introduce a new approach for predicting the status of driver cancer pathways based on signature functions derived from RNA sequencing data. To identify the driver cancer pathways of interest, we mined DNA variant data from TCGA and nominated driver alterations in seven major cancer pathways in breast, ovarian and colon cancer tumors. The activation status of these driver pathways were then characterized using RNA sequencing data by constructing classification signature functions in training datasets and then testing the accuracy of the signatures in test datasets. The signature functions differentiate well tumors with nominated pathway activation from tumors with no signs of activation: average AUC equals to 0.83. Our results confirm that driver genomic alterations are distinctively displayed at the transcriptional level and that the transcriptional signatures can generally provide an alternative to DNA sequencing methods in detecting specific driver pathways.
Asunto(s)
Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Genómica , Neoplasias/genética , Transcriptoma , Algoritmos , Biomarcadores de Tumor , Biología Computacional/métodos , Bases de Datos Genéticas , Femenino , Marcadores Genéticos , Genómica/métodos , Humanos , Aprendizaje Automático , Masculino , Neoplasias/metabolismo , Transducción de SeñalRESUMEN
MOTIVATION: Underrepresentation of racial groups represents an important challenge and major gap in phenomics research. Most of the current human phenomics research is based primarily on European populations; hence it is an important challenge to expand it to consider other population groups. One approach is to utilize data from EMR databases that contain patient data from diverse demographics and ancestries. The implications of this racial underrepresentation of data can be profound regarding effects on the healthcare delivery and actionability. To the best of our knowledge, our work is the first attempt to perform comparative, population-scale analyses of disease networks across three different populations, namely Caucasian (EA), African American (AA) and Hispanic/Latino (HL). RESULTS: We compared susceptibility profiles and temporal connectivity patterns for 1988 diseases and 37 282 disease pairs represented in a clinical population of 1 025 573 patients. Accordingly, we revealed appreciable differences in disease susceptibility, temporal patterns, network structure and underlying disease connections between EA, AA and HL populations. We found 2158 significantly comorbid diseases for the EA cohort, 3265 for AA and 672 for HL. We further outlined key disease pair associations unique to each population as well as categorical enrichments of these pairs. Finally, we identified 51 key 'hub' diseases that are the focal points in the race-centric networks and of particular clinical importance. Incorporating race-specific disease comorbidity patterns will produce a more accurate and complete picture of the disease landscape overall and could support more precise understanding of disease relationships and patient management towards improved clinical outcomes. CONTACTS: rong.chen@mssm.edu or joel.dudley@mssm.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Registros Electrónicos de Salud , Negro o Afroamericano , Bases de Datos Factuales , Hispánicos o Latinos , Humanos , Población BlancaRESUMEN
Both obesity and being underweight have been associated with increased mortality. Underweight, defined as a body mass index (BMI) ≤ 18.5 kg per m(2) in adults and ≤ -2 standard deviations from the mean in children, is the main sign of a series of heterogeneous clinical conditions including failure to thrive, feeding and eating disorder and/or anorexia nervosa. In contrast to obesity, few genetic variants underlying these clinical conditions have been reported. We previously showed that hemizygosity of a â¼600-kilobase (kb) region on the short arm of chromosome 16 causes a highly penetrant form of obesity that is often associated with hyperphagia and intellectual disabilities. Here we show that the corresponding reciprocal duplication is associated with being underweight. We identified 138 duplication carriers (including 132 novel cases and 108 unrelated carriers) from individuals clinically referred for developmental or intellectual disabilities (DD/ID) or psychiatric disorders, or recruited from population-based cohorts. These carriers show significantly reduced postnatal weight and BMI. Half of the boys younger than five years are underweight with a probable diagnosis of failure to thrive, whereas adult duplication carriers have an 8.3-fold increased risk of being clinically underweight. We observe a trend towards increased severity in males, as well as a depletion of male carriers among non-medically ascertained cases. These features are associated with an unusually high frequency of selective and restrictive eating behaviours and a significant reduction in head circumference. Each of the observed phenotypes is the converse of one reported in carriers of deletions at this locus. The phenotypes correlate with changes in transcript levels for genes mapping within the duplication but not in flanking regions. The reciprocal impact of these 16p11.2 copy-number variants indicates that severe obesity and being underweight could have mirror aetiologies, possibly through contrasting effects on energy balance.
Asunto(s)
Índice de Masa Corporal , Cromosomas Humanos Par 16/genética , Dosificación de Gen/genética , Obesidad/genética , Fenotipo , Delgadez/genética , Adolescente , Adulto , Anciano , Envejecimiento , Estatura/genética , Estudios de Casos y Controles , Niño , Preescolar , Estudios de Cohortes , Hibridación Genómica Comparativa , Discapacidades del Desarrollo/genética , Metabolismo Energético/genética , Europa (Continente) , Femenino , Duplicación de Gen/genética , Perfilación de la Expresión Génica , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Cabeza/anatomía & histología , Heterocigoto , Humanos , Lactante , Recién Nacido , Masculino , Trastornos Mentales/genética , Persona de Mediana Edad , Mutación/genética , América del Norte , ARN Mensajero/análisis , ARN Mensajero/genética , Eliminación de Secuencia/genética , Transcripción Genética , Adulto JovenRESUMEN
BACKGROUND: It has recently become possible to rapidly and accurately detect epigenetic signatures in bacterial genomes using third generation sequencing data. Monitoring the speed at which a single polymerase inserts a base in the read strand enables one to infer whether a modification is present at that specific site on the template strand. These sites can be challenging to detect in the absence of high coverage and reliable reference genomes. METHODS: Here we provide a new method for detecting epigenetic motifs in bacteria on datasets with low-coverage, with incomplete references, and with mixed samples (i.e. metagenomic data). Our approach treats motif inference as a kmer comparison problem. First, genomes (or contigs) are deconstructed into kmers. Then, native genome-wide distributions of interpulse durations (IPDs) for kmers are compared with corresponding whole genome amplified (WGA, modification free) IPD distributions using log likelihood ratios. Finally, kmers are ranked and greedily selected by iteratively correcting for sequences within a particular kmer's neighborhood. CONCLUSIONS: Our method can detect multiple types of modifications, even at very low-coverage and in the presence of mixed genomes. Additionally, we are able to predict modified motifs when genomes with "neighbor" modified motifs exist within the sample. Lastly, we show that these motifs can provide an alternative source of information by which to cluster metagenomics contigs and that iterative refinement on these clustered contigs can further improve both sensitivity and specificity of motif detection. AVAILABILITY: https://github.com/alibashir/EMMCKmer.
Asunto(s)
Bacterias/genética , Epigénesis Genética , Genoma Bacteriano , Metagenómica/métodos , Motivos de Nucleótidos , Algoritmos , Secuencia de Bases , Simulación por Computador , Modelos GenéticosRESUMEN
Introduction: Dual specificity protein phosphatase 6 (DUSP6) was recently identified as a key hub gene in a causal VGF gene network that regulates late-onset Alzheimer's disease (AD). Importantly, decreased DUSP6 levels are correlated with an increased clinical dementia rating (CDR) in human subjects, and DUSP6 levels are additionally decreased in the 5xFAD amyloidopathy mouse model. Methods: To investigate the role of DUSP6 in AD, we stereotactically injected AAV5-DUSP6 or AAV5-GFP (control) into the dorsal hippocampus (dHc) of both female and male 5xFAD or wild type mice, to induce overexpression of DUSP6 or GFP. Results: Barnes maze testing indicated that DUSP6 overexpression in the dHc of 5xFAD mice improved memory deficits and was associated with reduced amyloid plaque load, Aß1-40 and Aß1-42 levels, and amyloid precursor protein processing enzyme BACE1, in male but not in female mice. Microglial activation, which was increased in 5xFAD mice, was significantly reduced by dHc DUSP6 overexpression in both males and females, as was the number of "microglial clusters," which correlated with reduced amyloid plaque size. Transcriptomic profiling of female 5xFAD hippocampus revealed upregulation of inflammatory and extracellular signal-regulated kinase pathways, while dHc DUSP6 overexpression in female 5xFAD mice downregulated a subset of genes in these pathways. Gene ontology analysis of DEGs (p < 0.05) identified a greater number of synaptic pathways that were regulated by DUSP6 overexpression in male compared to female 5xFAD. Discussion: In summary, DUSP6 overexpression in dHc reduced amyloid deposition and memory deficits in male but not female 5xFAD mice, whereas reduced neuroinflammation and microglial activation were observed in both males and females, suggesting that DUSP6-induced reduction of microglial activation did not contribute to sex-dependent improvement in memory deficits. The sex-dependent regulation of synaptic pathways by DUSP6 overexpression, however, correlated with the improvement of spatial memory deficits in male but not female 5xFAD.
RESUMEN
Single-nucleus RNA sequencing (snRNA-seq) is often used to define gene expression patterns characteristic of brain cell types as well as to identify cell type specific gene expression signatures of neurological and mental illnesses in postmortem human brains. As methods to obtain brain tissue from living individuals emerge, it is essential to characterize gene expression differences associated with tissue originating from either living or postmortem subjects using snRNA-seq, and to assess whether and how such differences may impact snRNA-seq studies of brain tissue. To address this, human prefrontal cortex single nuclei gene expression was generated and compared between 31 samples from living individuals and 21 postmortem samples. The same cell types were consistently identified in living and postmortem nuclei, though for each cell type, a large proportion of genes were differentially expressed between samples from postmortem and living individuals. Notably, estimation of cell type proportions by cell type deconvolution of pseudo-bulk data was found to be more accurate in samples from living individuals. To allow for future integration of living and postmortem brain gene expression, a model was developed that quantifies from gene expression data the probability a human brain tissue sample was obtained postmortem. These probabilities are established as a means to statistically account for the gene expression differences between samples from living and postmortem individuals. Together, the results presented here provide a deep characterization of both differences between snRNA-seq derived from samples from living and postmortem individuals, as well as qualify and account for their effect on common analyses performed on this type of data.
RESUMEN
Long Covid is a debilitating condition of unknown etiology. We performed multimodal proteomics analyses of blood serum from COVID-19 patients followed up to 12 months after confirmed severe acute respiratory syndrome coronavirus 2 infection. Analysis of >6500 proteins in 268 longitudinal samples revealed dysregulated activation of the complement system, an innate immune protection and homeostasis mechanism, in individuals experiencing Long Covid. Thus, active Long Covid was characterized by terminal complement system dysregulation and ongoing activation of the alternative and classical complement pathways, the latter associated with increased antibody titers against several herpesviruses possibly stimulating this pathway. Moreover, markers of hemolysis, tissue injury, platelet activation, and monocyte-platelet aggregates were increased in Long Covid. Machine learning confirmed complement and thromboinflammatory proteins as top biomarkers, warranting diagnostic and therapeutic interrogation of these systems.
Asunto(s)
Activación de Complemento , Proteínas del Sistema Complemento , Síndrome Post Agudo de COVID-19 , Proteoma , Tromboinflamación , Humanos , Proteínas del Sistema Complemento/análisis , Proteínas del Sistema Complemento/metabolismo , Síndrome Post Agudo de COVID-19/sangre , Síndrome Post Agudo de COVID-19/complicaciones , Síndrome Post Agudo de COVID-19/inmunología , Tromboinflamación/sangre , Tromboinflamación/inmunología , Biomarcadores/sangre , Proteómica , Masculino , Femenino , Adulto Joven , Adulto , Persona de Mediana Edad , AncianoRESUMEN
Adenosine-to-inosine (A-to-I) editing is a prevalent post-transcriptional RNA modification within the brain. Yet, most research has relied on postmortem samples, assuming it is an accurate representation of RNA biology in the living brain. We challenge this assumption by comparing A-to-I editing between postmortem and living prefrontal cortical tissues. Major differences were found, with over 70,000 A-to-I sites showing higher editing levels in postmortem tissues. Increased A-to-I editing in postmortem tissues is linked to higher ADAR and ADARB1 expression, is more pronounced in non-neuronal cells, and indicative of postmortem activation of inflammation and hypoxia. Higher A-to-I editing in living tissues marks sites that are evolutionarily preserved, synaptic, developmentally timed, and disrupted in neurological conditions. Common genetic variants were also found to differentially affect A-to-I editing levels in living versus postmortem tissues. Collectively, these discoveries offer more nuanced and accurate insights into the regulatory mechanisms of RNA editing in the human brain.
Asunto(s)
Adenosina Desaminasa , Adenosina , Autopsia , Encéfalo , Inosina , Edición de ARN , Proteínas de Unión al ARN , Humanos , Adenosina/metabolismo , Adenosina Desaminasa/metabolismo , Adenosina Desaminasa/genética , Encéfalo/metabolismo , Inosina/metabolismo , Inosina/genética , Proteínas de Unión al ARN/metabolismo , Proteínas de Unión al ARN/genética , Corteza Prefrontal/metabolismo , Cambios Post Mortem , MasculinoRESUMEN
Adenosine-to-inosine (A-to-I) editing is a prevalent post-transcriptional RNA modification within the brain. Yet, most research has relied on postmortem samples, assuming it is an accurate representation of RNA biology in the living brain. We challenge this assumption by comparing A-to-I editing between postmortem and living prefrontal cortical tissues. Major differences were found, with over 70,000 A-to-I sites showing higher editing levels in postmortem tissues. Increased A-to-I editing in postmortem tissues is linked to higher ADAR1 and ADARB1 expression, is more pronounced in non-neuronal cells, and indicative of postmortem activation of inflammation and hypoxia. Higher A-to-I editing in living tissues marks sites that are evolutionarily preserved, synaptic, developmentally timed, and disrupted in neurological conditions. Common genetic variants were also found to differentially affect A-to-I editing levels in living versus postmortem tissues. Collectively, these discoveries illuminate the nuanced functions and intricate regulatory mechanisms of RNA editing within the human brain.
RESUMEN
BACKGROUND: The recurrent ~600 kb 16p11.2 BP4-BP5 deletion is among the most frequent known genetic aetiologies of autism spectrum disorder (ASD) and related neurodevelopmental disorders. OBJECTIVE: To define the medical, neuropsychological, and behavioural phenotypes in carriers of this deletion. METHODS: We collected clinical data on 285 deletion carriers and performed detailed evaluations on 72 carriers and 68 intrafamilial non-carrier controls. RESULTS: When compared to intrafamilial controls, full scale intelligence quotient (FSIQ) is two standard deviations lower in carriers, and there is no difference between carriers referred for neurodevelopmental disorders and carriers identified through cascade family testing. Verbal IQ (mean 74) is lower than non-verbal IQ (mean 83) and a majority of carriers require speech therapy. Over 80% of individuals exhibit psychiatric disorders including ASD, which is present in 15% of the paediatric carriers. Increase in head circumference (HC) during infancy is similar to the HC and brain growth patterns observed in idiopathic ASD. Obesity, a major comorbidity present in 50% of the carriers by the age of 7 years, does not correlate with FSIQ or any behavioural trait. Seizures are present in 24% of carriers and occur independently of other symptoms. Malformations are infrequently found, confirming only a few of the previously reported associations. CONCLUSIONS: The 16p11.2 deletion impacts in a quantitative and independent manner FSIQ, behaviour and body mass index, possibly through direct influences on neural circuitry. Although non-specific, these features are clinically significant and reproducible. Lastly, this study demonstrates the necessity of studying large patient cohorts ascertained through multiple methods to characterise the clinical consequences of rare variants involved in common diseases.
Asunto(s)
Trastornos Generalizados del Desarrollo Infantil/genética , Deleción Cromosómica , Cromosomas Humanos Par 16 , Discapacidades del Desarrollo/genética , Fenotipo , Adolescente , Adulto , Índice de Masa Corporal , Niño , Trastornos Generalizados del Desarrollo Infantil/diagnóstico , Discapacidades del Desarrollo/diagnóstico , Femenino , Orden Génico , Heterocigoto , Humanos , Pruebas de Inteligencia , Masculino , Síndrome , Adulto JovenRESUMEN
Calcium has a pivotal role in biological functions, and serum calcium levels have been associated with numerous disorders of bone and mineral metabolism, as well as with cardiovascular mortality. Here we report results from a genome-wide association study of serum calcium, integrating data from four independent cohorts including a total of 12,865 individuals of European and Indian Asian descent. Our meta-analysis shows that serum calcium is associated with SNPs in or near the calcium-sensing receptor (CASR) gene on 3q13. The top hit with a p-value of 6.3 x 10(-37) is rs1801725, a missense variant, explaining 1.26% of the variance in serum calcium. This SNP had the strongest association in individuals of European descent, while for individuals of Indian Asian descent the top hit was rs17251221 (p = 1.1 x 10(-21)), a SNP in strong linkage disequilibrium with rs1801725. The strongest locus in CASR was shown to replicate in an independent Icelandic cohort of 4,126 individuals (p = 1.02 x 10(-4)). This genome-wide meta-analysis shows that common CASR variants modulate serum calcium levels in the adult general population, which confirms previous results in some candidate gene studies of the CASR locus. This study highlights the key role of CASR in calcium regulation.
Asunto(s)
Calcio/sangre , Polimorfismo de Nucleótido Simple , Receptores Sensibles al Calcio/genética , Genoma Humano , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Población Blanca/estadística & datos numéricosRESUMEN
Background: Dual specificity protein phosphatase 6 (DUSP6) was recently identified as a key hub gene in a causal network that regulates late-onset Alzheimer's disease. Importantly, decreased DUSP6 levels are correlated with an increased clinical dementia rating in human subjects, and DUSP6 levels are additionally decreased in the 5xFAD amyloidopathy mouse model. Methods: AAV5-DUSP6 or AAV5-GFP (control) were stereotactically injected into the dorsal hippocampus (dHc) of female and male 5xFAD or wild type mice to overexpress DUSP6 or GFP. Spatial learning memory of these mice was assessed in the Barnes maze, after which hippocampal tissues were isolated for downstream analysis. Results: Barnes maze testing indicated that DUSP6 overexpression in the dHc of 5xFAD mice improved memory deficits and was associated with reduced amyloid plaque load, Aß 1-40 and Aß 1-42 levels, and amyloid precursor protein processing enzyme BACE1, in male but not in female mice. Microglial activation and microgliosis, which are increased in 5xFAD mice, were significantly reduced by dHc DUSP6 overexpression in both males and females. Transcriptomic profiling of female 5xFAD hippocampus revealed upregulated expression of genes involved in inflammatory and extracellular signal-regulated kinase (ERK) pathways, while dHc DUSP6 overexpression in female 5xFAD mice downregulated a subset of genes in these pathways. A limited number of differentially expressed genes (DEGs) (FDR<0.05) were identified in male mice; gene ontology analysis of DEGs (p<0.05) identified a greater number of synaptic pathways that were regulated by DUSP6 overexpression in male compared to female 5xFAD. Notably, the msh homeobox 3 gene, Msx3 , previously shown to regulate microglial M1/M2 polarization and reduce neuroinflammation, was one of the most robustly upregulated genes in female and male wild type and 5xFAD mice overexpressing DUSP6. Conclusions: In summary, our data indicate that DUSP6 overexpression in dHc reduced amyloid deposition and memory deficits in male but not female 5xFAD mice, whereas reduced neuroinflammation and microglial activation were observed in both males and females. The sex-dependent regulation of synaptic pathways by DUSP6 overexpression, however, correlated with the improvement of spatial memory deficits in male but not female 5xFAD.
RESUMEN
Post-acute sequelae of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection are debilitating, clinically heterogeneous and of unknown molecular etiology. A transcriptome-wide investigation was performed in 165 acutely infected hospitalized individuals who were followed clinically into the post-acute period. Distinct gene expression signatures of post-acute sequelae were already present in whole blood during acute infection, with innate and adaptive immune cells implicated in different symptoms. Two clusters of sequelae exhibited divergent plasma-cell-associated gene expression patterns. In one cluster, sequelae associated with higher expression of immunoglobulin-related genes in an anti-spike antibody titer-dependent manner. In the other, sequelae associated independently of these titers with lower expression of immunoglobulin-related genes, indicating lower non-specific antibody production in individuals with these sequelae. This relationship between lower total immunoglobulins and sequelae was validated in an external cohort. Altogether, multiple etiologies of post-acute sequelae were already detectable during SARS-CoV-2 infection, directly linking these sequelae with the acute host response to the virus and providing early insights into their development.