RESUMO
As SARS-CoV-2 continues to spread and evolve, detecting emerging variants early is critical for public health interventions. Inferring lineage prevalence by clinical testing is infeasible at scale, especially in areas with limited resources, participation, or testing and/or sequencing capacity, which can also introduce biases1-3. SARS-CoV-2 RNA concentration in wastewater successfully tracks regional infection dynamics and provides less biased abundance estimates than clinical testing4,5. Tracking virus genomic sequences in wastewater would improve community prevalence estimates and detect emerging variants. However, two factors limit wastewater-based genomic surveillance: low-quality sequence data and inability to estimate relative lineage abundance in mixed samples. Here we resolve these critical issues to perform a high-resolution, 295-day wastewater and clinical sequencing effort, in the controlled environment of a large university campus and the broader context of the surrounding county. We developed and deployed improved virus concentration protocols and deconvolution software that fully resolve multiple virus strains from wastewater. We detected emerging variants of concern up to 14 days earlier in wastewater samples, and identified multiple instances of virus spread not captured by clinical genomic surveillance. Our study provides a scalable solution for wastewater genomic surveillance that allows early detection of SARS-CoV-2 variants and identification of cryptic transmission.
Assuntos
COVID-19 , SARS-CoV-2 , Vigilância Epidemiológica Baseada em Águas Residuárias , Águas Residuárias , COVID-19/epidemiologia , COVID-19/transmissão , COVID-19/virologia , Humanos , RNA Viral/análise , RNA Viral/genética , SARS-CoV-2/classificação , SARS-CoV-2/genética , SARS-CoV-2/isolamento & purificação , Análise de Sequência de RNA , Águas Residuárias/virologiaRESUMO
MOTIVATION: Phylogenetic placement of a query sequence on a backbone tree is increasingly used across biomedical sciences to identify the content of a sample from its DNA content. The accuracy of such analyses depends on the density of the backbone tree, making it crucial that placement methods scale to very large trees. Moreover, a new paradigm has been recently proposed to place sequences on the species tree using single-gene data. The goal is to better characterize the samples and to enable combined analyses of marker-gene (e.g., 16S rRNA gene amplicon) and genome-wide data. The recent method DEPP enables performing such analyses using metric learning. However, metric learning is hampered by a need to compute and save a quadratically growing matrix of pairwise distances during training. Thus, the training phase of DEPP does not scale to more than roughly 10 000 backbone species, a problem that we faced when trying to use our recently released Greengenes2 (GG2) reference tree containing 331 270 species. RESULTS: This paper explores divide-and-conquer for training ensembles of DEPP models, culminating in a method called C-DEPP. While divide-and-conquer has been extensively used in phylogenetics, applying divide-and-conquer to data-hungry machine-learning methods needs nuance. C-DEPP uses carefully crafted techniques to enable quasi-linear scaling while maintaining accuracy. C-DEPP enables placing 20 million 16S fragments on the GG2 reference tree in 41 h of computation. AVAILABILITY AND IMPLEMENTATION: The dataset and C-DEPP software are freely available at https://github.com/yueyujiang/dataset_cdepp/.
Assuntos
Filogenia , Algoritmos , RNA Ribossômico 16S/genética , Software , Biologia Computacional/métodos , Aprendizado de Máquina , Análise de Sequência de DNA/métodosRESUMO
To understand the transmissibility and spread of infectious diseases, epidemiologists turn to estimates of the instantaneous reproduction number. While many estimation approaches exist, their utility may be limited. Challenges of surveillance data collection, model assumptions that are unverifiable with data alone, and computationally inefficient frameworks are critical limitations for many existing approaches. We propose a discrete spline-based approach that solves a convex optimization problem-Poisson trend filtering-using the proximal Newton method. It produces a locally adaptive estimator for instantaneous reproduction number estimation with heterogeneous smoothness. Our methodology remains accurate even under some process misspecifications and is computationally efficient, even for large-scale data. The implementation is easily accessible in a lightweight R package rtestim.
Assuntos
Algoritmos , Número Básico de Reprodução , Humanos , Biologia Computacional/métodos , Doenças Transmissíveis/epidemiologia , Simulação por Computador , Software , Modelos Epidemiológicos , Distribuição de Poisson , Modelos EstatísticosRESUMO
The number of publicly available microbiome samples is continually growing. As data set size increases, bottlenecks arise in standard analytical pipelines. Faith's phylogenetic diversity (Faith's PD) is a highly utilized phylogenetic alpha diversity metric that has thus far failed to effectively scale to trees with millions of vertices. Stacked Faith's phylogenetic diversity (SFPhD) enables calculation of this widely adopted diversity metric at a much larger scale by implementing a computationally efficient algorithm. The algorithm reduces the amount of computational resources required, resulting in more accessible software with a reduced carbon footprint, as compared to previous approaches. The new algorithm produces identical results to the previous method. We further demonstrate that the phylogenetic aspect of Faith's PD provides increased power in detecting diversity differences between younger and older populations in the FINRISK study's metagenomic data.
Assuntos
Microbiota , Microbiota/genética , FilogeniaRESUMO
High-throughput amplicon sequencing of large genomic regions remains challenging for short-read technologies. Here, we report a high-throughput amplicon sequencing approach combining unique molecular identifiers (UMIs) with Oxford Nanopore Technologies (ONT) or Pacific Biosciences circular consensus sequencing, yielding high-accuracy single-molecule consensus sequences of large genomic regions. We applied our approach to sequence ribosomal RNA operon amplicons (~4,500 bp) and genomic sequences (>10,000 bp) of reference microbial communities in which we observed a chimera rate <0.02%. To reach a mean UMI consensus error rate <0.01%, a UMI read coverage of 15× (ONT R10.3), 25× (ONT R9.4.1) and 3× (Pacific Biosciences circular consensus sequencing) is needed, which provides a mean error rate of 0.0042%, 0.0041% and 0.0007%, respectively.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Microbiota , Nanoporos , Fluxo de TrabalhoRESUMO
Short-term forecasts of traditional streams from public health reporting (such as cases, hospitalizations, and deaths) are a key input to public health decision-making during a pandemic. Since early 2020, our research group has worked with data partners to collect, curate, and make publicly available numerous real-time COVID-19 indicators, providing multiple views of pandemic activity in the United States. This paper studies the utility of five such indicators-derived from deidentified medical insurance claims, self-reported symptoms from online surveys, and COVID-related Google search activity-from a forecasting perspective. For each indicator, we ask whether its inclusion in an autoregressive (AR) model leads to improved predictive accuracy relative to the same model excluding it. Such an AR model, without external features, is already competitive with many top COVID-19 forecasting models in use today. Our analysis reveals that 1) inclusion of each of these five indicators improves on the overall predictive accuracy of the AR model; 2) predictive gains are in general most pronounced during times in which COVID cases are trending in "flat" or "down" directions; and 3) one indicator, based on Google searches, seems to be particularly helpful during "up" trends.
Assuntos
COVID-19/epidemiologia , Indicadores Básicos de Saúde , Modelos Estatísticos , Métodos Epidemiológicos , Previsões , Humanos , Internet/estatística & dados numéricos , Inquéritos e Questionários , Estados Unidos/epidemiologiaRESUMO
The COVID-19 pandemic presented enormous data challenges in the United States. Policy makers, epidemiological modelers, and health researchers all require up-to-date data on the pandemic and relevant public behavior, ideally at fine spatial and temporal resolution. The COVIDcast API is our attempt to fill this need: Operational since April 2020, it provides open access to both traditional public health surveillance signals (cases, deaths, and hospitalizations) and many auxiliary indicators of COVID-19 activity, such as signals extracted from deidentified medical claims data, massive online surveys, cell phone mobility data, and internet search trends. These are available at a fine geographic resolution (mostly at the county level) and are updated daily. The COVIDcast API also tracks all revisions to historical data, allowing modelers to account for the frequent revisions and backfill that are common for many public health data sources. All of the data are available in a common format through the API and accompanying R and Python software packages. This paper describes the data sources and signals, and provides examples demonstrating that the auxiliary signals in the COVIDcast API present information relevant to tracking COVID activity, augmenting traditional public health reporting and empowering research and decision-making.
Assuntos
COVID-19/epidemiologia , Bases de Dados Factuais , Indicadores Básicos de Saúde , Assistência Ambulatorial/tendências , Métodos Epidemiológicos , Humanos , Internet/estatística & dados numéricos , Distanciamento Físico , Inquéritos e Questionários , Viagem , Estados Unidos/epidemiologiaRESUMO
Functional MRI (fMRI) data may be contaminated by artifacts arising from a myriad of sources, including subject head motion, respiration, heartbeat, scanner drift, and thermal noise. These artifacts cause deviations from common distributional assumptions, introduce spatial and temporal outliers, and reduce the signal-to-noise ratio of the data-all of which can have negative consequences for the accuracy and power of downstream statistical analysis. Scrubbing is a technique for excluding fMRI volumes thought to be contaminated by artifacts and generally comes in two flavors. Motion scrubbing based on subject head motion-derived measures is popular but suffers from a number of drawbacks, among them the need to choose a threshold, a lack of generalizability to multiband acquisitions, and high rates of censoring of individual volumes and entire subjects. Alternatively, data-driven scrubbing methods like DVARS are based on observed noise in the processed fMRI timeseries and may avoid some of these issues. Here we propose "projection scrubbing", a novel data-driven scrubbing method based on a statistical outlier detection framework and strategic dimension reduction, including independent component analysis (ICA), to isolate artifactual variation. We undertake a comprehensive comparison of motion scrubbing with data-driven projection scrubbing and DVARS. We argue that an appropriate metric for the success of scrubbing is maximal data retention subject to reasonable performance on typical benchmarks such as the validity, reliability, and identifiability of functional connectivity. We find that stringent motion scrubbing yields worsened validity, worsened reliability, and produced small improvements to fingerprinting. Meanwhile, data-driven scrubbing methods tend to yield greater improvements to fingerprinting while not generally worsening validity or reliability. Importantly, however, data-driven scrubbing excludes a fraction of the number of volumes or entire sessions compared to motion scrubbing. The ability of data-driven fMRI scrubbing to improve data retention without negatively impacting the quality of downstream analysis has major implications for sample sizes in population neuroscience research.
Assuntos
Processamento de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Humanos , Imageamento por Ressonância Magnética/métodos , Reprodutibilidade dos Testes , Processamento de Imagem Assistida por Computador/métodos , Artefatos , Movimento (Física) , Encéfalo/diagnóstico por imagem , Mapeamento Encefálico/métodosRESUMO
BACKGROUND: The Dietary Guidelines for Americans advises on dietary intake to meet nutritional needs, promote health, and prevent diseases. Diet affects the intestinal microbiota and is increasingly linked to health. It is vital to investigate the relationships between diet quality and the microbiota to better understand the impact of nutrition on human health. OBJECTIVES: This study aimed to investigate the differences in fecal microbiota composition in adults from the American Gut Project based on their adherence to the Dietary Guidelines for Americans. METHODS: This study was a cross-sectional analysis of the 16S sequencing and food frequency data of a subset of adults (n = 432; age = 18-60 y; 65% female, 89% white) participating in the crowdsourced American Gut Project. The Healthy Eating Index-2015 assessed the compliance with Dietary Guideline recommendations. The cohort was divided into tertiles based on Healthy Eating Index-2015 scores, and differences in taxonomic abundances and diversity were compared between high and low scorers. RESULTS: The mean Total Score for low-scoring adults (58.1 ± 5.4) was comparable with the reported score of the average American adult (56.7). High scorers for the Total Score and components related to vegetables, grains, and dairy had greater alpha diversity than low scorers. High scorers in the fatty acid component had a lower alpha diversity than low scorers (95% CI: 0.35, 1.85). A positive log-fold difference in abundance of plant carbohydrate-metabolizing taxa in the families Lachnospiraceae and Ruminococcaceae was observed in high-scoring tertiles for Total Score, vegetable, fruit, and grain components (Benjamini-Hochberg; q < 0.05). CONCLUSIONS: Adults with greater compliance to the Dietary Guidelines demonstrated higher diversity in their fecal microbiota and greater abundance of bacteria capable of metabolizing complex carbohydrates, providing evidence on how Dietary Guidelines support the gut microbiota.
Assuntos
Promoção da Saúde , Microbiota , Humanos , Adulto , Estados Unidos , Adolescente , Adulto Jovem , Pessoa de Meia-Idade , Estudos Transversais , Dieta , Verduras , RNA Ribossômico 16SRESUMO
Untargeted mass spectrometry is employed to detect small molecules in complex biospecimens, generating data that are difficult to interpret. We developed Qemistree, a data exploration strategy based on the hierarchical organization of molecular fingerprints predicted from fragmentation spectra. Qemistree allows mass spectrometry data to be represented in the context of sample metadata and chemical ontologies. By expressing molecular relationships as a tree, we can apply ecological tools that are designed to analyze and visualize the relatedness of DNA sequences to metabolomics data. Here we demonstrate the use of tree-guided data exploration tools to compare metabolomics samples across different experimental conditions such as chromatographic shifts. Additionally, we leverage a tree representation to visualize chemical diversity in a heterogeneous collection of samples. The Qemistree software pipeline is freely available to the microbiome and metabolomics communities in the form of a QIIME2 plugin, and a global natural products social molecular networking workflow.
Assuntos
Espectrometria de Massas/métodos , Metabolômica , Algoritmos , Análise por Conglomerados , DNA/química , Impressões Digitais de DNA , Bases de Dados Factuais , Ecologia , Análise de Alimentos , Microbiota , Análise Multivariada , Software , Espectrometria de Massas em Tandem , Fluxo de TrabalhoRESUMO
This corrects the article DOI: 10.1038/nature23889.
RESUMO
The characterization of baseline microbial and functional diversity in the human microbiome has enabled studies of microbiome-related disease, diversity, biogeography, and molecular function. The National Institutes of Health Human Microbiome Project has provided one of the broadest such characterizations so far. Here we introduce a second wave of data from the study, comprising 1,631 new metagenomes (2,355 total) targeting diverse body sites with multiple time points in 265 individuals. We applied updated profiling and assembly methods to provide new characterizations of microbiome personalization. Strain identification revealed subspecies clades specific to body sites; it also quantified species with phylogenetic diversity under-represented in isolate genomes. Body-wide functional profiling classified pathways into universal, human-enriched, and body site-enriched subsets. Finally, temporal analysis decomposed microbial variation into rapidly variable, moderately variable, and stable subsets. This study furthers our knowledge of baseline human microbial diversity and enables an understanding of personalized microbiome function and dynamics.
Assuntos
Microbiota/fisiologia , Filogenia , Conjuntos de Dados como Assunto , Humanos , Metagenoma/genética , Metagenoma/fisiologia , Microbiota/genética , Anotação de Sequência Molecular , National Institutes of Health (U.S.) , Especificidade de Órgãos , Análise Espaço-Temporal , Fatores de Tempo , Estados UnidosRESUMO
Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
Assuntos
Biodiversidade , Planeta Terra , Microbiota/genética , Animais , Archaea/genética , Archaea/isolamento & purificação , Bactérias/genética , Bactérias/isolamento & purificação , Ecologia/métodos , Dosagem de Genes , Mapeamento Geográfico , Humanos , Plantas/microbiologia , RNA Ribossômico 16S/análise , RNA Ribossômico 16S/genéticaRESUMO
Australia is no different to any other country in that information technology (IT) and the speed of its advancement has a huge impact on health care and therefore on health libraries. Australian health librarians are valuable members of health care teams and strive to integrate services and resources across hospitals. This article looks at the role Australian health libraries play in the broader health information landscape and the importance of information governance and health informatics as a tenet of the work undertaken by libraries. Of particular focus in this is the Health Libraries Australia/Telstra Health Digital Health Innovation Award, offered annually, to help focus on particular technological challenges. Three cases studies are explored demonstrating impact on the systematic review process, inter-library loan system automation and a room booking service. Also discussed are the ongoing professional development opportunities, which help upskill the Australian health library workforce. Australian health libraries also face many challenges with piecemeal IT systems across the nation, resulting in lost opportunities. Also, many Australian health services do not have a qualified librarian on staff, which undermines information governance. However, resiliency shines through with strong professional health library networks working to challenge the status quo in an effort to improve the application of health informatics.
Assuntos
Bibliotecários , Bibliotecas Médicas , Bibliotecas , Informática Médica , Humanos , Austrália , TecnologiaRESUMO
BACKGROUND & AIMS: Fecal microbiota transplantation (FMT) is used commonly for treatment of Clostridioides difficile infections (CDIs), although prospective safety data are limited and real-world FMT practice and outcomes are not well described. The FMT National Registry was designed to assess FMT methods and both safety and effectiveness outcomes from North American FMT providers. METHODS: Patients undergoing FMT in clinical practices across North America were eligible. Participating investigators enter de-identified data into an online platform, including FMT protocol, baseline patient characteristics, CDI cure and recurrence, and short and long-term safety outcomes. RESULTS: Of the first 259 participants enrolled at 20 sites, 222 had completed short-term follow-up at 1 month and 123 had follow-up to 6 months; 171 (66%) were female. All FMTs were done for CDI and 249 (96%) used an unknown donor (eg, stool bank). One-month cure occurred in 200 patients (90%); of these, 197 (98%) received only 1 FMT. Among 112 patients with initial cure who were followed to 6 months, 4 (4%) had CDI recurrence. Severe symptoms reported within 1-month of FMT included diarrhea (n = 5 [2%]) and abdominal pain (n = 4 [2%]); 3 patients (1%) had hospitalizations possibly related to FMT. At 6 months, new diagnoses of irritable bowel syndrome were made in 2 patients (1%) and inflammatory bowel disease in 2 patients (1%). CONCLUSIONS: This prospective real-world study demonstrated high effectiveness of FMT for CDI with a good safety profile. Assessment of new conditions at long-term follow-up is planned as this registry grows and will be important for determining the full safety profile of FMT.
Assuntos
Infecções por Clostridium/terapia , Transplante de Microbiota Fecal , Doenças Inflamatórias Intestinais/terapia , Síndrome do Intestino Irritável/terapia , Sistema de Registros , Adolescente , Adulto , Clostridioides difficile , Humanos , Pessoa de Meia-Idade , Estudos Prospectivos , Resultado do Tratamento , Estados Unidos , Adulto JovemRESUMO
The gut microbiome plays an integral role in health and disease, and diet is a major driver of its composition, diversity, and functional capacity. Given the dynamic development of the gut microbiome in infants and children, it is critical to address two major questions: (a) Can diet modify the composition, diversity, or function of the gut microbiome, and (b) will such modification affect functional/clinical outcomes including immune function, cognitive development, and overall health? We synthesize the evidence on the effect of nutritional interventions on the gut microbiome in infants and children across 26 studies. Findings indicate the need to study older children, assess the whole intestinal tract, and harmonize methods and interpretation of findings, which are critical for informing meaningful clinical and public health practice. These findings are relevant for precision health, may help identify windows of opportunity for intervention, and may inform the design and delivery of such interventions.
Assuntos
Microbioma Gastrointestinal , Adolescente , Criança , Dieta , Humanos , Lactente , IntestinosRESUMO
Multi-omic insights into microbiome function and composition typically advance one study at a time. However, in order for relationships across studies to be fully understood, data must be aggregated into meta-analyses. This makes it possible to generate new hypotheses by finding features that are reproducible across biospecimens and data layers. Qiita dramatically accelerates such integration tasks in a web-based microbiome-comparison platform, which we demonstrate with Human Microbiome Project and Integrative Human Microbiome Project (iHMP) data.
Assuntos
Biologia Computacional/métodos , Internet , Metagenômica , Microbiota , Software , Humanos , Interface Usuário-ComputadorRESUMO
We apply the Job Demands-Resources model to explicate how two contextual factors (nondiscriminatory leadership behavior and cohesion) may equip subordinates to benefit from the leadership style of goal-focused leadership (GFL), a predominant leadership style in the military context. We predict that only when GFL is delivered in conjunction with nondiscriminatory leadership behaviors in a cohesive workgroup (which, we theorize, combine to create a resource-rich environment), subordinates may experience the lowest levels of exhaustion. We tested our hypothesis in two independent samples of uniformed United States Department of Defense personnel deployed in non-combat zones, and results are fully supportive. We add to recent efforts to expand the nomological network of GFL, pinpointing situational factors that may equip subordinates to experience lower (rather than higher) exhaustion when working with a goal-focused leader. In doing so, we also contribute to theory on diversity and stress, and we suggest practical applications for leadership across a range of hierarchical contexts, including the military and other large organizations. In all, our work may help inform the proper balance of leadership and workgroup factors, which determine the optimal context in which individuals can be equipped to benefit from GFL.
RESUMO
Over the past few years, microbiome research has dramatically reshaped our understanding of human biology. New insights range from an enhanced understanding of how microbes mediate digestion and disease processes (e.g., in inflammatory bowel disease) to surprising associations with Parkinson's disease, autism, and depression. In this review, we describe how new generations of sequencing technology, analytical advances coupled to new software capabilities, and the integration of animal model data have led to these new discoveries. We also discuss the prospects for integrating studies of the microbiome, metabolome, and immune system, with the goal of elucidating mechanisms that govern their interactions. This systems-level understanding will change how we think about ourselves as organisms.
Assuntos
Sistema Imunitário , Metaboloma , Metagenoma , Microbiota/genética , Análise de Sequência de DNA , Animais , HumanosRESUMO
The number of rTOF patients who survive into adulthood is steadily rising, with currently more than 90% reaching the third decade of life. However, rTOF patients are not cured, but rather have a lifelong increased risk for cardiac and non-cardiac complications. Heart failure is recognized as a significant complication. Its occurrence is strongly associated with adverse outcome. Unfortunately, conventional concepts of heart failure may not be directly applicable in this patient group. This article presents a review of the current knowledge on HF in rTOF patients, including incidence and prevalence, the most common mechanisms of heart failure, i.e., valvular pathologies, shunt lesions, left atrial hypertension, primary left heart and right heart failure, arrhythmias, and coronary artery disease. In addition, we will review information regarding extracardiac complications, risk factors for the development of heart failure, clinical impact and prognosis, and assessment possibilities, particularly of the right ventricle, as well as management strategies. We explore potential future concepts that may stimulate further research into this field.