Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 82
Filtrar
1.
Bioinformatics ; 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38724243

RESUMEN

MOTIVATION: Since 2016, the number of microbial species with available reference genomes in NCBI has more than tripled. Multiple genome alignment, the process of identifying nucleotides across multiple genomes which share a common ancestor, is used as the input to numerous downstream comparative analysis methods. Parsnp is one of the few multiple genome alignment methods able to scale to the current era of genomic data; however, there has been no major release since its initial release in 2014. RESULTS: To address this gap, we developed Parsnp v2, which significantly improves on its original release. Parsnp v2 provides users with more control over executions of the program, allowing Parsnp to be better tailored for different use-cases. We introduce a partitioning option to Parsnp, which allows the input to be broken up into multiple parallel alignment processes which are then combined into a final alignment. The partitioning option can reduce memory usage by over 4x and reduce runtime by over 2x, all while maintaining a precise core-genome alignment. The partitioning workflow is also less susceptible to complications caused by assembly artifacts and minor variation, as alignment anchors only need to be conserved within their partition and not across the entire input set. We highlight the performance on datasets involving thousands of bacterial and viral genomes. AVAILABILITY: Parsnp v2 is available at https://github.com/marbl/parsnp.

2.
Sci Total Environ ; 931: 172683, 2024 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-38663617

RESUMEN

Wastewater monitoring is an efficient and effective way to surveil for various pathogens in communities. This is especially beneficial in areas of high transmission, such as preK-12 schools, where infections may otherwise go unreported. In this work, we apply wastewater disease surveillance using school and community wastewater from across Houston, Texas to monitor three major enteric viruses: astrovirus, sapovirus genogroup GI, and group A rotavirus. We present the results of a 10-week study that included the analysis of 164 wastewater samples for astrovirus, rotavirus, and sapovirus in 10 preK-12 schools, 6 wastewater treatment plants, and 2 lift stations using newly designed RT-ddPCR assays. We show that the RT-ddPCR assays were able to detect astrovirus, rotavirus, and sapovirus in school, lift station, and wastewater treatment plant (WWTP) wastewater, and that a positive detection of a virus in a school sample was paired with a positive detection of the same virus at a downstream lift station or wastewater treatment plant over 97 % of the time. Additionally, we show how wastewater detections of rotavirus in schools and WWTPs were significantly associated with citywide viral intestinal infections. School wastewater can play a role in the monitoring of enteric viruses and in the detection of outbreaks, potentially allowing public health officials to quickly implement mitigation strategies to prevent viral spread into surrounding communities.


Asunto(s)
Rotavirus , Sapovirus , Instituciones Académicas , Aguas Residuales , Aguas Residuales/virología , Sapovirus/aislamiento & purificación , Rotavirus/aislamiento & purificación , Texas , Monitoreo del Ambiente/métodos , Humanos , Mamastrovirus/aislamiento & purificación
3.
Res Sq ; 2024 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-38645104

RESUMEN

Background: Recent studies have shed light on the potential role of gut dysbiosis in shaping traumatic brain injury (TBI) outcomes. Changes in the levels and types of Lactobacillus bacteria present might impact the immune system disturbances, neuroinflammatory responses, anxiety and depressive-like behaviors, and compromised neuroprotection mechanisms triggered by TBI. Objective: This study aimed to investigate the effects of a daily pan-probiotic (PP) mixture in drinking water containing strains of Lactobacillus plantarum, L. reuteri, L. helveticus, L. fermentum, L. rhamnosus, L. gasseri, and L. casei, administered for either two or seven weeks before inducing TBI on both male and female mice. Methods: Mice were subjected to controlled cortical impact (CCI) injury. Short-chain fatty acids (SCFAs) analysis was performed for metabolite measurements. The taxonomic profiles of murine fecal samples were evaluated using 16S rRNA V1-V3 sequencing analysis. Histological analyses were used to assess neuroinflammation and gut changes post-TBI, while behavioral tests were conducted to evaluate sensorimotor and cognitive functions. Results: Our findings suggest that PP administration modulates the diversity and composition of the microbiome and increases the levels of SCFAs in a sex-dependent manner. We also observed a reduction of lesion volume, cell death, and microglial and macrophage activation after PP treatment following TBI in male mice. Furthermore, PP-treated mice show motor function improvements and decreases in anxiety and depressive-like behaviors. Conclusion: Our findings suggest that PP administration can mitigate neuroinflammation and ameliorate motor and anxiety and depressive-like behavior deficits following TBI. These results underscore the potential of probiotic interventions as a viable therapeutic strategy to address TBI-induced impairments, emphasizing the need for gender-specific treatment approaches.

4.
Nat Methods ; 2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38689099

RESUMEN

Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.

5.
J Hazard Mater ; 469: 133939, 2024 May 05.
Artículo en Inglés | MEDLINE | ID: mdl-38490149

RESUMEN

Wastewater surveillance is a powerful tool to assess the risks associated with antibiotic resistance in communities. One challenge is selecting which analytical tool to deploy to measure risk indicators, such as antibiotic resistance genes (ARGs) and their respective bacterial hosts. Although metagenomics is frequently used for analyzing ARGs, few studies have compared the performance of long-read and short-read metagenomics in identifying which bacteria harbor ARGs in wastewater. Furthermore, for ARG host detection, untargeted metagenomics has not been compared to targeted methods such as epicPCR. Here, we 1) evaluated long-read and short-read metagenomics as well as epicPCR for detecting ARG hosts in wastewater, and 2) investigated the host range of ARGs across the wastewater treatment plant (WWTP) to evaluate host proliferation. Results highlighted long-read revealed a wider range of ARG hosts compared to short-read metagenomics. Nonetheless, the ARG host range detected by long-read metagenomics only represented a subset of the hosts detected by epicPCR. The ARG-host linkages across the influent and effluent of the WWTP were characterized. Results showed the ARG-host phylum linkages were relatively consistent across the WWTP, whereas new ARG-host species linkages appeared in the WWTP effluent. The ARG-host linkages of several clinically relevant species found in the effluent were identified.


Asunto(s)
Antibacterianos , Aguas Residuales , Antibacterianos/farmacología , Genes Bacterianos , Monitoreo Epidemiológico Basado en Aguas Residuales , Bacterias/genética , Farmacorresistencia Bacteriana/genética , Metagenómica/métodos
6.
Curr Protoc ; 4(3): e978, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38511467

RESUMEN

16S rRNA targeted amplicon sequencing is an established standard for elucidating microbial community composition. While high-throughput short-read sequencing can elicit only a portion of the 16S rRNA gene due to their limited read length, third generation sequencing can read the 16S rRNA gene in its entirety and thus provide more precise taxonomic classification. Here, we present a protocol for generating full-length 16S rRNA sequences with Oxford Nanopore Technologies (ONT) and a microbial community profile with Emu. We select Emu for analyzing ONT sequences as it leverages information from the entire community to overcome errors due to incomplete reference databases and hardware limitations to ultimately obtain species-level resolution. This pipeline provides a low-cost solution for characterizing microbiome composition by exploiting real-time, long-read ONT sequencing and tailored software for accurate characterization of microbial communities. © 2024 Wiley Periodicals LLC. Basic Protocol: Microbial community profiling with Emu Support Protocol 1: Full-length 16S rRNA microbial sequences with Oxford Nanopore Technologies sequencing platform Support Protocol 2: Building a custom reference database for Emu.


Asunto(s)
Dromaiidae , Microbiota , Animales , ARN Ribosómico 16S/genética , Dromaiidae/genética , Bacterias/genética , Análisis de Secuencia de ADN/métodos , Microbiota/genética
7.
bioRxiv ; 2024 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-38352342

RESUMEN

Motivation: Since 2016, the number of microbial species with available reference genomes in NCBI has more than tripled. Multiple genome alignment, the process of identifying nucleotides across multiple genomes which share a common ancestor, is used as the input to numerous downstream comparative analysis methods. Parsnp is one of the few multiple genome alignment methods able to scale to the current era of genomic data; however, there has been no major release since its initial release in 2014. Results: To address this gap, we developed Parsnp v2, which significantly improves on its original release. Parsnp v2 provides users with more control over executions of the program, allowing Parsnp to be better tailored for different use-cases. We introduce a partitioning option to Parsnp, which allows the input to be broken up into multiple parallel alignment processes which are then combined into a final alignment. The partitioning option can reduce memory usage by over 4x and reduce runtime by over 2x, all while maintaining a precise core-genome alignment. The partitioning workflow is also less susceptible to complications caused by assembly artifacts and minor variation, as alignment anchors only need to be conserved within their partition and not across the entire input set. We highlight the performance on datasets involving thousands of bacterial and viral genomes. Availability: Parsnp is available at https://github.com/marbl/parsnp.

8.
bioRxiv ; 2024 Jan 30.
Artículo en Inglés | MEDLINE | ID: mdl-38352454

RESUMEN

Bacterial genome dynamics are vital for understanding the mechanisms underlying microbial adaptation, growth, and their broader impact on host phenotype. Structural variants (SVs), genomic alterations of 10 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to absence of clear reference genomes and presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing a single metagenome coassembly graph constructed from all samples in a series. The log fold change in graph coverage between subsequent samples is then calculated to call SVs that are thriving or declining throughout the series. We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, which is particularly noticeable as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between subsequent time and temperature samples, suggesting host advantage. Our innovative approach leverages raw read patterns rather than references or MAGs to include all sequencing reads in analysis, and thus provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial genome dynamics.

9.
Microb Genom ; 10(2)2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38407244

RESUMEN

Despite the notable clinical impact, recent molecular epidemiology regarding third-generation-cephalosporin-resistant (3GC-R) Klebsiella pneumoniae in the USA remains limited. We performed whole-genome sequencing of 3GC-R K. pneumoniae bacteraemia isolates collected from March 2016 to May 2022 at a tertiary care cancer centre in Houston, TX, USA, using Illumina and Oxford Nanopore Technologies platforms. A comprehensive comparative genomic analysis was performed to dissect population structure, transmission dynamics and pan-genomic signatures of our 3GC-R K. pneumoniae population. Of the 178 3GC-R K. pneumoniae bacteraemias that occurred during our study time frame, we were able to analyse 153 (86 %) bacteraemia isolates, 126 initial and 27 recurrent isolates. While isolates belonging to the widely prevalent clonal group (CG) 258 were rarely observed, the predominant CG, 307, accounted for 37 (29 %) index isolates and displayed a significant correlation (Pearson correlation test P value=0.03) with the annual frequency of 3GC-R K. pneumoniae bacteraemia. Interestingly, only 11 % (4/37) of CG307 isolates belonged to the commonly detected 'Texas-specific' clade that has been observed in previous Texas-based K. pneumoniae antimicrobial-resistance surveillance studies. We identified nearly half of our CG307 isolates (n=18) belonged to a novel, monophyletic CG307 sub-clade characterized by the chromosomally encoded bla SHV-205 and unique accessory genome content. This CG307 sub-clade was detected in various regions of the USA, with genome sequences from 24 additional strains becoming recently available in the National Center for Biotechnology Information (NCBI) SRA database. Collectively, this study underscores the emergence and dissemination of a distinct CG307 sub-clade that is a prevalent cause of 3GC-R K. pneumoniae bacteraemia among cancer patients seen in Houston, TX, and has recently been isolated throughout the USA.


Asunto(s)
Bacteriemia , Klebsiella pneumoniae , Humanos , Estados Unidos/epidemiología , Klebsiella pneumoniae/genética , Bacteriemia/epidemiología , Hibridación Genómica Comparativa , Bases de Datos Factuales , Cefalosporinas
10.
J Clin Invest ; 134(2)2024 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-37962956

RESUMEN

Targeted metagenomic sequencing is an emerging strategy to survey disease-specific microbiome biomarkers for clinical diagnosis and prognosis. However, this approach often yields inconsistent or conflicting results owing to inadequate study power and sequencing bias. We introduce Taxa4Meta, a bioinformatics pipeline explicitly designed to compensate for technical and demographic bias. We designed and validated Taxa4Meta for accurate taxonomic profiling of 16S rRNA amplicon data acquired from different sequencing strategies. Taxa4Meta offers significant potential in identifying clinical dysbiotic features that can reliably predict human disease, validated comprehensively via reanalysis of individual patient 16S data sets. We leveraged the power of Taxa4Meta's pan-microbiome profiling to generate 16S-based classifiers that exhibited excellent utility for stratification of diarrheal patients with Clostridioides difficile infection, irritable bowel syndrome, or inflammatory bowel diseases, which represent common misdiagnoses and pose significant challenges for clinical management. We believe that Taxa4Meta represents a new "best practices" approach to individual microbiome surveys that can be used to define gut dysbiosis at a population-scale level.


Asunto(s)
Microbioma Gastrointestinal , Microbiota , Humanos , Disbiosis , ARN Ribosómico 16S/genética , Diarrea/genética
11.
Pac Symp Biocomput ; 29: 506-520, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38160303

RESUMEN

The microbes present in the human gastrointestinal tract are regularly linked to human health and disease outcomes. Thanks to technological and methodological advances in recent years, metagenomic sequencing data, and computational methods designed to analyze metagenomic data, have contributed to improved understanding of the link between the human gut microbiome and disease. However, while numerous methods have been recently developed to extract quantitative and qualitative results from host-associated microbiome data, improved computational tools are still needed to track microbiome dynamics with short-read sequencing data. Previously we have proposed KOMB as a de novo tool for identifying copy number variations in metagenomes for characterizing microbial genome dynamics in response to perturbations. In this work, we present KombOver (KO), which includes four key contributions with respect to our previous work: (i) it scales to large microbiome study cohorts, (ii) it includes both k-core and K-truss based analysis, (iii) we provide the foundation of a theoretical understanding of the relation between various graph-based metagenome representations, and (iv) we provide an improved user experience with easier-to-run code and more descriptive outputs/results. To highlight the aforementioned benefits, we applied KO to nearly 1000 human microbiome samples, requiring less than 10 minutes and 10 GB RAM per sample to process these data. Furthermore, we highlight how graph-based approaches such as k-core and K-truss can be informative for pinpointing microbial community dynamics within a myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) cohort. KO is open source and available for download/use at: https://github.com/treangenlab/komb.


Asunto(s)
Microbioma Gastrointestinal , Microbiota , Humanos , Variaciones en el Número de Copia de ADN , Biología Computacional , Microbiota/genética , Metagenoma , Metagenómica/métodos
12.
medRxiv ; 2023 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-37986916

RESUMEN

We present Crykey, a computational tool for rapidly identifying cryptic mutations of SARS-CoV-2. Specifically, we identify co-occurring single nucleotide mutations on the same sequencing read, called linked-read mutations, that are rare or entirely missing in existing databases, and have the potential to represent novel cryptic lineages found in wastewater. While previous approaches exist for identifying cryptic linked-read mutations from specific regions of the SARS-CoV-2 genome, there is a need for computational tools capable of efficiently tracking cryptic mutations across the entire genome and for tens of thousands of samples and with increased scrutiny, given their potential to represent either artifacts or hidden SARS-CoV-2 lineages. Crykey fills this gap by identifying rare linked-read mutations that pass stringent computational filters to limit the potential for artifacts. We evaluate the utility of Crykey on >3,000 wastewater and >22,000 clinical samples; our findings are three-fold: i) we identify hundreds of cryptic mutations that cover the entire SARS-CoV-2 genome, ii) we track the presence of these cryptic mutations across multiple wastewater treatment plants and over a three years of sampling in Houston, and iii) we find a handful of cryptic mutations in wastewater mirror cryptic mutations in clinical samples and investigate their potential to represent real cryptic lineages. In summary, Crykey enables large-scale detection of cryptic mutations representing potential cryptic lineages in wastewater.

13.
Microbiol Spectr ; 11(6): e0153923, 2023 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-37800925

RESUMEN

IMPORTANCE: Our study provides insights into the evolution of the coronavirus disease 2019 (COVID-19) pandemic in Malta, a highly connected and understudied country. We combined epidemiological and phylodynamic analyses to analyze trends in the number of new cases, deaths, tests, positivity rates, and evolutionary and dispersal patterns from August 2020 to January 2022. Our reconstructions inferred 173 independent severe acute respiratory syndrome coronavirus 2 introductions into Malta from various global regions. Our study demonstrates that characterizing epidemiological trends coupled with phylodynamic modeling can inform the implementation of public health interventions to help control COVID-19 transmission in the community.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/epidemiología , Malta , Salud Pública , Análisis Espacio-Temporal , Filogenia
14.
Nat Biotechnol ; 41(10): 1373-1374, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37626231
15.
Bioinformatics ; 39(9)2023 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-37603771

RESUMEN

MOTIVATION: The Jaccard similarity on k-mer sets has shown to be a convenient proxy for sequence identity. By avoiding expensive base-level alignments and comparing reduced sequence representations, tools such as MashMap can scale to massive numbers of pairwise comparisons while still providing useful similarity estimates. However, due to their reliance on minimizer winnowing, previous versions of MashMap were shown to be biased and inconsistent estimators of Jaccard similarity. This directly impacts downstream tools that rely on the accuracy of these estimates. RESULTS: To address this, we propose the minmer winnowing scheme, which generalizes the minimizer scheme by use of a rolling minhash with multiple sampled k-mers per window. We show both theoretically and empirically that minmers yield an unbiased estimator of local Jaccard similarity, and we implement this scheme in an updated version of MashMap. The minmer-based implementation is over 10 times faster than the minimizer-based version under the default ANI threshold, making it well-suited for large-scale comparative genomics applications. AVAILABILITY AND IMPLEMENTATION: MashMap3 is available at https://github.com/marbl/MashMap.


Asunto(s)
Biología Computacional , Genómica
16.
Bioinformatics ; 39(39 Suppl 1): i47-i56, 2023 06 30.
Artículo en Inglés | MEDLINE | ID: mdl-37387148

RESUMEN

MOTIVATION: Interactions among microbes within microbial communities have been shown to play crucial roles in human health. In spite of recent progress, low-level knowledge of bacteria driving microbial interactions within microbiomes remains unknown, limiting our ability to fully decipher and control microbial communities. RESULTS: We present a novel approach for identifying species driving interactions within microbiomes. Bakdrive infers ecological networks of given metagenomic sequencing samples and identifies minimum sets of driver species (MDS) using control theory. Bakdrive has three key innovations in this space: (i) it leverages inherent information from metagenomic sequencing samples to identify driver species, (ii) it explicitly takes host-specific variation into consideration, and (iii) it does not require a known ecological network. In extensive simulated data, we demonstrate identifying driver species identified from healthy donor samples and introducing them to the disease samples, we can restore the gut microbiome in recurrent Clostridioides difficile (rCDI) infection patients to a healthy state. We also applied Bakdrive to two real datasets, rCDI and Crohn's disease patients, uncovering driver species consistent with previous work. Bakdrive represents a novel approach for capturing microbial interactions. AVAILABILITY AND IMPLEMENTATION: Bakdrive is open-source and available at: https://gitlab.com/treangenlab/bakdrive.


Asunto(s)
Enfermedad de Crohn , Microbioma Gastrointestinal , Microbiota , Humanos , Metagenoma , Bacterias/genética
17.
bioRxiv ; 2023 May 18.
Artículo en Inglés | MEDLINE | ID: mdl-37325780

RESUMEN

Motivation: The Jaccard similarity on k-mer sets has shown to be a convenient proxy for sequence identity. By avoiding expensive base-level alignments and comparing reduced sequence representations, tools such as MashMap can scale to massive numbers of pairwise comparisons while still providing useful similarity estimates. However, due to their reliance on minimizer winnowing, previous versions of MashMap were shown to be biased and inconsistent estimators of Jaccard similarity. This directly impacts downstream tools that rely on the accuracy of these estimates. Results: To address this, we propose the minmer winnowing scheme, which generalizes the minimizer scheme by use of a rolling minhash with multiple sampled k-mers per window. We show both theoretically and empirically that minmers yield an unbiased estimator of local Jaccard similarity, and we implement this scheme in an updated version of MashMap. The minmer-based implementation is over 10 times faster than the minimizer-based version under the default ANI threshold, making it well-suited for large-scale comparative genomics applications.

18.
Nat Commun ; 14(1): 2834, 2023 05 17.
Artículo en Inglés | MEDLINE | ID: mdl-37198181

RESUMEN

As clinical testing declines, wastewater monitoring can provide crucial surveillance on the emergence of SARS-CoV-2 variant of concerns (VoCs) in communities. In this paper we present QuaID, a novel bioinformatics tool for VoC detection based on quasi-unique mutations. The benefits of QuaID are three-fold: (i) provides up to 3-week earlier VoC detection, (ii) accurate VoC detection (>95% precision on simulated benchmarks), and (iii) leverages all mutational signatures (including insertions & deletions).


Asunto(s)
COVID-19 , Humanos , COVID-19/diagnóstico , SARS-CoV-2/genética , Aguas Residuales , Benchmarking
19.
Front Bioeng Biotechnol ; 11: 1124100, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37180048

RESUMEN

Regulation of research on microbes that cause disease in humans has historically been focused on taxonomic lists of 'bad bugs'. However, given our increased knowledge of these pathogens through inexpensive genome sequencing, 5 decades of research in microbial pathogenesis, and the burgeoning capacity of synthetic biologists, the limitations of this approach are apparent. With heightened scientific and public attention focused on biosafety and biosecurity, and an ongoing review by US authorities of dual-use research oversight, this article proposes the incorporation of sequences of concern (SoCs) into the biorisk management regime governing genetic engineering of pathogens. SoCs enable pathogenesis in all microbes infecting hosts that are 'of concern' to human civilization. Here we review the functions of SoCs (FunSoCs) and discuss how they might bring clarity to potentially problematic research outcomes involving infectious agents. We believe that annotation of SoCs with FunSoCs has the potential to improve the likelihood that dual use research of concern is recognized by both scientists and regulators before it occurs.

20.
bioRxiv ; 2023 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-36993481

RESUMEN

Massively parallel genetic screens have been used to map sequence-to-function relationships for a variety of genetic elements. However, because these approaches only interrogate short sequences, it remains challenging to perform high throughput (HT) assays on constructs containing combinations of sequence elements arranged across multi-kb length scales. Overcoming this barrier could accelerate synthetic biology; by screening diverse gene circuit designs, "composition-to-function" mappings could be created that reveal genetic part composability rules and enable rapid identification of behavior-optimized variants. Here, we introduce CLASSIC, a generalizable genetic screening platform that combines long- and short-read next-generation sequencing (NGS) modalities to quantitatively assess pooled libraries of DNA constructs of arbitrary length. We show that CLASSIC can measure expression profiles of >10 5 drug-inducible gene circuit designs (ranging from 6-9 kb) in a single experiment in human cells. Using statistical inference and machine learning (ML) approaches, we demonstrate that data obtained with CLASSIC enables predictive modeling of an entire circuit design landscape, offering critical insight into underlying design principles. Our work shows that by expanding the throughput and understanding gained with each design-build-test-learn (DBTL) cycle, CLASSIC dramatically augments the pace and scale of synthetic biology and establishes an experimental basis for data-driven design of complex genetic systems.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...