Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 56
Filter
1.
Nat Commun ; 15(1): 5210, 2024 Jun 18.
Article in English | MEDLINE | ID: mdl-38890321

ABSTRACT

Cell-fate decisions during mammalian gastrulation are poorly understood outside of rodent embryos. The embryonic disc of pig embryos mirrors humans, making them a useful proxy for studying gastrulation. Here we present a single-cell transcriptomic atlas of pig gastrulation, revealing cell-fate emergence dynamics, as well as conserved and divergent gene programs governing early porcine, primate, and murine development. We highlight heterochronicity in extraembryonic cell-types, despite the broad conservation of cell-type-specific transcriptional programs. We apply these findings in combination with functional investigations, to outline conserved spatial, molecular, and temporal events during definitive endoderm specification. We find early FOXA2 + /TBXT- embryonic disc cells directly form definitive endoderm, contrasting later-emerging FOXA2/TBXT+ node/notochord progenitors. Unlike mesoderm, none of these progenitors undergo epithelial-to-mesenchymal transition. Endoderm/Node fate hinges on balanced WNT and hypoblast-derived NODAL, which is extinguished upon endodermal differentiation. These findings emphasise the interplay between temporal and topological signalling in fate determination during gastrulation.


Subject(s)
Embryo, Mammalian , Endoderm , Gastrulation , Gene Expression Regulation, Developmental , Single-Cell Analysis , Animals , Endoderm/cytology , Endoderm/metabolism , Endoderm/embryology , Swine , Mice , Embryo, Mammalian/cytology , Embryo, Mammalian/metabolism , Cell Differentiation , Mesoderm/cytology , Mesoderm/embryology , Mesoderm/metabolism , Transcriptome , Hepatocyte Nuclear Factor 3-beta/metabolism , Hepatocyte Nuclear Factor 3-beta/genetics , Cell Lineage , T-Box Domain Proteins/metabolism , T-Box Domain Proteins/genetics , Epithelial-Mesenchymal Transition/genetics
2.
Bioinformatics ; 40(4)2024 Mar 29.
Article in English | MEDLINE | ID: mdl-38478392

ABSTRACT

MOTIVATION: Oxford Nanopore Technologies (ONT) sequencers enable real-time generation of sequence data, which allows for concurrent analysis during a run. Adaptive sampling leverages this real-time capability in extremis, rejecting or accepting reads for sequencing based on assessment of the sequence from the start of each read. This functionality is provided by ONT's software, MinKNOW (Oxford Nanopore Technologies). Designing and developing software to take advantage of adaptive sampling can be costly in terms of sequencing consumables, using precious samples and preparing sequencing libraries. MinKNOW addresses this in part by allowing the replay of previously sequenced runs for testing. However, as we show, the sequencing output only partially changes in response to adaptive sampling instructions. Here we present Icarust, a tool enabling more accurate approximations of sequencing runs. Icarust recreates all the required endpoints of MinKNOW to perform adaptive sampling and writes output compatible with current base-callers and analysis pipelines. Icarust serves nanopore signal simulating a MinION or PromethION flow cell experiment from any reference genome using either R9 or R10 pore models. We show that simulating sequencing runs with Icarust provides a realistic testing and development environment for software exploiting the real-time nature of Nanopore sequencing. AVAILABILITY AND IMPLEMENTATION: All code is open source and freely available here-https://github.com/LooseLab/Icarust. Icarust is implemented in Rust, with a docker container also available. The data underlying this article will be shared on reasonable request to the corresponding author.


Subject(s)
Nanopore Sequencing , Nanopores , Sequence Analysis, DNA , Software , High-Throughput Nucleotide Sequencing
3.
J Gen Virol ; 104(12)2023 12.
Article in English | MEDLINE | ID: mdl-38059490

ABSTRACT

Repeat spillover of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) into new hosts has highlighted the critical role of cross-species transmission of coronaviruses and establishment of new reservoirs of virus in pandemic and epizootic spread of coronaviruses. Species particularly susceptible to SARS-CoV-2 spillover include Mustelidae (mink, ferrets and related animals), cricetid rodents (hamsters and related animals), felids (domestic cats and related animals) and white-tailed deer. These predispositions led us to screen British wildlife with sarbecovirus-specific quantitative PCR and pan coronavirus PCR assays for SARS-CoV-2 using samples collected during the human pandemic to establish if widespread spillover was occurring. Fourteen wildlife species (n=402) were tested, including: two red foxes (Vulpes vulpes), 101 badgers (Meles meles), two wild American mink (Neogale vison), 41 pine marten (Martes martes), two weasels (Mustela nivalis), seven stoats (Mustela erminea), 108 water voles (Arvicola amphibius), 39 bank voles (Myodes glareolous), 10 field voles (Microtus agrestis), 15 wood mice (Apodemus sylvaticus), one common shrew (Sorex aranaeus), two pygmy shrews (Sorex minutus), two hedgehogs (Erinaceus europaeus) and 75 Eurasian otters (Lutra lutra). No cases of SARS-CoV-2 were detected in any animals, but a novel minacovirus related to mink and ferret alphacoronaviruses was detected in stoats recently introduced to the Orkney Islands. This group of viruses is of interest due to pathogenicity in ferrets. The impact of this virus on the health of stoat populations remains to be established.


Subject(s)
Alphacoronavirus , COVID-19 , Deer , Otters , Viruses , Animals , Humans , Cats , Mice , Animals, Wild , Ferrets , Mink , SARS-CoV-2/genetics , COVID-19/veterinary , Arvicolinae
4.
PLoS Biol ; 21(6): e3002121, 2023 Jun.
Article in English | MEDLINE | ID: mdl-37315073

ABSTRACT

Pluripotency defines the unlimited potential of individual cells of vertebrate embryos, from which all adult somatic cells and germ cells are derived. Understanding how the programming of pluripotency evolved has been obscured in part by a lack of data from lower vertebrates; in model systems such as frogs and zebrafish, the function of the pluripotency genes NANOG and POU5F1 have diverged. Here, we investigated how the axolotl ortholog of NANOG programs pluripotency during development. Axolotl NANOG is absolutely required for gastrulation and germ-layer commitment. We show that in axolotl primitive ectoderm (animal caps; ACs) NANOG and NODAL activity, as well as the epigenetic modifying enzyme DPY30, are required for the mass deposition of H3K4me3 in pluripotent chromatin. We also demonstrate that all 3 protein activities are required for ACs to establish the competency to differentiate toward mesoderm. Our results suggest the ancient function of NANOG may be establishing the competence for lineage differentiation in early cells. These observations provide insights into embryonic development in the tetrapod ancestor from which terrestrial vertebrates evolved.


Subject(s)
Homeodomain Proteins , Pluripotent Stem Cells , Animals , Homeodomain Proteins/metabolism , Ambystoma mexicanum/genetics , Ambystoma mexicanum/metabolism , Zebrafish/genetics , Cell Differentiation , Nanog Homeobox Protein/genetics , Nanog Homeobox Protein/metabolism , Gene Expression Regulation, Developmental
5.
J Gen Virol ; 104(6)2023 06.
Article in English | MEDLINE | ID: mdl-37319000

ABSTRACT

Horseshoe bats are the natural hosts of the Sarbecovirus subgenus that includes SARS-CoV and SARS-CoV- 2. Despite the devastating impact of the COVID-19 pandemic, there is still little known about the underlying epidemiology and virology of sarbecoviruses in their natural hosts, leaving large gaps in our pandemic preparedness. Here we describe the results of PCR testing for sarbecoviruses in the two horseshoe bat species (Rhinolophus hipposideros and R. ferrumequinum) present in Great Britain, collected in 2021-22 during the peak of COVID-19 pandemic. One hundred and ninety seven R. hipposideros samples from 33 roost sites and 277 R. ferrumequinum samples from 20 roost sites were tested. No coronaviruses were detected in any samples from R. ferrumequinum whereas 44 and 56 % of individual and pooled (respectively) faecal samples from R. hipposideros across multiple roost sites tested positive in a sarbecovirus-specific qPCR. Full genome sequences were generated from three of the positive samples (and partial genomes from two more) using Illumina RNAseq on unenriched samples. Phylogenetic analyses showed that the obtained sequences belong to the same monophyletic clade, with >95 % similarity to previously-reported European isolates from R. hipposideros. The sequences differed in the presence or absence of accessory genes ORF 7b, 9b and 10. All lacked the furin cleavage site of SARS-CoV-2 spike gene and are therefore unlikely to be infective for humans. These results demonstrate a lack, or at least low incidence, of SARS-CoV-2 spill over from humans to susceptible GB bats, and confirm that sarbecovirus infection is widespread in R. hipposideros. Despite frequently sharing roost sites with R. ferrumequinum, no evidence of cross-species transmission was found.


Subject(s)
COVID-19 , Chiroptera , Severe acute respiratory syndrome-related coronavirus , Animals , Humans , Phylogeny , Pandemics , COVID-19/epidemiology , SARS-CoV-2/genetics
6.
Microb Genom ; 9(4)2023 04.
Article in English | MEDLINE | ID: mdl-37074153

ABSTRACT

Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , COVID-19/epidemiology , Wastewater-Based Epidemiological Monitoring , Wastewater , Algorithms
7.
Nat Biotechnol ; 41(7): 1018-1025, 2023 Jul.
Article in English | MEDLINE | ID: mdl-36593407

ABSTRACT

Nanopore sequencers can select which DNA molecules to sequence, rejecting a molecule after analysis of a small initial part. Currently, selection is based on predetermined regions of interest that remain constant throughout an experiment. Sequencing efforts, thus, cannot be re-focused on molecules likely contributing most to experimental success. Here we present BOSS-RUNS, an algorithmic framework and software to generate dynamically updated decision strategies. We quantify uncertainty at each genome position with real-time updates from data already observed. For each DNA fragment, we decide whether the expected decrease in uncertainty that it would provide warrants fully sequencing it, thus optimizing information gain. BOSS-RUNS mitigates coverage bias between and within members of a microbial community, leading to improved variant calling; for example, low-coverage sites of a species at 1% abundance were reduced by 87.5%, with 12.5% more single-nucleotide polymorphisms detected. Such data-driven updates to molecule selection are applicable to many sequencing scenarios, such as enriching for regions with increased divergence or low coverage, reducing time-to-answer.


Subject(s)
Nanopore Sequencing , Nanopores , Research Design , Bayes Theorem , Genome , Software , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA
8.
Genome Biol Evol ; 15(1)2023 01 04.
Article in English | MEDLINE | ID: mdl-36542479

ABSTRACT

Koala populations show marked differences in inbreeding levels and in the presence or absence of the endogenous Koala retrovirus (KoRV). These genetic differences among populations may lead to severe disease impacts threatening koala population viability. In addition, the recent colonization of the koala genome by KoRV provides a unique opportunity to study the process of retroviral adaptation to vertebrate genomes and the impact this has on speciation, genome structure, and function. The genome build described here is from an animal from the bottlenecked Southern population free of endogenous and exogenous KoRV. It provides a more contiguous genome build than the previous koala reference derived from an animal from a more outbred Northern population and is the first koala genome from a KoRV polymerase-free animal.


Subject(s)
Endogenous Retroviruses , Gammaretrovirus , Phascolarctidae , Retroviridae Infections , Animals , Phascolarctidae/genetics , Australia/epidemiology , Retroviridae Infections/epidemiology , Retroviridae Infections/genetics , Retroviridae/genetics , Gammaretrovirus/genetics
9.
ISME J ; 17(1): 21-35, 2023 01.
Article in English | MEDLINE | ID: mdl-36151458

ABSTRACT

A debate is currently ongoing as to whether intensive livestock farms may constitute reservoirs of clinically relevant antimicrobial resistance (AMR), thus posing a threat to surrounding communities. Here, combining shotgun metagenome sequencing, machine learning (ML), and culture-based methods, we focused on a poultry farm and connected slaughterhouse in China, investigating the gut microbiome of livestock, workers and their households, and microbial communities in carcasses and soil. For both the microbiome and resistomes in this study, differences are observed across environments and hosts. However, at a finer scale, several similar clinically relevant antimicrobial resistance genes (ARGs) and similar associated mobile genetic elements were found in both human and broiler chicken samples. Next, we focused on Escherichia coli, an important indicator for the surveillance of AMR on the farm. Strains of E. coli were found intermixed between humans and chickens. We observed that several ARGs present in the chicken faecal resistome showed correlation to resistance/susceptibility profiles of E. coli isolates cultured from the same samples. Finally, by using environmental sensing these ARGs were found to be correlated to variations in environmental temperature and humidity. Our results show the importance of adopting a multi-domain and multi-scale approach when studying microbial communities and AMR in complex, interconnected environments.


Subject(s)
Anti-Infective Agents , Microbiota , Soil Microbiology , Animals , Humans , Anti-Bacterial Agents , Chickens/microbiology , Escherichia coli/genetics , Genes, Bacterial , Livestock/microbiology , Drug Resistance, Bacterial
10.
Elife ; 112022 Sep 13.
Article in English | MEDLINE | ID: mdl-36098502

ABSTRACT

Background: Viral sequencing of SARS-CoV-2 has been used for outbreak investigation, but there is limited evidence supporting routine use for infection prevention and control (IPC) within hospital settings. Methods: We conducted a prospective non-randomised trial of sequencing at 14 acute UK hospital trusts. Sites each had a 4-week baseline data collection period, followed by intervention periods comprising 8 weeks of 'rapid' (<48 hr) and 4 weeks of 'longer-turnaround' (5-10 days) sequencing using a sequence reporting tool (SRT). Data were collected on all hospital-onset COVID-19 infections (HOCIs; detected ≥48 hr from admission). The impact of the sequencing intervention on IPC knowledge and actions, and on the incidence of probable/definite hospital-acquired infections (HAIs), was evaluated. Results: A total of 2170 HOCI cases were recorded from October 2020 to April 2021, corresponding to a period of extreme strain on the health service, with sequence reports returned for 650/1320 (49.2%) during intervention phases. We did not detect a statistically significant change in weekly incidence of HAIs in longer-turnaround (incidence rate ratio 1.60, 95% CI 0.85-3.01; p=0.14) or rapid (0.85, 0.48-1.50; p=0.54) intervention phases compared to baseline phase. However, IPC practice was changed in 7.8 and 7.4% of all HOCI cases in rapid and longer-turnaround phases, respectively, and 17.2 and 11.6% of cases where the report was returned. In a 'per-protocol' sensitivity analysis, there was an impact on IPC actions in 20.7% of HOCI cases when the SRT report was returned within 5 days. Capacity to respond effectively to insights from sequencing was breached in most sites by the volume of cases and limited resources. Conclusions: While we did not demonstrate a direct impact of sequencing on the incidence of nosocomial transmission, our results suggest that sequencing can inform IPC response to HOCIs, particularly when returned within 5 days. Funding: COG-UK is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) (grant code: MC_PC_19027), and Genome Research Limited, operating as the Wellcome Sanger Institute. Clinical trial number: NCT04405934.


Subject(s)
COVID-19 , Cross Infection , Humans , SARS-CoV-2/genetics , COVID-19/epidemiology , COVID-19/prevention & control , Prospective Studies , Infection Control/methods , Cross Infection/epidemiology , Cross Infection/prevention & control , Hospitals
11.
Viruses ; 14(4)2022 04 08.
Article in English | MEDLINE | ID: mdl-35458508

ABSTRACT

Whole-genome sequencing (WGS) has played a significant role in understanding the epidemiology and biology of SARS-CoV-2 virus. Here, we investigate the use of SARS-CoV-2 WGS in Southeast and East Asian countries as a genomic surveillance during the COVID-19 pandemic. Nottingham-Indonesia Collaboration for Clinical Research and Training (NICCRAT) initiative has facilitated collaboration between the University of Nottingham and a team in the Research Center for Biotechnology, National Research and Innovation Agency (BRIN), to carry out a small number of SARS-CoV-2 WGS in Indonesia using Oxford Nanopore Technology (ONT). Analyses of SARS- CoV-2 genomes deposited on GISAID reveal the importance of clinical and demographic metadata collection and the importance of open access and data sharing. Lineage and phylogenetic analyses of two periods defined by the Delta variant outbreak reveal that: (1) B.1.466.2 variants were the most predominant in Indonesia before the Delta variant outbreak, having a unique spike gene mutation N439K at more than 98% frequency, (2) Delta variants AY.23 sub-lineage took over after June 2021, and (3) the highest rate of virus transmissions between Indonesia and other countries was through interactions with Singapore and Japan, two neighbouring countries with a high degree of access and travels to and from Indonesia.


Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiology , Humans , Indonesia/epidemiology , Mutation , Pandemics , Phylogeny , SARS-CoV-2/genetics
13.
Genome Biol ; 23(1): 54, 2022 02 14.
Article in English | MEDLINE | ID: mdl-35164830

ABSTRACT

BACKGROUND: Ribosomal DNA (rDNA) displays substantial inter-individual genetic variation in human and mouse. A systematic analysis of how this variation impacts epigenetic states and expression of the rDNA has thus far not been performed. RESULTS: Using a combination of long- and short-read sequencing, we establish that 45S rDNA units in the C57BL/6J mouse strain exist as distinct genetic haplotypes that influence the epigenetic state and transcriptional output of any given unit. DNA methylation dynamics at these haplotypes are dichotomous and life-stage specific: at one haplotype, the DNA methylation state is sensitive to the in utero environment, but refractory to post-weaning influences, whereas other haplotypes entropically gain DNA methylation during aging only. On the other hand, individual rDNA units in human show limited evidence of genetic haplotypes, and hence little discernible correlation between genetic and epigenetic states. However, in both species, adjacent units show similar epigenetic profiles, and the overall epigenetic state at rDNA is strongly positively correlated with the total rDNA copy number. Analysis of different mouse inbred strains reveals that in some strains, such as 129S1/SvImJ, the rDNA copy number is only approximately 150 copies per diploid genome and DNA methylation levels are < 5%. CONCLUSIONS: Our work demonstrates that rDNA-associated genetic variation has a considerable influence on rDNA epigenetic state and consequently rRNA expression outcomes. In the future, it will be important to consider the impact of inter-individual rDNA (epi)genetic variation on mammalian phenotypes and diseases.


Subject(s)
DNA Methylation , RNA, Ribosomal , Animals , DNA, Ribosomal/genetics , Epigenesis, Genetic , Genetic Variation , Humans , Mammals/genetics , Mice , Mice, Inbred C57BL , RNA, Ribosomal/genetics , RNA, Ribosomal/metabolism
14.
J Infect Dis ; 225(1): 10-18, 2022 01 05.
Article in English | MEDLINE | ID: mdl-34555152

ABSTRACT

Nosocomial severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections have severely affected bed capacity and patient flow. We utilized whole-genome sequencing (WGS) to identify outbreaks and focus infection control resources and intervention during the United Kingdom's second pandemic wave in late 2020. Phylogenetic analysis of WGS and epidemiological data pinpointed an initial transmission event to an admission ward, with immediate prior community infection linkage documented. High incidence of asymptomatic staff infection with genetically identical viral sequences was also observed, which may have contributed to the propagation of the outbreak. WGS allowed timely nosocomial transmission intervention measures, including admissions ward point-of-care testing and introduction of portable HEPA14 filters. Conversely, WGS excluded nosocomial transmission in 2 instances with temporospatial linkage, conserving time and resources. In summary, WGS significantly enhanced understanding of SARS-CoV-2 clusters in a hospital setting, both identifying high-risk areas and conversely validating existing control measures in other units, maintaining clinical service overall.


Subject(s)
COVID-19 , Cross Infection , Disease Outbreaks/prevention & control , Reverse Transcriptase Polymerase Chain Reaction/methods , Whole Genome Sequencing , Asymptomatic Infections , Cross Infection/epidemiology , Delivery of Health Care , Health Personnel , Humans , Personal Protective Equipment , Phylogeny , SARS-CoV-2
15.
Bioinformatics ; 38(4): 1133-1135, 2022 01 27.
Article in English | MEDLINE | ID: mdl-34791062

ABSTRACT

SUMMARY: minoTour offers a Laboratory Informations Management System (LIMS) system for Oxford Nanopore Technology sequencers, with real-time metrics and analysis available permanently for review. Integration of unique real-time automated analysis can reduce the time required to answer biological questions, including mapping and classification of sequence while a run is in progress. Real-time sequence data require new methods of analysis which do not wait for the completion of a run and minoTour provides a framework to allow users to exploit these features. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available at https://github.com/LooseLab/minotourcli and https://github.com/LooseLab/minotourapp. Docker images are available from https://hub.docker.com/r/adoni5/, and can be installed using a preconfigured docker-compose script at https://github.com/LooseLab/minotour-docker. An example server is available at http://137.44.59.170. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Nanopores , Software
17.
J Gen Virol ; 102(6)2021 06.
Article in English | MEDLINE | ID: mdl-34130773

ABSTRACT

In the early phases of the SARS coronavirus type 2 (SARS-CoV-2) pandemic, testing focused on individuals fitting a strict case definition involving a limited set of symptoms together with an identified epidemiological risk, such as contact with an infected individual or travel to a high-risk area. To assess whether this impaired our ability to detect and control early introductions of the virus into the UK, we PCR-tested archival specimens collected on admission to a large UK teaching hospital who retrospectively were identified as having a clinical presentation compatible with COVID-19. In addition, we screened available archival specimens submitted for respiratory virus diagnosis, and dating back to early January 2020, for the presence of SARS-CoV-2 RNA. Our data provides evidence for widespread community circulation of SARS-CoV-2 in early February 2020 and into March that was undetected at the time due to restrictive case definitions informing testing policy. Genome sequence data showed that many of these early cases were infected with a distinct lineage of the virus. Sequences obtained from the first officially recorded case in Nottinghamshire - a traveller returning from Daegu, South Korea - also clustered with these early UK sequences suggesting acquisition of the virus occurred in the UK and not Daegu. Analysis of a larger sample of sequences obtained in the Nottinghamshire area revealed multiple viral introductions, mainly in late February and through March. These data highlight the importance of timely and extensive community testing to prevent future widespread transmission of the virus.


Subject(s)
COVID-19/diagnosis , COVID-19/virology , Respiratory System/virology , SARS-CoV-2/isolation & purification , Adult , Aged , COVID-19/epidemiology , COVID-19/transmission , COVID-19 Nucleic Acid Testing , Female , Humans , Male , Mass Screening/methods , Middle Aged , Phylogeny , RNA, Viral/genetics , Retrospective Studies , SARS-CoV-2/genetics , United Kingdom/epidemiology
18.
Nat Biotechnol ; 39(4): 442-450, 2021 04.
Article in English | MEDLINE | ID: mdl-33257864

ABSTRACT

Nanopore sequencers can be used to selectively sequence certain DNA molecules in a pool by reversing the voltage across individual nanopores to reject specific sequences, enabling enrichment and depletion to address biological questions. Previously, we achieved this using dynamic time warping to map the signal to a reference genome, but the method required substantial computational resources and did not scale to gigabase-sized references. Here we overcome this limitation by using graphical processing unit (GPU) base-calling. We show enrichment of specific chromosomes from the human genome and of low-abundance organisms in mixed populations without a priori knowledge of sample composition. Finally, we enrich targeted panels comprising 25,600 exons from 10,000 human genes and 717 genes implicated in cancer, identifying PML-RARA fusions in the NB4 cell line in <15 h sequencing. These methods can be used to efficiently screen any target panel of genes without specialized sample preparation using any computer and a suitable GPU. Our toolkit, readfish, is available at https://www.github.com/looselab/readfish .


Subject(s)
Computational Biology/methods , Nanopore Sequencing/instrumentation , Neoplasms/genetics , Oncogene Proteins, Fusion/genetics , Cell Line, Tumor , Exons , Genome Size , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , Sequence Analysis, DNA , Software
19.
Math Biosci Eng ; 17(4): 2881-2904, 2020 03 30.
Article in English | MEDLINE | ID: mdl-32987505

ABSTRACT

MicroRNAs are known to regulate gene expression either by repressing translation or by directing sequence-specific degradation of target mRNAs, and are therefore considered to be key regulators of gene expression. A gene-regulatory pathway involving heterochronic genes controls the temporal pattern of Caenorhabditis elegans postembryonic cell lineages. Based on experimental data, we propose and analyze a mathematical model of a gene-regulatory module in this nematode involving two heterochronic genes, lin-14 and lin-28, which are both regulated by lin-4, encoding a microRNA. The conditions under which the model experiences bifurcations are investigated. We determine the parameter regimes for which the system exhibits monostability and bistability, the latter associated with a biological switch. We observe in particular that bistability occurs without co-operativity, in keeping with knowledge about the regulatory behaviour of lin-14 and lin-28. The analytical results are confirmed by numerical simulations that illustrate how the microRNA lin-4 plays a crucial role in determining of the qualitative dynamics of the model.


Subject(s)
Caenorhabditis elegans Proteins , MicroRNAs , Animals , Caenorhabditis elegans/genetics , Caenorhabditis elegans Proteins/genetics , Gene Regulatory Networks , MicroRNAs/genetics , Models, Theoretical , RNA, Messenger/genetics , Repressor Proteins/genetics
20.
Nature ; 585(7823): 79-84, 2020 09.
Article in English | MEDLINE | ID: mdl-32663838

ABSTRACT

After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist1,2. Here we present a human genome assembly that surpasses the continuity of GRCh382, along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome3, we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes.


Subject(s)
Chromosomes, Human, X/genetics , Genome, Human/genetics , Telomere/genetics , Centromere/genetics , CpG Islands/genetics , DNA Methylation , DNA, Satellite/genetics , Female , Humans , Hydatidiform Mole/genetics , Male , Pregnancy , Reproducibility of Results , Testis/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...