RESUMEN
5-methylcytosine (5mC) is the most important DNA modification in mammalian genomes. The ideal method for 5mC localization would be both nondestructive of DNA and direct, without requiring inference based on detection of unmodified cytosines. Here we present direct methylation sequencing (DM-Seq), a bisulfite-free method for profiling 5mC at single-base resolution using nanogram quantities of DNA. DM-Seq employs two key DNA-modifying enzymes: a neomorphic DNA methyltransferase and a DNA deaminase capable of precise discrimination between cytosine modification states. Coupling these activities with deaminase-resistant adapters enables accurate detection of only 5mC via a C-to-T transition in sequencing. By comparison, we uncover a PCR-related underdetection bias with the hybrid enzymatic-chemical TET-assisted pyridine borane sequencing approach. Importantly, we show that DM-Seq, unlike bisulfite sequencing, unmasks prognostically important CpGs in a clinical tumor sample by not confounding 5mC with 5-hydroxymethylcytosine. DM-Seq thus offers an all-enzymatic, nondestructive, faithful and direct method for the reading of 5mC alone.
Asunto(s)
5-Metilcitosina , Metilación de ADN , Animales , Citosina , ADN/genética , Análisis de Secuencia de ADN/métodos , Mamíferos/genéticaRESUMEN
The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture, sequence, and assemble diploid human KIR haplotypes. Probes were designed to capture KIR fragments efficiently by leveraging the repeating homology of the region. IDT xGen® Lockdown probes were used to capture 2-8 kb of sheared DNA fragments followed by sequencing on a PacBio Sequel. The sequences were error corrected, binned, and then assembled using the Canu assembler. The location of genes and their exon/intron boundaries are included in the workflow. The assembly and annotation was evaluated on 16 individuals (8 African American and 8 Europeans) from whom ground truth was known via long-range sequencing with fosmid library preparation. Using only 18 capture probes, the results show that the assemblies cover 97% of the GenBank reference, are 99.97% concordant, and it takes only 1.8 haplotigs to cover 75% of the reference. We also report the first assembly of diploid KIR haplotypes from long-read WGS. Our targeted hybridization probe capture and sequencing approach is the first of its kind to fully sequence and phase all diploid human KIR haplotypes, and it is efficient enough for population-scale studies and clinical use. The open and free software is available at https://github.com/droeatumn/kass and supported by a environment at https://hub.docker.com/repository/docker/droeatumn/kass.
Asunto(s)
Enfermedades Autoinmunes/inmunología , Biología Computacional/métodos , Haplotipos/genética , Inmunoterapia/métodos , Células Asesinas Naturales/inmunología , Receptores KIR/genética , Programas Informáticos , Virosis/inmunología , Negro o Afroamericano , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN/métodos , Población BlancaRESUMEN
The mitochondrial DNA of Trypanosoma brucei, termed kinetoplast DNA or kDNA, consists of thousands of minicircles and a small number of maxicircles catenated into a single network organized as a nucleoprotein disk at the base of the flagellum. Minicircles are replicated free of the network but still contain nicks and gaps after rejoining to the network. Covalent closure of remaining discontinuities in newly replicated minicircles after their rejoining to the network is delayed until all minicircles have been replicated. The DNA ligase involved in this terminal step in minicircle replication has not been identified. A search of kinetoplastid genome databases has identified two putative DNA ligase genes in tandem. These genes (LIG k alpha and LIG k beta) are highly diverged from mitochondrial and nuclear DNA ligase genes of higher eukaryotes. Expression of epitope-tagged versions of these genes shows that both LIG k alpha and LIG k beta are mitochondrial DNA ligases. Epitope-tagged LIG k alpha localizes throughout the kDNA, whereas LIG k beta shows an antipodal localization close to, but not overlapping, that of topoisomerase II, suggesting that these proteins may be contained in distinct structures or protein complexes. Knockdown of the LIG k alpha mRNA by RNA interference led to a cessation of the release of minicircles from the network and resulted in a reduction in size of the kDNA networks and rapid loss of the kDNA from the cell. Closely related pairs of mitochondrial DNA ligase genes were also identified in Leishmania major and Crithidia fasciculata.