RESUMEN
In the deep-sea environment, the volume available for an in-situ gene sequencer is severely limited. In addition, optical imaging systems are subject to real-time, large-scale defocusing problems caused by ambient temperature fluctuations and vibrational perturbations. To address these challenges, we propose an edge detection algorithm for defocused images based on grayscale gradients and establish a defocus state detection model with nanometer resolution capabilities by relying on the inherent critical illumination light field. The model has been applied to a prototype deep-sea gene sequencing microscope with a 20× objective. It has demonstrated the ability to focus within a dynamic range of ±40 µm with an accuracy of 200 nm by a single iteration within 160 ms. By increasing the number of iterations and exposures, the focusing accuracy can be refined to 78 nm within a dynamic range of ±100 µm within 1.2 s. Notably, unlike conventional photoelectric hill-climbing, this method requires no additional hardware and meets the wide dynamic range, speed, and high-accuracy autofocusing requirements of deep-sea gene sequencing in a compact form factor.
Asunto(s)
Algoritmos , Microscopía/métodos , Microscopía/instrumentación , Iluminación/instrumentación , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/instrumentaciónRESUMEN
In this article, we introduce GEMA, a genome exact mapping accelerator based on learned indexes, specifically designed for FPGA implementation. GEMA utilizes a machine learning (ML) algorithm to precisely locate the exact position of read sequences within the original sequence. To enhance the accuracy of the trained ML model, we incorporate data augmentation and data-distribution-aware partitioning techniques. Additionally, we present an efficient yet low-overhead error recovery technique. To map long reads more efficiently, we propose a speculative prefetching approach, which reduces the required memory bandwidth. Furthermore, we suggest an FPGA-based architecture for implementing the proposed mapping accelerator, optimizing the accesses to off-chip memory. Our studies demonstrate that GEMA achieves up to 1.36 × higher speed for short reads compared to the corresponding results reported in recently published exact mapping accelerators. Moreover, GEMA achieves up to â¼22 × faster mapping of long reads compared to the available results for the longest mapped reads using these accelerators.
Asunto(s)
Algoritmos , Aprendizaje Automático , Humanos , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/instrumentación , Mapeo Cromosómico/métodos , Mapeo Cromosómico/instrumentaciónRESUMEN
Increasing potential for fast, cheap genomes may break open biology's bottleneck and broaden clinical uses.
Asunto(s)
Análisis de Secuencia de ADN , Detección Precoz del Cáncer/instrumentación , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ADN/tendencias , Análisis de la Célula Individual/instrumentaciónRESUMEN
Os avanços metodológicos e instrumentais decorrentes do Projeto Genoma Humano formaram o arcabouço necessário para o surgimento das tecnologias de sequenciamento de DNA de Nova Geração, as quais se caracterizam por um custo reduzido, uma baixa demanda operacional e a produção de um grande volume de dados por experimento. Concomitantemente a isso, o aumento no poder de processamento computacional permitiu o desenvolvimento de análises genéticas em larga escala, de modo que, atualmente, é possível estudar características genômicas individualizadas e, até então, pouco ou nunca exploradas. Dentre essas características, aquelas relacionadas às variações estruturais em genomas têm recebido bastante atenção. Os pseudogenes processados, ou retrocópias, são variações estruturais causadas pela duplicação de genes codificadores mediante à transposição de seu RNA mensageiro maduro pela maquinaria enzimática de LINE- 1. As retrocópias podem estar fixadas, ou seja, presentes em todos os genomas de uma dada espécie, os quais são representados pela montagem modelo do genoma de referência, ou podem não estar fixadas, sendo polimórficas, germinativas ou somáticas. No entanto, o conhecimento acerca das retrocópias não fixadas ainda é limitado devido à falta de ferramentas de bioinformática dedicadas a sua identificação e anotação em dados de sequenciamento de DNA. Posto isso, este trabalho apresenta o sideRETRO um programa computacional especializado na detecção de pseudogenes processados ausentes do genoma de referência, mas presentes em dados de sequenciamento de genoma completo e exoma de outros indivíduos. Além de apontar para a presença de retrocópias não fixadas, o sideRETRO é capaz de anotar várias outras características relacionadas a esses evento, tais como: a coordenada genômica de inserção do pseudogene processado, a qual constitui o cromossomo, o ponto de inserção e a fita de DNA (líder or retardada); o contexto genômico do evento (exônico, intrônico ou intergênico); a genotipagem (presente ou ausente) e a haplotipagem (em homozigose ou heterozigose). Para atestar a eficiência da ferramenta, o sideRETRO foi executado para dados simulados e para dados reais validados experimentalmente por um grupo independente. Portanto, em resumo, nesta tese são descritos o desenvolvimento e o uso do sideRETRO uma ferramenta computacional robusta e eficiente, designada para identificar e anotar pseudogenes processados não fixados. Por fim, vale destacar que o sideRETRO preenche uma lacuna metodológica e possibilita novas hipóteses e investigações sistemáticas no campo de chamada de variantes estruturais
The methodological and instrumental advances resulting from the Human Genome Project have created the necessary framework to the emergence of Next Generation DNA sequencing technologies, which are characterized by a reduced cost, low operational demand and the generation of a large volume of data per experiment. Concomitantly with this, the increase in computational processing power has driven the development of large-scale genetic analyses, which allowed us to study individualized genomic traits little or never explored before. Among these characteristics, those related to structural variations in genomes have received much attention. Processed pseudogenes, or retrocopies, are structural variations caused by the duplication of coding genes through the transposition of their mature messenger RNA by the LINE-1 enzymatic machinery. Retrocopies can be fixed (i.e., present in all genomes of a given species and included into the assembly of the reference genome) or unfixed, being polymorphic, germinal or somatic. However, knowledge about unfixed retrocopies is still limited due to the lack of bioinformatics tools dedicated to their identification and annotation in DNA sequencing data. Therefore, this work presents sideRETRO a computer program specialized in the detection of processed pseudogenes absent from the reference genome, but present in whole genome and exome sequencing data from other individuals. In addition to pointing out the presence of unfixed retrocopies, sideRETRO is able to annotate several other characteristics related to these events, such as: the genomic coordinate of the processed pseudogene insetion, which constitutes the chromosome, the insertion point and the DNA strand (leader or retard); the genomic context of the event (exonic, intronic or intergenic); genotyping (present or absent) and haplotyping (homozygous or heterozygous). To certify the sideRETRO efficiency, it was run on simulated data and on real data experimentally validated by an independent group. Therefore, in summary, this thesis describes the development and use of sideRETRO a robust and efficient computational tool, designed to identify and annotate unfixed processed pseudogenes. Finally, it is worth noting that sideRETRO fills a methodological gap and allows new hypotheses and systematic investigations in the field of structural variant calling
Asunto(s)
Polimorfismo Genético/genética , Biología Computacional/clasificación , Biología Computacional/instrumentación , Costos y Análisis de Costo , Genómica/instrumentación , Análisis de Secuencia de ADN/instrumentación , Codificación ClínicaRESUMEN
Oxford Nanopore Technologies' (ONT) long read sequencers offer access to longer DNA fragments than previous sequencer generations, at the cost of a higher error rate. While many papers have studied read correction methods, few have addressed the detailed characterization of observed errors, a task complicated by frequent changes in chemistry and software in ONT technology. The MinION sequencer is now more stable and this paper proposes an up-to-date view of its error landscape, using the most mature flowcell and basecaller. We studied Nanopore sequencing error biases on both bacterial and human DNA reads. We found that, although Nanopore sequencing is expected not to suffer from GC bias, it is a crucial parameter with respect to errors. In particular, low-GC reads have fewer errors than high-GC reads (about 6% and 8% respectively). The error profile for homopolymeric regions or regions with short repeats, the source of about half of all sequencing errors, also depends on the GC rate and mainly shows deletions, although there are some reads with long insertions. Another interesting finding is that the quality measure, although over-estimated, offers valuable information to predict the error rate as well as the abundance of reads. We supplemented this study with an analysis of a rapeseed RNA read set and shown a higher level of errors with a higher level of deletion in these data. Finally, we have implemented an open source pipeline for long-term monitoring of the error profile, which enables users to easily compute various analysis presented in this work, including for future developments of the sequencing device. Overall, we hope this work will provide a basis for the design of better error-correction methods.
Asunto(s)
Secuenciación de Nanoporos , Nanoporos , Análisis de Secuencia de ADN/instrumentación , Humanos , Programas InformáticosRESUMEN
Aptamers feature a number of advantages, compared to antibodies. However, their application has been limited so far, mainly because of the complex selection process. 'High-throughput sequencing fluorescent ligand interaction profiling' (HiTS-FLIP) significantly increases the selection efficiency and is consequently a very powerful and versatile technology for the selection of high-performance aptamers. It is the first experiment to allow the direct and quantitative measurement of the affinity and specificity of millions of aptamers simultaneously by harnessing the potential of optical next-generation sequencing platforms to perform fluorescence-based binding assays on the clusters displayed on the flow cells and determining their sequence and position in regular high-throughput sequencing. Many variants of the experiment have been developed that allow automation and in situ conversion of DNA clusters into base-modified DNA, RNA, peptides, and even proteins. In addition, the information from mutational assays, performed with HiTS-FLIP, provides deep insights into the relationship between the sequence, structure, and function of aptamers. This enables a detailed understanding of the sequence-specific rules that determine affinity, and thus, supports the evolution of aptamers. Current variants of the HiTS-FLIP experiment and its application in the field of aptamer selection, characterisation, and optimisation are presented in this review.
Asunto(s)
Aptámeros de Nucleótidos/química , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Automatización de Laboratorios/instrumentación , Automatización de Laboratorios/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Mutagénesis , Dispositivos Ópticos , Análisis de Secuencia de ADN/instrumentaciónRESUMEN
Since December 2019, a novel coronavirus responsible for a severe acute respiratory syndrome (SARS-CoV-2) is accountable for a major pandemic situation. The emergence of the B.1.1.7 strain, as a highly transmissible variant has accelerated the world-wide interest in tracking SARS-CoV-2 variants' occurrence. Similarly, other extremely infectious variants, were described and further others are expected to be discovered due to the long period of time on which the pandemic situation is lasting. All described SARS-CoV-2 variants present several mutations within the gene encoding the Spike protein, involved in host receptor recognition and entry into the cell. Hence, instead of sequencing the whole viral genome for variants' tracking, herein we propose to focus on the SPIKE region to increase the number of candidate samples to screen at once; an essential aspect to accelerate diagnostics, but also variants' emergence/progression surveillance. This proof of concept study accomplishes both at once, population-scale diagnostics and variants' tracking. This strategy relies on (1) the use of the portable MinION DNA sequencer; (2) a DNA barcoding and a SPIKE gene-centered variant's tracking, increasing the number of candidates per assay; and (3) a real-time diagnostics and variant's tracking monitoring thanks to our software RETIVAD. This strategy represents an optimal solution for addressing the current needs on SARS-CoV-2 progression surveillance, notably due to its affordable implementation, allowing its implantation even in remote places over the world.
Asunto(s)
COVID-19/diagnóstico , SARS-CoV-2/genética , Análisis de Secuencia de ADN/métodos , COVID-19/virología , Prueba de Ácido Nucleico para COVID-19/instrumentación , Prueba de Ácido Nucleico para COVID-19/métodos , Genoma Viral , Humanos , Nanoporos , ARN Viral/genética , Análisis de Secuencia de ADN/instrumentación , Glicoproteína de la Espiga del Coronavirus/genéticaRESUMEN
Recent developments have revolutionized the study of biomolecules. Among them are molecular markers, amplification and sequencing of nucleic acids. The latter is classified into three generations. The first allows to sequence small DNA fragments. The second one increases throughput, reducing turnaround and pricing, and is therefore more convenient to sequence full genomes and transcriptomes. The third generation is currently pushing technology to its limits, being able to sequence single molecules, without previous amplification, which was previously impossible. Besides, this represents a new revolution, allowing researchers to directly sequence RNA without previous retrotranscription. These technologies are having a significant impact on different areas, such as medicine, agronomy, ecology and biotechnology. Additionally, the study of biomolecules is revealing interesting evolutionary information. That includes deciphering what makes us human, including phenomena like non-coding RNA expansion. All this is redefining the concept of gene and transcript. Basic analyses and applications are now facilitated with new genome editing tools, such as CRISPR. All these developments, in general, and nucleic-acid sequencing, in particular, are opening a new exciting era of biomolecule analyses and applications, including personalized medicine, and diagnosis and prevention of diseases for humans and other animals.
Asunto(s)
Genoma , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ARN/métodos , Secuenciación Completa del Genoma/métodos , Animales , Secuencia de Bases , ADN/química , Genómica/historia , Secuenciación de Nucleótidos de Alto Rendimiento/historia , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Historia del Siglo XX , Historia del Siglo XXI , Humanos , ARN Mensajero/química , Análisis de Secuencia de ADN/historia , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ARN/historia , Análisis de Secuencia de ARN/instrumentación , Secuenciación Completa del Genoma/historia , Secuenciación Completa del Genoma/instrumentaciónRESUMEN
Microbial communities are commonly studied by using amplicon sequencing of part of the 16S rRNA gene. Sequencing of the full-length 16S rRNA gene can provide higher taxonomic resolution and accuracy. To obtain even higher taxonomic resolution, with as few false-positives as possible, we assessed a method using long amplicon sequencing targeting the rRNA operon combined with a CCMetagen pipeline. Taxonomic assignment had > 90% accuracy at the species level in a mock sample and at the family level in equine fecal samples, generating similar taxonomic composition as shotgun sequencing. The rRNA operon amplicon sequencing of equine fecal samples underestimated compositional percentages of bacterial strains containing unlinked rRNA genes by a fourth to a third, but unlinked rRNA genes had a limited effect on the overall results. The rRNA operon amplicon sequencing with the A519F + U2428R primer set was able to detect some kind of archaeal genomes such as Methanobacteriales and Methanomicrobiales, whereas full-length 16S rRNA with 27F + 1492R could not. Therefore, we conclude that amplicon sequencing targeting the rRNA operon captures more detailed variations of equine microbiota.
Asunto(s)
ADN Intergénico , Microbioma Gastrointestinal , Microbiota , ARN Ribosómico 16S/genética , ARN Ribosómico 23S/genética , Análisis de Secuencia de ADN/métodos , Animales , Bacterias/genética , ADN Bacteriano/genética , Heces , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Caballos , Filogenia , Análisis de Secuencia de ADN/instrumentación , Operón de ARNrRESUMEN
The NovaSeq 6000 is a sequencing platform from Illumina that enables the sequencing of short reads with an output up to 6 Tb. The NovaSeq 6000 uses the typical Illumina sequencing workflow based on library preparation, cluster generation by in situ amplification, and sequencing by synthesis. Flexibility is one of the major features of the NovaSeq 6000. Several types of sequencing kits coupled with dual flow cell mode enable high scalability of sequencing outputs to match a wide range of applications from complete genome sequencing to metagenomics analysis. In this chapter, after explaining how to assemble a normalized pool of libraries for sequencing, we will describe the experimental steps required to run the pools on the NovaSeq 6000 platform.
Asunto(s)
Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Biblioteca Genómica , Genómica/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Proyectos de Investigación , Análisis de Secuencia de ADN/instrumentación , Flujo de TrabajoRESUMEN
Cell-free fetal DNA analysis has an established role in prenatal assessments. It serves as a source of fetal genetic material that is accessible non-invasively from maternal blood. Through the years, evidence has accumulated to show that cell-free fetal DNA molecules are derived from placental tissues, are mainly of short DNA fragments and have rapid post-delivery clearance profiles. But questions regarding how they come to being short molecules from placental cells and in which physical forms do they exist remained largely unanswered until recently. We now know that the distributions of ending sites of cell-free DNA molecules are non-random across the genome and bear correlations with the chromatin structures of cells from which they have originated. Such an insight offers ways to deduce the tissue-of-origin of these molecules. Besides, the physical nature and sequence characteristics of the ends of each cell-free DNA molecule provide tell-tale signs of how the DNA fragmentation processes are orchestrated by nuclease enzymes. These realizations offered opportunities to develop methods for enriching cell-free fetal DNA to facilitate non-invasive prenatal diagnostics. Here we aimed to collate what is known about the biological and physical characteristics of cell-free fetal DNA into one article and explain the implications of these observations.
Asunto(s)
Ácidos Nucleicos Libres de Células/análisis , Análisis de Secuencia de ADN/métodos , Ácidos Nucleicos Libres de Células/sangre , Femenino , Humanos , Embarazo , Diagnóstico Prenatal/métodos , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ADN/tendenciasRESUMEN
Annotation of structural variations (SVs) and base-level karyotyping in cancer cells remains challenging. Here, we present Integrative Framework for Genome Reconstruction (InfoGenomeR)-a graph-based framework that can reconstruct individual SVs into karyotypes based on whole-genome sequencing data, by integrating SVs, total copy number alterations, allele-specific copy numbers, and haplotype information. Using whole-genome sequencing data sets of patients with breast cancer, glioblastoma multiforme, and ovarian cancer, we demonstrate the analytical potential of InfoGenomeR. We identify recurrent derivative chromosomes derived from chromosomes 11 and 17 in breast cancer samples, with homogeneously staining regions for CCND1 and ERBB2, and double minutes and breakage-fusion-bridge cycles in glioblastoma multiforme and ovarian cancer samples, respectively. Moreover, we show that InfoGenomeR can discriminate private and shared SVs between primary and metastatic cancer sites that could contribute to tumour evolution. These findings indicate that InfoGenomeR can guide targeted therapies by unravelling cancer-specific SVs on a genome-wide scale.
Asunto(s)
Neoplasias de la Mama/genética , Genoma Humano/genética , Variación Estructural del Genoma/genética , Glioblastoma/genética , Neoplasias Ováricas/genética , Células A549 , Línea Celular Tumoral , Aberraciones Cromosómicas , Ciclina D1/genética , Variaciones en el Número de Copia de ADN/genética , Femenino , Células HeLa , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Cariotipificación , Poliploidía , Receptor ErbB-2/genética , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ADN/métodos , Secuenciación Completa del GenomaRESUMEN
Species identification of non-human biological evidence through DNA nucleotide sequencing is routinely used for forensic genetic analysis to support law enforcement. The gold standard for forensic genetics is conventional Sanger sequencing; however, this is gradually being replaced by high-throughput sequencing (HTS) approaches which can generate millions of individual reads in a single experiment. HTS sequencing, which now dominates molecular biology research, has already been demonstrated for use in a number of forensic genetic analysis applications, including species identification. However, the generation of HTS data to date requires expensive equipment and is cost-effective only when large numbers of samples are analysed simultaneously. The Oxford Nanopore Technologies (ONT) MinION™ is an affordable and small footprint DNA sequencing device with the potential to quickly deliver reliable and cost effective data. However, there has been no formal validation of forensic species identification using high-throughput (deep read) sequence data from the MinION making it currently impractical for many wildlife forensic end-users. Here, we present a MinION deep read sequence data validation study for species identification. First, we tested whether the clustering-based bioinformatics pipeline NGSpeciesID can be used to generate an accurate consensus sequence for species identification. Second, we systematically evaluated the read variation distribution around the generated consensus sequences to understand what confidence we have in the accuracy of the resulting consensus sequence and to determine how to interpret individual sample results. Finally, we investigated the impact of differences between the MinION consensus and Sanger control sequences on correct species identification to understand the ability and accuracy of the MinION consensus sequence to differentiate the true species from the next most similar species. This validation study establishes that ONT MinION sequence data used in conjunction with the NGSpeciesID pipeline can produce consensus DNA sequences of sufficient accuracy for forensic genetic species identification.
Asunto(s)
Genética Forense , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Análisis de Secuencia de ADN/instrumentación , Especificidad de la Especie , Animales , Aves/genética , Citocromos b/genética , ADN Mitocondrial/genética , Ciervos/genética , Humanos , Lynx/genética , Nanoporos , Panthera/genética , Reproducibilidad de los Resultados , Rupicapra/genética , Sus scrofa/genéticaRESUMEN
Since the days of Sanger sequencing, next-generation sequencing technologies have significantly evolved to provide increased data output, efficiencies, and applications. These next generations of technologies can be categorized based on read length. This review provides an overview of these technologies as two paradigms: short-read, or "second-generation," technologies, and long-read, or "third-generation," technologies. Herein, short-read sequencing approaches are represented by the most prevalent technologies, Illumina and Ion Torrent, and long-read sequencing approaches are represented by Pacific Biosciences and Oxford Nanopore technologies. All technologies are reviewed along with reported advantages and disadvantages. Until recently, short-read sequencing was thought to provide high accuracy limited by read-length, while long-read technologies afforded much longer read-lengths at the expense of accuracy. Emerging developments for third-generation technologies hold promise for the next wave of sequencing evolution, with the co-existence of longer read lengths and high accuracy.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Sondas de ADN de HLA , Técnicas de Genotipaje/instrumentación , Técnicas de Genotipaje/métodos , Antígenos HLA/genética , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Humanos , Análisis de Secuencia de ADN/instrumentaciónRESUMEN
Analyzing genetic markers in nuclear and mitochondrial genomes is helpful in various forensic applications, such as individual identifications and kinship analyses. However, most commercial kits detect these markers separately, which is time-consuming, laborious, and more error-prone (mislabelling, contamination, ...). The MGIEasy Signature Identification Library Prep Kit (hereinafter "MGIEasy identification system"; MGI Tech, Shenzhen, China) has been designed to provide a simple, fast, and robust way to detect appropriate markers in one multiplex PCR reaction: 52 autosomal STRs, 27 X-chromosomal STRs, 48 Y-chromosomal STRs, 145 identity-informative SNPs, 53 ancestry-informative SNPs, 29 phenotype-informative SNPs, and the hypervariable regions of mitochondrial DNA (mtDNA). Here, we validated the performance of MGIEasy identification system following the guidelines of the Scientific Working Group on DNA Analysis Methods (SWGDAM), assessing species specificity, sensitivity, mixture identification, stability under non-optimal conditions (degraded samples, inhibitor contamination, and various substrates), repeatability, and concordance. Libraries prepared using MGIEasy identification system were sequenced on a MGISEQ-2000 instrument (MGI Tech). MGIEasy-derived STR, SNP, and mtDNA genotypes were highly concordant with CE-based STR genotypes (99.79%), MiSeq FGx-based SNP genotypes (99.78%), and Sanger-based mtDNA genotypes (100%), respectively. This system was strongly human-specific, resistant to four common PCR inhibitors, and reliably amplified both low quantities of DNA (as low as 0.125 ng) and degraded DNA (~ 150 nt). Most of the unique alleles from the minor contributor were detected in 1:10 male-female and male-male mixtures; some minor Y-STR alleles were even detected in 1:1000 male-female mixtures. MGIEasy also successfully directly amplified markers from blood stains on FTA cards, filter papers, and swabs. Thus, our results demonstrated that MGIEasy identification system was suitable for use in forensic analyses due to its robust and reliable performance on samples of varying quality and quantity.
Asunto(s)
ADN Mitocondrial , Marcadores Genéticos , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Repeticiones de Microsatélite , Reacción en Cadena de la Polimerasa Multiplex/métodos , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/instrumentación , Animales , Femenino , Biblioteca de Genes , Inestabilidad Genómica , Genotipo , Humanos , Masculino , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Especificidad de la EspecieRESUMEN
The introduction of next generation sequencing (NGS; also known as massively parallel sequencing) technology in the field of forensic genetics has been welcomed by the scientific community, above all because it complements the weaknesses of capillary electrophoresis (CE) in the analysis of genetic markers, such as single nucleotide polymorphism (SNP) typing. However, one of the main obstacles to its adoption does not seem to be the cost of the instrumentation, but rather the cost of the NGS library preparation kits. With the aim of reducing the cost of library preparation without compromising the quality of the results, we tried to scale down reaction volumes for the first two polymerase chain reactions in the amplification and enrichment phases of the targeted loci of library preparation using the ForenSeq™ DNA Signature Prep kit. We used 1 µL templated DNA input to a concentration of 1 ng/µL, instead of the 5 µL at 0.2 ng/µL recommended by the manufacturer. Our findings indicate that reduction of the library preparation volume using the ForenSeq™ DNA Signature Prep kit did not interfere with the quality and reproducibility of the DNA profiles obtained and can help lower the overall cost of NGS.
Asunto(s)
Genética Forense/instrumentación , Biblioteca de Genes , Secuenciación de Nucleótidos de Alto Rendimiento/economía , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Reacción en Cadena de la Polimerasa , Análisis de Secuencia de ADN/economía , Análisis de Secuencia de ADN/instrumentación , Humanos , Reproducibilidad de los ResultadosRESUMEN
Extended line defects in graphene (ELDG) sheets have been found to be promising for biomolecule sensing applications. By means of the consistent-exchange van der Waals density-functional (vdW-DF-cx) method, the electronic, structural, and quantum transport properties of the ELDG nanogap setup has been studied when a DNA nucleotide molecule is positioned inside the nanogap electrodes. The interaction energy (Ei) values indicate charge transfer interaction between the nucleotide molecule and electrode edges. The charge density difference plots reveal that charge fluctuates around the ELDG nanogap edges adjacent to the nucleotides. This charge redistribution grounds the modulation of electronic charge transport in the ELDG nanogap device. Further, we study the electronic transverse-conductance and tunnelling current-voltage (I-V) characteristics across two closely spaced ELDG nanogap electrodes using the density functional theory and the nonequilibrium Green's function methods when a DNA nucleotide is translocated through the nanogap. Our outcomes indicate that the ELDG nano gap device could allow sequencing of DNA nucleotides with a robust and consistent yield, giving the tunneling electric current signals that vary by more than 1 order of magnitude electric current (I) for the different DNA nucleotides. So, we predict that the ELDG nanogap-based tunneling device can be suitable for sequencing DNA nucleobases.
Asunto(s)
ADN/química , Técnicas Electroquímicas/instrumentación , Grafito/química , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Simulación por Computador , Técnicas Electroquímicas/métodos , Electrodos , Modelos Químicos , Estructura MolecularRESUMEN
<b>Background and Objective:</b> Protocols commonly used in plant DNA extraction were known to be highly time-consuming and harmful due to the application of some hazardous reagents. Therefore, it was not applicable for such laboratories with limited resources as well as for high-throughput analysis. This study was aimed to develop a rapid yet less hazardous DNA extraction protocol for a plant using potassium phosphate buffer. <b>Materials and Methods:</b> Genomic DNA of chili pepper (<i>Capsicum annuum</i>) was extracted using potassium phosphate buffer and its efficacy was compared to three widely known protocols (CTAB-based, mini preparation and commercial kit). The extracted DNA from those four methods was evaluated based on its quality, quantity, practicality and cost per reaction. <b>Results:</b> Genomic DNA resulted from potassium phosphate buffer-based protocol exhibited comparable quality with adequate concentration for further downstream analysis. Results of PCR and sequencing were also emphasized the amplifiable DNA quality from this developed protocol. Compared to those commonly used protocols, potassium phosphate buffer consisted of 5 main working steps only, thus providing a simple yet rapid plant DNA extraction protocol. Since this protocol used ethanol only, it also offered a less hazardous and low-cost protocol that applicable for those resource-limited laboratories. <b>Conclusion:</b> This developed protocol provided a promising alternative of plant DNA extraction that might be applicable for both large scale analysis and any laboratory type. Further investigation was needed to evaluate its efficacy in extracting genomic DNA from various plants with different morphological characteristic.
Asunto(s)
Tampones (Química) , ADN de Plantas/análisis , Fosfatos/metabolismo , Compuestos de Potasio/metabolismo , Análisis de Secuencia de ADN/métodos , Fosfatos/uso terapéutico , Reacción en Cadena de la Polimerasa/métodos , Compuestos de Potasio/uso terapéutico , Análisis de Secuencia de ADN/instrumentaciónRESUMEN
The SureID® PathFinder Plus is a new 6-dye, 41-plex Y-STR system that includes the 17 loci from the Yfiler® kit (DYS19, DYS385a/b, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, and Y-GATA-H4) plus 14 rapidly mutating Y-STR loci (DYS449, DYS481, DYS518, DYS527a/b, DYS533, DYS549, DYS570, DYS576, DYS627, DYF387S1a/b, and DYF404S1), and 10 low-medium mutation loci (DYS388, DYS444, DYS447, DYS460, DYS522, DYS557, DYS593, DYS596, DYS643, and DYS645). The inclusion of the 14 rapidly mutating Y-STR loci improves the discrimination of related individuals. Conversely, the 10 low-medium mutation loci are suitable not only for familial searching but also for providing a higher refinement in the construction of Y chromosome phylogenetic relationships among lineages. The 41-plex Y-STR system is designed for direct amplification of reference samples, such as blood samples on an FTA® Card, gauze, tissue, or cotton substrates as well as hair root or buccal samples on swabs. We performed developmental validation work including accuracy, stability, stutter precision, species specificity, sensitivity, PCR inhibitors, reproducibility, parallel testing of the system, and suitability for use on DNA mixtures. In addition, mutations of the loci were analyzed by 754 DNA-confirmed father-son pairs. The results demonstrate that this kit, developed in-house, is time-efficient, accurate, reliable, and highly informative for forensic database, familial searching, and distinguishing related males.
Asunto(s)
Cromosomas Humanos Y/genética , Dermatoglifia del ADN/métodos , ADN/análisis , Repeticiones de Microsatélite/genética , Reacción en Cadena de la Polimerasa Multiplex/instrumentación , Análisis de Secuencia de ADN/instrumentación , Pueblo Asiatico/genética , Etnicidad/genética , Humanos , Masculino , Reproducibilidad de los Resultados , Sensibilidad y EspecificidadRESUMEN
The MiSeq® FGX Forensic system and the HID-Ion AmpliSeq Panel were previously developed for massively parallel sequencing (MPS) for forensic casework. Among the three major sequencing platforms, BGISEQ-500TM, which is based on multiple PCRs, is still lacking in forensics. Here, a novel forensic panel was constructed to detect 186 single-nucleotide polymorphisms (SNPs) and 123 short tandem repeats (STRs) with MPS technology on the BGISEQ-500™ platform. First, the library preparation, sequencing process, and data analysis were performed, focusing on the average depth of coverage and heterozygote balance. We calculated the allelic frequencies and forensic parameters of STR and SNP loci in 73 unrelated Chinese Han individuals. In addition, performance was evaluated with accuracy, uniformity, sensitivity, PCR inhibitor, repeatability and reproducibility, mixtures, degraded samples, case-type samples, and pedigree analyses. The results showed that 100% accurate and concordant genotypes can be obtained, and the loci with an abundance in the interquartile range accounted for 92.90% of the total, suggesting reliable uniformity in this panel. We obtained a locus detection rate that was higher than 98.78% from 78 pg of input DNA, and the optimal amount was 1.25-10 ng. The maximum concentrations of hematin and humic acid were 200 and 100 µM, respectively (the ratios of detected loci were 96.52% and 92.41%), in this panel. As a mixture, compared with those of SNPs, minor-contributor alleles of STRs could be detected at higher levels. For the degraded sample, the ratio of detected loci was 98.41%, and most profiles from case-type samples were not significantly different in abundance in our studies. As a whole, this panel showed high-performance, reliable, robust, repeatable, and reproducible results, which are sufficient for paternity testing, individual identification, and use for potentially degraded samples in forensic science.