Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Appl Environ Microbiol ; 87(1)2020 12 17.
Artículo en Inglés | MEDLINE | ID: mdl-33097499

RESUMEN

A reliable and standardized classification of Listeria monocytogenes is important for accurate strain identification during outbreak investigations. Current whole-genome sequencing (WGS)-based approaches for strain characterization are either difficult to standardize, rendering them less suitable for data exchange, or are not freely available. Thus, we developed a portable and open-source tool, Haplo-ST, to improve standardization and provide maximum discriminatory potential to WGS data tied to a multilocus sequence typing (MLST) framework. Haplo-ST performs whole-genome MLST (wgMLST) for L. monocytogenes while allowing for data exchangeability worldwide. This tool takes in (i) raw WGS reads as input, (ii) cleans the raw data according to user-specified parameters, (iii) assembles genes across loci by mapping to genes from reference strains, and (iv) assigns allelic profiles to assembled genes and provides a wgMLST subtyping for each isolate. Data exchangeability relies on the tool assigning allelic profiles based on a centralized nomenclature defined by the widely used BIGSdb-Lm database. Tests of Haplo-ST's performance with simulated reads from L. monocytogenes reference strains demonstrated high sensitivity (97.5%), and coverage depths of ≥20× were found to be sufficient for wgMLST profiling. We then used Haplo-ST to characterize and differentiate between two groups of L. monocytogenes isolates derived from the natural environment and poultry processing plants. Phylogenetic reconstruction identified lineages within each group, and no lineage specificity was observed with isolate phenotypes (transient versus persistent) or origins. Genetic differentiation analyses between isolate groups identified 21 significantly differentiated loci, potentially enriched for adaptation and persistence of L. monocytogenes within poultry processing plants.IMPORTANCE We have developed an open-source tool (https://github.com/swarnalilouha/Haplo-ST) that provides allele-based subtyping of L. monocytogenes isolates at the whole-genome level. Along with allelic profiles, this tool also generates allele sequences and identifies paralogs, which is useful for phylogenetic tree reconstruction and deciphering relationships between closely related isolates. More broadly, Haplo-ST is flexible and can be adapted to characterize the genome of any haploid organism simply by installing an organism-specific gene database. Haplo-ST also allows for scalable subtyping of isolates; fewer reference genes can be used for low-resolution typing, whereas higher resolution can be achieved by increasing the number of genes used in the analysis. Our tool enabled clustering of L. monocytogenes isolates into lineages and detection of potential loci for adaptation and persistence in food processing environments. Findings from these analyses highlight the effectiveness of Haplo-ST in subtyping and evaluating relationships among isolates in studies of bacterial population genetics.


Asunto(s)
Microbiología Ambiental , Variación Genética , Listeria monocytogenes/genética , Tipificación de Secuencias Multilocus , Secuenciación Completa del Genoma , Mataderos , Animales , Aves de Corral
2.
PLoS One ; 17(1): e0262616, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35030215

RESUMEN

Sequencing large numbers of individual samples is often needed for countrywide antimalarial drug resistance surveillance. Pooling DNA from several individual samples is an alternative cost and time saving approach for providing allele frequency (AF) estimates at a population level. Using 100 individual patient DNA samples of dried blood spots from a 2017 nationwide drug resistance surveillance study in Haiti, we compared codon coverage of drug resistance-conferring mutations in four Plasmodium falciparum genes (crt, dhps, dhfr, and mdr1), for the same deep sequenced samples run individually and pooled. Samples with similar real-time PCR cycle threshold (Ct) values (+/- 1.0 Ct value) were combined with ten samples per pool. The sequencing success for samples in pools were higher at a lower parasite density than the individual samples sequence method. The median codon coverage for drug resistance-associated mutations in all four genes were greater than 3-fold higher in the pooled samples than in individual samples. The overall codon coverage distribution for pooled samples was wider than the individual samples. The sample pools with < 40 parasites/µL blood showed more discordance in AF calls for dhfr and mdr1 between the individual and pooled samples. This discordance in AF estimation may be due to low amounts of parasite DNA, which could lead to variable PCR amplification efficiencies. Grouping samples with an estimated ≥ 40 parasites/µL blood prior to pooling and deep sequencing yielded the expected population level AF. Pooling DNA samples based on estimates of > 40 parasites/µL prior to deep sequencing can be used for rapid genotyping of a large number of samples for these four genes and possibly other drug resistant markers in population-based studies. As Haiti is a low malaria transmission country with very few mixed infections and continued chloroquine sensitivity, the pooled sequencing approach can be used for routine national molecular surveillance of resistant parasites.


Asunto(s)
Resistencia a Medicamentos/genética , Ensayos Analíticos de Alto Rendimiento/métodos , Plasmodium falciparum/genética , Animales , Antimaláricos/farmacología , Pruebas con Sangre Seca/métodos , Monitoreo Epidemiológico , Haití , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Malaria/epidemiología , Malaria Falciparum/parasitología , Técnicas de Amplificación de Ácido Nucleico/métodos , Parásitos/genética , Plasmodium falciparum/efectos de los fármacos , Plasmodium falciparum/patogenicidad , Reacción en Cadena de la Polimerasa/métodos , Análisis de Secuencia de ADN
3.
PLoS One ; 16(2): e0242297, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33630832

RESUMEN

We performed whole-genome multi-locus sequence typing for 2554 genes in a large and heterogenous panel of 180 Listeria monocytogenes strains having diverse geographical and temporal origins. The subtyping data was used for characterizing genetic variation and evaluating patterns of linkage disequilibrium in the pan-genome of L. monocytogenes. Our analysis revealed the presence of strong linkage disequilibrium in L. monocytogenes, with ~99% of genes showing significant non-random associations with a large majority of other genes in the genome. Twenty-seven loci having lower levels of association with other genes were considered to be potential "hot spots" for horizontal gene transfer (i.e., recombination via conjugation, transduction, and/or transformation). The patterns of linkage disequilibrium in L. monocytogenes suggest limited exchange of foreign genetic material in the genome and can be used as a tool for identifying new recombinant strains. This can help understand processes contributing to the diversification and evolution of this pathogenic bacteria, thereby facilitating development of effective control measures.


Asunto(s)
Genoma Bacteriano , Desequilibrio de Ligamiento , Listeria monocytogenes/genética , Listeriosis/microbiología , Microbiología de Alimentos , Variación Genética , Humanos , Listeria monocytogenes/aislamiento & purificación , Tipificación de Secuencias Multilocus , Filogenia
4.
G3 (Bethesda) ; 10(4): 1159-1166, 2020 04 09.
Artículo en Inglés | MEDLINE | ID: mdl-32075855

RESUMEN

The song sparrow, Melospiza melodia, is one of the most widely distributed species of songbirds found in North America. It has been used in a wide range of behavioral and ecological studies. This species' pronounced morphological and behavioral diversity across populations makes it a favorable candidate in several areas of biomedical research. We have generated a high-quality de novo genome assembly of M. melodia using Illumina short read sequences from genomic and in vitro proximity-ligation libraries. The assembled genome is 978.3 Mb, with a physical coverage of 24.9×, N50 scaffold size of 5.6 Mb and N50 contig size of 31.7 Kb. Our genome assembly is highly complete, with 87.5% full-length genes present out of a set of 4,915 universal single-copy orthologs present in most avian genomes. We annotated our genome assembly and constructed 15,086 gene models, a majority of which have high homology to related birds, Taeniopygia guttata and Junco hyemalis In total, 83% of the annotated genes are assigned with putative functions. Furthermore, only ∼7% of the genome is found to be repetitive; these regions and other non-coding functional regions are also identified. The high-quality M. melodia genome assembly and annotations we report will serve as a valuable resource for facilitating studies on genome structure and evolution that can contribute to biomedical research and serve as a reference in population genomic and comparative genomic studies of closely related species.


Asunto(s)
Gorriones , Animales , Genoma , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , América del Norte , Gorriones/genética , Estados Unidos
5.
Genome Biol Evol ; 12(1): 3635-3646, 2020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-31821505

RESUMEN

Crocodilians are an economically, culturally, and biologically important group. To improve researchers' ability to study genome structure, evolution, and gene regulation in the clade, we generated a high-quality de novo genome assembly of the saltwater crocodile, Crocodylus porosus, from Illumina short read data from genomic libraries and in vitro proximity-ligation libraries. The assembled genome is 2,123.5 Mb, with N50 scaffold size of 17.7 Mb and N90 scaffold size of 3.8 Mb. We then annotated this new assembly, increasing the number of annotated genes by 74%. In total, 96% of 23,242 annotated genes were associated with a functional protein domain. Furthermore, multiple noncoding functional regions and mappable genetic markers were identified. Upon analysis and overlapping the results of branch length estimation and site selection tests for detecting potential selection, we found 16 putative genes under positive selection in crocodilians, 10 in C. porosus and 6 in Alligator mississippiensis. The annotated C. porosus genome will serve as an important platform for osmoregulatory, physiological, and sex determination studies, as well as an important reference in investigating the phylogenetic relationships of crocodilians, birds, and other tetrapods.


Asunto(s)
Caimanes y Cocodrilos/genética , Genoma , Animales , Redes Reguladoras de Genes , Genes , Repeticiones de Microsatélite , Anotación de Secuencia Molecular , ARN de Transferencia/genética , Selección Genética
6.
PeerJ ; 7: e7724, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31616583

RESUMEN

Molecular ecologists frequently use genome reduction strategies that rely upon restriction enzyme digestion of genomic DNA to sample consistent portions of the genome from many individuals (e.g., RADseq, GBS). However, researchers often find the existing methods expensive to initiate and/or difficult to implement consistently, especially because it is difficult to multiplex sufficient numbers of samples to fill entire sequencing lanes. Here, we introduce a low-cost and highly robust approach for the construction of dual-digest RADseq libraries that build on adapters and primers designed in Adapterama I. Major features of our method include: (1) minimizing the number of processing steps; (2) focusing on a single strand of sample DNA for library construction, allowing the use of a non-phosphorylated adapter on one end; (3) ligating adapters in the presence of active restriction enzymes, thereby reducing chimeras; (4) including an optional third restriction enzyme to cut apart adapter-dimers formed by the phosphorylated adapter, thus increasing the efficiency of adapter ligation to sample DNA, which is particularly effective when only low quantity/quality DNA samples are available; (5) interchangeable adapter designs; (6) incorporating variable-length internal indexes within the adapters to increase the scope of sample indexing, facilitate pooling, and increase sequence diversity; (7) maintaining compatibility with universal dual-indexed primers and thus, Illumina sequencing reagents and libraries; and, (8) easy modification for the identification of PCR duplicates. We present eight adapter designs that work with 72 restriction enzyme combinations. We demonstrate the efficiency of our approach by comparing it with existing methods, and we validate its utility through the discovery of many variable loci in a variety of non-model organisms. Our 2RAD/3RAD method is easy to perform, has low startup costs, has increased utility with low-concentration input DNA, and produces libraries that can be highly-multiplexed and pooled with other Illumina libraries.

7.
PeerJ ; 7: e7786, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31616589

RESUMEN

Next-generation sequencing (NGS) of amplicons is used in a wide variety of contexts. In many cases, NGS amplicon sequencing remains overly expensive and inflexible, with library preparation strategies relying upon the fusion of locus-specific primers to full-length adapter sequences with a single identifying sequence or ligating adapters onto PCR products. In Adapterama I, we presented universal stubs and primers to produce thousands of unique index combinations and a modifiable system for incorporating them into Illumina libraries. Here, we describe multiple ways to use the Adapterama system and other approaches for amplicon sequencing on Illumina instruments. In the variant we use most frequently for large-scale projects, we fuse partial adapter sequences (TruSeq or Nextera) onto the 5' end of locus-specific PCR primers with variable-length tag sequences between the adapter and locus-specific sequences. These fusion primers can be used combinatorially to amplify samples within a 96-well plate (8 forward primers + 12 reverse primers yield 8 × 12 = 96 combinations), and the resulting amplicons can be pooled. The initial PCR products then serve as template for a second round of PCR with dual-indexed iTru or iNext primers (also used combinatorially) to make full-length libraries. The resulting quadruple-indexed amplicons have diversity at most base positions and can be pooled with any standard Illumina library for sequencing. The number of sequencing reads from the amplicon pools can be adjusted, facilitating deep sequencing when required or reducing sequencing costs per sample to an economically trivial amount when deep coverage is not needed. We demonstrate the utility and versatility of our approaches with results from six projects using different implementations of our protocols. Thus, we show that these methods facilitate amplicon library construction for Illumina instruments at reduced cost with increased flexibility. A simple web page to design fusion primers compatible with iTru primers is available at: http://baddna.uga.edu/tools-taggi.html. A fast and easy to use program to demultiplex amplicon pools with internal indexes is available at: https://github.com/lefeverde/Mr_Demuxy.

8.
PeerJ ; 7: e7755, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31616586

RESUMEN

Massively parallel DNA sequencing offers many benefits, but major inhibitory cost factors include: (1) start-up (i.e., purchasing initial reagents and equipment); (2) buy-in (i.e., getting the smallest possible amount of data from a run); and (3) sample preparation. Reducing sample preparation costs is commonly addressed, but start-up and buy-in costs are rarely addressed. We present dual-indexing systems to address all three of these issues. By breaking the library construction process into universal, re-usable, combinatorial components, we reduce all costs, while increasing the number of samples and the variety of library types that can be combined within runs. We accomplish this by extending the Illumina TruSeq dual-indexing approach to 768 (384 + 384) indexed primers that produce 384 unique dual-indexes or 147,456 (384 × 384) unique combinations. We maintain eight nucleotide indexes, with many that are compatible with Illumina index sequences. We synthesized these indexing primers, purifying them with only standard desalting and placing small aliquots in replicate plates. In qPCR validation tests, 206 of 208 primers tested passed (99% success). We then created hundreds of libraries in various scenarios. Our approach reduces start-up and per-sample costs by requiring only one universal adapter that works with indexed PCR primers to uniquely identify samples. Our approach reduces buy-in costs because: (1) relatively few oligonucleotides are needed to produce a large number of indexed libraries; and (2) the large number of possible primers allows researchers to use unique primer sets for different projects, which facilitates pooling of samples during sequencing. Our libraries make use of standard Illumina sequencing primers and index sequence length and are demultiplexed with standard Illumina software, thereby minimizing customization headaches. In subsequent Adapterama papers, we use these same primers with different adapter stubs to construct amplicon and restriction-site associated DNA libraries, but their use can be expanded to any type of library sequenced on Illumina platforms.

9.
G3 (Bethesda) ; 8(6): 1855-1861, 2018 05 31.
Artículo en Inglés | MEDLINE | ID: mdl-29703783

RESUMEN

The western mosquitofish, Gambusia affinis, is a freshwater poecilid fish native to the southeastern United States but with a global distribution due to widespread human introduction. Gambusia affinis has been used as a model species for a broad range of evolutionary and ecological studies. We sequenced the genome of a male G. affinis to facilitate genetic studies in diverse fields including invasion biology and comparative genetics. We generated Illumina short read data from paired-end libraries and in vitro proximity-ligation libraries. We obtained 54.9× coverage, N50 contig length of 17.6 kb, and N50 scaffold length of 6.65 Mb. Compared to two other species in the Poeciliidae family, G. affinis has slightly fewer genes that have shorter total, exon, and intron length on average. Using a set of universal single-copy orthologs in fish genomes, we found 95.5% of these genes were complete in the G. affinis assembly. The number of transposable elements in the G. affinis assembly is similar to those of closely related species. The high-quality genome sequence and annotations we report will be valuable resources for scientists to map the genetic architecture of traits of interest in this species.


Asunto(s)
Ciprinodontiformes/genética , Biblioteca de Genes , Genoma , Especies Introducidas , Animales , Elementos Transponibles de ADN/genética , Masculino , Anotación de Secuencia Molecular , ARN de Transferencia/genética , ARN no Traducido/genética , Estándares de Referencia , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA