Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 38
Filtrar
1.
G3 (Bethesda) ; 14(1)2023 Dec 29.
Artículo en Inglés | MEDLINE | ID: mdl-37875130

RESUMEN

Black spruce (Picea mariana [Mill.] B.S.P.) is a dominant conifer species in the North American boreal forest that plays important ecological and economic roles. Here, we present the first genome assembly of P. mariana with a reconstructed genome size of 18.3 Gbp and NG50 scaffold length of 36.0 kbp. A total of 66,332 protein-coding sequences were predicted in silico and annotated based on sequence homology. We analyzed the evolutionary relationships between P. mariana and 5 other spruces for which complete nuclear and organelle genome sequences were available. The phylogenetic tree estimated from mitochondrial genome sequences agrees with biogeography; specifically, P. mariana was strongly supported as a sister lineage to P. glauca and 3 other taxa found in western North America, followed by the European Picea abies. We obtained mixed topologies with weaker statistical support in phylogenetic trees estimated from nuclear and chloroplast genome sequences, indicative of ancient reticulate evolution affecting these 2 genomes. Clustering of protein-coding sequences from the 6 Picea taxa and 2 Pinus species resulted in 34,776 orthogroups, 560 of which appeared to be specific to P. mariana. Analysis of these specific orthogroups and dN/dS analysis of positive selection signatures for 497 single-copy orthogroups identified gene functions mostly related to plant development and stress response. The P. mariana genome assembly and annotation provides a valuable resource for forest genetics research and applications in this broadly distributed species, especially in relation to climate adaptation.


Asunto(s)
Picea , Filogenia , Picea/genética , América del Norte
2.
F1000Res ; 12: 336, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37455852

RESUMEN

We present a genome assembly of Caretta caretta (the Loggerhead sea turtle; Chordata, Testudines, Cheloniidae), generated from genomic data from two unrelated females. The genome sequence is 2.13 gigabases in size. The assembly has a busco completion score of 96.1% and N50 of 130.95 Mb. The majority of the assembly is scaffolded into 28 chromosomal representations with a remaining 2% of the assembly being excluded from these.


Asunto(s)
Tortugas , Animales , Femenino , Tortugas/genética , Reptiles , Genoma , Genómica
3.
J Pathol ; 253(2): 225-233, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33135777

RESUMEN

The practical application of genome-scale technologies to precision oncology research requires flexible tissue processing strategies that can be used to differentially select both tumour and normal cell populations from formalin-fixed, paraffin-embedded tissues. As tumour sequencing scales towards clinical implementation, practical difficulties in scheduling and obtaining fresh tissue biopsies at scale, including blood samples as surrogates for matched 'normal' DNA, have focused attention on the use of formalin-preserved clinical samples collected routinely for diagnostic purposes. In practice, such samples often contain both tumour and normal cells which, if correctly partitioned, could be used to profile both tumour and normal genomes, thus identifying somatic alterations. Here we report a semi-automated method for laser microdissecting entire slide-mounted tissue sections to enrich for cells of interest with sufficient yield for whole genome and transcriptome sequencing. Using this method, we demonstrated enrichment of tumour material from mixed tumour-normal samples by up to 67%. Leveraging new methods that allow for the extraction of high-quality nucleic acids from small amounts of formalin-fixed tissues, we further showed that the method was successful in yielding sequence data of sufficient quality for use in BC Cancer's Personalized OncoGenomics (POG) program. © 2020 The Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.


Asunto(s)
Captura por Microdisección con Láser , Neoplasias/patología , Medicina de Precisión , Animales , Formaldehído , Humanos , Hígado/patología , Ratones , Ratones Endogámicos C57BL , Fijación del Tejido
4.
Microbiol Resour Announc ; 9(39)2020 Sep 24.
Artículo en Inglés | MEDLINE | ID: mdl-32972944

RESUMEN

Here, we present the chloroplast genome sequence of black spruce (Picea mariana), a conifer widely distributed throughout North American boreal forests. This complete and annotated chloroplast sequence is 123,961 bp long and will contribute to future studies on the genetic basis of evolutionary change in spruce and adaptation in conifers.

5.
Genome Biol Evol ; 12(7): 1174-1179, 2020 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-32449750

RESUMEN

Plant mitochondrial genomes vary widely in size. Although many plant mitochondrial genomes have been sequenced and assembled, the vast majority are of angiosperms, and few are of gymnosperms. Most plant mitochondrial genomes are smaller than a megabase, with a few notable exceptions. We have sequenced and assembled the complete 5.5-Mb mitochondrial genome of Sitka spruce (Picea sitchensis), to date, one of the largest mitochondrial genomes of a gymnosperm. We sequenced the whole genome using Oxford Nanopore MinION, and then identified contigs of mitochondrial origin assembled from these long reads based on sequence homology to the white spruce mitochondrial genome. The assembly graph shows a multipartite genome structure, composed of one smaller 168-kb circular segment of DNA, and a larger 5.4-Mb single component with a branching structure. The assembly graph gives insight into a putative complex physical genome structure, and its branching points may represent active sites of recombination.


Asunto(s)
Genoma Mitocondrial , Genoma de Planta , Picea/genética , Estructura Molecular
6.
PLoS One ; 14(10): e0224578, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31671154

RESUMEN

Next generation RNA-sequencing (RNA-seq) is a flexible approach that can be applied to a range of applications including global quantification of transcript expression, the characterization of RNA structure such as splicing patterns and profiling of expressed mutations. Many RNA-seq protocols require up to microgram levels of total RNA input amounts to generate high quality data, and thus remain impractical for the limited starting material amounts typically obtained from rare cell populations, such as those from early developmental stages or from laser micro-dissected clinical samples. Here, we present an assessment of the contemporary ribosomal RNA depletion-based protocols, and identify those that are suitable for inputs as low as 1-10 ng of intact total RNA and 100-500 ng of partially degraded RNA from formalin-fixed paraffin-embedded tissues.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN Ribosómico/genética , Análisis de Secuencia de ARN/métodos , Animales , Secuencia de Bases/genética , Perfilación de la Expresión Génica/métodos , Humanos , Mamíferos/genética , ARN/genética , ARN Mensajero/genética , Fijación del Tejido/métodos , Transcriptoma/genética
7.
Genes (Basel) ; 10(7)2019 06 26.
Artículo en Inglés | MEDLINE | ID: mdl-31248052

RESUMEN

The Steller sea lion is the largest member of the Otariidae family and is found in the coastal waters of the northern Pacific Rim. Here, we present the Steller sea lion genome, determined through DNA sequencing approaches that utilized microfluidic partitioning library construction, as well as nanopore technologies. These methods constructed a highly contiguous assembly with a scaffold N50 length of over 14 megabases, a contig N50 length of over 242 kilobases and a total length of 2.404 gigabases. As a measure of completeness, 95.1% of 4104 highly conserved mammalian genes were found to be complete within the assembly. Further annotation identified 19,668 protein coding genes. The assembled genome sequence and underlying sequence data can be found at the National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA475770.


Asunto(s)
Genoma , Leones Marinos/genética , Animales , Biblioteca Genómica , Microfluídica/métodos , Nanoporos , Secuenciación Completa del Genoma
8.
Microbiol Resour Announc ; 8(24)2019 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-31196920

RESUMEN

Engelmann spruce (Picea engelmannii) is a conifer found primarily on the west coast of North America. Here, we present the complete chloroplast genome sequence of Picea engelmannii genotype Se404-851. This chloroplast sequence will benefit future conifer genomic research and contribute resources to further species conservation efforts.

9.
Genes (Basel) ; 10(6)2019 06 03.
Artículo en Inglés | MEDLINE | ID: mdl-31163709

RESUMEN

: The threespine stickleback is a geographically widespread and ecologically highly diverse fish that has emerged as a powerful model system for evolutionary genomics and developmental biology. Investigations in this species currently rely on a single high-quality reference genome, but would benefit from the availability of additional, independently sequenced and assembled genomes. We present here the assembly of four new stickleback genomes, based on the sequencing of microfluidic partitioned DNA libraries. The base pair lengths of the four genomes reach 92-101% of the standard reference genome length. Together with their de novo gene annotation, these assemblies offer a resource enhancing genomic investigations in stickleback. The genomes and their annotations are available from the Dryad Digital Repository (https://doi.org/10.5061/dryad.113j3h7).


Asunto(s)
Genoma/genética , Anotación de Secuencia Molecular , Smegmamorpha/genética , Animales , Biblioteca de Genes , Genómica/métodos , Microfluídica , Análisis de Secuencia de ADN
10.
Microbiol Resour Announc ; 8(23)2019 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-31171622

RESUMEN

Here, we present the complete chloroplast genome sequence of white spruce (Picea glauca, genotype WS77111), a coniferous tree widespread in the boreal forests of North America. This sequence contributes to genomic and phylogenetic analyses of the Picea genus that are part of ongoing research to understand their adaptation to environmental stress.

11.
Biotechniques ; 66(2): 85-92, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30744412

RESUMEN

The analysis of cell-free circulating tumor DNA (ctDNA) is potentially a less invasive, more dynamic assessment of cancer progression and treatment response than characterizing solid tumor biopsies. Standard isolation methods require separation of plasma by centrifugation, a time-consuming step that complicates automation. To address these limitations, we present an automatable magnetic bead-based ctDNA isolation method that eliminates centrifugation to purify ctDNA directly from peripheral blood (PB). To develop and test our method, ctDNA from cancer patients was purified from PB and plasma. We found that allelic fractions of somatic single-nucleotide variants from target gene capture libraries were comparable, indicating that the PB ctDNA purification method may be a suitable replacement for the plasma-based protocols currently in use.


Asunto(s)
Ácidos Nucleicos Libres de Células/sangre , ADN Tumoral Circulante/sangre , Ensayos Analíticos de Alto Rendimiento/métodos , Neoplasias/sangre , Biomarcadores de Tumor/sangre , Biomarcadores de Tumor/aislamiento & purificación , Ácidos Nucleicos Libres de Células/aislamiento & purificación , ADN Tumoral Circulante/aislamiento & purificación , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación , Neoplasias/genética
12.
Nucleic Acids Res ; 47(2): e12, 2019 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-30418619

RESUMEN

Tissues used in pathology laboratories are typically stored in the form of formalin-fixed, paraffin-embedded (FFPE) samples. One important consideration in repurposing FFPE material for next generation sequencing (NGS) analysis is the sequencing artifacts that can arise from the significant damage to nucleic acids due to treatment with formalin, storage at room temperature and extraction. One such class of artifacts consists of chimeric reads that appear to be derived from non-contiguous portions of the genome. Here, we show that a major proportion of such chimeric reads align to both the 'Watson' and 'Crick' strands of the reference genome. We refer to these as strand-split artifact reads (SSARs). This study provides a conceptual framework for the mechanistic basis of the genesis of SSARs and other chimeric artifacts along with supporting experimental evidence, which have led to approaches to reduce the levels of such artifacts. We demonstrate that one of these approaches, involving S1 nuclease-mediated removal of single-stranded fragments and overhangs, also reduces sequence bias, base error rates, and false positive detection of copy number and single nucleotide variants. Finally, we describe an analytical approach for quantifying SSARs from NGS data.


Asunto(s)
Artefactos , Fijadores , Formaldehído , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Animales , Biblioteca Genómica , Genómica , Calor , Ratones Endogámicos C57BL , Adhesión en Parafina
13.
Genes (Basel) ; 9(12)2018 Nov 30.
Artículo en Inglés | MEDLINE | ID: mdl-30513700

RESUMEN

The grizzly bear (Ursus arctos ssp. horribilis) represents the largest population of brown bears in North America. Its genome was sequenced using a microfluidic partitioning library construction technique, and these data were supplemented with sequencing from a nanopore-based long read platform. The final assembly was 2.33 Gb with a scaffold N50 of 36.7 Mb, and the genome is of comparable size to that of its close relative the polar bear (2.30 Gb). An analysis using 4104 highly conserved mammalian genes indicated that 96.1% were found to be complete within the assembly. An automated annotation of the genome identified 19,848 protein coding genes. Our study shows that the combination of the two sequencing modalities that we used is sufficient for the construction of highly contiguous reference quality mammalian genomes. The assembled genome sequence and the supporting raw sequence reads are available from the NCBI (National Center for Biotechnology Information) under the bioproject identifier PRJNA493656, and the assembly described in this paper is version QXTK01000000.

14.
Genes (Basel) ; 8(12)2017 Dec 11.
Artículo en Inglés | MEDLINE | ID: mdl-29232880

RESUMEN

The northern sea otter inhabits coastal waters of the northern Pacific Ocean and is the largest member of the Mustelidae family. DNA sequencing methods that utilize microfluidic partitioned and non-partitioned library construction were used to establish the sea otter genome. The final assembly provided 2.426 Gbp of highly contiguous assembled genomic sequences with a scaffold N50 length of over 38 Mbp. We generated transcriptome data derived from a lymphoma to aid in the determination of functional elements. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA388419.

15.
Genes (Basel) ; 8(12)2017 Dec 11.
Artículo en Inglés | MEDLINE | ID: mdl-29232881

RESUMEN

The beluga whale is a cetacean that inhabits arctic and subarctic regions, and is the only living member of the genus Delphinapterus. The genome of the beluga whale was determined using DNA sequencing approaches that employed both microfluidic partitioning library and non-partitioned library construction. The former allowed for the construction of a highly contiguous assembly with a scaffold N50 length of over 19 Mbp and total reconstruction of 2.32 Gbp. To aid our understanding of the functional elements, transcriptome data was also derived from brain, duodenum, heart, lung, spleen, and liver tissue. Assembled sequence and all of the underlying sequence data are available at the National Center for Biotechnology Information (NCBI) under the Bioproject accession number PRJNA360851A.

16.
Nat Commun ; 8(1): 1433, 2017 11 10.
Artículo en Inglés | MEDLINE | ID: mdl-29127278

RESUMEN

Frogs play important ecological roles, and several species are important model organisms for scientific research. The globally distributed Ranidae (true frogs) are the largest frog family, and have substantial evolutionary distance from the model laboratory Xenopus frog species. Unfortunately, there are currently no genomic resources for the former, important group of amphibians. More widely applicable amphibian genomic data is urgently needed as more than two-thirds of known species are currently threatened or are undergoing population declines. We report a 5.8 Gbp (NG50 = 69 kbp) genome assembly of a representative North American bullfrog (Rana [Lithobates] catesbeiana). The genome contains over 22,000 predicted protein-coding genes and 6,223 candidate long noncoding RNAs (lncRNAs). RNA-Seq experiments show thyroid hormone causes widespread transcriptional change among protein-coding and putative lncRNA genes. This initial bullfrog draft genome will serve as a key resource with broad utility including amphibian research, developmental biology, and environmental research.


Asunto(s)
Genoma , ARN Largo no Codificante/genética , Rana catesbeiana/genética , Animales , Biología Computacional , Genoma Mitocondrial , Masculino , Anotación de Secuencia Molecular , América del Norte , Filogenia , ARN Largo no Codificante/metabolismo , Rana catesbeiana/metabolismo , Hormonas Tiroideas/metabolismo
17.
BMC Genomics ; 18(1): 515, 2017 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-28679365

RESUMEN

BACKGROUND: RNA-Sequencing (RNA-seq) is now commonly used to reveal quantitative spatiotemporal snapshots of the transcriptome, the structures of transcripts (splice variants and fusions) and landscapes of expressed mutations. However, standard approaches for library construction typically require relatively high amounts of input RNA, are labor intensive, and are time consuming. METHODS: Here, we report the outcome of a systematic effort to optimize and streamline steps in strand-specific RNA-seq library construction. RESULTS: This work has resulted in the identification of an optimized messenger RNA isolation protocol, a potent reverse transcriptase for cDNA synthesis, and an efficient chemistry and a simplified formulation of library construction reagents. We also present an optimization of bead-based purification and size selection designed to maximize the recovery of cDNA fragments. CONCLUSIONS: These developments have allowed us to assemble a rapid high throughput pipeline that produces high quality data from amounts of total RNA as low as 25 ng. While the focus of this study is on RNA-seq sample preparation, some of these developments are also relevant to other next-generation sequencing library types.


Asunto(s)
Biblioteca de Genes , ARN Mensajero , Análisis de Secuencia de ARN/métodos , Manejo de Especímenes/normas , Células HL-60 , Humanos
18.
PLoS One ; 12(6): e0178706, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28570594

RESUMEN

Curation and storage of formalin-fixed, paraffin-embedded (FFPE) samples are standard procedures in hospital pathology laboratories around the world. Many thousands of such samples exist and could be used for next generation sequencing analysis. Retrospective analyses of such samples are important for identifying molecular correlates of carcinogenesis, treatment history and disease outcomes. Two major hurdles in using FFPE material for sequencing are the damaged nature of the nucleic acids and the labor-intensive nature of nucleic acid purification. These limitations and a number of other issues that span multiple steps from nucleic acid purification to library construction are addressed here. We optimized and automated a 96-well magnetic bead-based extraction protocol that can be scaled to large cohorts and is compatible with automation. Using sets of 32 and 91 individual FFPE samples respectively, we generated libraries from 100 ng of total RNA and DNA starting amounts with 95-100% success rate. The use of the resulting RNA in micro-RNA sequencing was also demonstrated. In addition to offering the potential of scalability and rapid throughput, the yield obtained with lower input requirements makes these methods applicable to clinical samples where tissue abundance is limiting.


Asunto(s)
Automatización , ADN/aislamiento & purificación , Formaldehído/química , Secuenciación de Nucleótidos de Alto Rendimiento , Adhesión en Parafina , ARN/aislamiento & purificación , Fijación del Tejido/métodos , ADN/genética , ARN/genética
19.
New Phytol ; 203(1): 257-66, 2014 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-24684288

RESUMEN

Plant defense against herbivores may compromise attraction of mutualists, yet information remains limited about the mechanisms underlying such signaling tradeoffs. Here, we investigated the effects of foliar herbivory by two herbivore species on defense compounds, floral signaling, pollinator and parasitoid attraction, and seed production. Herbivory generally reduced the quantity of many floral volatile organic compounds VOCs) in Brassica rapa. By contrast, floral color, flower diameter, and plant height remained unaffected. The decreased amounts of floral volatiles led to reduced attractiveness of flowers to pollinators, but increased the attractiveness of herbivore-infested plants to parasitoids. Plants infested with the native butterfly Pieris brassicae produced more flowers during early flowering, effectively compensating for the lower olfactory attractiveness. Herbivory by the invasive Spodoptera littoralis increased the amounts of glucobrassicanapin, and led to delayed flowering. These plants tended to attract fewer pollinators and to produce fewer seeds. Our study indicates a tradeoff between pollinator attraction and indirect defense (parasitoid attraction), which can be mitigated by reduced floral VOC emission and production of more early flowers. We suggest that this compensatory mechanism is specific to plant-herbivore associations with a coevolutionary history.


Asunto(s)
Flores/fisiología , Herbivoria , Polinización , Compuestos Orgánicos Volátiles/química , Animales , Abejas , Brassica rapa/fisiología , Mariposas Diurnas , Color , Flores/anatomía & histología , Flores/química , Spodoptera , Simbiosis , Avispas
20.
Bioinformatics ; 29(12): 1492-7, 2013 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-23698863

RESUMEN

UNLABELLED: White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20,356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. AVAILABILITY: The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435.


Asunto(s)
Genoma de Planta , Genómica/métodos , Picea/genética , Secuencia de Bases , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA