Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 74
Filtrar
Más filtros

Base de datos
Tipo del documento
Intervalo de año de publicación
1.
Genome Biol Evol ; 16(5)2024 05 02.
Artículo en Inglés | MEDLINE | ID: mdl-38701023

RESUMEN

Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics.


Asunto(s)
Genoma , Escorpiones , Animales , Escorpiones/genética , Cromosomas/genética , Filogenia , Anotación de Secuencia Molecular , Evolución Molecular
2.
G3 (Bethesda) ; 14(7)2024 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-38781445

RESUMEN

The first chromosome-scale reference genome of the rare narrow-endemic African moss Physcomitrellopsis africana (P. africana) is presented here. Assembled from 73 × Oxford Nanopore Technologies (ONT) long reads and 163 × Beijing Genomics Institute (BGI)-seq short reads, the 414 Mb reference comprises 26 chromosomes and 22,925 protein-coding genes [Benchmarking Universal Single-Copy Ortholog (BUSCO) scores: C:94.8% (D:13.9%)]. This genome holds 2 genes that withstood rigorous filtration of microbial contaminants, have no homolog in other land plants, and are thus interpreted as resulting from 2 unique horizontal gene transfers (HGTs) from microbes. Further, P. africana shares 176 of the 273 published HGT candidates identified in Physcomitrium patens (P. patens), but lacks 98 of these, highlighting that perhaps as many as 91 genes were acquired in P. patens in the last 40 million years following its divergence from its common ancestor with P. africana. These observations suggest rather continuous gene gains via HGT followed by potential losses during the diversification of the Funariaceae. Our findings showcase both dynamic flux in plant HGTs over evolutionarily "short" timescales, alongside enduring impacts of successful integrations, like those still functionally maintained in extant P. africana. Furthermore, this study describes the informatic processes employed to distinguish contaminants from candidate HGT events.


Asunto(s)
Bryopsida , Transferencia de Gen Horizontal , Genoma de Planta , Filogenia , Bryopsida/genética , Genómica/métodos , Anotación de Secuencia Molecular
3.
Evol Appl ; 17(4): e13669, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38633133

RESUMEN

DNA methylation is critical to the regulation of transposable elements and gene expression and can play an important role in the adaptation of stress response mechanisms in plants. Traditional methods of methylation quantification rely on bisulfite conversion that can compromise accuracy. Recent advances in long-read sequencing technologies allow for methylation detection in real time. The associated algorithms that interpret these modifications have evolved from strictly statistical approaches to Hidden Markov Models and, recently, deep learning approaches. Much of the existing software focuses on methylation in the CG context, but methylation in other contexts is important to quantify, as it is extensively leveraged in plants. Here, we present methylation profiles for two maple species across the full range of 5mC sequence contexts using Oxford Nanopore Technologies (ONT) long-reads. Hybrid and reference-guided assemblies were generated for two new Acer accessions: Acer negundo (box elder; 65x ONT and 111X Illumina) and Acer saccharum (sugar maple; 93x ONT and 148X Illumina). The ONT reads generated for these assemblies were re-basecalled, and methylation detection was conducted in a custom pipeline with the published Acer references (PacBio assemblies) and hybrid assemblies reported herein to generate four epigenomes. Examination of the transposable element landscape revealed the dominance of LTR Copia elements and patterns of methylation associated with different classes of TEs. Methylation distributions were examined at high resolution across gene and repeat density and described within the broader angiosperm context, and more narrowly in the context of gene family dynamics and candidate nutrient stress genes.

4.
G3 (Bethesda) ; 14(5)2024 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-38526344

RESUMEN

Whitebark pine (WBP, Pinus albicaulis) is a white pine of subalpine regions in the Western contiguous United States and Canada. WBP has become critically threatened throughout a significant part of its natural range due to mortality from the introduced fungal pathogen white pine blister rust (WPBR, Cronartium ribicola) and additional threats from mountain pine beetle (Dendroctonus ponderosae), wildfire, and maladaptation due to changing climate. Vast acreages of WBP have suffered nearly complete mortality. Genomic technologies can contribute to a faster, more cost-effective approach to the traditional practices of identifying disease-resistant, climate-adapted seed sources for restoration. With deep-coverage Illumina short reads of haploid megagametophyte tissue and Oxford Nanopore long reads of diploid needle tissue, followed by a hybrid, multistep assembly approach, we produced a final assembly containing 27.6 Gb of sequence in 92,740 contigs (N50 537,007 bp) and 34,716 scaffolds (N50 2.0 Gb). Approximately 87.2% (24.0 Gb) of total sequence was placed on the 12 WBP chromosomes. Annotation yielded 25,362 protein-coding genes, and over 77% of the genome was characterized as repeats. WBP has demonstrated the greatest variation in resistance to WPBR among the North American white pines. Candidate genes for quantitative resistance include disease resistance genes known as nucleotide-binding leucine-rich repeat receptors (NLRs). A combination of protein domain alignments and direct genome scanning was employed to fully describe the 3 subclasses of NLRs. Our high-quality reference sequence and annotation provide a marked improvement in NLR identification compared to previous assessments that leveraged de novo-assembled transcriptomes.


Asunto(s)
Genoma de Planta , Anotación de Secuencia Molecular , Pinus , Pinus/genética , Pinus/parasitología , Genómica/métodos , Especies en Peligro de Extinción , Secuenciación de Nucleótidos de Alto Rendimiento
5.
Glob Chang Biol ; 30(1): e17145, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38273516

RESUMEN

Human activity changes multiple factors in the environment, which can have positive or negative synergistic effects on organisms. However, few studies have explored the causal effects of multiple anthropogenic factors, such as urbanization and invasive species, on animals and the mechanisms that mediate these interactions. This study examines the influence of urbanization on the detrimental effect of invasive avian vampire flies (Philornis downsi) on endemic Darwin's finches in the Galápagos Islands. We experimentally manipulated nest fly abundance in urban and non-urban locations and then characterized nestling health, fledging success, diet, and gene expression patterns related to host defense. Fledging success of non-parasitized nestlings from urban (79%) and non-urban (75%) nests did not differ significantly. However, parasitized, non-urban nestlings lost more blood, and fewer nestlings survived (8%) compared to urban nestlings (50%). Stable isotopic values (δ15 N) from urban nestling feces were higher than those from non-urban nestlings, suggesting that urban nestlings are consuming more protein. δ15 N values correlated negatively with parasite abundance, which suggests that diet might influence host defenses (e.g., tolerance and resistance). Parasitized, urban nestlings differentially expressed genes within pathways associated with red blood cell production (tolerance) and pro-inflammatory response (innate immunological resistance), compared to parasitized, non-urban nestlings. In contrast, parasitized non-urban nestlings differentially expressed genes within pathways associated with immunoglobulin production (adaptive immunological resistance). Our results suggest that urban nestlings are investing more in pro-inflammatory responses to resist parasites but also recovering more blood cells to tolerate blood loss. Although non-urban nestlings are mounting an adaptive immune response, it is likely a last effort by the immune system rather than an effective defense against avian vampire flies since few nestlings survived.


Asunto(s)
Pinzones , Muscidae , Parásitos , Animales , Humanos , Pinzones/parasitología , Ecuador
6.
G3 (Bethesda) ; 14(2)2024 Feb 07.
Artículo en Inglés | MEDLINE | ID: mdl-37703053

RESUMEN

With the advent of affordable and more accurate third-generation sequencing technologies, and the associated bioinformatic tools, it is now possible to sequence, assemble, and annotate more species of conservation concern than ever before. Juglans cinerea, commonly known as butternut or white walnut, is a member of the walnut family, native to the Eastern United States and Southeastern Canada. The species is currently listed as Endangered on the IUCN Red List due to decline from an invasive fungus known as Ophiognomonia clavigignenti-juglandacearum (Oc-j) that causes butternut canker. Oc-j creates visible sores on the trunks of the tree which essentially starves and slowly kills the tree. Natural resistance to this pathogen is rare. Conserving butternut is of utmost priority due to its critical ecosystem role and cultural significance. As part of an integrated undergraduate and graduate student training program in biodiversity and conservation genomics, the first reference genome for Juglans cinerea is described here. This chromosome-scale 539 Mb assembly was generated from over 100 × coverage of Oxford Nanopore long reads and scaffolded with the Juglans mandshurica genome. Scaffolding with a closely related species oriented and ordered the sequences in a manner more representative of the structure of the genome without altering the sequence. Comparisons with sequenced Juglandaceae revealed high levels of synteny and further supported J. cinerea's recent phylogenetic placement. Comparative assessment of gene family evolution revealed a significant number of contracting families, including several associated with biotic stress response.


Asunto(s)
Juglans , Humanos , Estados Unidos , Juglans/genética , Filogenia , Ecosistema , Cromosomas , América del Norte
7.
bioRxiv ; 2023 Nov 17.
Artículo en Inglés | MEDLINE | ID: mdl-38014212

RESUMEN

Whitebark pine (WBP, Pinus albicaulis ) is a white pine of subalpine regions in western contiguous US and Canada. WBP has become critically threatened throughout a significant part of its natural range due to mortality from the introduced fungal pathogen white pine blister rust (WPBR, Cronartium ribicola ) and additional threats from mountain pine beetle ( Dendroctonus ponderosae ), wildfire, and maladaptation due to changing climate. Vast acreages of WBP have suffered nearly complete mortality. Genomic technologies can contribute to a faster, more cost-effective approach to the traditional practices of identifying disease-resistant, climate-adapted seed sources for restoration. With deep-coverage Illumina short-reads of haploid megametophyte tissue and Oxford Nanopore long-reads of diploid needle tissue, followed by a hybrid, multistep assembly approach, we produced a final assembly containing 27.6 Gbp of sequence in 92,740 contigs (N50 537,007 bp) and 34,716 scaffolds (N50 2.0 Gbp). Approximately 87.2% (24.0 Gbp) of total sequence was placed on the twelve WBP chromosomes. Annotation yielded 25,362 protein-coding genes, and over 77% of the genome was characterized as repeats. WBP has demonstrated the greatest variation in resistance to WPBR among the North American white pines. Candidate genes for quantitative resistance include disease resistance genes known as nucleotide-binding leucine-rich-repeat receptors (NLRs). A combination of protein domain alignments and direct genome scanning was employed to fully describe the three subclasses of NLRs (TNL, CNL, RNL). Our high-quality reference sequence and annotation provide a marked improvement in NLR identification compared to previous assessments that leveraged de novo assembled transcriptomes.

8.
Appl Plant Sci ; 11(4): e11533, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37601314

RESUMEN

Premise: Robust standards to evaluate quality and completeness are lacking in eukaryotic structural genome annotation, as genome annotation software is developed using model organisms and typically lacks benchmarking to comprehensively evaluate the quality and accuracy of the final predictions. The annotation of plant genomes is particularly challenging due to their large sizes, abundant transposable elements, and variable ploidies. This study investigates the impact of genome quality, complexity, sequence read input, and method on protein-coding gene predictions. Methods: The impact of repeat masking, long-read and short-read inputs, and de novo and genome-guided protein evidence was examined in the context of the popular BRAKER and MAKER workflows for five plant genomes. The annotations were benchmarked for structural traits and sequence similarity. Results: Benchmarks that reflect gene structures, reciprocal similarity search alignments, and mono-exonic/multi-exonic gene counts provide a more complete view of annotation accuracy. Transcripts derived from RNA-read alignments alone are not sufficient for genome annotation. Gene prediction workflows that combine evidence-based and ab initio approaches are recommended, and a combination of short and long reads can improve genome annotation. Adding protein evidence from de novo assemblies, genome-guided transcriptome assemblies, or full-length proteins from OrthoDB generates more putative false positives as implemented in the current workflows. Post-processing with functional and structural filters is highly recommended. Discussion: While the annotation of non-model plant genomes remains complex, this study provides recommendations for inputs and methodological approaches. We discuss a set of best practices to generate an optimal plant genome annotation and present a more robust set of metrics to evaluate the resulting predictions.

9.
Plant Cell Environ ; 46(5): 1705-1725, 2023 05.
Artículo en Inglés | MEDLINE | ID: mdl-36541367

RESUMEN

Fusarium circinatum poses a threat to both commercial and natural pine forests. Large variation in host resistance exists between species, with many economically important species being susceptible. Development of resistant genotypes could be expedited and optimised by investigating the molecular mechanisms underlying host resistance and susceptibility as well as increasing the available genetic resources. RNA-seq data, from F. circinatum inoculated and mock-inoculated ca. 6-month-old shoot tissue at 3- and 7-days postinoculation, was generated for three commercially important tropical pines, Pinus oocarpa, Pinus maximinoi and Pinus greggii. De novo transcriptomes were assembled and used to investigate the NLR and PR gene content within available pine references. Host responses to F. circinatum challenge were investigated in P. oocarpa (resistant) and P. greggii (susceptible), in comparison to previously generated expression profiles from Pinus tecunumanii (resistant) and Pinus patula (susceptible). Expression results indicated crosstalk between induced salicylate, jasmonate and ethylene signalling is involved in host resistance and compromised in susceptible hosts. Additionally, higher constitutive expression of sulfur metabolism and flavonoid biosynthesis in resistant hosts suggest involvement of these metabolites in resistance.


Asunto(s)
Fusarium , Pinus , Transcriptoma/genética , Fusarium/fisiología , Genotipo , Pinus/genética , Enfermedades de las Plantas/genética
10.
G3 (Bethesda) ; 13(2)2023 02 09.
Artículo en Inglés | MEDLINE | ID: mdl-36454025

RESUMEN

Douglas-fir (Pseudotsuga menziesii) is native to western North America. It grows in a wide range of environmental conditions and is an important timber tree. Although there are several studies on the gene expression responses of Douglas-fir to abiotic cues, the absence of high-quality transcriptome and genome data is a barrier to further investigation. Like for most conifers, the available transcriptome and genome reference dataset for Douglas-fir remains fragmented and requires refinement. We aimed to generate a highly accurate, and complete reference transcriptome and genome annotation. We deep-sequenced the transcriptome of Douglas-fir needles from seedlings that were grown under nonstress control conditions or a combination of heat and drought stress conditions using long-read (LR) and short-read (SR) sequencing platforms. We used 2 computational approaches, namely de novo and genome-guided LR transcriptome assembly. Using the LR de novo assembly, we identified 1.3X more high-quality transcripts, 1.85X more "complete" genes, and 2.7X more functionally annotated genes compared to the genome-guided assembly approach. We predicted 666 long noncoding RNAs and 12,778 unique protein-coding transcripts including 2,016 putative transcription factors. We leveraged the LR de novo assembled transcriptome with paired-end SR and a published single-end SR transcriptome to generate an improved genome annotation. This was conducted with BRAKER2 and refined based on functional annotation, repetitive content, and transcriptome alignment. This high-quality genome annotation has 51,419 unique gene models derived from 322,631 initial predictions. Overall, our informatics approach provides a new reference Douglas-fir transcriptome assembly and genome annotation with considerably improved completeness and functional annotation.


Asunto(s)
Pseudotsuga , Transcriptoma , Pseudotsuga/genética , Perfilación de la Expresión Génica , Anotación de Secuencia Molecular , Secuencia de Bases
11.
Tree Physiol ; 43(2): 315-334, 2023 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-36210755

RESUMEN

Climate change challenges the adaptive capacity of several forest tree species in the face of increasing drought and rising temperatures. Therefore, understanding the mechanistic connections between genetic diversity and drought resilience is highly valuable for conserving drought-sensitive forests. Nonetheless, the post-drought recovery in trees from a transcriptomic perspective has not yet been studied by comparing contrasting phenotypes. Here, experimental drought treatments, gas-exchange dynamics and transcriptomic analysis (RNA-seq) were performed in the relict and drought-sensitive fir Abies pinsapo Boiss. to identify gene expression differences over immediate (24 h) and extended drought (20 days). Post-drought responses were investigated to define resilient and sensitive phenotypes. Single nucleotide polymorphisms (SNPs) were also studied to characterize the genomic basis of A. pinsapo drought resilience. Weighted gene co-expression network analysis showed an activation of stomatal closing and an inhibition of plant growth-related genes during the immediate drought, consistent with an isohydric dynamic. During the extended drought, transcription factors, as well as cellular damage and homeostasis protection-related genes prevailed. Resilient individuals activate photosynthesis-related genes and inhibit aerial growth-related genes, suggesting a shifting shoot/root biomass allocation to improve water uptake and whole-plant carbon balance. About, 152 fixed SNPs were found between resilient and sensitive seedlings, which were mostly located in RNA-activity-related genes, including epigenetic regulation. Contrasting gene expression and SNPs were found between different post-drought resilience phenotypes for the first time in a forest tree, suggesting a transcriptomic and genomic basis for drought resilience. The obtained drought-related transcriptomic profile and drought-resilience candidate genes may guide conservation programs for this threatened tree species.


Asunto(s)
Abies , Abies/fisiología , Transcriptoma , Sequías , Epigénesis Genética , Bosques , Árboles/genética , Genómica
12.
Nat Plants ; 8(4): 389-401, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35437001

RESUMEN

Cycads represent one of the most ancient lineages of living seed plants. Identifying genomic features uniquely shared by cycads and other extant seed plants, but not non-seed-producing plants, may shed light on the origin of key innovations, as well as the early diversification of seed plants. Here, we report the 10.5-Gb reference genome of Cycas panzhihuaensis, complemented by the transcriptomes of 339 cycad species. Nuclear and plastid phylogenomic analyses strongly suggest that cycads and Ginkgo form a clade sister to all other living gymnosperms, in contrast to mitochondrial data, which place cycads alone in this position. We found evidence for an ancient whole-genome duplication in the common ancestor of extant gymnosperms. The Cycas genome contains four homologues of the fitD gene family that were likely acquired via horizontal gene transfer from fungi, and these genes confer herbivore resistance in cycads. The male-specific region of the Y chromosome of C. panzhihuaensis contains a MADS-box transcription factor expressed exclusively in male cones that is similar to a system reported in Ginkgo, suggesting that a sex determination mechanism controlled by MADS-box genes may have originated in the common ancestor of cycads and Ginkgo. The C. panzhihuaensis genome provides an important new resource of broad utility for biologists.


Asunto(s)
Cycas , Cycadopsida/genética , Cycas/genética , Genes de Plantas , Ginkgo biloba/genética , Filogenia , Semillas/genética
13.
BMC Plant Biol ; 22(1): 143, 2022 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-35337270

RESUMEN

Aronia is a group of deciduous fruiting shrubs, of the Rosaceae family, native to eastern North America. Interest in Aronia has increased because of the high levels of dietary antioxidants in Aronia fruits. Using Illumina RNA-seq transcriptome analysis, this study investigates the molecular mechanisms of polyphenol biosynthesis during Aronia fruit development. Six A. melanocarpa (diploid) accessions were collected at four fruit developmental stages. De novo assembly was performed with 341 million clean reads from 24 samples and assembled into 90,008 transcripts with an average length of 801 bp. The transcriptome had 96.1% complete according to Benchmarking Universal Single-Copy Orthologs (BUSCOs). The differentially expressed genes (DEGs) were identified in flavonoid biosynthetic and metabolic processes, pigment biosynthesis, carbohydrate metabolic processes, and polysaccharide metabolic processes based on significant Gene Ontology (GO) biological terms. The expression of ten anthocyanin biosynthetic genes showed significant up-regulation during fruit development according to the transcriptomic data, which was further confirmed using qRT-PCR expression analyses. Additionally, transcription factor genes were identified among the DEGs. Using a transient expression assay, we confirmed that AmMYB10 induces anthocyanin biosynthesis. The de novo transcriptome data provides a valuable resource for the understanding the molecular mechanisms of fruit anthocyanin biosynthesis in Aronia and species of the Rosaceae family.


Asunto(s)
Photinia , Transcriptoma , Antocianinas/metabolismo , Frutas , Regulación de la Expresión Génica de las Plantas , Photinia/genética , Photinia/metabolismo , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo
14.
Ecol Evol ; 12(2): e8611, 2022 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-35222971

RESUMEN

In land plants, heteroblasty broadly refers to a drastic change in morphology during growth through ontogeny. Juniperus flaccida and Pinus cembroides are conifers of independent lineages known to exhibit leaf heteroblasty between the juvenile and adult life stage of development. Juvenile leaves of P. cembroides develop spirally on the main stem and appear decurrent, flattened, and needle-like; whereas adult photosynthetic leaves are triangular or semi-circular needle-like, and grow in whorls on secondary or tertiary compact dwarf shoots. By comparison, J. flaccida juvenile leaves are decurrent and needle-like, and adult leaves are compact, short, and scale-like. Comparative analyses were performed to evaluate differences in anatomy and gene expression patterns between developmental phases in both species. RNA from 12 samples was sequenced and analyzed with available software. They were assembled de novo from the RNA-Seq reads. Following assembly, 63,741 high-quality transcripts were functionally annotated in P. cembroides and 69,448 in J. flaccida. Evaluation of the orthologous groups yielded 4140 shared gene families among the four references (adult and juvenile from each species). Activities related to cell division and development were more abundant in juveniles than adults in P. cembroides, and more abundant in adults than juveniles in J. flaccida. Overall, there were 509 up-regulated and 81 down-regulated genes in the juvenile condition of P. cembroides and 14 up-regulated and 22 down-regulated genes in J. flaccida. Gene interaction network analysis showed evidence of co-expression and co-localization of up-regulated genes involved in cell wall and cuticle formation, development, and phenylpropanoid pathway, in juvenile P. cembroides leaves. Whereas in J. flaccida, differential expression and gene interaction patterns were detected in genes involved in photosynthesis and chloroplast biogenesis. Although J. flaccida and P. cembroides both exhibit leaf heteroblastic development, little overlap was detected, and unique genes and pathways were highlighted in this study.

15.
G3 (Bethesda) ; 12(1)2022 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-35100403

RESUMEN

Sequencing, assembly, and annotation of the 26.5 Gbp hexaploid genome of coast redwood (Sequoia sempervirens) was completed leading toward discovery of genes related to climate adaptation and investigation of the origin of the hexaploid genome. Deep-coverage short-read Illumina sequencing data from haploid tissue from a single seed were combined with long-read Oxford Nanopore Technologies sequencing data from diploid needle tissue to create an initial assembly, which was then scaffolded using proximity ligation data to produce a highly contiguous final assembly, SESE 2.1, with a scaffold N50 size of 44.9 Mbp. The assembly included several scaffolds that span entire chromosome arms, confirmed by the presence of telomere and centromere sequences on the ends of the scaffolds. The structural annotation produced 118,906 genes with 113 containing introns that exceed 500 Kbp in length and one reaching 2 Mb. Nearly 19 Gbp of the genome represented repetitive content with the vast majority characterized as long terminal repeats, with a 2.9:1 ratio of Copia to Gypsy elements that may aid in gene expression control. Comparison of coast redwood to other conifers revealed species-specific expansions for a plethora of abiotic and biotic stress response genes, including those involved in fungal disease resistance, detoxification, and physical injury/structural remodeling and others supporting flavonoid biosynthesis. Analysis of multiple genes that exist in triplicate in coast redwood but only once in its diploid relative, giant sequoia, supports a previous hypothesis that the hexaploidy is the result of autopolyploidy rather than any hybridizations with separate but closely related conifer species.


Asunto(s)
Sequoia , Evolución Biológica , Cromosomas , Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Sequoia/genética
16.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-35042802

RESUMEN

A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.


Asunto(s)
Secuencia de Bases/genética , Eucariontes/genética , Genómica/normas , Animales , Biodiversidad , Genómica/métodos , Humanos , Estándares de Referencia , Valores de Referencia , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/normas
17.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-35042803

RESUMEN

Green plants play a fundamental role in ecosystems, human health, and agriculture. As de novo genomes are being generated for all known eukaryotic species as advocated by the Earth BioGenome Project, increasing genomic information on green land plants is essential. However, setting standards for the generation and storage of the complex set of genomes that characterize the green lineage of life is a major challenge for plant scientists. Such standards will need to accommodate the immense variation in green plant genome size, transposable element content, and structural complexity while enabling research into the molecular and evolutionary processes that have resulted in this enormous genomic variation. Here we provide an overview and assessment of the current state of knowledge of green plant genomes. To date fewer than 300 complete chromosome-scale genome assemblies representing fewer than 900 species have been generated across the estimated 450,000 to 500,000 species in the green plant clade. These genomes range in size from 12 Mb to 27.6 Gb and are biased toward agricultural crops with large branches of the green tree of life untouched by genomic-scale sequencing. Locating suitable tissue samples of most species of plants, especially those taxa from extreme environments, remains one of the biggest hurdles to increasing our genomic inventory. Furthermore, the annotation of plant genomes is at present undergoing intensive improvement. It is our hope that this fresh overview will help in the development of genomic quality standards for a cohesive and meaningful synthesis of green plant genomes as we scale up for the future.


Asunto(s)
Secuencia de Bases/genética , Genómica/tendencias , Viridiplantae/genética , Biodiversidad , Evolución Biológica , Elementos Transponibles de ADN/genética , Ecología , Ecosistema , Embryophyta/genética , Evolución Molecular , Genoma , Genoma de Planta/genética , Genómica/métodos , Difusión de la Información/métodos , Almacenamiento y Recuperación de la Información/métodos , Filogenia , Plantas/genética
19.
Gigascience ; 122022 12 28.
Artículo en Inglés | MEDLINE | ID: mdl-37632754

RESUMEN

BACKGROUND: De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FINDINGS: Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. CONCLUSIONS: Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.


Asunto(s)
Eucalyptus , Animales , Eucalyptus/genética , Árboles , Bosques , Reordenamiento Génico , Haplotipos
20.
Appl Plant Sci ; 9(6): e11439, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-34268018

RESUMEN

PREMISE: An informatics approach was used for the construction of an Axiom genotyping array from heterogeneous, high-throughput sequence data to assess the complex genome of loblolly pine (Pinus taeda). METHODS: High-throughput sequence data, sourced from exome capture and whole genome reduced-representation approaches from 2698 trees across five sequence populations, were analyzed with the improved genome assembly and annotation for the loblolly pine. A variant detection, filtering, and probe design pipeline was developed to detect true variants across and within populations. From 8.27 million variants, a total of 642,275 were evaluated and 423,695 of those were screened across a range-wide population. RESULTS: The final informatics and screening approach delivered an Axiom array representing 46,439 high-confidence variants to the forest tree breeding and genetics community. Based on the annotated reference genome, 34% were located in or directly upstream or downstream of genic regions. DISCUSSION: The Pita50K array represents a genome-wide resource developed from sequence data for an economically important conifer, loblolly pine. It uniquely integrates independent projects that assessed trees sampled across the native range. The challenges associated with the large and repetitive genome are addressed in the development of this resource.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA