Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Bioinformatics ; 39(8)2023 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-37607004

RESUMEN

SUMMARY: Genome-wide association studies (GWAS) excels at harnessing dense genomic variant datasets to identify candidate regions responsible for producing a given phenotype. However, GWAS and traditional fine-mapping methods do not provide insight into the complex local landscape of linkage that contains and has been shaped by the causal variant(s). Here, we present crosshap, an R package that performs robust density-based clustering of variants based on their linkage profiles to capture haplotype structures in a local genomic region of interest. Following this, crosshap is equipped with visualization tools for choosing optimal clustering parameters (ɛ) before producing an intuitive figure that provides an overview of the complex relationships between linked variants, haplotype combinations, phenotype, and metadata traits. AVAILABILITY AND IMPLEMENTATION: The crosshap package is freely available under the MIT license and can be downloaded directly from CRAN with R >4.0.0. The development version is available on GitHub alongside issue support (https://github.com/jacobimarsh/crosshap). Tutorial vignettes and documentation are available (https://jacobimarsh.github.io/crosshap/).


Asunto(s)
Documentación , Estudio de Asociación del Genoma Completo , Análisis por Conglomerados , Haplotipos , Fenotipo
2.
Plant J ; 115(1): 68-80, 2023 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-36970933

RESUMEN

Pod dehiscence is a major source of yield loss in legumes, which is exacerbated by aridity. Disruptive mutations in "Pod indehiscent 1" (PDH1), a pod sclerenchyma-specific lignin biosynthesis gene, has been linked to significant reductions in dehiscence in several legume species. We compared syntenic PDH1 regions across 12 legumes and two outgroups to uncover key historical evolutionary trends at this important locus. Our results clarified the extent to which PDH1 orthologs are present in legumes, showing the typical genomic context surrounding PDH1 has only arisen relatively recently in certain phaseoloid species (Vigna, Phaseolus, Glycine). The notable absence of PDH1 in Cajanus cajan may be a major contributor to its indehiscent phenotype compared with other phaseoloids. In addition, we identified a novel PDH1 ortholog in Vigna angularis and detected remarkable increases in PDH1 transcript abundance during Vigna unguiculata pod development. Investigation of the shared genomic context of PDH1 revealed it lies in a hotspot of transcription factors and signaling gene families that respond to abscisic acid and drought stress, which we hypothesize may be an additional factor influencing expression of PDH1 under specific environmental conditions. Our findings provide key insights into the evolutionary history of PDH1 and lay the foundation for optimizing the pod dehiscence role of PDH1 in major and understudied legume species.


Asunto(s)
Phaseolus , Vigna , Vigna/genética , Sitios de Carácter Cuantitativo , Genoma de Planta/genética , Phaseolus/genética , Genómica
3.
Int J Mol Sci ; 23(19)2022 Sep 21.
Artículo en Inglés | MEDLINE | ID: mdl-36232406

RESUMEN

Rye (Secale cereale) is a climate-resilient cereal grown extensively as grain or forage crop in Northern and Eastern Europe. In addition to being an important crop, it has been used to improve wheat through introgression of genomic regions for improved yield and disease resistance. Understanding the genomic diversity of rye will assist both the improvement of this crop and facilitate the introgression of more valuable traits into wheat. Here, we isolated and sequenced the short arm of rye chromosome 7 (7RS) from Triticale 380SD using flow cytometry and compared it to the public Lo7 rye whole genome reference assembly. We identify 2747 Lo7 genes present on the isolated chromosome arm and two clusters containing seven and sixty-five genes that are present on Triticale 380SD 7RS, but absent from Lo7 7RS. We identified 29 genes that are not assigned to chromosomal locations in the Lo7 assembly but are present on Triticale 380SD 7RS, suggesting a chromosome arm location for these genes. Our study supports the Lo7 reference assembly and provides a repertoire of genes on Triticale 7RS.


Asunto(s)
Secale , Triticale , Cromosomas de las Plantas/genética , Resistencia a la Enfermedad/genética , Grano Comestible/genética , Secale/genética , Triticale/genética , Triticum/genética
4.
Methods Mol Biol ; 2512: 73-80, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35818000

RESUMEN

Presence-absence variants (PAV) are genomic regions present in some individuals of a species, but not others. PAVs have been shown to contribute to genomic diversity, especially in bacteria and plants. These structural variations have been linked to traits and can be used to track a species' evolutionary history. PAVs are usually called by aligning short read sequence data from one or more individuals to a reference genome or pangenome assembly, and then comparing coverage. Regions where reads do not align define absence in that individual, and the regions are classified as PAVs. The method below details how to align sequence reads to a reference and how to use the sequencing-coverage calculator Mosdepth to identify PAVs and construct a PAV table for use in downstream comparative genome analysis.


Asunto(s)
Genoma , Genómica , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Análisis de Secuencia de ADN/métodos
5.
Plants (Basel) ; 11(9)2022 Apr 27.
Artículo en Inglés | MEDLINE | ID: mdl-35567182

RESUMEN

Soybean (Glycine max) is a legume species of significant economic and nutritional value. The yield of soybean continues to increase with the breeding of improved varieties, and this is likely to continue with the application of advanced genetic and genomic approaches for breeding. Genome technologies continue to advance rapidly, with an increasing number of high-quality genome assemblies becoming available. With accumulating data from marker arrays and whole-genome resequencing, studying variations between individuals and populations is becoming increasingly accessible. Furthermore, the recent development of soybean pangenomes has highlighted the significant structural variation between individuals, together with knowledge of what has been selected for or lost during domestication and breeding, information that can be applied for the breeding of improved cultivars. Because of this, resources such as genome assemblies, SNP datasets, pangenomes and associated databases are becoming increasingly important for research underlying soybean crop improvement.

6.
Theor Appl Genet ; 135(4): 1443-1455, 2022 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-35141762

RESUMEN

KEY MESSAGE: The major soy protein QTL, cqProt-003, was analysed for haplotype diversity and global distribution, and results indicate 304 bp deletion and variable tandem repeats in protein coding regions are likely causal candidates. Here, we present association and linkage analysis of 985 wild, landrace and cultivar soybean accessions in a pan genomic dataset to characterize the major high-protein/low-oil associated locus cqProt-003 located on chromosome 20. A significant trait-associated region within a 173 kb linkage block was identified, and variants in the region were characterized, identifying 34 high confidence SNPs, 4 insertions, 1 deletion and a larger 304 bp structural variant in the high-protein haplotype. Trinucleotide tandem repeats of variable length present in the second exon of gene Glyma.20G085100 are strongly correlated with the high-protein phenotype and likely represent causal variation. Structural variation has previously been found in the same gene, for which we report the global distribution of the 304 bp deletion and have identified additional nested variation present in high-protein individuals. Mapping variation at the cqProt-003 locus across demographic groups suggests that the high-protein haplotype is common in wild accessions (94.7%), rare in landraces (10.6%) and near absent in cultivated breeding pools (4.1%), suggesting its decrease in frequency primarily correlates with domestication and continued during subsequent improvement. However, the variation that has persisted in under-utilized wild and landrace populations holds high breeding potential for breeders willing to forego seed oil to maximize protein content. The results of this study include the identification of distinct haplotype structures within the high-protein population, and a broad characterization of the genomic context and linkage patterns of cqProt-003 across global populations, supporting future functional characterization and modification.


Asunto(s)
Fabaceae , Glycine max , Fabaceae/genética , Haplotipos , Fitomejoramiento , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Semillas/metabolismo , Glycine max/genética , Glycine max/metabolismo
7.
Int J Mol Sci ; 23(4)2022 Feb 18.
Artículo en Inglés | MEDLINE | ID: mdl-35216392

RESUMEN

Pangenomes aim to represent the complete repertoire of the genome diversity present within a species or cohort of species, capturing the genomic structural variance between individuals. This genomic information coupled with phenotypic data can be applied to identify genes and alleles involved with abiotic stress tolerance, disease resistance, and other desirable traits. The characterisation of novel structural variants from pangenomes can support genome editing approaches such as Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR associated protein Cas (CRISPR-Cas), providing functional information on gene sequences and new target sites in variant-specific genes with increased efficiency. This review discusses the application of pangenomes in genome editing and crop improvement, focusing on the potential of pangenomes to accurately identify target genes for CRISPR-Cas editing of plant genomes while avoiding adverse off-target effects. We consider the limitations of applying CRISPR-Cas editing with pangenome references and potential solutions to overcome these limitations.


Asunto(s)
Sistemas CRISPR-Cas/genética , Productos Agrícolas/genética , Genoma de Planta/genética , Edición Génica/métodos , Fenotipo , Fitomejoramiento/métodos , Plantas Modificadas Genéticamente/genética
8.
Plant Genome ; 15(1): e20109, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34169673

RESUMEN

The gene content of plants varies between individuals of the same species due to gene presence/absence variation, and selection can alter the frequency of specific genes in a population. Selection during domestication and breeding will modify the genomic landscape, though the nature of these modifications is only understood for specific genes or on a more general level (e.g., by a loss of genetic diversity). Here we have assembled and analyzed a soybean (Glycine spp.) pangenome representing more than 1,000 soybean accessions derived from the USDA Soybean Germplasm Collection, including both wild and cultivated lineages, to assess genomewide changes in gene and allele frequency during domestication and breeding. We identified 3,765 genes that are absent from the Lee reference genome assembly and assessed the presence/absence of all genes across this population. In addition to a loss of genetic diversity, we found a significant reduction in the average number of protein-coding genes per individual during domestication and subsequent breeding, though with some genes and allelic variants increasing in frequency associated with selection for agronomic traits. This analysis provides a genomic perspective of domestication and breeding in this important oilseed crop.


Asunto(s)
Domesticación , Fabaceae , Fabaceae/genética , Genoma de Planta , Fitomejoramiento , Glycine max/genética , Estados Unidos , United States Department of Agriculture
9.
Theor Appl Genet ; 134(6): 1677-1690, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33852055

RESUMEN

KEY MESSAGE: Safeguarding crop yields in a changing climate requires bioinformatics advances in harnessing data from vast phenomics and genomics datasets to translate research findings into climate smart crops in the field. Climate change and an additional 3 billion mouths to feed by 2050 raise serious concerns over global food security. Crop breeding and land management strategies will need to evolve to maximize the utilization of finite resources in coming years. High-throughput phenotyping and genomics technologies are providing researchers with the information required to guide and inform the breeding of climate smart crops adapted to the environment. Bioinformatics has a fundamental role to play in integrating and exploiting this fast accumulating wealth of data, through association studies to detect genomic targets underlying key adaptive climate-resilient traits. These data provide tools for breeders to tailor crops to their environment and can be introduced using advanced selection or genome editing methods. To effectively translate research into the field, genomic and phenomic information will need to be integrated into comprehensive clade-specific databases and platforms alongside accessible tools that can be used by breeders to inform the selection of climate adaptive traits. Here we discuss the role of bioinformatics in extracting, analysing, integrating and managing genomic and phenomic data to improve climate resilience in crops, including current, emerging and potential approaches, applications and bottlenecks in the research and breeding pipeline.


Asunto(s)
Cambio Climático , Biología Computacional , Productos Agrícolas/genética , Genómica , Fenómica , Fitomejoramiento/métodos , Adaptación Fisiológica , Edición Génica , Fenotipo
10.
Methods Mol Biol ; 2222: 149-166, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33301093

RESUMEN

Molecular markers provide researchers with a powerful tool for variation analysis between plant genomes. They are heritable and widely distributed across the genome and for this reason have many applications in plant taxonomy and genotyping. Over the last decade, molecular marker technology has developed rapidly and is now a crucial component for genetic linkage analysis, trait mapping, diversity analysis, and association studies. This chapter focuses on molecular marker discovery, its application, and future perspectives for plant genotyping through pangenome assemblies. Included are descriptions of automated methods for genome and sequence distance estimation, genome contaminant analysis in sequence reads, genome structural variation, and SNP discovery methods.


Asunto(s)
Código de Barras del ADN Taxonómico , Técnicas de Genotipaje , Ensayos Analíticos de Alto Rendimiento , Plantas/clasificación , Plantas/genética , Biología Computacional/métodos , Código de Barras del ADN Taxonómico/métodos , Código de Barras del ADN Taxonómico/normas , Contaminación de ADN , Evolución Molecular , Marcadores Genéticos , Genoma de Planta , Genómica/métodos , Genotipo , Ensayos Analíticos de Alto Rendimiento/normas , Filogenia , Polimorfismo de Nucleótido Simple
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...