Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
1.
Nucleic Acids Res ; 52(D1): D1024-D1032, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37941143

RESUMEN

The silkworm Bombyx mori is a domesticated insect that serves as an animal model for research and agriculture. The silkworm super-pan-genome dataset, which we published last year, is a unique resource for the study of global genomic diversity and phenotype-genotype association. Here we present SilkMeta (http://silkmeta.org.cn), a comprehensive database covering the available silkworm pan-genome and multi-omics data. The database contains 1082 short-read genomes, 546 long-read assembled genomes, 1168 transcriptomes, 294 phenotype characterizations (phenome), tens of millions of variations (variome), 7253 long non-coding RNAs (lncRNAs), 18 717 full length transcripts and a set of population statistics. We have compiled publications on functional genomics research and genetic stock deciphering (mutant map). A range of bioinformatics tools is also provided for data visualization and retrieval. The large batch of omics data and tools were integrated in twelve functional modules that provide useful strategies and data for comparative and functional genomics research. The interactive bioinformatics platform SilkMeta will benefit not only the silkworm but also the insect biology communities.


Asunto(s)
Bombyx , Genoma de los Insectos , Animales , Bombyx/genética , Biología Computacional , Genómica , Metadatos , Multiómica
2.
BMC Bioinformatics ; 25(1): 173, 2024 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-38693489

RESUMEN

Principal component analysis (PCA) is an important and widely used unsupervised learning method that determines population structure based on genetic variation. Genome sequencing of thousands of individuals usually generate tens of millions of SNPs, making it challenging for PCA analysis and interpretation. Here we present VCF2PCACluster, a simple, fast and memory-efficient tool for Kinship estimation, PCA and clustering analysis, and visualization based on VCF formatted SNPs. We implemented five Kinship estimation methods and three clustering methods for its users to choose from. Moreover, unlike other PCA tools, VCF2PCACluster possesses a clustering function based on PCA result, which enabling users to automatically and clearly know about population structure. We demonstrated the same accuracy but a higher performance of this tool in performing PCA analysis on tens of millions of SNPs compared to another popular PLINK2 software, especially in peak memory usage that is independent of the number of SNPs in VCF2PCACluster.


Asunto(s)
Polimorfismo de Nucleótido Simple , Análisis de Componente Principal , Programas Informáticos , Análisis por Conglomerados , Humanos
3.
Nature ; 557(7703): 43-49, 2018 05.
Artículo en Inglés | MEDLINE | ID: mdl-29695866

RESUMEN

Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.


Asunto(s)
Productos Agrícolas/clasificación , Productos Agrícolas/genética , Variación Genética , Genoma de Planta/genética , Oryza/clasificación , Oryza/genética , Asia , Evolución Molecular , Genes de Plantas/genética , Genética de Población , Genómica , Haplotipos , Mutación INDEL/genética , Filogenia , Fitomejoramiento , Polimorfismo de Nucleótido Simple/genética
4.
Plant Cell Environ ; 42(1): 212-229, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-29749073

RESUMEN

Developing crops with better root systems is a promising strategy to ensure productivity in both optimum and stress environments. Root system architectural traits in 397 soybean accessions were characterized and a high-density single nucleotide polymorphisms (SNPs)-based genome-wide association study was performed to identify the underlying genes associated with root structure. SNPs associated with root architectural traits specific to landraces and elite germplasm pools were detected. Four loci were detected in landraces for lateral root number (LRN) and distribution of root thickness in diameter Class I with a major locus on chromosome 16. This major loci was detected in the coding region of unknown protein, and subsequent analyses demonstrated that root traits are affected with mutated haplotypes of the gene. In elite germplasm pool, 3 significant SNPs in alanine-glyoxalate aminotransferase, Leucine-Rich Repeat receptor/No apical meristem, and unknown functional genes were found to govern multiple traits including root surface area and volume. However, no major loci were detected for LRN in elite germplasm. Nucleotide diversity analysis found evidence of selective sweeps around the landraces LRN gene. Soybean accessions with minor and mutated allelic variants of LRN gene were found to perform better in both water-limited and optimal field conditions.


Asunto(s)
Glycine max/genética , Raíces de Plantas/genética , Genes de Plantas/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Raíces de Plantas/anatomía & histología , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , Carácter Cuantitativo Heredable , Análisis de Secuencia de ADN , Glycine max/anatomía & histología , Transcriptoma
5.
Nucleic Acids Res ; 43(Database issue): D1023-7, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25429973

RESUMEN

We have identified about 20 million rice SNPs by aligning reads from the 3000 rice genomes project with the Nipponbare genome. The SNPs and allele information are organized into a SNP-Seek system (http://www.oryzasnp.org/iric-portal/), which consists of Oracle database having a total number of rows with SNP genotypes close to 60 billion (20 M SNPs × 3 K rice lines) and web interface for convenient querying. The database allows quick retrieving of SNP alleles for all varieties in a given genome region, finding different alleles from predefined varieties and querying basic passport and morphological phenotypic information about sequenced rice lines. SNPs can be visualized together with the gene structures in JBrowse genome browser. Evolutionary relationships between rice varieties can be explored using phylogenetic trees or multidimensional scaling plots.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Oryza/genética , Polimorfismo de Nucleótido Simple , Oryza/anatomía & histología
6.
Plant Biotechnol J ; 14(12): 2240-2253, 2016 12.
Artículo en Inglés | MEDLINE | ID: mdl-27155090

RESUMEN

Next-generation sequencing of complete genomes has given researchers unprecedented levels of information to study the multifaceted evolutionary changes that have shaped elite plant germplasm. In conjunction with population genetic analytical techniques and detailed online databases, we can more accurately capture the effects of domestication on entire biological pathways of agronomic importance. In this study, we explore the genetic diversity and signatures of selection in all predicted gene models of the storage starch synthesis pathway of Sorghum bicolor, utilizing a diversity panel containing lines categorized as either 'Landraces' or 'Wild and Weedy' genotypes. Amongst a total of 114 genes involved in starch synthesis, 71 had at least a single signal of purifying selection and 62 a signal of balancing selection and others a mix of both. This included key genes such as STARCH PHOSPHORYLASE 2 (SbPHO2, under balancing selection), PULLULANASE (SbPUL, under balancing selection) and ADP-glucose pyrophosphorylases (SHRUNKEN2, SbSH2 under purifying selection). Effectively, many genes within the primary starch synthesis pathway had a clear reduction in nucleotide diversity between the Landraces and wild and weedy lines indicating that the ancestral effects of domestication are still clearly identifiable. There was evidence of the positional rate variation within the well-characterized primary starch synthesis pathway of sorghum, particularly in the Landraces, whereby low evolutionary rates upstream and high rates downstream in the metabolic pathway were expected. This observation did not extend to the wild and weedy lines or the minor starch synthesis pathways.


Asunto(s)
Genoma de Planta/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Sorghum/genética , Sorghum/metabolismo , Almidón/metabolismo
7.
BMC Genomics ; 16: 1078, 2015 Dec 21.
Artículo en Inglés | MEDLINE | ID: mdl-26691201

RESUMEN

BACKGROUND: To safeguard the food supply for the growing human population, it is important to understand and exploit the genetic basis of quantitative traits. Next-generation sequencing technology performs advantageously and effectively in genetic mapping and genome analysis of diverse genetic resources. Hence, we combined re-sequencing technology and a bin map strategy to construct an ultra-high-density bin map with thousands of bin markers to precisely map a quantitative trait locus. RESULTS: In this study, we generated a linkage map containing 1,151,856 high quality SNPs between Mo17 and B73, which were verified in the maize intermated B73 × Mo17 (IBM) Syn10 population. This resource is an excellent complement to existing maize genetic maps available in an online database  (iPlant, http://data.maizecode.org/maize/qtl/syn10/ ). Moreover, in this population combined with the IBM Syn4 RIL population, we detected 135 QTLs for flowering time and plant height traits across the two populations. Eighteen known functional genes and twenty-five candidate genes for flowering time and plant height trait were fine-mapped into a 2.21-4.96 Mb interval. Map expansion and segregation distortion were also analyzed, and evidence for inadvertent selection of early flowering time in the process of mapping population development was observed. Furthermore, an updated integrated map with 1,151,856 high-quality SNPs, 2,916 traditional markers and 6,618 bin markers was constructed. The data were deposited into the iPlant Discovery Environment (DE), which provides a fundamental resource of genetic data for the maize genetic research community. CONCLUSIONS: Our findings provide basic essential genetic data for the maize genetic research community. An updated IBM Syn10 population and a reliable, verified high-quality SNP set between Mo17 and B73 will aid in future molecular breeding efforts.


Asunto(s)
Mapeo Cromosómico/métodos , Sitios de Carácter Cuantitativo , Zea mays/genética , Ligamiento Genético , Genoma de Planta , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN , Zea mays/fisiología
9.
BMC Plant Biol ; 14: 253, 2014 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-25928459

RESUMEN

BACKGROUND: Increased disease resistance is a key target of cereal breeding programs, with disease outbreaks continuing to threaten global food production, particularly in Africa. Of the disease resistance gene families, the nucleotide-binding site plus leucine-rich repeat (NBS-LRR) family is the most prevalent and ancient and is also one of the largest gene families known in plants. The sequence diversity in NBS-encoding genes was explored in sorghum, a critical food staple in Africa, with comparisons to rice and maize and with comparisons to fungal pathogen resistance QTL. RESULTS: In sorghum, NBS-encoding genes had significantly higher diversity in comparison to non NBS-encoding genes and were significantly enriched in regions of the genome under purifying and balancing selection, both through domestication and improvement. Ancestral genes, pre-dating species divergence, were more abundant in regions with signatures of selection than in regions not under selection. Sorghum NBS-encoding genes were also significantly enriched in the regions of the genome containing fungal pathogen disease resistance QTL; with the diversity of the NBS-encoding genes influenced by the type of co-locating biotic stress resistance QTL. CONCLUSIONS: NBS-encoding genes are under strong selection pressure in sorghum, through the contrasting evolutionary processes of purifying and balancing selection. Such contrasting evolutionary processes have impacted ancestral genes more than species-specific genes. Fungal disease resistance hot-spots in the genome, with resistance against multiple pathogens, provides further insight into the mechanisms that cereals use in the "arms race" with rapidly evolving pathogens in addition to providing plant breeders with selection targets for fast-tracking the development of high performing varieties with more durable pathogen resistance.


Asunto(s)
Resistencia a la Enfermedad/genética , Evolución Molecular , Sorghum/genética , Familia de Multigenes , Polimorfismo Genético , Sorghum/inmunología
10.
J Adv Res ; 42: 1-16, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-35988902

RESUMEN

INTRODUCTION: Rice, Oryza sativa L. (Os), is one of the oldest domesticated cereals that has also gone through extensive improvement in modern breeding. OBJECTIVES: How rice was domesticated and impacted by modern breeding. METHODS: We performed comprehensive analyses of genomic sequences of 504 accessions of Os and 456 accessions of O. rufipogon/O. nivara (Or). RESULTS: The natural selection on Or before domestication and the natural and artificial selection during domestication together shaped the well-differentiated genomes of two subspecies, geng(j) (japonica) and xian(i) (indica), while breeding has made apparent genomic imprints between landrace and modern varieties of each subspecies, and also between primary modern and advanced modern varieties of xian(i). Selection during domestication and breeding left genome-wide selective signals covering âˆ¼ 22.8 % and âˆ¼ 8.6 % of the Os genome, significantly reduced within-population genomic diversity by âˆ¼ 22 % in xian(i) and âˆ¼ 53 % in geng(j) plus more pronounced subspecific differentiation. Only âˆ¼ 10 % reduction in the total genomic diversity was observed between the Os and Or populations, indicating domestication did not suffer severe genetic bottleneck. CONCLUSION: Our results revealed clear differentiation of the Or accessions into three large populations, two of which correspond to the well-differentiated Os subspecies, geng(j) and xian(i). Improved productivity and common changes in the same suit of adaptive traits in xian(i) and geng(j) during domestication and breeding resulted apparently from compensatory and convergent selections for different genes/alleles acting in the common KEGG terms and/or same gene families, and thus maintaining or even increasing the within population diversity and subspecific differentiation of Os, while more genes/alleles of novel function were selected during domestication than modern breeding. Our results supported the multiple independent domestication of Os in Asia and suggest the more efficient utilization of the rich diversity within Os by exploiting inter-subspecific and among population diversity in future rice improvement.


Asunto(s)
Oryza , Oryza/genética , Domesticación , Productos Agrícolas/genética , Fitomejoramiento , Genómica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA