Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
Bioinformatics ; 37(7): 1021-1023, 2021 05 17.
Artículo en Inglés | MEDLINE | ID: mdl-32766813

RESUMEN

SUMMARY: Rapid increase of the data size in metagenome researches has raised the demand for new tools to process large datasets efficiently. To accelerate the metagenome profiling process in the scenario of big data, we developed SOAPMetaS, a marker gene-based multiple-sample metagenome profiling tool built on Apache Spark. SOAPMetaS demonstrates high performance and scalability to process large datasets. It can process 80 samples of FASTQ data, summing up to 416 GiB, in around half an hour; and the accuracy of species profiling results of SOAPMetaS is similar to that of MetaPhlAn2. SOAPMetaS can deal with a large volume of metagenome data more efficiently than common-used single-machine tools. AVAILABILITY AND IMPLEMENTATION: Source code is implemented in Java and freely available at https://github.com/BGI-flexlab/SOAPMetaS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Metagenoma , Programas Informáticos , Algoritmos , Macrodatos
2.
BMC Genomics ; 20(1): 260, 2019 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-30940069

RESUMEN

BACKGROUND: The different leaf type associated traits of soybean (Glycine max L.) including leaf area, leaf length, leaf width, leaf shape and petiole length are considered to be associated with seed yield. In order to identify quantitative trait loci (QTLs) affecting leaf type traits, two advanced recombinant inbred line (RIL, ZH, Zhonghuang 24 × Huaxia 3; GB, Guizao 1 × Brazil 13) populations were introduced to score phenotypic values in plants across nine different environments (years, seasons, locations and soybean growth stages). Two restriction site-associated DNA sequencing (RAD-seq) based high-density genetic linkage maps with an average distance of 1.00 centimorgan (cM) between adjacent bin markers were utilized for QTL fine mapping. RESULTS: Correlation analysis showed that most of the traits were correlated with each other and regulated both by hereditary and environmental factors. A total of 190 QTLs were identified for leaf type associated traits in the two populations, of which 14 loci were found to be environmentally stable. Moreover, these detected QTLs were categorized into 34 QTL hotspots, and four important QTL hotspots with phenotypic variance ranging from 3.89-23.13% were highlighted. Furthermore, Glyma04g05840, Glyma19g37820, Glyma14g07140 and Glyma19g39340 were predicted in the intervals of the stable loci and important QTL hotspots for leaf type traits by adopting Gene Ontology (GO) enrichment analysis. CONCLUSIONS: Our findings of the QTLs and the putative genes will be beneficial to gain new insights into the genetic basis for soybean leaf type traits and may further accelerate the breeding process for reasonable leaf type soybean.


Asunto(s)
Glycine max/genética , Hojas de la Planta/genética , Sitios de Carácter Cuantitativo , Mapeo Cromosómico , Cromosomas de las Plantas , Genotipo , Fenotipo , Hojas de la Planta/fisiología , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN
3.
New Phytol ; 222(2): 1012-1029, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30609067

RESUMEN

Verticillium dahliae is a broad host-range pathogen that causes vascular wilts in plants. Interactions between three hosts and specific V. dahliae genotypes result in severe defoliation. The underlying mechanisms of defoliation are unresolved. Genome resequencing, gene deletion and complementation, gene expression analysis, sequence divergence, defoliating phenotype identification, virulence analysis, and quantification of V. dahliae secondary metabolites were performed. Population genomics previously revealed that G-LSR2 was horizontally transferred from the fungus Fusarium oxysporum f. sp. vasinfectum to V. dahliae and is exclusively found in the genomes of defoliating (D) strains. Deletion of seven genes within G-LSR2, designated as VdDf genes, produced the nondefoliation phenotype on cotton, olive, and okra but complementation of two genes restored the defoliation phenotype. Genes VdDf5 and VdDf6 associated with defoliation shared homology with polyketide synthases involved in secondary metabolism, whereas VdDf7 shared homology with proteins involved in the biosynthesis of N-lauroylethanolamine (N-acylethanolamine (NAE) 12:0), a compound that induces defoliation. NAE overbiosynthesis by D strains also appears to disrupt NAE metabolism in cotton by inducing overexpression of fatty acid amide hydrolase. The VdDfs modulate the synthesis and overproduction of secondary metabolites, such as NAE 12:0, that cause defoliation either by altering abscisic acid sensitivity, hormone disruption, or sensitivity to the pathogen.


Asunto(s)
Genómica , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/microbiología , Verticillium/genética , Verticillium/patogenicidad , Secuencia de Bases , Etanolaminas/metabolismo , Genes Fúngicos , Variación Genética , Genoma Fúngico , Gossypium/genética , Ácidos Láuricos/metabolismo , Modelos Biológicos , Familia de Multigenes , Fenotipo , Metabolismo Secundario/genética
4.
Theor Appl Genet ; 132(6): 1833-1845, 2019 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-30826863

RESUMEN

KEY MESSAGE: The Mendelian locus conferring resistance to powdery mildew in soybean was precisely mapped using a combination of phenotypic screening, genetic analyses, and high-throughput genome-wide sequencing. Powdery mildew (PMD), caused by the fungus Microsphaera diffusa Cooke & Peck, leads to considerable yield losses in soybean [Glycine max (L.) Merr.] under favourable environmental conditions and can be controlled by identifying germplasm resources with resistance genes. In this study, resistance to M. diffusa among resistant varieties B3, Fudou234, and B13 is mapped as a single Mendelian locus using three mapping populations derived from crossing susceptible with resistant cultivars. The position of the PMD resistance locus in B3 is located between simple sequence repeat (SSR) markers GMES6959 and Satt_393 on chromosome 16, at genetic distances of 7.1 cM and 4.6 cM, respectively. To more finely map the PMD resistance gene, a high-density genetic map was constructed using 248 F8 recombinant inbred lines derived from a cross of Guizao1 × B13. The final map includes 3748 bins and is 3031.9 cM in length, with an average distance of 0.81 cM between adjacent markers. This genotypic analysis resulted in the precise delineation of the B13 PMD resistance locus to a 188.06-kb genomic region on chromosome 16 that harbours 28 genes, including 17 disease resistance (R)-like genes in the reference Williams 82 genome. Quantitative real-time PCR assays of possible candidate genes revealed differences in the expression levels of 9 R-like genes between the resistant and susceptible parents. These results provide useful information for marker-assisted breeding and gene cloning for PMD resistance.


Asunto(s)
Actinomycetales/patogenicidad , Resistencia a la Enfermedad/genética , Genoma de Planta , Estudio de Asociación del Genoma Completo , Glycine max/genética , Enfermedades de las Plantas/genética , Proteínas de Plantas/metabolismo , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Cromosomas de las Plantas/metabolismo , Desequilibrio de Ligamiento , Repeticiones de Microsatélite , Fenotipo , Enfermedades de las Plantas/microbiología , Proteínas de Plantas/genética , Polimorfismo de Nucleótido Simple , Glycine max/metabolismo
5.
Int J Mol Sci ; 20(12)2019 Jun 20.
Artículo en Inglés | MEDLINE | ID: mdl-31226841

RESUMEN

Selenoproteins that contain selenocysteine (Sec) are found in all kingdoms of life. Although they constitute a small proportion of the proteome, selenoproteins play essential roles in many organisms. In photosynthetic eukaryotes, selenoproteins have been found in algae but are missing in land plants (embryophytes). In this study, we explored the evolutionary dynamics of Sec incorporation by conveying a genomic search for the Sec machinery and selenoproteins across Archaeplastida. We identified a complete Sec machinery and variable sizes of selenoproteomes in the main algal lineages. However, the entire Sec machinery was missing in the Bangiophyceae-Florideophyceae clade (BV) of Rhodoplantae (red algae) and only partial machinery was found in three species of Archaeplastida, indicating parallel loss of Sec incorporation in different groups of algae. Further analysis of genome and transcriptome data suggests that all major lineages of streptophyte algae display a complete Sec machinery, although the number of selenoproteins is low in this group, especially in subaerial taxa. We conclude that selenoproteins tend to be lost in Archaeplastida upon adaptation to a subaerial or acidic environment. The high number of redox-active selenoproteins found in some bloom-forming marine microalgae may be related to defense against viral infections. Some of the selenoproteins in these organisms may have been gained by horizontal gene transfer from bacteria.


Asunto(s)
Chlorophyta/genética , Proteínas de Plantas/genética , Rhodophyta/genética , Selenoproteínas/genética , Streptophyta/genética , Evolución Molecular , Transferencia de Gen Horizontal , Genómica , Filogenia , Selenocisteína/genética , Transcriptoma
6.
BMC Genomics ; 19(1): 451, 2018 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-29895260

RESUMEN

BACKGROUND: Cottonseed is one of the most important raw materials for plant protein, oil and alternative biofuel for diesel engines. Understanding the complex genetic basis of cottonseed traits is requisite for achieving efficient genetic improvement of the traits. However, it is not yet clear about their genetic architecture in genomic level. GWAS has been an effective way to explore genetic basis of quantitative traits in human and many crops. This study aims to dissect genetic mechanism seven cottonseed traits by a GWAS for genetic improvement. RESULTS: A genome-wide association study (GWAS) based on a full gene model with gene effects as fixed and gene-environment interaction as random, was conducted for protein, oil and 5 fatty acids using 316 accessions and ~ 390 K SNPs. Totally, 124 significant quantitative trait SNPs (QTSs), consisting of 16, 21, 87 for protein, oil and fatty acids (palmitic, linoleic, oleic, myristic, stearic), respectively, were identified and the broad-sense heritability was estimated from 71.62 to 93.43%; no QTS-environment interaction was detected for the protein, the palmitic and the oleic contents; the protein content was predominantly controlled by epistatic effects accounting for 65.18% of the total variation, but the oil content and the fatty acids except the palmitic were mainly determined by gene main effects and no epistasis was detected for the myristic and the stearic. Prediction of superior pure line and hybrid revealed the potential of the QTSs in the improvement of cottonseed traits, and the hybrid could achieve higher or lower genetic values compared with pure lines. CONCLUSIONS: This study revealed complex genetic architecture of seven cottonseed traits at whole genome-wide by mixed linear model approach; the identified genetic variants and estimated genetic component effects of gene, gene-gene and gene-environment interaction provide cotton geneticist or breeders new knowledge on the genetic mechanism of the traits and the potential molecular breeding design strategy.


Asunto(s)
Gossypium/genética , Semillas/genética , Ácidos Grasos/análisis , Genes de Plantas , Estudio de Asociación del Genoma Completo , Genotipo , Gossypium/química , Fitomejoramiento , Proteínas de Plantas/genética , Carácter Cuantitativo Heredable , Semillas/química
7.
New Phytol ; 217(2): 756-770, 2018 01.
Artículo en Inglés | MEDLINE | ID: mdl-29084346

RESUMEN

Verticillium dahliae isolates are most virulent on the host from which they were originally isolated. Mechanisms underlying these dominant host adaptations are currently unknown. We sequenced the genome of V. dahliae Vd991, which is highly virulent on its original host, cotton, and performed comparisons with the reference genomes of JR2 (from tomato) and VdLs.17 (from lettuce). Pathogenicity-related factor prediction, orthology and multigene family classification, transcriptome analyses, phylogenetic analyses, and pathogenicity experiments were performed. The Vd991 genome harbored several exclusive, lineage-specific (LS) genes within LS regions (LSRs). Deletion mutants of the seven genes within one LSR (G-LSR2) in Vd991 were less virulent only on cotton. Integration of G-LSR2 genes individually into JR2 and VdLs.17 resulted in significantly enhanced virulence on cotton but did not affect virulence on tomato or lettuce. Transcription levels of the seven LS genes in Vd991 were higher during the early stages of cotton infection, as compared with other hosts. Phylogenetic analyses suggested that G-LSR2 was acquired from Fusarium oxysporum f. sp. vasinfectum through horizontal gene transfer. Our results provide evidence that horizontal gene transfer from Fusarium to Vd991 contributed significantly to its adaptation to cotton and may represent a significant mechanism in the evolution of an asexual plant pathogen.


Asunto(s)
Fusarium/genética , Transferencia de Gen Horizontal , Genoma Fúngico , Genómica , Gossypium/microbiología , Verticillium/genética , Verticillium/patogenicidad , Factores de Virulencia/metabolismo , Secuencia de Bases , Evolución Molecular , Interacciones Huésped-Patógeno/genética , Lactuca/microbiología , Solanum lycopersicum/microbiología , Familia de Multigenes , Filogenia , Especificidad de la Especie , Sintenía/genética , Virulencia/genética
8.
Theor Appl Genet ; 131(3): 555-568, 2018 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-29159422

RESUMEN

KEY MESSAGE: Fifteen stable QTLs were identified using a high-density soybean genetic map across multiple environments. One major QTL, qIF5-1, contributing to total isoflavone content explained phenotypic variance 49.38, 43.27, 46.59, 45.15 and 52.50%, respectively. Soybeans (Glycine max L.) are a major source of dietary isoflavones. To identify novel quantitative trait loci (QTL) underlying isoflavone content, and to improve the accuracy of marker-assisted breeding in soybean, a valuable mapping population comprised of 196 F7:8-10 recombinant inbred lines (RILs, Huachun 2 × Wayao) was utilized to evaluate individual and total isoflavone content in plants grown in four different environments in Guangdong. A high-density genetic linkage map containing 3469 recombination bin markers based on 0.2 × restriction site-associated DNA tag sequencing (RAD-seq) technology was used to finely map QTLs for both individual and total isoflavone contents. Correlation analyses showed that total isoflavone content, and that of five individual isoflavone, was significantly correlated across the four environments. Based on the high-density genetic linkage map, a total of 15 stable quantitative trait loci (QTLs) associated with isoflavone content across multiple environments were mapped onto chromosomes 02, 05, 07, 09, 10, 11, 13, 16, 17, and 19. Further, one of them, qIF5-1, localized to chromosomes 05 (38,434,171-39,045,620 bp) contributed to almost all isoflavone components across all environments, and explained 6.37-59.95% of the phenotypic variance, especially explained 49.38, 43.27, 46.59, 45.15 and 52.50% for total isoflavone. The results obtained in the present study will pave the way for a better understanding of the genetics of isoflavone accumulation and reveals the scope available for improvement of isoflavone content through marker-assisted selection.


Asunto(s)
Glycine max/genética , Isoflavonas/análisis , Sitios de Carácter Cuantitativo , Semillas/química , Mapeo Cromosómico , Ligamiento Genético , Genotipo , Fenotipo , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN , Glycine max/química
9.
Theor Appl Genet ; 131(8): 1715-1728, 2018 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-29754326

RESUMEN

KEY MESSAGE: Map-based cloning identified GmHAD1, a gene which encodes a HAD-like acid phosphatase, associated with soybean tolerance to low phosphorus stress. Phosphorus (P) deficiency in soils is a major limiting factor for crop growth worldwide. Plants may adapt to low phosphorus (LP) conditions via changes to root morphology, including the number, length, orientation, and branching of the principal root classes. To elucidate the genetic mechanisms for LP tolerance in soybean, quantitative trait loci (QTL) related to root morphology responses to LP were identified via hydroponic experiments. In total, we identified 14 major loci associated with these traits in a RIL population. The log-likelihood scores ranged from 2.81 to 7.43, explaining 4.23-13.98% of phenotypic variance. A major locus on chromosome 08, named qP8-2, was co-localized with an important P efficiency QTL (qPE8), containing phosphatase genes GmACP1 and GmACP2. Another major locus on chromosome 10 named qP10-2 explained 4.80-13.98% of the total phenotypic variance in root morphology. The qP10-2 contains GmHAD1, a gene which encodes an acid phosphatase. In the transgenic soybean hairy roots, GmHAD1 overexpression increased P efficiency by 8.4-16.5% relative to the control. Transgenic Arabidopsis plants had higher biomass than wild-type plants across both short- and long-term P reduction. These results suggest that GmHAD1, an acid phosphatase gene, improved the utilization of organic phosphate by soybean and Arabidopsis plants.


Asunto(s)
Fosfatasa Ácida/genética , Glycine max/genética , Fósforo/metabolismo , Sitios de Carácter Cuantitativo , Arabidopsis , Biomasa , Mapeo Cromosómico , Clonación Molecular , Genes de Plantas , Fenotipo , Raíces de Plantas/crecimiento & desarrollo , Plantas Modificadas Genéticamente , Glycine max/enzimología
10.
Molecules ; 23(5)2018 May 11.
Artículo en Inglés | MEDLINE | ID: mdl-29751655

RESUMEN

Bile salt hydrolase (BSH) is a well-known enzyme that has been commonly characterized in probiotic bacteria, as it has cholesterol-lowering effects. However, its molecular investigations are scarce. Here, we build a local database of BSH sequences from Lactobacillaceae (BSH⁻SDL), and phylogenetic analysis and homology searches were employed to elucidate their comparability and distinctiveness among species. Evolutionary study demonstrates that BSH sequences in BSH⁻SDL are divided into five groups, named BSH A, B, C, D and E here, which can be the genetic basis for BSH classification and nomenclature. Sequence analysis suggests the differences between BSH-active and BSH-inactive proteins clearly, especially on site 82. In addition, a total of 551 BSHs from 107 species are identified from 451 genomes of 158 Lactobacillaceae species. Interestingly, those bacteria carrying various copies of BSH A or B can be predicted to be potential cholesterol-lowering probiotics, based on the results of phylogenetic analysis and the subtypes that those previously reported BSH-active probiotics possess. In summary, this study elaborates the molecular basis of BSH in Lactobacillaceae systematically, and provides a novel methodology as well as a consistent standard for the identification of the BSH subtype. We believe that high-throughput screening can be efficiently applied to the selection of promising candidate BSH-active probiotics, which will advance the development of healthcare products in cholesterol metabolism.


Asunto(s)
Amidohidrolasas/genética , Amidohidrolasas/metabolismo , Genoma Bacteriano , Genómica , Lactobacillaceae/enzimología , Lactobacillaceae/genética , Amidohidrolasas/química , Secuencia de Aminoácidos , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Dominio Catalítico , Activación Enzimática , Genómica/métodos , Lactobacillaceae/clasificación , Filogenia
11.
BMC Genomics ; 18(1): 466, 2017 06 19.
Artículo en Inglés | MEDLINE | ID: mdl-28629322

RESUMEN

BACKGROUND: One of the overarching goals of soybean breeding is to develop lines that combine increased yield with improved quality characteristics. High-density-marker QTL mapping can serve as an effective strategy to identify novel genomic information to facilitate crop improvement. In this study, we genotyped a recombinant inbred line (RIL) population (Zhonghuang 24 × Huaxia 3) using a restriction-site associated DNA sequencing (RAD-seq) approach. A high-density soybean genetic map was constructed and used to identify several QTLs that were shown to influence six yield-related and two quality traits. RESULTS: A total of 47,472 single-nucleotide polymorphisms (SNPs) were detected for the RILs that were integrated into 2639 recombination bin units, with an average distance of 1.00 cM between adjacent markers. Forty seven QTLs for yield-related traits and 13 QTLs for grain quality traits were found to be distributed on 16 chromosomes in the 2 year studies. Among them, 18 QTLs were stable, and were identified in both analyses. Twenty six QTLs were identified for the first time, with a single QTL (qNN19a) in a 56 kb region explaining 32.56% of phenotypic variation, and an additional 10 of these were novel, stable QTLs. Moreover, 8 QTL hotpots on four different chromosomes were identified for the correlated traits. CONCLUSIONS: With RAD-sequencing, some novel QTLs and important QTL clusters for both yield-related and quality traits were identified based on a new, high-density bin linkage map. Three predicted genes were selected as candidates that likely have a direct or indirect influence on both yield and quality in soybean. Our findings will be helpful for understanding common genetic control mechanisms of co-localized traits and to select cultivars for further analysis to predictably modulate soybean yield and quality simultaneously.


Asunto(s)
Mapeo Cromosómico/métodos , Enzimas de Restricción del ADN/metabolismo , Glycine max/crecimiento & desarrollo , Glycine max/genética , Sitios de Carácter Cuantitativo/genética , Análisis de Secuencia de ADN , Ontología de Genes , Fenotipo , Polimorfismo de Nucleótido Simple
12.
BMC Genomics ; 18(1): 702, 2017 Sep 06.
Artículo en Inglés | MEDLINE | ID: mdl-28877662

RESUMEN

BACKGROUND: Maize was originally domesticated in a tropical environment but is now widely cultivated at temperate latitudes. Temperate and tropical maize populations have diverged both genotypically and phenotypically. Tropical maize lines grown in temperate environments usually exhibit delayed flowering, pollination, and seed set, which reduces their grain yield relative to temperate adapted maize lines. One potential mechanism by which temperate maize may have adapted to a new environment is novel transposable element insertions, which can influence gene regulation. Recent advances in sequencing technology have made it possible to study variation in transposon content and insertion location in large sets of maize lines. RESULTS: In total, 274,408 non-redundant TEs (NRTEs) were identified using resequencing data generated from 83 maize inbred lines. The locations of DNA TEs and copia-superfamily retrotransposons showed significant positive correlations with gene density and genetic recombination rates, whereas gypsy-superfamily retrotransposons showed a negative correlation with these two parameters. Compared to tropical maize, temperate maize had fewer unique NRTEs but higher insertion frequency, lower background recombination rates, and higher linkage disequilibrium, with more NRTEs close to flowering and stress-related genes in the genome. Association mapping demonstrated that the presence/absence of 48 NRTEs was associated with flowering time and that expression of neighboring genes differed between haplotypes where a NRTE was present or absent. CONCLUSIONS: This study suggests that NRTEs may have played an important role in creating the variation in gene regulation that enabled the rapid adaptation of maize to diverse environments.


Asunto(s)
Elementos Transponibles de ADN/genética , Genómica , Polimorfismo Genético , Clima Tropical , Zea mays/genética , Adaptación Fisiológica/genética , Ecosistema , Recombinación Genética , Zea mays/fisiología
13.
Theor Appl Genet ; 130(5): 1041-1051, 2017 May.
Artículo en Inglés | MEDLINE | ID: mdl-28246754

RESUMEN

KEY MESSAGE: Using a combination of phenotypic screening, genetic and statistical analyses, and high-throughput genome-wide sequencing, we have finely mapped a dominant Phytophthora resistance gene in soybean cultivar Wayao. Phytophthora root rot (PRR) caused by Phytophthora sojae is one of the most important soil-borne diseases in many soybean-production regions in the world. Identification of resistant gene(s) and incorporating them into elite varieties are an effective way for breeding to prevent soybean from being harmed by this disease. Two soybean populations of 191 F2 individuals and 196 F7:8 recombinant inbred lines (RILs) were developed to map Rps gene by crossing a susceptible cultivar Huachun 2 with the resistant cultivar Wayao. Genetic analysis of the F2 population indicated that PRR resistance in Wayao was controlled by a single dominant gene, temporarily named RpsWY, which was mapped on chromosome 3. A high-density genetic linkage bin map was constructed using 3469 recombination bins of the RILs to explore the candidate genes by the high-throughput genome-wide sequencing. The results of genotypic analysis showed that the RpsWY gene was located in bin 401 between 4466230 and 4502773 bp on chromosome 3 through line 71 and 100 of the RILs. Four predicted genes (Glyma03g04350, Glyma03g04360, Glyma03g04370, and Glyma03g04380) were found at the narrowed region of 36.5 kb in bin 401. These results suggest that the high-throughput genome-wide resequencing is an effective method to fine map PRR candidate genes.


Asunto(s)
Resistencia a la Enfermedad/genética , Genes de Plantas , Glycine max/genética , Phytophthora , Enfermedades de las Plantas/genética , Mapeo Cromosómico , Genes Dominantes , Ligamiento Genético , Secuenciación de Nucleótidos de Alto Rendimiento , Repeticiones de Microsatélite , Fenotipo , Enfermedades de las Plantas/microbiología , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ADN , Glycine max/microbiología
14.
Nature ; 475(7355): 189-95, 2011 Jul 10.
Artículo en Inglés | MEDLINE | ID: mdl-21743474

RESUMEN

Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.


Asunto(s)
Genoma de Planta/genética , Genómica , Solanum tuberosum/genética , Evolución Molecular , Duplicación de Gen , Regulación de la Expresión Génica de las Plantas , Genes de Plantas/genética , Variación Genética , Haplotipos/genética , Heterocigoto , Homocigoto , Inmunidad Innata , Endogamia , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Enfermedades de las Plantas/genética , Ploidias , Solanum tuberosum/fisiología
15.
Nucleic Acids Res ; 43(Database issue): D1023-7, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25429973

RESUMEN

We have identified about 20 million rice SNPs by aligning reads from the 3000 rice genomes project with the Nipponbare genome. The SNPs and allele information are organized into a SNP-Seek system (http://www.oryzasnp.org/iric-portal/), which consists of Oracle database having a total number of rows with SNP genotypes close to 60 billion (20 M SNPs × 3 K rice lines) and web interface for convenient querying. The database allows quick retrieving of SNP alleles for all varieties in a given genome region, finding different alleles from predefined varieties and querying basic passport and morphological phenotypic information about sequenced rice lines. SNPs can be visualized together with the gene structures in JBrowse genome browser. Evolutionary relationships between rice varieties can be explored using phylogenetic trees or multidimensional scaling plots.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Oryza/genética , Polimorfismo de Nucleótido Simple , Oryza/anatomía & histología
16.
Mol Genet Genomics ; 291(1): 297-307, 2016 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-26334612

RESUMEN

The future of rice breeding will likely be built on the basis of the further utilization of heterosis between elite cultivars and genetic resources from distant subspecies of rice. Previous studies have proved that exogenous genomic DNA transformation methods can be used to transfer genetic information from distant relatives (donor) into cultivated rice (recipient). However, the mechanism underlying this form of genetic transfer is poorly characterized, and the genes that cause the phenotypic changes in these variants are typically difficult to identify. This study examined YVB, a stable variant line with greatly improved grain quality traits that was derived from an indica variety (V20B) by transferring genomic DNA of O.minuta through the "spike-stalk injection method (SIM)". We used restriction-site associated DNA sequencing technology (RAD-seq) to evaluate a population of BC1F5 backcross lines (YVB × V20B); the RAD-seq data were used to construct a genetic linkage map with high-density SNPs for use in association analysis exploring genotype-phenotype relationships at the whole-genome level. A total of 17 quantitative trait loci (QTLs) for rice quality traits were mapped to chromosomes 3, 5, 6, 8, and 9. 8 major QTLs controlling different phenotypic variations were mapped to the same region of chromosome 5. This region contained the GS5 gene for grain weight and the qSW5/GW5 gene for grain width. This study provides new resources and insights into the molecular mechanisms of grain trait phenotypic variation and the transmission of genetic information via the introduction of genomic DNA to a distantly related crop relative species.


Asunto(s)
Grano Comestible/genética , Oryza/genética , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , Cruzamiento/métodos , Mapeo Cromosómico/métodos , Cromosomas de las Plantas/genética , Pruebas Genéticas/métodos , Genotipo , Endogamia/métodos , Fenotipo , Análisis de Secuencia de ADN/métodos
17.
Plant Cell ; 25(8): 2813-30, 2013 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-23983221

RESUMEN

The Brassicaceae, including Arabidopsis thaliana and Brassica crops, is unmatched among plants in its wealth of genomic and functional molecular data and has long served as a model for understanding gene, genome, and trait evolution. However, genome information from a phylogenetic outgroup that is essential for inferring directionality of evolutionary change has been lacking. We therefore sequenced the genome of the spider flower (Tarenaya hassleriana) from the Brassicaceae sister family, the Cleomaceae. By comparative analysis of the two lineages, we show that genome evolution following ancient polyploidy and gene duplication events affect reproductively important traits. We found an ancient genome triplication in Tarenaya (Th-α) that is independent of the Brassicaceae-specific duplication (At-α) and nested Brassica (Br-α) triplication. To showcase the potential of sister lineage genome analysis, we investigated the state of floral developmental genes and show Brassica retains twice as many floral MADS (for minichromosome maintenance1, AGAMOUS, DEFICIENS and serum response factor) genes as Tarenaya that likely contribute to morphological diversity in Brassica. We also performed synteny analysis of gene families that confer self-incompatibility in Brassicaceae and found that the critical serine receptor kinase receptor gene is derived from a lineage-specific tandem duplication. The T. hassleriana genome will facilitate future research toward elucidating the evolutionary history of Brassicaceae genomes.


Asunto(s)
Brassicaceae/genética , Evolución Molecular , Genoma de Planta/genética , Carácter Cuantitativo Heredable , Flores/genética , Flores/crecimiento & desarrollo , Regulación de la Expresión Génica de las Plantas , Genes de Plantas/genética , Proteínas de Dominio MADS/genética , Proteínas de Dominio MADS/metabolismo , Anotación de Secuencia Molecular , Filogenia , Mapeo Físico de Cromosoma , Poliploidía , Reproducción/genética , Autoincompatibilidad en las Plantas con Flores/genética , Análisis de Secuencia de ADN , Sintenía/genética , Factores de Tiempo
18.
Proc Natl Acad Sci U S A ; 110(35): 14492-7, 2013 Aug 27.
Artículo en Inglés | MEDLINE | ID: mdl-23940322

RESUMEN

The growing world population and shrinkage of arable land demand yield improvement of rice, one of the most important staple crops. To elucidate the genetic basis of yield and uncover its associated loci in rice, we resequenced the core recombinant inbred lines of Liang-You-Pei-Jiu, the widely cultivated super hybrid rice, and constructed a high-resolution linkage map. We detected 43 yield-associated quantitative trait loci, of which 20 are unique. Based on the high-density physical map, the genome sequences of paternal variety 93-11 and maternal cultivar PA64s of Liang-You-Pei-Jiu were significantly improved. The large recombinant inbred line population combined with plentiful high-quality single nucleotide polymorphisms and insertions/deletions between parental genomes allowed us to fine-map two quantitative trait loci, qSN8 and qSPB1, and to identify days to heading8 and lax panicle1 as candidate genes, respectively. The quantitative trait locus qSN8 was further confirmed to be days to heading8 by a complementation test. Our study provided an ideal platform for molecular breeding by targeting and dissecting yield-associated loci in rice.


Asunto(s)
Genoma de Planta , Hibridación Genética , Oryza/genética , Recombinación Genética , Ligamiento Genético , Sitios de Carácter Cuantitativo
19.
Plant J ; 80(1): 136-48, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25039268

RESUMEN

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies.


Asunto(s)
Variación Genética , Genoma de Planta/genética , Solanum lycopersicum/genética , Cruzamiento , Mapeo Cromosómico , ADN de Plantas/química , ADN de Plantas/genética , Frutas/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Datos de Secuencia Molecular , Fenotipo , Filogenia , Polimorfismo de Nucleótido Simple , Alineación de Secuencia , Análisis de Secuencia de ADN , Especificidad de la Especie
20.
BMC Plant Biol ; 14: 83, 2014 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-24684805

RESUMEN

BACKGROUND: Drought stress is one of the major limiting factors for maize production. With the availability of maize B73 reference genome and whole-genome resequencing of 15 maize inbreds, common variants (CV) and clustering analyses were applied to identify non-synonymous SNPs (nsSNPs) and corresponding candidate genes for drought tolerance. RESULTS: A total of 524 nsSNPs that were associated with 271 candidate genes involved in plant hormone regulation, carbohydrate and sugar metabolism, signaling molecules regulation, redox reaction and acclimation of photosynthesis to environment were detected by CV and cluster analyses. Most of the nsSNPs identified were clustered in bin 1.07 region that harbored six previously reported QTL with relatively high phenotypic variation explained for drought tolerance. Genes Ontology (GO) analysis of candidate genes revealed that there were 35 GO terms related to biotic stimulus and membrane-bounded organelle, showing significant differences between the candidate genes and the reference B73 background. Changes of expression level in these candidate genes for drought tolerance were detected using RNA sequencing for fertilized ovary, basal leaf meristem tissue and roots collected under drought stressed and well-watered conditions. The results indicated that 70% of candidate genes showed significantly expression changes under two water treatments and our strategies for mining candidate genes are feasible and relatively efficient. CONCLUSIONS: Our results successfully revealed candidate nsSNPs and associated genes for drought tolerance by comparative sequence analysis of 16 maize inbred lines. Both methods we applied were proved to be efficient for identifying candidate genes for complex traits through the next-generation sequencing technologies (NGS). These selected genes will not only facilitate understanding of genetic basis of drought stress response, but also accelerate genetic improvement through marker-assisted selection in maize.


Asunto(s)
Adaptación Fisiológica/genética , Estudios de Asociación Genética , Genoma de Planta/genética , Análisis de Secuencia de ADN/métodos , Zea mays/genética , Zea mays/fisiología , Cromosomas de las Plantas/genética , Análisis por Conglomerados , Deshidratación , Sequías , Ontología de Genes , Genes de Plantas , Genotipo , Endogamia , Desnaturalización de Ácido Nucleico/genética , Reacción en Cadena de la Polimerasa , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Reproducibilidad de los Resultados
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA