Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
BMC Res Notes ; 16(1): 248, 2023 Oct 02.
Artículo en Inglés | MEDLINE | ID: mdl-37784104

RESUMEN

OBJECTIVE: Black poplar (Populus nigra L.) is a species native to Eurasia with a wide distribution area. It is an ecologically important species from riparian ecosystems, that is used as a parent of interspecific (P. deltoides x P. nigra) cultivated poplar hybrids. Variant detection from transcriptomics sequences of 241 P. nigra individuals, sampled in natural populations from 11 river catchments (in four European countries) is described here. These data provide new valuable resources for population structure analysis, population genomics and genome-wide association studies. DATA DESCRIPTION: We generated transcriptomics data from a mixture of young differentiating xylem and cambium tissues of 480 Populus nigra trees sampled in a common garden experiment located at Orléans (France), corresponding to 241 genotypes (2 clonal replicates per genotype, at maximum) by using RNAseq technology. We launched on the resulting sequences an in-silico pipeline that allowed us to obtain 878,957 biallelic polymorphisms without missing data. More than 99% of these positions are annotated and 98.8% are located on the 19 chromosomes of the P. trichocarpa reference genome. The raw RNAseq sequences are available at the NCBI Sequence Read Archive SPR188754 and the variant dataset at the Recherche Data Gouv repository under https://doi.org/10.15454/8DQXK5 .


Asunto(s)
Populus , Humanos , Populus/genética , Ecosistema , Estudio de Asociación del Genoma Completo , Genotipo , Francia
2.
Biomolecules ; 13(9)2023 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-37759800

RESUMEN

The Douglas fir (Pseudotsuga menziesii) is a conifer native to North America that has become increasingly popular in plantations in France due to its many advantages as timber: rapid growth, quality wood, and good adaptation to climate change. Tree genetic improvement programs require knowledge of a species' genetic structure and history and the development of genetic markers. The very slow progress in this field, for Douglas fir as well as the entire genus Pinus, can be explained using the very large size of their genomes, as well as by the presence of numerous highly repeated sequences. Proteomics, therefore, provides a powerful way to access genomic information of otherwise challenging species. Here, we present the first Douglas fir proteomes acquired using nLC-MS/MS from 12 different plant organs or tissues. We identified 3975 different proteins and quantified 3462 of them, then examined the distribution of specific proteins across plant organs/tissues and their implications in various molecular processes. As the first large proteomic study of a resinous tree species with organ-specific profiling, this short note provides an important foundation for future genomic annotations of conifers and other trees.


Asunto(s)
Pseudotsuga , Tracheophyta , Proteoma/genética , Pseudotsuga/genética , Proteómica , Espectrometría de Masas en Tándem , Cambio Climático
3.
BMC Genomics ; 21(1): 416, 2020 Jun 22.
Artículo en Inglés | MEDLINE | ID: mdl-32571208

RESUMEN

BACKGROUND: Recent literature on the differential role of genes within networks distinguishes core from peripheral genes. If previous works have shown contrasting features between them, whether such categorization matters for phenotype prediction remains to be studied. RESULTS: We measured 17 phenotypic traits for 241 cloned genotypes from a Populus nigra collection, covering growth, phenology, chemical and physical properties. We also sequenced RNA for each genotype and built co-expression networks to define core and peripheral genes. We found that cores were more differentiated between populations than peripherals while being less variable, suggesting that they have been constrained through potentially divergent selection. We also showed that while cores were overrepresented in a subset of genes statistically selected for their capacity to predict the phenotypes (by Boruta algorithm), they did not systematically predict better than peripherals or even random genes. CONCLUSION: Our work is the first attempt to assess the importance of co-expression network connectivity in phenotype prediction. While highly connected core genes appear to be important, they do not bear enough information to systematically predict better quantitative traits than other gene sets.


Asunto(s)
Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Redes Reguladoras de Genes , Populus/crecimiento & desarrollo , Regulación del Desarrollo de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genotipo , Aprendizaje Automático , Fenotipo , Proteínas de Plantas/genética , Populus/genética , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ARN
4.
BMC Genomics ; 20(1): 302, 2019 Apr 18.
Artículo en Inglés | MEDLINE | ID: mdl-30999856

RESUMEN

BACKGROUND: Genomic selection accuracy increases with the use of high SNP (single nucleotide polymorphism) coverage. However, such gains in coverage come at high costs, preventing their prompt operational implementation by breeders. Low density panels imputed to higher densities offer a cheaper alternative during the first stages of genomic resources development. Our study is the first to explore the imputation in a tree species: black poplar. About 1000 pure-breed Populus nigra trees from a breeding population were selected and genotyped with a 12K custom Infinium Bead-Chip. Forty-three of those individuals corresponding to nodal trees in the pedigree were fully sequenced (reference), while the remaining majority (target) was imputed from 8K to 1.4 million SNPs using FImpute. Each SNP and individual was evaluated for imputation errors by leave-one-out cross validation in the training sample of 43 sequenced trees. Some summary statistics such as Hardy-Weinberg Equilibrium exact test p-value, quality of sequencing, depth of sequencing per site and per individual, minor allele frequency, marker density ratio or SNP information redundancy were calculated. Principal component and Boruta analyses were used on all these parameters to rank the factors affecting the quality of imputation. Additionally, we characterize the impact of the relatedness between reference population and target population. RESULTS: During the imputation process, we used 7540 SNPs from the chip to impute 1,438,827 SNPs from sequences. At the individual level, imputation accuracy was high with a proportion of SNPs correctly imputed between 0.84 and 0.99. The variation in accuracies was mostly due to differences in relatedness between individuals. At a SNP level, the imputation quality depended on genotyped SNP density and on the original minor allele frequency. The imputation did not appear to result in an increase of linkage disequilibrium. The genotype densification not only brought a better distribution of markers all along the genome, but also we did not detect any substantial bias in annotation categories. CONCLUSIONS: This study shows that it is possible to impute low-density marker panels to whole genome sequence with good accuracy under certain conditions that could be common to many breeding populations.


Asunto(s)
Cruzamiento , Polimorfismo de Nucleótido Simple , Populus/genética , Análisis de Secuencia , Desequilibrio de Ligamiento , Anotación de Secuencia Molecular
5.
BMC Genomics ; 19(1): 909, 2018 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-30541448

RESUMEN

BACKGROUD: Populus nigra is a major tree species of ecological and economic importance for which several initiatives have been set up to create genomic resources. In order to access the large number of Single Nucleotide Polymorphisms (SNPs) typically needed to carry out a genome scan, the present study aimed at evaluating RNA sequencing as a tool to discover and type SNPs in genes within natural populations of P. nigra. RESULTS: We have devised a bioinformatics pipeline to call and type SNPs from RNAseq reads and applied it to P. nigra transcriptomic data. The accuracy of the resulting RNAseq-based SNP calling and typing has been evaluated by (i) comparing their position and alleles to those previously reported in candidate genes, (ii) assessing their genotyping accuracy with respect to a previously available SNP chip and (iii) evaluating their inter-annual repeatability. We found that a combination of several callers yields a good compromise between the number of variants type and the accuracy of genotyping. We further used the resulting genotypic data to carry out basic genetic analyses whose results confirm the quality of the RNAseq-based SNP dataset. CONCLUSIONS: We demonstrated the potential and accuracy of RNAseq as an efficient way to genotype SNPs in P. nigra. These results open prospects towards the use of this technology for quantitative and population genomics studies.


Asunto(s)
Genes de Plantas , Polimorfismo de Nucleótido Simple , Populus/genética , Regiones no Traducidas 3' , Regiones no Traducidas 5' , Mapeo Cromosómico , Análisis por Conglomerados , Exones , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , ARN de Planta/química , ARN de Planta/aislamiento & purificación , ARN de Planta/metabolismo , Análisis de Secuencia de ARN
6.
Brief Bioinform ; 19(1): 65-76, 2018 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-27742662

RESUMEN

Numerous statistical pipelines are now available for the differential analysis of gene expression measured with RNA-sequencing technology. Most of them are based on similar statistical frameworks after normalization, differing primarily in the choice of data distribution, mean and variance estimation strategy and data filtering. We propose an evaluation of the impact of these choices when few biological replicates are available through the use of synthetic data sets. This framework is based on real data sets and allows the exploration of various scenarios differing in the proportion of non-differentially expressed genes. Hence, it provides an evaluation of the key ingredients of the differential analysis, free of the biases associated with the simulation of data using parametric models. Our results show the relevance of a proper modeling of the mean by using linear or generalized linear modeling. Once the mean is properly modeled, the impact of the other parameters on the performance of the test is much less important. Finally, we propose to use the simple visualization of the raw P-value histogram as a practical evaluation criterion of the performance of differential analysis methods on real data sets.


Asunto(s)
Proteínas de Arabidopsis/genética , Perfilación de la Expresión Génica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN/genética , Análisis de Secuencia de ARN/métodos , Transcriptoma , Arabidopsis/genética , Simulación por Computador , Conjuntos de Datos como Asunto , Humanos , Modelos Estadísticos , Programas Informáticos
7.
Genome Biol ; 9(12): R175, 2008.
Artículo en Inglés | MEDLINE | ID: mdl-19087247

RESUMEN

Next generation technologies enable massive-scale cDNA sequencing (so-called RNA-Seq). Mainly because of the difficulty of aligning short reads on exon-exon junctions, no attempts have been made so far to use RNA-Seq for building gene models de novo, that is, in the absence of a set of known genes and/or splicing events. We present G-Mo.R-Se (Gene Modelling using RNA-Seq), an approach aimed at building gene models directly from RNA-Seq and demonstrate its utility on the grapevine genome.


Asunto(s)
Genoma de Planta , Modelos Genéticos , Análisis de Secuencia de ARN/métodos , Vitis/genética , ADN Complementario/genética , ARN de Planta/genética , Análisis de Secuencia de ARN/economía
8.
BMC Genomics ; 9: 603, 2008 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-19087275

RESUMEN

BACKGROUND: Massively parallel DNA sequencing instruments are enabling the decoding of whole genomes at significantly lower cost and higher throughput than classical Sanger technology. Each of these technologies have been estimated to yield assemblies with more problematic features than the standard method. These problems are of a different nature depending on the techniques used. So, an appropriate mix of technologies may help resolve most difficulties, and eventually provide assemblies of high quality without requiring any Sanger-based input. RESULTS: We compared assemblies obtained using Sanger data with those from different inputs from New Sequencing Technologies. The assemblies were systematically compared with a reference finished sequence. We found that the 454 GSFLX can efficiently produce high continuity when used at high coverage. The potential to enhance continuity by scaffolding was tested using 454 sequences from circularized genomic fragments. Finally, we explore the use of Solexa-Illumina short reads to polish the genome draft by implementing a technique to correct 454 consensus errors. CONCLUSION: High quality drafts can be produced for small genomes without any Sanger data input. We found that 454 GSFLX and Solexa/Illumina show great complementarity in producing large contigs and supercontigs with a low error rate.


Asunto(s)
Genoma Bacteriano , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Biología Computacional/métodos , Mapeo Contig , Biblioteca de Genes , Análisis de Secuencia de ADN/instrumentación
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA