Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Comput Struct Biotechnol J ; 21: 5676-5685, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38058296

RESUMEN

Long non-coding ribonucleic acids (lncRNAs) have been shown to play an important role in plant gene regulation, involving both epigenetic and transcript regulation. LncRNAs are transcripts longer than 200 nucleotides that are not translated into functional proteins but can be translated into small peptides. Machine learning models have predominantly used transcriptome data with manually defined features to detect lncRNAs, however, they often underrepresent the abundance of lncRNAs and can be biased in their detection. Here we present a study using Natural Language Processing (NLP) models to identify plant lncRNAs from genomic sequences rather than transcriptomic data. The NLP models were trained to predict lncRNAs for seven model and crop species (Zea mays, Arabidopsis thaliana, Brassica napus, Brassica oleracea, Brassica rapa, Glycine max and Oryza sativa) using publicly available genomic references. We demonstrated that lncRNAs can be accurately predicted from genomic sequences with the highest accuracy of 83.4% for Z. mays and the lowest accuracy of 57.9% for B. rapa, revealing that genome assembly quality might affect the accuracy of lncRNA identification. Furthermore, we demonstrated the potential of using NLP models for cross-species prediction with an average of 63.1% accuracy using target species not previously seen by the model. As more species are incorporated into the training datasets, we expect the accuracy to increase, becoming a more reliable tool for uncovering novel lncRNAs. Finally, we show that the models can be interpreted using explainable artificial intelligence to identify motifs important to lncRNA prediction and that these motifs frequently flanked the lncRNA sequence.

2.
Plants (Basel) ; 11(20)2022 Oct 17.
Artículo en Inglés | MEDLINE | ID: mdl-36297764

RESUMEN

The global demand for oilseeds is increasing along with the human population. The family of Brassicaceae crops are no exception, typically harvested as a valuable source of oil, rich in beneficial molecules important for human health. The global capacity for improving Brassica yield has steadily risen over the last 50 years, with the major crop Brassica napus (rapeseed, canola) production increasing to ~72 Gt in 2020. In contrast, the production of Brassica mustard crops has fluctuated, rarely improving in farming efficiency. The drastic increase in global yield of B. napus is largely due to the demand for a stable source of cooking oil. Furthermore, with the adoption of highly efficient farming techniques, yield enhancement programs, breeding programs, the integration of high-throughput phenotyping technology and establishing the underlying genetics, B. napus yields have increased by >450 fold since 1978. Yield stability has been improved with new management strategies targeting diseases and pests, as well as by understanding the complex interaction of environment, phenotype and genotype. This review assesses the global yield and yield stability of agriculturally important oilseed Brassica species and discusses how contemporary farming and genetic techniques have driven improvements.

3.
Front Genet ; 13: 822173, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35664329

RESUMEN

Genomic prediction tools support crop breeding based on statistical methods, such as the genomic best linear unbiased prediction (GBLUP). However, these tools are not designed to capture non-linear relationships within multi-dimensional datasets, or deal with high dimension datasets such as imagery collected by unmanned aerial vehicles. Machine learning (ML) algorithms have the potential to surpass the prediction accuracy of current tools used for genotype to phenotype prediction, due to their capacity to autonomously extract data features and represent their relationships at multiple levels of abstraction. This review addresses the challenges of applying statistical and machine learning methods for predicting phenotypic traits based on genetic markers, environment data, and imagery for crop breeding. We present the advantages and disadvantages of explainable model structures, discuss the potential of machine learning models for genotype to phenotype prediction in crop breeding, and the challenges, including the scarcity of high-quality datasets, inconsistent metadata annotation and the requirements of ML models.

4.
Plants (Basel) ; 11(9)2022 Apr 27.
Artículo en Inglés | MEDLINE | ID: mdl-35567182

RESUMEN

Soybean (Glycine max) is a legume species of significant economic and nutritional value. The yield of soybean continues to increase with the breeding of improved varieties, and this is likely to continue with the application of advanced genetic and genomic approaches for breeding. Genome technologies continue to advance rapidly, with an increasing number of high-quality genome assemblies becoming available. With accumulating data from marker arrays and whole-genome resequencing, studying variations between individuals and populations is becoming increasingly accessible. Furthermore, the recent development of soybean pangenomes has highlighted the significant structural variation between individuals, together with knowledge of what has been selected for or lost during domestication and breeding, information that can be applied for the breeding of improved cultivars. Because of this, resources such as genome assemblies, SNP datasets, pangenomes and associated databases are becoming increasingly important for research underlying soybean crop improvement.

5.
Methods Mol Biol ; 2481: 153-159, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35641763

RESUMEN

Single-nucleotide polymorphisms (SNPs) have become the primary type of molecular genetic marker used in a diverse range of genetic and genomic studies. SNPs can be used to identify genomic regions linked to traits such as disease in genome-wide association studies, to understand population structure and diversity, or to understand mechanisms of genome evolution. One of the first steps of any SNP-based workflow, following SNP discovery, is quality control of SNP data. The protocol described here details how to perform quality control on SNP data to minimise errors in downstream analysis.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Exactitud de los Datos , Genoma , Genómica
6.
Int J Mol Sci ; 23(4)2022 Feb 18.
Artículo en Inglés | MEDLINE | ID: mdl-35216392

RESUMEN

Pangenomes aim to represent the complete repertoire of the genome diversity present within a species or cohort of species, capturing the genomic structural variance between individuals. This genomic information coupled with phenotypic data can be applied to identify genes and alleles involved with abiotic stress tolerance, disease resistance, and other desirable traits. The characterisation of novel structural variants from pangenomes can support genome editing approaches such as Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR associated protein Cas (CRISPR-Cas), providing functional information on gene sequences and new target sites in variant-specific genes with increased efficiency. This review discusses the application of pangenomes in genome editing and crop improvement, focusing on the potential of pangenomes to accurately identify target genes for CRISPR-Cas editing of plant genomes while avoiding adverse off-target effects. We consider the limitations of applying CRISPR-Cas editing with pangenome references and potential solutions to overcome these limitations.


Asunto(s)
Sistemas CRISPR-Cas/genética , Productos Agrícolas/genética , Genoma de Planta/genética , Edición Génica/métodos , Fenotipo , Fitomejoramiento/métodos , Plantas Modificadas Genéticamente/genética
7.
Plant Physiol ; 187(2): 699-715, 2021 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-34608963

RESUMEN

High-throughput phenotyping (HTP) platforms are capable of monitoring the phenotypic variation of plants through multiple types of sensors, such as red green and blue (RGB) cameras, hyperspectral sensors, and computed tomography, which can be associated with environmental and genotypic data. Because of the wide range of information provided, HTP datasets represent a valuable asset to characterize crop phenotypes. As HTP becomes widely employed with more tools and data being released, it is important that researchers are aware of these resources and how they can be applied to accelerate crop improvement. Researchers may exploit these datasets either for phenotype comparison or employ them as a benchmark to assess tool performance and to support the development of tools that are better at generalizing between different crops and environments. In this review, we describe the use of image-based HTP for yield prediction, root phenotyping, development of climate-resilient crops, detecting pathogen and pest infestation, and quantitative trait measurement. We emphasize the need for researchers to share phenotypic data, and offer a comprehensive list of available datasets to assist crop breeders and tool developers to leverage these resources in order to accelerate crop breeding.


Asunto(s)
Productos Agrícolas/genética , Genómica/métodos , Ensayos Analíticos de Alto Rendimiento/métodos , Difusión de la Información/métodos , Fenotipo , Fitomejoramiento/métodos
8.
Methods Mol Biol ; 2222: 149-166, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33301093

RESUMEN

Molecular markers provide researchers with a powerful tool for variation analysis between plant genomes. They are heritable and widely distributed across the genome and for this reason have many applications in plant taxonomy and genotyping. Over the last decade, molecular marker technology has developed rapidly and is now a crucial component for genetic linkage analysis, trait mapping, diversity analysis, and association studies. This chapter focuses on molecular marker discovery, its application, and future perspectives for plant genotyping through pangenome assemblies. Included are descriptions of automated methods for genome and sequence distance estimation, genome contaminant analysis in sequence reads, genome structural variation, and SNP discovery methods.


Asunto(s)
Código de Barras del ADN Taxonómico , Técnicas de Genotipaje , Ensayos Analíticos de Alto Rendimiento , Plantas/clasificación , Plantas/genética , Biología Computacional/métodos , Código de Barras del ADN Taxonómico/métodos , Código de Barras del ADN Taxonómico/normas , Contaminación de ADN , Evolución Molecular , Marcadores Genéticos , Genoma de Planta , Genómica/métodos , Genotipo , Ensayos Analíticos de Alto Rendimiento/normas , Filogenia , Polimorfismo de Nucleótido Simple
9.
Noncoding RNA ; 4(4)2018 Oct 08.
Artículo en Inglés | MEDLINE | ID: mdl-30297664

RESUMEN

Long non-coding RNAs (lncRNAs) are involved in multiple regulatory pathways and its versatile form of action has disclosed a new layer in gene regulation. LncRNAs have their expression levels modulated during plant development, and in response to stresses with tissue-specific functions. In this study, we analyzed lncRNA from leaf samples collected from the legume Copaifera langsdorffii Desf. (copaíba) present in two divergent ecosystems: Cerrado (CER; Ecological Station of Botanical Garden in Brasília, Brazil) and Atlantic Rain Forest (ARF; Rio de Janeiro, Brazil). We identified 8020 novel lncRNAs, and they were compared to seven Fabaceae genomes and transcriptomes, to which 1747 and 2194 copaíba lncRNAs were mapped, respectively, to at least one species. The secondary structures of the lncRNAs that were conserved and differentially expressed between the populations were predicted using in silico methods. A few selected lncRNA were confirmed by RT-qPCR in the samples from both biomes; Additionally, the analysis of the lncRNA sequences predicted that some might act as microRNA (miRNA) targets or decoys. The emerging studies involving lncRNAs function and conservation have shown their involvement in several types of biotic and abiotic stresses. Thus, the conservation of lncRNAs among Fabaceae species considering their rapid turnover, suggests they are likely to have been under functional conservation pressure. Our results indicate the potential involvement of lncRNAs in the adaptation of C. langsdorffii in two different biomes.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...