Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Nat Genet ; 56(4): 721-731, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38622339

RESUMO

Coffea arabica, an allotetraploid hybrid of Coffea eugenioides and Coffea canephora, is the source of approximately 60% of coffee products worldwide, and its cultivated accessions have undergone several population bottlenecks. We present chromosome-level assemblies of a di-haploid C. arabica accession and modern representatives of its diploid progenitors, C. eugenioides and C. canephora. The three species exhibit largely conserved genome structures between diploid parents and descendant subgenomes, with no obvious global subgenome dominance. We find evidence for a founding polyploidy event 350,000-610,000 years ago, followed by several pre-domestication bottlenecks, resulting in narrow genetic variation. A split between wild accessions and cultivar progenitors occurred ~30.5 thousand years ago, followed by a period of migration between the two populations. Analysis of modern varieties, including lines historically introgressed with C. canephora, highlights their breeding histories and loci that may contribute to pathogen resistance, laying the groundwork for future genomics-based breeding of C. arabica.


Assuntos
Coffea , Coffea/genética , Café , Genoma de Planta/genética , Metagenômica , Melhoramento Vegetal
2.
Brief Funct Genomics ; 22(5): 428-441, 2023 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-37119295

RESUMO

Artificial intelligence is revolutionizing all fields that affect people's lives and health. One of the most critical applications is in the study of tumors. It is the case of glioblastoma (GBM) that has behaviors that need to be understood to develop effective therapies. Due to advances in single-cell RNA sequencing (scRNA-seq), it is possible to understand the cellular and molecular heterogeneity in the GBM. Given that there are different cell groups in these tumors, there is a need to apply Machine Learning (ML) algorithms. It will allow extracting information to understand how cancer changes and broaden the search for effective treatments. We proposed multiple comparisons of ML algorithms to classify cell groups based on the GBM scRNA-seq data. This broad comparison spectrum can show the scientific-medical community which models can achieve the best performance in this task. In this work are classified the following cell groups: Tumor Core (TC), Tumor Periphery (TP) and Normal Periphery (NP), in binary and multi-class scenarios. This work presents the biomarker candidates found for the models with the best results. The analyses presented here allow us to verify the biomarker candidates to understand the genetic characteristics of GBM, which may be affected by a suitable identification of GBM heterogeneity. This work obtained for the four scenarios covered cross-validation results of $93.03\% \pm 5.37\%$, $97.42\% \pm 3.94\%$, $98.27\% \pm 1.81\%$ and $93.04\% \pm 6.88\%$ for the classification of TP versus TC, TP versus NP, NP versus TP and TC (TPC) and NP versus TP versus TC, respectively.


Assuntos
Glioblastoma , Humanos , Glioblastoma/genética , Glioblastoma/patologia , Inteligência Artificial , Biomarcadores , Aprendizado de Máquina , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos
3.
Arch Virol ; 168(4): 125, 2023 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-36988711

RESUMO

Human endogenous retroviruses (HERVs) are LTR retrotransposons that are present in the human genome. Among them, members of the HERV-K (HML-2) group are suspected to play a role in the development of different types of cancer, including lung, ovarian, and prostate cancer, as well as leukemia. Acute myeloid leukemia (AML) is an important disease that causes 1% of cancer deaths in the United States and has a survival rate of 28.7%. Here, we describe a method for assessing the statistical association between HERV-K (HML-2) transposable element insertion polymorphisms (or TIPs) and AML, using whole-genome sequencing and read mapping using TIP_finder software. Our results suggest that 101 polymorphisms involving HERV-K (HML-2) elements were correlated with AML, with a percentage between 44.4 to 56.6%, most of which (70) were located in the region from 8q24.13 to 8q24.21. Moreover, it was found that the TRIB1, LRATD2, POU5F1B, MYC, PCAT1, PVT1, and CCDC26 genes could be displaced or fragmented by TIPs. Furthermore, a general method was devised to facilitate analysis of the correlation between transposable element insertions and specific diseases. Finally, although the relationship between HERV-K (HML-2) TIPs and AML remains unclear, the data reported in this study indicate a statistical correlation, as supported by the χ2 test with p-values < 0.05.


Assuntos
Retrovirus Endógenos , Leucemia Mieloide Aguda , Masculino , Humanos , Retrovirus Endógenos/genética , Elementos de DNA Transponíveis , Polimorfismo Genético , Genoma Humano , Leucemia Mieloide Aguda/genética , Proteínas Serina-Treonina Quinases , Peptídeos e Proteínas de Sinalização Intracelular/genética
4.
Biology (Basel) ; 9(9)2020 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-32917036

RESUMO

Transposable elements (TEs) are non-static genomic units capable of moving indistinctly from one chromosomal location to another. Their insertion polymorphisms may cause beneficial mutations, such as the creation of new gene function, or deleterious in eukaryotes, e.g., different types of cancer in humans. A particular type of TE called LTR-retrotransposons comprises almost 8% of the human genome. Among LTR retrotransposons, human endogenous retroviruses (HERVs) bear structural and functional similarities to retroviruses. Several tools allow the detection of transposon insertion polymorphisms (TIPs) but fail to efficiently analyze large genomes or large datasets. Here, we developed a computational tool, named TIP_finder, able to detect mobile element insertions in very large genomes, through high-performance computing (HPC) and parallel programming, using the inference of discordant read pair analysis. TIP_finder inputs are (i) short pair reads such as those obtained by Illumina, (ii) a chromosome-level reference genome sequence, and (iii) a database of consensus TE sequences. The HPC strategy we propose adds scalability and provides a useful tool to analyze huge genomic datasets in a decent running time. TIP_finder accelerates the detection of transposon insertion polymorphisms (TIPs) by up to 55 times in breast cancer datasets and 46 times in cancer-free datasets compared to the fastest available algorithms. TIP_finder applies a validated strategy to find TIPs, accelerates the process through HPC, and addresses the issues of runtime for large-scale analyses in the post-genomic era. TIP_finder version 1.0 is available at https://github.com/simonorozcoarias/TIP_finder.

5.
PeerJ Comput Sci ; 6: e270, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33816921

RESUMO

Cancer classification is a topic of major interest in medicine since it allows accurate and efficient diagnosis and facilitates a successful outcome in medical treatments. Previous studies have classified human tumors using a large-scale RNA profiling and supervised Machine Learning (ML) algorithms to construct a molecular-based classification of carcinoma cells from breast, bladder, adenocarcinoma, colorectal, gastro esophagus, kidney, liver, lung, ovarian, pancreas, and prostate tumors. These datasets are collectively known as the 11_tumor database, although this database has been used in several works in the ML field, no comparative studies of different algorithms can be found in the literature. On the other hand, advances in both hardware and software technologies have fostered considerable improvements in the precision of solutions that use ML, such as Deep Learning (DL). In this study, we compare the most widely used algorithms in classical ML and DL to classify the tumors described in the 11_tumor database. We obtained tumor identification accuracies between 90.6% (Logistic Regression) and 94.43% (Convolutional Neural Networks) using k-fold cross-validation. Also, we show how a tuning process may or may not significantly improve algorithms' accuracies. Our results demonstrate an efficient and accurate classification method based on gene expression (microarray data) and ML/DL algorithms, which facilitates tumor type prediction in a multi-cancer-type scenario.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA