Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 60
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
J Hum Genet ; 69(10): 541-547, 2024 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-39014190

RESUMEN

Genome-wide association studies have enabled the identification of important genetic factors in many trait studies. However, only a fraction of the heritability can be explained by known genetic factors, even in the most common diseases. Genetic loci combinations, or epistatic contributions expressed by combinations of single nucleotide polymorphisms (SNPs), have been argued to be one of the critical factors explaining some of the missing heritability, especially in oligogenic/polygenic diseases. Rheumatoid arthritis (RA) is a complex disease with more than 100 reported SNP associations, as well as various HLA haplotypes and amino acids; however, many associations between RA and inter-chromosomal SNP combinations are unknown. To discover novel associations of epistatic interactions with high odds ratios in RA, we applied the LAMPLINK method, a systematic enumerative procedure for identifying high-order SNP combinations, to a Japanese RA cohort (discovery cohort; 4024 patients with RA and 7731 controls). We validated the identified associations in a different Japanese cohort (validation cohort; 810 RA patients and 6303 controls). In this study, we identified 90 significant genetic associations in the discovery cohort. Among these, 74 (82.2%) associations were replicated in the validation cohort, and eight combinations were inter-chromosomal, all of which comprised rs7765379 or rs35265698 located in the HLA region. These two SNPs exhibited strong correlations with valine at amino acid position 11 in HLA-DRB1 (HLA-DRB1-11-Val). Finally, we discovered that rs9624 showed an association with RA through an epistatic interaction with HLA-DRB1-11-Val. Overall, LAMPLINK showed high reliability for identifying epistatic genetic contributions hidden in complex traits.


Asunto(s)
Artritis Reumatoide , Epistasis Genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Femenino , Humanos , Masculino , Persona de Mediana Edad , Artritis Reumatoide/genética , Estudios de Cohortes , Pueblos del Este de Asia/genética , Haplotipos , Japón/epidemiología
2.
Genes Cells ; 26(8): 555-569, 2021 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-33993586

RESUMEN

Ribonucleotides incorporated in the genome are a source of endogenous DNA damage and also serve as signals for repair. Although recent advances of ribonucleotide detection by sequencing, the balance between incorporation and repair of ribonucleotides has not been elucidated. Here, we describe a competitive sequencing method, Ribonucleotide Scanning Quantification sequencing (RiSQ-seq), which enables absolute quantification of misincorporated ribonucleotides throughout the genome by background normalization and standard adjustment within a single sample. RiSQ-seq analysis of cells harboring wild-type DNA polymerases revealed that ribonucleotides were incorporated nonuniformly in the genome with a 3'-shifted distribution and preference for GC sequences. Although ribonucleotide profiles in wild-type and repair-deficient mutant strains showed a similar pattern, direct comparison of distinct ribonucleotide levels in the strains by RiSQ-seq enabled evaluation of ribonucleotide excision repair activity at base resolution and revealed the strand bias of repair. The distinct preferences of ribonucleotide incorporation and repair create vulnerable regions associated with indel hotspots, suggesting that repair at sites of ribonucleotide misincorporation serves to maintain genome integrity and that RiSQ-seq can provide an estimate of indel risk.


Asunto(s)
Reparación del ADN , Ribonucleótidos/genética , ADN/química , ADN/genética , Genoma Fúngico , Tasa de Mutación , Ribonucleótidos/análisis , Saccharomyces cerevisiae
3.
Brief Bioinform ; 21(2): 395-407, 2020 03 23.
Artículo en Inglés | MEDLINE | ID: mdl-30590436

RESUMEN

Genome duplication with hybridization, or allopolyploidization, occurs in animals, fungi and plants, and is especially common in crop plants. There is an increasing interest in the study of allopolyploids because of advances in polyploid genome assembly; however, the high level of sequence similarity in duplicated gene copies (homeologs) poses many challenges. Here we compared standard RNA-seq expression quantification approaches used currently for diploid species against subgenome-classification approaches which maps reads to each subgenome separately. We examined mapping error using our previous and new RNA-seq data in which a subgenome is experimentally added (synthetic allotetraploid Arabidopsis kamchatica) or reduced (allohexaploid wheat Triticum aestivum versus extracted allotetraploid) as ground truth. The error rates in the two species were very similar. The standard approaches showed higher error rates (>10% using pseudo-alignment with Kallisto) while subgenome-classification approaches showed much lower error rates (<1% using EAGLE-RC, <2% using HomeoRoq). Although downstream analysis may partly mitigate mapping errors, the difference in methods was substantial in hexaploid wheat, where Kallisto appeared to have systematic differences relative to other methods. Only approximately half of the differentially expressed homeologs detected using Kallisto overlapped with those by any other method in wheat. In general, disagreement in low-expression genes was responsible for most of the discordance between methods, which is consistent with known biases in Kallisto. We also observed that there exist uncertainties in genome sequences and annotation which can affect each method differently. Overall, subgenome-classification approaches tend to perform better than standard approaches with EAGLE-RC having the highest precision.


Asunto(s)
Poliploidía , Triticum/genética , Cromosomas de las Plantas , Regulación de la Expresión Génica de las Plantas , Análisis de Secuencia de ARN/métodos
4.
PLoS Biol ; 17(11): e3000476, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-31721761

RESUMEN

Learning of most motor skills is constrained in a species-specific manner. However, the proximate mechanisms underlying species-specific learned behaviors remain poorly understood. Songbirds acquire species-specific songs through learning, which is hypothesized to depend on species-specific patterns of gene expression in functionally specialized brain regions for vocal learning and production, called song nuclei. Here, we leveraged two closely related songbird species, zebra finch, owl finch, and their interspecific first-generation (F1) hybrids, to relate transcriptional regulatory divergence between species with the production of species-specific songs. We quantified genome-wide gene expression in both species and compared this with allele-specific expression in F1 hybrids to identify genes whose expression in song nuclei is regulated by species divergence in either cis- or trans-regulation. We found that divergence in transcriptional regulation altered the expression of approximately 10% of total transcribed genes and was linked to differential gene expression between the two species. Furthermore, trans-regulatory changes were more prevalent than cis-regulatory and were associated with synaptic formation and transmission in song nucleus RA, the avian analog of the mammalian laryngeal motor cortex. We identified brain-derived neurotrophic factor (BDNF) as an upstream mediator of trans-regulated genes in RA, with a significant correlation between individual variation in BDNF expression level and species-specific song phenotypes in F1 hybrids. This was supported by the fact that the pharmacological overactivation of BDNF receptors altered the expression of its trans-regulated genes in the RA, thus disrupting the learned song structures of adult zebra finch songs at the acoustic and sequence levels. These results demonstrate functional neurogenetic associations between divergence in region-specific transcriptional regulation and species-specific learned behaviors.


Asunto(s)
Pinzones/genética , Factores de Transcripción/fisiología , Vocalización Animal/fisiología , Animales , Encéfalo/metabolismo , Factor Neurotrófico Derivado del Encéfalo/genética , Factor Neurotrófico Derivado del Encéfalo/metabolismo , Pinzones/fisiología , Regulación de la Expresión Génica/genética , Variación Genética/genética , Aprendizaje/fisiología , Neuronas/metabolismo , Pájaros Cantores/genética , Pájaros Cantores/fisiología , Especificidad de la Especie , Factores de Transcripción/genética , Transcriptoma
5.
Surg Today ; 52(12): 1753-1758, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-35511359

RESUMEN

PURPOSE: We are attempting to develop a navigation system for safe and effective peripancreatic lymphadenectomy in gastric cancer surgery. As a preliminary study, we examined whether or not the peripancreatic dissection line could be learned by a machine learning model (MLM). METHODS: Among the 41 patients with gastric cancer who underwent radical gastrectomy between April 2019 and January 2020, we selected 6 in whom the pancreatic contour was relatively easy to trace. The pancreatic contour was annotated by a trainer surgeon in 1242 images captured from the video recordings. The MLM was trained using the annotated images from five of the six patients. The pancreatic contour was then segmented by the trained MLM using images from the remaining patient. The same procedure was repeated for all six combinations. RESULTS: The median maximum intersection over union of each image was 0.708, which was higher than the threshold (0.5). However, the pancreatic contour was misidentified in parts where fatty tissue or thin vessels overlaid the pancreas in some cases. CONCLUSION: The contour of the pancreas could be traced relatively well using the trained MLM. Further investigations and training of the system are needed to develop a practical navigation system.


Asunto(s)
Laparoscopía , Neoplasias Gástricas , Humanos , Neoplasias Gástricas/cirugía , Inteligencia Artificial , Laparoscopía/métodos , Gastrectomía/métodos , Escisión del Ganglio Linfático/métodos
6.
BMC Genomics ; 22(1): 547, 2021 Jul 17.
Artículo en Inglés | MEDLINE | ID: mdl-34273949

RESUMEN

BACKGROUND: Whole genome duplication (WGD) events are common in the evolutionary history of many living organisms. For decades, researchers have been trying to understand the genetic and epigenetic impact of WGD and its underlying molecular mechanisms. Particular attention was given to allopolyploid study systems, species resulting from an hybridization event accompanied by WGD. Investigating the mechanisms behind the survival of a newly formed allopolyploid highlighted the key role of DNA methylation. With the improvement of high-throughput methods, such as whole genome bisulfite sequencing (WGBS), an opportunity opened to further understand the role of DNA methylation at a larger scale and higher resolution. However, only a few studies have applied WGBS to allopolyploids, which might be due to lack of genomic resources combined with a burdensome data analysis process. To overcome these problems, we developed the Automated Reproducible Polyploid EpiGenetic GuIdance workflOw (ARPEGGIO): the first workflow for the analysis of epigenetic data in polyploids. This workflow analyzes WGBS data from allopolyploid species via the genome assemblies of the allopolyploid's parent species. ARPEGGIO utilizes an updated read classification algorithm (EAGLE-RC), to tackle the challenge of sequence similarity amongst parental genomes. ARPEGGIO offers automation, but more importantly, a complete set of analyses including spot checks starting from raw WGBS data: quality checks, trimming, alignment, methylation extraction, statistical analyses and downstream analyses. A full run of ARPEGGIO outputs a list of genes showing differential methylation. ARPEGGIO was made simple to set up, run and interpret, and its implementation ensures reproducibility by including both package management and containerization. RESULTS: We evaluated ARPEGGIO in two ways. First, we tested EAGLE-RC's performance with publicly available datasets given a ground truth, and we show that EAGLE-RC decreases the error rate by 3 to 4 times compared to standard approaches. Second, using the same initial dataset, we show agreement between ARPEGGIO's output and published results. Compared to other similar workflows, ARPEGGIO is the only one supporting polyploid data. CONCLUSIONS: The goal of ARPEGGIO is to promote, support and improve polyploid research with a reproducible and automated set of analyses in a convenient implementation. ARPEGGIO is available at https://github.com/supermaxiste/ARPEGGIO .


Asunto(s)
Metilación de ADN , Programas Informáticos , Epigénesis Genética , Humanos , Poliploidía , Reproducibilidad de los Resultados , Flujo de Trabajo
7.
Plant Cell Physiol ; 62(1): 8-27, 2021 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-33244607

RESUMEN

Bread wheat is a major crop that has long been the focus of basic and breeding research. Assembly of its genome has been difficult because of its large size and allohexaploid nature (AABBDD genome). Following the first reported assembly of the genome of the experimental strain Chinese Spring (CS), the 10+ Wheat Genomes Project was launched to produce multiple assemblies of worldwide modern cultivars. The only Asian cultivar in the project is Norin 61, a representative Japanese cultivar adapted to grow across a broad latitudinal range, mostly characterized by a wet climate and a short growing season. Here, we characterize the key aspects of its chromosome-scale genome assembly spanning 15 Gb with a raw scaffold N50 of 22 Mb. Analysis of the repetitive elements identified chromosomal regions unique to Norin 61 that encompass a tandem array of the pathogenesis-related 13 family. We report novel copy-number variations in the B homeolog of the florigen gene FT1/VRN3, pseudogenization of its D homeolog and the association of its A homeologous alleles with the spring/winter growth habit. Furthermore, the Norin 61 genome carries typical East Asian functional variants different from CS, ranging from a single nucleotide to multi-Mb scale. Examples of such variation are the Fhb1 locus, which confers Fusarium head-blight resistance, Ppd-D1a, which confers early flowering, Glu-D1f for Asian noodle quality and Rht-D1b, which introduced semi-dwarfism during the green revolution. The adoption of Norin 61 as a reference assembly for functional and evolutionary studies will enable comprehensive characterization of the underexploited Asian bread wheat diversity.


Asunto(s)
Resistencia a la Enfermedad/genética , Flores/crecimiento & desarrollo , Genes de Plantas/genética , Genoma de Planta/genética , Triticum/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Citogenética , Asia Oriental , Flores/genética , Fusarium , Genes de Plantas/fisiología , Estudios de Asociación Genética , Variación Genética/genética , Variación Genética/fisiología , Genoma de Planta/fisiología , Genotipo , Filogenia , Alineación de Secuencia , Análisis de Secuencia de ADN , Triticum/crecimiento & desarrollo , Triticum/inmunología , Triticum/fisiología
8.
Genes Cells ; 25(2): 124-138, 2020 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-31917895

RESUMEN

Translesion synthesis (TLS) polymerases mediate DNA damage bypass during replication. The TLS polymerase Rev1 has two important functions in the TLS pathway, including dCMP transferase activity and acting as a scaffolding protein for other TLS polymerases at the C-terminus. Because of the former activity, Rev1 bypasses apurinic/apyrimidinic sites by incorporating dCMP, whereas the latter activity mediates assembly of multipolymerase complexes at the DNA lesions. We generated rev1 mutants lacking each of these two activities in Oryzias latipes (medaka) fish and analyzed cytotoxicity and mutagenicity in response to the alkylating agent diethylnitrosamine (DENA). Mutant lacking the C-terminus was highly sensitive to DENA cytotoxicity, whereas mutant with reduced dCMP transferase activity was slightly sensitive to DENA cytotoxicity, but exhibited a higher tumorigenic rate than wild-type fish. There was no significant difference in the frequency of DENA-induced mutations between mutant with reduced dCMP transferase activity and wild-type cultured cell. However, loss of heterozygosity (LOH) occurred frequently in cells with reduced dCMP transferase activity. LOH is a common genetic event in many cancer types and plays an important role on carcinogenesis. To our knowledge, this is the first report to identify the involvement of the catalytic activity of Rev1 in suppression of LOH.


Asunto(s)
Pérdida de Heterocigocidad , Nucleotidiltransferasas/genética , Nucleotidiltransferasas/metabolismo , Oryzias/genética , Animales , Animales Modificados Genéticamente , Carcinogénesis , Línea Celular , Daño del ADN , Reparación del ADN , Replicación del ADN , ADN Polimerasa Dirigida por ADN , Femenino , Regulación de la Expresión Génica , Hígado/patología , Masculino , Mutagénesis , Mutación , Proteínas Recombinantes , Transcriptoma
9.
New Phytol ; 229(6): 3587-3601, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33222195

RESUMEN

Polyploidization is pervasive in plants, but little is known about the niche divergence of wild allopolyploids (species that harbor polyploid genomes originating from different diploid species) relative to their diploid progenitor species and the gene expression patterns that may underlie such ecological divergence. We conducted a fine-scale empirical study on habitat and gene expression of an allopolyploid and its diploid progenitors. We quantified soil properties and light availability of habitats of an allotetraploid Cardamine flexuosa and its diploid progenitors Cardamine amara and Cardamine hirsuta in two seasons. We analyzed expression patterns of genes and homeologs (homeologous gene copies in allopolyploids) using RNA sequencing. We detected niche divergence between the allopolyploid and its diploid progenitors along water availability gradient at a fine scale: the diploids in opposite extremes and the allopolyploid in a broader range between diploids, with limited overlap with diploids at both ends. Most of the genes whose homeolog expression ratio changed among habitats in C. flexuosa varied spatially and temporally. These findings provide empirical evidence for niche divergence between an allopolyploid and its diploid progenitor species at a fine scale and suggest that divergent expression patterns of homeologs in an allopolyploid may underlie its persistence in diverse habitats.


Asunto(s)
Cardamine , Diploidia , Ecosistema , Poliploidía
10.
Bioinformatics ; 35(2): 309-318, 2019 01 15.
Artículo en Inglés | MEDLINE | ID: mdl-29982330

RESUMEN

Motivation: In bioinformatics, machine learning-based methods that predict the compound-protein interactions (CPIs) play an important role in the virtual screening for drug discovery. Recently, end-to-end representation learning for discrete symbolic data (e.g. words in natural language processing) using deep neural networks has demonstrated excellent performance on various difficult problems. For the CPI problem, data are provided as discrete symbolic data, i.e. compounds are represented as graphs where the vertices are atoms, the edges are chemical bonds, and proteins are sequences in which the characters are amino acids. In this study, we investigate the use of end-to-end representation learning for compounds and proteins, integrate the representations, and develop a new CPI prediction approach by combining a graph neural network (GNN) for compounds and a convolutional neural network (CNN) for proteins. Results: Our experiments using three CPI datasets demonstrated that the proposed end-to-end approach achieves competitive or higher performance as compared to various existing CPI prediction methods. In addition, the proposed approach significantly outperformed existing methods on an unbalanced dataset. This suggests that data-driven representations of compounds and proteins obtained by end-to-end GNNs and CNNs are more robust than traditional chemical and biological features obtained from databases. Although analyzing deep learning models is difficult due to their black-box nature, we address this issue using a neural attention mechanism, which allows us to consider which subsequences in a protein are more important for a drug compound when predicting its interaction. The neural attention mechanism also provides effective visualization, which makes it easier to analyze a model even when modeling is performed using real-valued representations instead of discrete features. Availability and implementation: https://github.com/masashitsubaki. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Aprendizaje Profundo , Redes Neurales de la Computación , Mapeo de Interacción de Proteínas , Proteínas/química , Descubrimiento de Drogas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA