Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 138
Filtrar
Más filtros

Banco de datos
Tipo del documento
País de afiliación
Intervalo de año de publicación
2.
Genome Res ; 31(6): 968-980, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-34006570

RESUMEN

Chromatin looping plays an important role in genome regulation. However, because ChIP-seq and loop-resolution Hi-C (DNA-DNA proximity ligation) are extremely challenging in mammalian early embryos, the developmental stage at which cohesin-mediated loops form remains unknown. Here, we study early development in medaka (the Japanese killifish, Oryzias latipes) at 12 time points before, during, and after gastrulation (the onset of cell differentiation) and characterize transcription, protein binding, and genome architecture. We find that gastrulation is associated with drastic changes in genome architecture, including the formation of the first loops between sites bound by the insulator protein CTCF and a large increase in the size of contact domains. In contrast, the binding of the CTCF is fixed throughout embryogenesis. Loops form long after genome-wide transcriptional activation, and long after domain formation seen in mouse embryos. These results suggest that, although loops may play a role in differentiation, they are not required for zygotic transcription. When we repeated our experiments in zebrafish, loops did not emerge until gastrulation, that is, well after zygotic genome activation. We observe that loop positions are highly conserved in synteny blocks of medaka and zebrafish, indicating that the 3D genome architecture has been maintained for >110-200 million years of evolution.


Asunto(s)
Oryzias , Animales , Factor de Unión a CCCTC/genética , Factor de Unión a CCCTC/metabolismo , Proteínas de Ciclo Celular/genética , Cromatina/genética , Gastrulación/genética , Ratones , Oryzias/genética , Pez Cebra/genética
3.
Bioinformatics ; 39(4)2023 04 03.
Artículo en Inglés | MEDLINE | ID: mdl-37039842

RESUMEN

MOTIVATION: Over the past 30 years, extended tandem repeats (TRs) have been correlated with ∼60 diseases with high odds ratios, and most known TRs consist of single repeat units. However, in the last few years, mosaic TRs composed of different units have been found to be associated with several brain disorders by long-read sequencing techniques. Mosaic TRs are difficult-to-characterize sequence configurations that are usually confirmed by manual inspection. Widely used tools are not designed to solve the mosaic TR problem and often fail to properly decompose mosaic TRs. RESULTS: We propose an efficient algorithm that can decompose mosaic TRs in the input string with high sensitivity. Using synthetic benchmark data, we demonstrate that our program named uTR outperforms TRF and RepeatMasker in terms of prediction accuracy, this is especially true when mosaic TRs are more complex, and uTR is faster than TRF and RepeatMasker in most cases. AVAILABILITY AND IMPLEMENTATION: The software program uTR that implements the proposed algorithm is available at https://github.com/morisUtokyo/uTR.


Asunto(s)
Programas Informáticos , Secuencias Repetidas en Tándem , Análisis de Secuencia de ADN/métodos , Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento
4.
Bioinformatics ; 39(7)2023 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-37354526

RESUMEN

MOTIVATION: Diploid assembly, or determining sequences of homologous chromosomes separately, is essential to elucidate genetic differences between haplotypes. One approach is to call and phase single nucleotide variants (SNVs) on a reference sequence. However, this approach becomes unstable on large segmental duplications (SDs) or structural variations (SVs) because the alignments of reads deriving from these regions tend to be unreliable. Another approach is to use highly accurate PacBio HiFi reads to output diploid assembly directly. Nonetheless, HiFi reads cannot phase homozygous regions longer than their length and require oxford nanopore technology (ONT) reads or Hi-C to produce a fully phased assembly. Is a single long-read sequencing technology sufficient to create an accurate diploid assembly? RESULTS: Here, we present JTK, a megabase-scale diploid genome assembler. It first randomly samples kilobase-scale sequences (called 'chunks') from the long reads, phases variants found on them, and produces two haplotypes. The novel idea of JTK is to utilize chunks to capture SNVs and SVs simultaneously. From 60-fold ONT reads on the HG002 and a Japanese sample, it fully assembled two haplotypes with approximately 99.9% accuracy on the histocompatibility complex (MHC) and the leukocyte receptor complex (LRC) regions, which was impossible by the reference-based approach. In addition, in the LRC region on a Japanese sample, JTK output an assembly of better contiguity than those built from high-coverage HiFi+Hi-C. In the coming age of pan-genomics, JTK would complement the reference-based phasing method to assemble the difficult-to-assemble but medically important regions. AVAILABILITY AND IMPLEMENTATION: JTK is available at https://github.com/ban-m/jtk, and the datasets are available at https://doi.org/10.5281/zenodo.7790310 or JGAS000580 in DDBJ.


Asunto(s)
Diploidia , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Genoma , Genómica , Haplotipos
5.
BMC Genomics ; 24(1): 472, 2023 Aug 21.
Artículo en Inglés | MEDLINE | ID: mdl-37605229

RESUMEN

BACKGROUND: The nutritional status during early life can have enduring effects on an animal's metabolism, although the mechanisms underlying these long-term effects are still unclear. Epigenetic modifications are considered a prime candidate mechanism for encoding early-life nutritional memories during this critical developmental period. However, the extent to which these epigenetic changes occur and persist over time remains uncertain, in part due to challenges associated with directly stimulating the fetus with specific nutrients in viviparous mammalian systems. RESULTS: In this study, we used medaka as an oviparous vertebrate model to establish an early-life high-fat diet (HFD) model. Larvae were fed with HFD from the hatching stages (one week after fertilization) for six weeks, followed by normal chow (NC) for eight weeks until the adult stage. We examined the changes in the transcriptomic and epigenetic state of the liver over this period. We found that HFD induces simple liver steatosis, accompanied by drastic changes in the hepatic transcriptome, chromatin accessibility, and histone modifications, especially in metabolic genes. These changes were largely reversed after the long-term NC, demonstrating the high plasticity of the epigenetic state in hepatocytes. However, we found a certain number of genomic loci showing non-reversible epigenetic changes, especially around genes related to cell signaling, liver fibrosis, and hepatocellular carcinoma, implying persistent changes in the cellular state of the liver triggered by early-life HFD feeding. CONCLUSION: In summary, our data show that early-life HFD feeding triggers both reversible and persistent epigenetic changes in medaka hepatocytes. Our data provide novel insights into the epigenetic mechanism of nutritional programming and a comprehensive atlas of the long-term epigenetic state in an early-life HFD model of non-mammalian vertebrates.


Asunto(s)
Epigénesis Genética , Animales , Oryzias/genética , Oryzias/crecimiento & desarrollo , Dieta Alta en Grasa , Regulación del Desarrollo de la Expresión Génica , Sitios Genéticos
6.
Cancer Sci ; 114(4): 1672-1685, 2023 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-36511816

RESUMEN

The molecular subtypes of pancreatic cancer (PC), either classical/progenitor-like or basal/squamous-like, are currently a major topic of research because of their direct association with clinical outcomes. Some transcription factors (TFs) have been reported to be associated with these subtypes. However, the mechanisms by which these molecular signatures of PCs are established remain unknown. Epigenetic regulatory processes, supported by dynamic changes in the chromatin structure, are essential for transcriptional profiles. Previously, we reported the importance of open chromatin profiles in the biological features and transcriptional status of PCs. Here, we aimed to analyze the relationships between three-dimensional (3D) genome structures and the molecular subtypes of human PCs using Hi-C analysis. We observed a correlation of the specific elements of 3D genome modules, including compartments, topologically associating domains, and enhancer-promoter loops, with the expression of related genes. We focused on HNF1B, a TF that is implicated in the progenitor subtype. Forced expression of HNF1B in squamous-type PC organoids induced the upregulation and downregulation of genes associated with progenitor and squamous subtypes, respectively. Long-range genomic interactions induced by HNF1B were accompanied by compartment modulation and H3K27ac redistribution. We also found that these HNF1B-induced changes in subtype-related gene expression required an intrinsically disordered region, suggesting a possible involvement of phase separation in compartment modulation. Thus, mapping of 3D structural changes induced by TFs, such as HNF1B, may become a useful resource for further understanding the molecular features of PCs.


Asunto(s)
Carcinoma de Células Escamosas , Genoma , Humanos , Cromatina/genética , Factores de Transcripción/genética , Epigénesis Genética , Carcinoma de Células Escamosas/genética , Factor Nuclear 1-beta del Hepatocito/genética , Factor Nuclear 1-beta del Hepatocito/metabolismo
7.
Angiogenesis ; 26(1): 37-52, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-35902510

RESUMEN

Orbital cavernous venous malformation (OCVM) is a sporadic vascular anomaly of uncertain etiology characterized by abnormally dilated vascular channels. Here, we identify a somatic missense mutation, c.121G > T (p.Gly41Cys) in GJA4, which encodes a transmembrane protein that is a component of gap junctions and hemichannels in the vascular system, in OCVM tissues from 25/26 (96.2%) individuals with OCVM. GJA4 expression was detected in OCVM tissue including endothelial cells and the stroma, through immunohistochemistry. Within OCVM tissue, the mutation allele frequency was higher in endothelial cell-enriched fractions obtained using magnetic-activated cell sorting. Whole-cell voltage clamp analysis in Xenopus oocytes revealed that GJA4 c.121G > T (p.Gly41Cys) is a gain-of-function mutation that leads to the formation of a hyperactive hemichannel. Overexpression of the mutant protein in human umbilical vein endothelial cells led to a loss of cellular integrity, which was rescued by carbenoxolone, a non-specific gap junction/hemichannel inhibitor. Our data suggest that GJA4 c.121G > T (p.Gly41Cys) is a potential driver gene mutation for OCVM. We propose that hyperactive hemichannel plays a role in the development of this vascular phenotype.


Asunto(s)
Mutación con Ganancia de Función , Malformaciones Vasculares , Humanos , Células Endoteliales , Uniones Comunicantes/genética , Mutación , Venas , Malformaciones Vasculares/metabolismo
8.
Gastroenterology ; 162(4): 1272-1287.e16, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-34953915

RESUMEN

BACKGROUND & AIMS: Chromatin architecture governs cell lineages by regulating the specific gene expression; however, its role in the diversity of cancer development remains unknown. Among pancreatic cancers, pancreatic ductal adenocarcinoma (PDAC) and intraductal papillary mucinous neoplasms (IPMN) with an associated invasive carcinoma (IPMNinv) arise from 2 distinct precursors, and their fundamental differences remain obscure. Here, we aimed to assess the difference of chromatin architecture regulating the transcriptional signatures or biological features in pancreatic cancers. METHODS: We established 28 human organoids from distinct subtypes of pancreatic tumors, including IPMN, IPMNinv, and PDAC. We performed exome sequencing (seq), RNA-seq, assay for transposase-accessible chromatin-seq, chromatin immunoprecipitation-seq, high-throughput chromosome conformation capture, and phenotypic analyses with short hairpin RNA or clustered regularly interspaced short palindromic repeats interference. RESULTS: Established organoids successfully reproduced the histology of primary tumors. IPMN and IPMNinv organoids harbored GNAS, RNF43, or KLF4 mutations and showed the distinct expression profiles compared with PDAC. Chromatin accessibility profiles revealed the gain of stomach-specific open regions in IPMN and the pattern of diverse gastrointestinal tissues in IPMNinv. In contrast, PDAC presented an impressive loss of accessible regions compared with normal pancreatic ducts. Transcription factor footprint analysis and functional assays identified that MNX1 and HNF1B were biologically indispensable for IPMN lineages. The upregulation of MNX1 was specifically marked in the human IPMN lineage tissues. The MNX1-HNF1B axis governed a set of genes, including MYC, SOX9, and OLFM4, which are known to be essential for gastrointestinal stem cells. High-throughput chromosome conformation capture analysis suggested the HNF1B target genes to be 3-dimensionally connected in the genome of IPMNinv. CONCLUSIONS: Our organoid analyses identified the MNX1-HNF1B axis to be biologically significant in IPMN lineages.


Asunto(s)
Adenocarcinoma Mucinoso , Carcinoma Ductal Pancreático , Factor Nuclear 1-beta del Hepatocito , Proteínas de Homeodominio , Neoplasias Intraductales Pancreáticas , Factores de Transcripción , Adenocarcinoma Mucinoso/genética , Carcinoma Ductal Pancreático/patología , Cromatina , Factor Nuclear 1-beta del Hepatocito/genética , Proteínas de Homeodominio/genética , Humanos , Neoplasias Intraductales Pancreáticas/genética , Factores de Transcripción/genética , Neoplasias Pancreáticas
9.
J Peripher Nerv Syst ; 28(3): 518-521, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37249095

RESUMEN

BACKGROUND AND AIMS: Pathogenic variants of HSPB1, the gene encoding the small heat shock protein 27, have been reported to cause autosomal dominant distal hereditary motor neuropathy (dHMN) type II and autosomal dominant Charcot-Marie-Tooth (CMT) disease with minimal sensory involvement (CMT2F). This study aimed to describe the clinical features of patients in a family with late-onset dHMN carrying the Pro39Leu variant of HSPB1. METHODS: Whole-exome sequence analysis identified a heterozygous pathogenic variant (Pro39Leu) of HSPB1 in the proband. The presence of the HSPB1 Pro39Leu variant in two affected individuals was confirmed using direct nucleotide sequence analysis. RESULTS: Both patients exhibited distal muscle weakness with lower extremity predominance and no obvious sensory deficits, leading to a clinical diagnosis of late-onset dHMN. Nerve conduction studies (NCSs) revealed a subclinical complication of sensory disturbance in one of the patients. The clinical and electrophysiological findings of patients with the HSPB1 Pro39Leu variant in this study and previous reports are summarized. INTERPRETATION: This study suggests that the clinical spectrum of patients carrying HSPB1 Pro39Leu variants, especially the disease onset, might be broader than expected, and HSPB1 variants should be considered in patients diagnosed with late-onset dHMN. Furthermore, patients with dHMN may have concomitant sensory deficits that should be evaluated using NCSs.


Asunto(s)
Enfermedad de Charcot-Marie-Tooth , Neuropatía Hereditaria Motora y Sensorial , Humanos , Mutación , Enfermedad de Charcot-Marie-Tooth/genética , Extremidad Inferior , Proteínas de Choque Térmico/genética , Chaperonas Moleculares/genética
10.
Nucleic Acids Res ; 49(5): 2700-2720, 2021 03 18.
Artículo en Inglés | MEDLINE | ID: mdl-33590099

RESUMEN

In animal gonads, transposable elements are actively repressed to preserve genome integrity through the PIWI-interacting RNA (piRNA) pathway. In mice, piRNAs are abundantly expressed in male germ cells, and form effector complexes with three distinct PIWIs. The depletion of individual Piwi genes causes male-specific sterility with no discernible phenotype in female mice. Unlike mice, most other mammals have four PIWI genes, some of which are expressed in the ovary. Here, purification of PIWI complexes from oocytes of the golden hamster revealed that the size of the PIWIL1-associated piRNAs changed during oocyte maturation. In contrast, PIWIL3, an ovary-specific PIWI in most mammals, associates with short piRNAs only in metaphase II oocytes, which coincides with intense phosphorylation of the protein. An improved high-quality genome assembly and annotation revealed that PIWIL1- and PIWIL3-associated piRNAs appear to share the 5'-ends of common piRNA precursors and are mostly derived from unannotated sequences with a diminished contribution from TE-derived sequences, most of which correspond to endogenous retroviruses. Our findings show the complex and dynamic nature of biogenesis of piRNAs in hamster oocytes, and together with the new genome sequence generated, serve as the foundation for developing useful models to study the piRNA pathway in mammalian oocytes.


Asunto(s)
Proteínas Argonautas/metabolismo , Oocitos/crecimiento & desarrollo , Oocitos/metabolismo , ARN Interferente Pequeño/metabolismo , Animales , Proteínas Argonautas/genética , Femenino , Genómica , Masculino , Mesocricetus , Metafase , Fosforilación , ARN Interferente Pequeño/genética , Testículo/metabolismo
11.
BMC Genomics ; 23(1): 249, 2022 Mar 31.
Artículo en Inglés | MEDLINE | ID: mdl-35361121

RESUMEN

BACKGROUND: Single molecule measurements of DNA polymerization kinetics provide a sensitive means to detect both secondary structures in DNA and deviations from primary chemical structure as a result of modified bases. In one approach to such analysis, deviations can be inferred by monitoring the behavior of DNA polymerase using single-molecule, real-time sequencing with zero-mode waveguide. This approach uses a Single Molecule Real Time (SMRT)-sequencing measurement of time between fluorescence pulse signals from consecutive nucleosides incorporated during DNA replication, called the interpulse duration (IPD). RESULTS: In this paper we present an analysis of loci with high IPDs in two genomes, a bacterial genome (E. coli) and a eukaryotic genome (C. elegans). To distinguish the potential effects of DNA modification on DNA polymerization speed, we paired an analysis of native genomic DNA with whole-genome amplified (WGA) material in which DNA modifications were effectively removed. Adenine modification sites for E. coli are known and we observed the expected IPD shifts at these sites in the native but not WGA samples. For C. elegans, such differences were not observed. Instead, we found a number of novel sequence contexts where IPDs were raised relative to the average IPDs for each of the four nucleotides, but for which the raised IPD was present in both native and WGA samples. CONCLUSION: The latter results argue strongly against DNA modification as the underlying driver for high IPD segments for C. elegans, and provide a framework for separating effects of DNA modification from context-dependent DNA polymerase kinetic patterns inherent in underlying DNA sequence for a complex eukaryotic genome.


Asunto(s)
Caenorhabditis elegans , Escherichia coli , Animales , Caenorhabditis elegans/genética , ADN/química , ADN/genética , Escherichia coli/genética , Polimerizacion , Análisis de Secuencia de ADN/métodos
12.
Genome Res ; 29(6): 1009-1022, 2019 06.
Artículo en Inglés | MEDLINE | ID: mdl-31123080

RESUMEN

Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology.


Asunto(s)
Caenorhabditis elegans/genética , Genoma de los Helmintos , Genómica , Animales , Proteínas de Caenorhabditis elegans/genética , Biología Computacional/métodos , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Reproducibilidad de los Resultados
13.
Bioinformatics ; 37(5): 612-621, 2021 05 05.
Artículo en Inglés | MEDLINE | ID: mdl-33031558

RESUMEN

MOTIVATION: Long tandem repeat expansions of more than 1000 nt have been suggested to be associated with diseases, but remain largely unexplored in individual human genomes because read lengths have been too short. However, new long-read sequencing technologies can produce single reads of 10 000 nt or more that can span such repeat expansions, although these long reads have high error rates, of 10-20%, which complicates the detection of repetitive elements. Moreover, most traditional algorithms for finding tandem repeats are designed to find short tandem repeats (<1000 nt) and cannot effectively handle the high error rate of long reads in a reasonable amount of time. RESULTS: Here, we report an efficient algorithm for solving this problem that takes advantage of the length of the repeat. Namely, a long tandem repeat has hundreds or thousands of approximate copies of the repeated unit, so despite the error rate, many short k-mers will be error-free in many copies of the unit. We exploited this characteristic to develop a method for first estimating regions that could contain a tandem repeat, by analyzing the k-mer frequency distributions of fixed-size windows across the target read, followed by an algorithm that assembles the k-mers of a putative region into the consensus repeat unit by greedily traversing a de Bruijn graph. Experimental results indicated that the proposed algorithm largely outperformed Tandem Repeats Finder, a widely used program for finding tandem repeats, in terms of sensitivity. AVAILABILITY AND IMPLEMENTATION: https://github.com/morisUtokyo/mTR.


Asunto(s)
Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento , Genoma Humano , Humanos , Repeticiones de Microsatélite , Análisis de Secuencia de ADN
14.
PLoS Comput Biol ; 17(1): e1008597, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33434206

RESUMEN

Plant mitochondrial genomes have distinctive features compared to those of animals; namely, they are large and divergent, with sizes ranging from hundreds of thousands of to a few million bases. Recombination among repetitive regions is thought to produce similar structures that differ slightly, known as "multipartite structures," which contribute to different phenotypes. Although many reference plant mitochondrial genomes represent almost all the genes in mitochondria, the full spectrum of their structures remains largely unknown. The emergence of long-read sequencing technology is expected to yield this landscape; however, many studies aimed to assemble only one representative circular genome, because properly understanding multipartite structures using existing assemblers is not feasible. To elucidate multipartite structures, we leveraged the information in existing reference genomes and classified long reads according to their corresponding structures. We developed a method that exploits two classic algorithms, partial order alignment (POA) and the hidden Markov model (HMM) to construct a sensitive read classifier. This method enables us to represent a set of reads as a POA graph and analyze it using the HMM. We can then calculate the likelihood of a read occurring in a given cluster, resulting in an iterative clustering algorithm. For synthetic data, our proposed method reliably detected one variation site out of 9,000-bp synthetic long reads with a 15% sequencing-error rate and produced accurate clustering. It was also capable of clustering long reads from six very similar sequences containing only slight differences. For real data, we assembled putative multipartite structures of mitochondrial genomes of Arabidopsis thaliana from nine accessions sequenced using PacBio Sequel. The results indicated that there are recurrent and strain-specific structures in A. thaliana mitochondrial genomes.


Asunto(s)
Arabidopsis/genética , Genoma Mitocondrial/genética , Genoma de Planta/genética , Análisis de Secuencia de ADN/métodos , Algoritmos , Cadenas de Markov
15.
J Med Genet ; 58(10): 701-711, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-33067351

RESUMEN

BACKGROUND: Although 60% of patients with de novo neurofibromatosis type 2 (NF2) are presumed to have mosaic NF2, the actual diagnostic rate of this condition remains low at around 20% because of the existing difficulties in detecting NF2 variants with low variant allele frequency (VAF). Here, we examined the correlation between the genotype and phenotype of mosaic NF2 after improving the diagnostic rate of mosaic NF2. METHODS: We performed targeted deep sequencing of 36 genes including NF2 using DNA samples from multiple tissues (blood, buccal mucosa, hair follicle and tumour) of 53 patients with de novo NF2 and elucidated their genotype-phenotype correlation. RESULTS: Twenty-four patients (45.2%) had the NF2 germline variant, and 20 patients with NF2 (37.7%) had mosaic NF2. The mosaic NF2 phenotype was significantly different from that in patients with NF2 germline variant in terms of distribution of NF2-related disease, tumour growth rate and hearing outcome. The behaviour of schwannoma correlated to the extent of VAF with NF2 variant in normal tissues unlike meningioma. CONCLUSION: We have improved the diagnostic rate of mosaic NF2 compared with that of previous studies by targeted deep sequencing of DNA from multiple tissues. Many atypical patients with NF2 diagnosed with 'unilateral vestibular schwannoma' or 'multiple meningiomas' presumably have mosaic NF2. Finally, we suggest that the highly diverse phenotype of NF2 could result not only from the type and location of NF2 variant but also the extent of VAF in the NF2 variant within normal tissue DNA.


Asunto(s)
Genes de la Neurofibromatosis 2 , Secuenciación de Nucleótidos de Alto Rendimiento , Mosaicismo , Mutación , Neurofibromatosis 2/diagnóstico , Neurofibromatosis 2/genética , Fenotipo , Biología Computacional/métodos , Análisis Mutacional de ADN , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Humanos , Técnicas de Amplificación de Ácido Nucleico , Análisis de Secuencia de ADN
16.
Neurogenetics ; 22(1): 11-17, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-32815063

RESUMEN

Our objective was to investigate the frequency of KIF5A variants in amyotrophic lateral sclerosis (ALS) and the clinical characteristics of familial ALS (FALS) associated with variants in KIF5A. Whole-exome sequence analysis was performed for a Japanese series of 43 families with FALS and 444 patients with sporadic ALS (SALS), in whom causative variants had not been identified. We compared the frequencies of rare variants (MAF < 0.01) in KIF5A, including missense and loss of function (LoF) variants, between ALS and control subjects (n = 1163). Clinical characteristics of patients with FALS carrying pathogenic variants in KIF5A were also described. LoF variants were identified only in the probands of two families with FALS, both of which were 3' splice-site variants leading to exon skipping and an altered C-terminal domain, located in the mutational hotspot causing FALS, and were considered to be pathogenic for FALS. Rare missense variants in KIF5A were identified in five patients with SALS (1.13%) and 11 control subjects (0.95%, carrier frequency), which were not significantly different. Consequently, the pathogenic LoF variants in KIF5A accounted for 2.1% of all FALS families in this study. These patients suffered from ALS characteristically associated with the predominant involvement of upper motor neuron. In conclusion, we identified two pathogenic splice-site variants in KIF5A in the probands in two Japanese families with FALS, which altered the C-terminal region of KIF5A. Our findings broaden the phenotype spectrum of ALS associated with variants in KIF5A in the Japanese series.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Predisposición Genética a la Enfermedad/genética , Cinesinas/genética , Mutación/genética , Adulto , Esclerosis Amiotrófica Lateral/diagnóstico , Pueblo Asiatico/genética , Femenino , Estudios de Asociación Genética , Humanos , Japón , Masculino , Persona de Mediana Edad
17.
Bioinformatics ; 36(13): 3966-3974, 2020 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-32369554

RESUMEN

MOTIVATION: De novo assembly of reference-quality genomes used to require enormously laborious tasks. In particular, it is extremely time-consuming to build genome markers for ordering assembled contigs along chromosomes; thus, they are only available for well-established model organisms. To resolve this issue, recent studies demonstrated that Hi-C could be a powerful and cost-effective means to output chromosome-length scaffolds for non-model species with no genome marker resources, because the Hi-C contact frequency between a pair of two loci can be a good estimator of their genomic distance, even if there is a large gap between them. Indeed, state-of-the-art methods such as 3D-DNA are now widely used for locating contigs in chromosomes. However, it remains challenging to reduce errors in contig orientation because shorter contigs have fewer contacts with their neighboring contigs. These orientation errors lower the accuracy of gene prediction, read alignment, and synteny block estimation in comparative genomics. RESULTS: To reduce these contig orientation errors, we propose a new algorithm, named HiC-Hiker, which has a firm grounding in probabilistic theory, rigorously models Hi-C contacts across contigs, and effectively infers the most probable orientations via the Viterbi algorithm. We compared HiC-Hiker and 3D-DNA using human and worm genome contigs generated from short reads, evaluated their performances, and observed a remarkable reduction in the contig orientation error rate from 4.3% (3D-DNA) to 1.7% (HiC-Hiker). Our algorithm can consider long-range information between distal contigs and precisely estimates Hi-C read contact probabilities among contigs, which may also be useful for determining the ordering of contigs. AVAILABILITY AND IMPLEMENTATION: HiC-Hiker is freely available at: https://github.com/ryought/hic_hiker.


Asunto(s)
Genoma , Genómica , Algoritmos , Cromosomas/genética , Mapeo Contig , Humanos , Modelos Estadísticos , Análisis de Secuencia de ADN
18.
J Hum Genet ; 66(3): 237-241, 2021 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-32920598

RESUMEN

Loss-of-function (LoF) variants in NEK1 have recently been reported to be associated with amyotrophic lateral sclerosis (ALS). In this study, we investigated the association of NEK1 LoF variants with an increased risk of sporadic ALS (SALS) and the clinical characteristics of patients with SALS carrying LoF variants in a Japanese case series. Whole-exome sequencing analysis was performed for a series of 446 SALS patients in whom pathogenic variants in familial ALS-causative genes have not been identified and 1163 healthy control subjects in our Japanese series. We evaluated LoF variants, defined as nonsense, splice-site disrupting single-nucleotide variants (SNVs), or short insertion/deletion (indel) variants predicted to cause frameshifts in NEK1. We identified seven NEK1 LoF variants in patients with SALS (1.57%), whereas only one was identified in control subjects (0.086%) (P = 0.00073, Fisher's exact test). This finding is consistent with those in recent reports from other regions in the world. In conclusion, we demonstrated that NEK1 LoF variants are also associated with an increased risk of SALS in the Japanese population.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Pueblo Asiatico/genética , Mutación con Pérdida de Función , Quinasa 1 Relacionada con NIMA/deficiencia , Edad de Inicio , Anciano , Esclerosis Amiotrófica Lateral/etnología , Esclerosis Amiotrófica Lateral/psicología , Codón sin Sentido , Trastornos del Conocimiento/genética , Femenino , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Humanos , Mutación INDEL , Masculino , Persona de Mediana Edad , Mutación , Quinasa 1 Relacionada con NIMA/genética , Quinasa 1 Relacionada con NIMA/fisiología , Polimorfismo de Nucleótido Simple , Isoformas de Proteínas/genética , Sitios de Empalme de ARN/genética , Secuenciación del Exoma
19.
Hum Mutat ; 41(8): 1447-1460, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32485067

RESUMEN

A heterozygous deletion at Xq27.3q28 including FMR1, AFF2, and IDS causing intellectual disability and characteristic facial features is very rare in females, with only 10 patients having been reported. Here, we examined two female patients with different clinical features harboring the Xq27.3q28 deletion and determined the chromosomal breakpoints. Moreover, we assessed the X chromosome inactivation (XCI) in peripheral blood from both patients. Both patients had an almost overlapping deletion at Xq27.3q28, however, the more severe patient (Patient 1) showed skewed XCI of the normal X chromosome (79:21) whereas the milder patient (Patient 2) showed random XCI. Therefore, deletion at Xq27.3q28 critically affected brain development, and the ratio of XCI of the normal X chromosome greatly affected the clinical characteristics of patients with deletion at Xq27.3q28. As the chromosomal breakpoints were determined, we analyzed a change in chromatin domains termed topologically associated domains (TADs) using published Hi-C data on the Xq27.3q28 region, and found that only patient 1 had a possibility of a drastic change in TADs. The altered chromatin topologies on the Xq27.3q28 region might affect the clinical features of patient 1 by changing the expression of genes just outside the deletion and/or the XCI establishment during embryogenesis resulting in skewed XCI.


Asunto(s)
Deleción Cromosómica , Discapacidad Intelectual/genética , Inactivación del Cromosoma X , Preescolar , Cromosomas Humanos X , Análisis Citogenético , Femenino , Humanos , Lactante , Japón , Proteína Nuclear Ligada al Cromosoma X/genética
20.
Bioinformatics ; 35(4): 584-592, 2019 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-30776078

RESUMEN

MOTIVATION: Detection of DNA at low abundance with respect to the entire sample is an important problem in areas such as epidemiology and field research, as these samples are highly contaminated with non-target DNA. To solve this problem, many methods have been developed to date, but all require additional time-consuming and costly procedures. Meanwhile, the MinION sequencer developed by Oxford Nanopore Technology (ONT) is considered a powerful tool for tackling this problem, as it allows selective sequencing of target DNA. The main technology employed involves rejection of an undesirable read from a specific pore by inverting the voltage of that pore, which is referred to as 'Read Until'. Despite its usefulness, several issues remain to be solved in real situations. First, limited computational resources are available in field research and epidemiological applications. In addition, a high-speed online classification algorithm is required to make a prompt decision. Lastly, the lack of a theoretical approach for modeling of selective sequencing makes it difficult to analyze and justify a given algorithm. RESULTS: In this paper, we introduced a statistical model of selective sequencing, proposed an efficient constant-time classifier for any background DNA profile, and validated its optimal precision. To confirm the feasibility of the proposed method in practice, for a pre-recorded mock sample, we demonstrate that the method can selectively sequence a 100 kb region, consisting of 0.1% of the entire read pool, and achieve approximately 500-fold amplification. Furthermore, the algorithm is shown to process 26 queries per second with a $500 palm-sized next unit of computing box using an Intel® CoreTMi7 CPU without extended computer resources such as a GPU or high-performance computing. Next, we prepared a mixed DNA pool composed of Saccharomyces cerevisiae and lambda phage, in which any 200 kb region of S.cerevisiae consists of 0.1% of the whole sample. From this sample, a 30-230 kb region of S.cerevisiae chromosome 1 was amplified approximately 30-fold. In addition, this method allowed on-the-fly changing of the amplified region according to the uncovered characteristics of a given DNA sample. AVAILABILITY AND IMPLEMENTATION: The source code is available at: https://bitbucket.org/ban-m/dyss.


Asunto(s)
Algoritmos , ADN/análisis , Análisis de Secuencia de ADN , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA