Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
PLoS Genet ; 18(8): e1010342, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35926060

RESUMEN

Genes generate transcripts of various functions by alternative splicing. However, in most transcriptome studies, short-reads sequencing technologies (next-generation sequencers) have been used, leaving full-length transcripts unobserved directly. Although long-reads sequencing technologies would enable the sequencing of full-length transcripts, the data analysis is difficult. In this study, we developed an analysis pipeline named SPLICE and analyzed cDNA sequences from 42 pairs of hepatocellular carcinoma (HCC) and matched non-cancerous livers with an Oxford Nanopore sequencer. Our analysis detected 46,663 transcripts from the protein-coding genes in the HCCs and the matched non-cancerous livers, of which 5,366 (11.5%) were novel. A comparison of expression levels identified 9,933 differentially expressed transcripts (DETs) in 4,744 genes. Interestingly, 746 genes with DETs, including the LINE1-MET transcript, were not found by a gene-level analysis. We also found that fusion transcripts of transposable elements and hepatitis B virus (HBV) were overexpressed in HCCs. In vitro experiments on DETs showed that LINE1-MET and HBV-human transposable elements promoted cell growth. Furthermore, fusion gene detection showed novel recurrent fusion events that were not detected in the short-reads. These results suggest the efficiency of full-length transcriptome studies and the importance of splicing variants in carcinogenesis.


Asunto(s)
Carcinoma Hepatocelular , Neoplasias Hepáticas , Empalme Alternativo/genética , Carcinogénesis/genética , Carcinoma Hepatocelular/genética , Elementos Transponibles de ADN , Virus de la Hepatitis B/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Neoplasias Hepáticas/genética , Empalme del ARN/genética , Transcriptoma/genética
2.
J Med Genet ; 59(11): 1133-1138, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-35710107

RESUMEN

Despite the successful identification of causative genes and genetic variants of retinitis pigmentosa (RP), many patients have not been molecularly diagnosed. Our recent study using targeted short-read sequencing showed that the proportion of carriers of pathogenic variants in EYS, the cause of autosomal recessive RP, was unexpectedly high in Japanese patients with unsolved RP. This result suggested that causative genetic variants, which are difficult to detect by short-read sequencing, exist in such patients. Using long-read sequencing technology (Oxford Nanopore), we analysed the whole genomes of 15 patients with RP with one heterozygous pathogenic variant in EYS detected in our previous study along with structural variants (SVs) in EYS and another 88 RP-associated genes. Two large exon-overlapping deletions involving six exons were identified in EYS in two patients with unsolved RP. An analysis of an independent patient set (n=1189) suggested that these two deletions are not founder mutations. Our results suggest that searching for SVs by long-read sequencing in genetically unsolved cases benefits the molecular diagnosis of RP.


Asunto(s)
Proteínas del Ojo , Retinitis Pigmentosa , Humanos , Genes Recesivos , Linaje , Proteínas del Ojo/genética , Mutación/genética , Retinitis Pigmentosa/diagnóstico , Retinitis Pigmentosa/genética , Retinitis Pigmentosa/patología , Análisis Mutacional de ADN
3.
Hum Genet ; 140(8): 1201-1216, 2021 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-33978893

RESUMEN

Intermediate-sized insertions are one of the structural variants contributing to genome diversity. However, due to technical difficulties in identifying them, their importance in disease pathogenicity and gene expression regulation remains unclear. We used whole-genome sequencing data of 174 Japanese samples to characterize intermediate-sized insertions using a highly-accurate insertion calling method (IMSindel software and joint-call recovery) and obtained a catalogue of 4254 insertions. We constructed an imputation panel comprising of insertions and SNVs from all samples, and conducted imputation of intermediate-sized insertions for 82 publicly-available Japanese samples. Positive Predictive Value of imputation, evaluated using Nanopore long-read sequencing data, was 97%. Subsequent eQTL analysis predicted 128 (~ 3.0%) insertions as causative for gene expression level changes. Enrichment analysis of causal insertions for genome regulatory elements showed significant associations with CTCF-binding sites, super-enhancers, and promoters. Among 17 causal insertions found in the same causal set with GWAS hits, there were insertions associated with changes in expression of cancer-related genes such as BRCA1, ZNF222, and ABCB10. Analysis of insertions sequences revealed that 461 insertions were short tandem duplications frequently found in early-replicating regions of genome. Furthermore, comparison of functional importance of intermediate-sized insertions with that of intermediate-sized deletions detected in the same sample set in our previous study showed that insertions were more frequent in genic regions, and proportion of functional candidates was smaller in insertions. Here, we characterize a high-confidence set of intermediate-sized insertions and indicate their importance in gene expression regulation. Our results emphasize the importance of considering intermediate-sized insertions in trait association studies.


Asunto(s)
Regulación de la Expresión Génica , Genoma Humano , Mutagénesis Insercional , Proteínas de Neoplasias/genética , Neoplasias/genética , Osteoartritis/genética , Transportadoras de Casetes de Unión a ATP/genética , Transportadoras de Casetes de Unión a ATP/metabolismo , Proteína BRCA1/genética , Proteína BRCA1/metabolismo , Secuencia de Bases , Sitios de Unión , Factor de Unión a CCCTC/genética , Factor de Unión a CCCTC/metabolismo , Mapeo Cromosómico , Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Perfilación de la Expresión Génica , Humanos , Japón , Anotación de Secuencia Molecular , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Neoplasias/patología , Osteoartritis/metabolismo , Osteoartritis/patología , Regiones Promotoras Genéticas , Sitios de Carácter Cuantitativo , Proteínas Represoras/genética , Proteínas Represoras/metabolismo , Programas Informáticos , Secuenciación Completa del Genoma
4.
Genome Med ; 13(1): 65, 2021 04 29.
Artículo en Inglés | MEDLINE | ID: mdl-33910608

RESUMEN

BACKGROUND: Identification of germline variation and somatic mutations is a major issue in human genetics. However, due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variation and somatic mutations is far from complete. METHODS: In the present study, we performed whole-genome sequencing using long-read sequencing technology (Oxford Nanopore) for 11 Japanese liver cancers and matched normal samples which were previously sequenced for the International Cancer Genome Consortium (ICGC). We constructed an analysis pipeline for the long-read data and identified germline and somatic structural variations (SVs). RESULTS: In polymorphic germline SVs, our analysis identified 8004 insertions, 6389 deletions, 27 inversions, and 32 intra-chromosomal translocations. By comparing to the chimpanzee genome, we correctly inferred events that caused insertions and deletions and found that most insertions were caused by transposons and Alu is the most predominant source, while other types of insertions, such as tandem duplications and processed pseudogenes, are rare. We inferred mechanisms of deletion generations and found that most non-allelic homolog recombination (NAHR) events were caused by recombination errors in SINEs. Analysis of somatic mutations in liver cancers showed that long reads could detect larger numbers of SVs than a previous short-read study and that mechanisms of cancer SV generation were different from that of germline deletions. CONCLUSIONS: Our analysis provides a comprehensive catalog of polymorphic and somatic SVs, as well as their possible causes. Our software are available at https://github.com/afujimoto/CAMPHOR and https://github.com/afujimoto/CAMPHORsomatic .


Asunto(s)
Genoma Humano , Variación Estructural del Genoma , Mutación/genética , Neoplasias/genética , Secuenciación Completa del Genoma , Secuencia de Bases , Metilación de ADN/genética , Mutación de Línea Germinal/genética , Humanos , Mutación INDEL/genética , Regiones Promotoras Genéticas/genética , Telomerasa/genética , Virus/metabolismo
5.
Commun Biol ; 3(1): 724, 2020 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-33247206

RESUMEN

(Pro)renin receptor [(P)RR] has a role in various diseases, such as cardiovascular and renal disorders and cancer. Aberrant (P)RR expression is prevalent in pancreatic ductal adenocarcinoma (PDAC) which is the most common pancreatic cancer. Here we show whether aberrant expression of (P)RR directly leads to genomic instability in human pancreatic ductal epithelial (HPDE) cells. (P)RR-expressing HPDE cells show obvious cellular atypia. Whole genome sequencing reveals that aberrant (P)RR expression induces large numbers of point mutations and structural variations at the genome level. A (P)RR-expressing cell population exhibits tumour-forming ability, showing both atypical nuclei characterised by distinctive nuclear bodies and chromosomal abnormalities. (P)RR overexpression upregulates SWItch/Sucrose Non-Fermentable (SWI/SNF)-related, matrix-associated, actin-dependent regulator of chromatin, subfamily a, member 5 (SMARCA5) through a direct molecular interaction, which results in the failure of several genomic stability pathways. These data reveal that aberrant (P)RR expression contributes to the early carcinogenesis of PDAC.


Asunto(s)
Adenosina Trifosfatasas/metabolismo , Carcinoma Ductal Pancreático/metabolismo , Proteínas Cromosómicas no Histona/metabolismo , Neoplasias Pancreáticas/metabolismo , Receptores de Superficie Celular/metabolismo , ATPasas de Translocación de Protón Vacuolares/metabolismo , Adenosina Trifosfatasas/genética , Animales , Carcinoma Ductal Pancreático/genética , Línea Celular , Transformación Celular Neoplásica , Ensamble y Desensamble de Cromatina , Proteínas Cromosómicas no Histona/genética , Regulación Neoplásica de la Expresión Génica , Inestabilidad Genómica , Humanos , Masculino , Ratones , Ratones Endogámicos BALB C , Neoplasias Pancreáticas/genética , Regulación hacia Arriba , Secuenciación Completa del Genoma
6.
Genome Res ; 2020 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-32209592

RESUMEN

Microsatellites are repeats of 1- to 6-bp units, and approximately 10 million microsatellites have been identified across the human genome. Microsatellites are vulnerable to DNA mismatch errors and have thus been used to detect cancers with mismatch repair deficiency. To reveal the mutational landscape of microsatellite repeat regions at the genome level, we analyzed approximately 20.1 billion microsatellites in 2717 whole genomes of pan-cancer samples across 21 tissue types. First, we developed a new insertion and deletion caller (MIMcall) that takes into consideration the error patterns of different types of microsatellites. Among the 2717 pan-cancer samples, our analysis identified 31 samples, including colorectal, uterus, and stomach cancers, with a higher proportion of mutated microsatellite (≥0.03), which we defined as microsatellite instability (MSI) cancers of genome-wide level. Next, we found 20 highly mutated microsatellites that can be used to detect MSI cancers with high sensitivity. Third, we found that replication timing and DNA shape were significantly associated with mutation rates of microsatellites. Last, analysis of mutations in mismatch repair genes showed that somatic SNVs and short indels had larger functional impacts than germline mutations and structural variations. Our analysis provides a comprehensive picture of mutations in the microsatellite regions and reveals possible causes of mutations, as well as provides a useful marker set for MSI detection.

7.
Sci Rep ; 9(1): 15017, 2019 10 22.
Artículo en Inglés | MEDLINE | ID: mdl-31641155

RESUMEN

Plasma cell-free DNA (cfDNA) testing plays an increasingly important role in precision medicine for cancer. However, circulating cell-free tumor DNA (ctDNA) is highly diluted by cfDNA from non-cancer cells, complicating ctDNA detection and analysis. To identify low-frequency variants, we developed a program, eVIDENCE, which is a workflow for filtering candidate variants detected by using the ThruPLEX tag-seq (Takara Bio), a commercially-available molecular barcoding kit. We analyzed 27 cfDNA samples from hepatocellular carcinoma patients. Sequencing libraries were constructed and hybridized to our custom panel targeting about 80 genes. An initial variant calling identified 36,500 single nucleotide variants (SNVs) and 9,300 insertions and deletions (indels) across the 27 samples, but the number was much greater than expected when compared with previous cancer genome studies. eVIDENCE was applied to the candidate variants and finally 70 SNVs and 7 indels remained. Of the 77 variants, 49 (63.6%) showed VAF of < 1% (0.20-0.98%). Twenty-five variants were selected in an unbiased manner and all were successfully validated, suggesting that eVIDENCE can identify variants with VAF of ≥ 0.2%. Additionally, this study is the first to detect hepatitis B virus integration sites and genomic rearrangements in the TERT region from cfDNA of HCC patients. We consider that our method can be applied in the examination of cfDNA from other types of malignancies using specific custom gene panels and will contribute to comprehensive ctDNA analysis.


Asunto(s)
Biomarcadores de Tumor/genética , Carcinoma Hepatocelular/genética , Ácidos Nucleicos Libres de Células/genética , Neoplasias Hepáticas/genética , Tasa de Mutación , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Carcinoma Hepatocelular/diagnóstico , Humanos , Neoplasias Hepáticas/diagnóstico
8.
Genome Med ; 11(1): 44, 2019 07 24.
Artículo en Inglés | MEDLINE | ID: mdl-31340865

RESUMEN

BACKGROUND: Next-generation sequencing has allowed for the identification of different genetic variations, which are known to contribute to diseases. Of these, insertions and deletions are the second most abundant type of variations in the genome, but their biological importance or disease association is not well-studied, especially for deletions of intermediate sizes. METHODS: We identified intermediate-sized deletions from whole-genome sequencing (WGS) data of Japanese samples (n = 174) with a novel deletion calling method which considered multiple samples. These deletions were used to construct a reference panel for use in imputation. Imputation was then conducted using the reference panel and data from 82 publically available Japanese samples with gene expression data. The accuracy of the deletion calling and imputation was examined with Nanopore long-read sequencing technology. We also conducted an expression quantitative trait loci (eQTL) association analysis using the deletions to infer their functional impacts on genes, before characterizing the deletions causal for gene expression level changes. RESULTS: We obtained a set of polymorphic 4378 high-confidence deletions and constructed a reference panel. The deletions were successfully imputed into the Japanese samples with high accuracy (97.3%). The eQTL analysis identified 181 deletions (4.1%) suggested as causal for gene expression level changes. The causal deletion candidates were significantly enriched in promoters, super-enhancers, and transcription elongation chromatin states. Generation of deletions in a cell line with the CRISPR-Cas9 system confirmed that they were indeed causative variants for gene expression change. Furthermore, one of the deletions was observed to affect the gene expression levels of a gene it was not located in. CONCLUSIONS: This paper reports an accurate deletion calling method for genotype imputation at the whole genome level and shows the importance of intermediate-sized deletions in the human population.


Asunto(s)
Epistasis Genética , Regulación de la Expresión Génica , Genética de Población , Eliminación de Secuencia , Sistemas CRISPR-Cas , Biología Computacional/métodos , Genoma Humano , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Anotación de Secuencia Molecular , Filogenia , Sitios de Carácter Cuantitativo , Secuencias Reguladoras de Ácidos Nucleicos , Reproducibilidad de los Resultados , Secuenciación Completa del Genoma
9.
Artículo en Inglés | MEDLINE | ID: mdl-30477169

RESUMEN

Although it has been a half-century since dioxin-contaminated herbicides were used to defoliate the landscape during the Vietnam War, dioxin contamination "hotspots" still remain in Vietnam. Environmental and health impacts of these hotspots need to be evaluated. Intellectual disability (ID) is one of the diseases found in the children of people exposed to the herbicides. This study aims to identify genetic alterations of a patient whose family lived in a dioxin hotspot. The patient's father had a highly elevated dioxin concentration. He was affected with undiagnosed moderate ID. To analyze de novo mutations and genetic variations, and to identify causal gene(s) for ID, we performed whole genome sequencing (WGS) of the proband and his parents. Two de novo missense mutations were detected, each one in ETS2 and ZNF408 genes, respectively. Compound heterozygosity was identified in CENPF and TTN genes. Existing knowledge on the genes and bioinformatics analyses suggest that EST2, ZNF408, and CENPF might be promising candidates for ID causative genes.


Asunto(s)
Dioxinas , Exposición a Riesgos Ambientales/efectos adversos , Herbicidas , Discapacidad Intelectual/genética , Biología Computacional , Familia , Femenino , Variación Genética , Humanos , Masculino , Mutación , Vietnam , Secuenciación Completa del Genoma
10.
Hum Mutat ; 39(10): 1384-1392, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-29969170

RESUMEN

2,3,7,8-Tetrachlorodibenzo-p-dioxin (TCDD) or dioxin, is commonly considered the most toxic man-made substance. Dioxin exposure impacts human health and diseases, birth defects and teratogenesis were frequently observed in children of persons who have been exposed to dioxin. However, the impact of dioxin on human mutation rate in trios has not yet been elucidated at the whole genome level. To identify and characterize the genetic alterations in the individuals exposed to dioxin, we performed whole genome sequencing (WGS) of nine Vietnamese trios whose fathers were exposed to dioxin. In total, 846 de novo point mutations, 26 de novo insertions and deletions, 4 de novo structural variations, and 1 de novo copy number variation were identified. The number of point mutations and dioxin concentrations were positively correlated (P-value < 0.05). Considering the substitution pattern, the number of A > T/T > A mutation and the dioxin concentration was positively correlated (P-value < 0.05). Our analysis also identified one possible disease-related mutation in LAMA5 in one trio. These findings suggested that dioxin exposure might affect father genomes of trios leading to de novo mutations in their children. Further analysis with larger sample sizes would be required to better clarify mutation rates and substitution patterns in trios caused by dioxin.


Asunto(s)
Dioxinas/efectos adversos , Estudio de Asociación del Genoma Completo , Mutación , Exposición Paterna/efectos adversos , Secuenciación Completa del Genoma , Alelos , Niño , Dioxinas/sangre , Femenino , Células Germinativas/metabolismo , Mutación de Línea Germinal , Humanos , Masculino , Espectrometría de Masas , Tasa de Mutación , Polimorfismo de Nucleótido Simple , Veteranos
11.
Hum Genome Var ; 3: 15067, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-27081565

RESUMEN

Tuberculosis (TB) is a complex disease, and both genetic and environmental factors contribute to disease progression. A previous genome-wide linkage study in Thailand determined that chromosome 20p13-12.3 may contain risk factors for young-onset disease. The present study aimed to identify novel susceptibility genes for young-onset TB within a 1-Mbp target region adjacent to the top-ranking risk marker in Chr.20p13-12.3. We performed next-generation sequencing (NGS) of the region in 13 young patients from multi-case families in Thailand. We then selected the functionally interesting single-nucleotide polymorphisms as candidates for subsequent analyses. The detected candidates rs13830 and rs1127354 in ITPA showed an association with young (<45 years old) TB patients. However, there was no association in old (⩾45 years old) patients. These findings confirm that stratifying patients based on age of TB onset can be important for identifying genetic risk factors for TB susceptibility. In addition, in silico expression quantitative trait loci analyses indicated that ITPA expression was associated with rs13830 genotype. This is the first study to use NGS resequencing to gain insight into host genetic factors associated with TB and to report a significant association for ITPA with host susceptibility in young-onset TB. The study also demonstrated the effectiveness of NGS in identifying susceptibility genes in common diseases.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...