Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
PLoS Genet ; 18(8): e1010342, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-35926060

RESUMO

Genes generate transcripts of various functions by alternative splicing. However, in most transcriptome studies, short-reads sequencing technologies (next-generation sequencers) have been used, leaving full-length transcripts unobserved directly. Although long-reads sequencing technologies would enable the sequencing of full-length transcripts, the data analysis is difficult. In this study, we developed an analysis pipeline named SPLICE and analyzed cDNA sequences from 42 pairs of hepatocellular carcinoma (HCC) and matched non-cancerous livers with an Oxford Nanopore sequencer. Our analysis detected 46,663 transcripts from the protein-coding genes in the HCCs and the matched non-cancerous livers, of which 5,366 (11.5%) were novel. A comparison of expression levels identified 9,933 differentially expressed transcripts (DETs) in 4,744 genes. Interestingly, 746 genes with DETs, including the LINE1-MET transcript, were not found by a gene-level analysis. We also found that fusion transcripts of transposable elements and hepatitis B virus (HBV) were overexpressed in HCCs. In vitro experiments on DETs showed that LINE1-MET and HBV-human transposable elements promoted cell growth. Furthermore, fusion gene detection showed novel recurrent fusion events that were not detected in the short-reads. These results suggest the efficiency of full-length transcriptome studies and the importance of splicing variants in carcinogenesis.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Processamento Alternativo/genética , Carcinogênese/genética , Carcinoma Hepatocelular/genética , Elementos de DNA Transponíveis , Vírus da Hepatite B/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Neoplasias Hepáticas/genética , Splicing de RNA/genética , Transcriptoma/genética
2.
Genome Res ; 2020 Mar 24.
Artigo em Inglês | MEDLINE | ID: mdl-32209592

RESUMO

Microsatellites are repeats of 1- to 6-bp units, and approximately 10 million microsatellites have been identified across the human genome. Microsatellites are vulnerable to DNA mismatch errors and have thus been used to detect cancers with mismatch repair deficiency. To reveal the mutational landscape of microsatellite repeat regions at the genome level, we analyzed approximately 20.1 billion microsatellites in 2717 whole genomes of pan-cancer samples across 21 tissue types. First, we developed a new insertion and deletion caller (MIMcall) that takes into consideration the error patterns of different types of microsatellites. Among the 2717 pan-cancer samples, our analysis identified 31 samples, including colorectal, uterus, and stomach cancers, with a higher proportion of mutated microsatellite (≥0.03), which we defined as microsatellite instability (MSI) cancers of genome-wide level. Next, we found 20 highly mutated microsatellites that can be used to detect MSI cancers with high sensitivity. Third, we found that replication timing and DNA shape were significantly associated with mutation rates of microsatellites. Last, analysis of mutations in mismatch repair genes showed that somatic SNVs and short indels had larger functional impacts than germline mutations and structural variations. Our analysis provides a comprehensive picture of mutations in the microsatellite regions and reveals possible causes of mutations, as well as provides a useful marker set for MSI detection.

3.
J Med Genet ; 59(11): 1133-1138, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-35710107

RESUMO

Despite the successful identification of causative genes and genetic variants of retinitis pigmentosa (RP), many patients have not been molecularly diagnosed. Our recent study using targeted short-read sequencing showed that the proportion of carriers of pathogenic variants in EYS, the cause of autosomal recessive RP, was unexpectedly high in Japanese patients with unsolved RP. This result suggested that causative genetic variants, which are difficult to detect by short-read sequencing, exist in such patients. Using long-read sequencing technology (Oxford Nanopore), we analysed the whole genomes of 15 patients with RP with one heterozygous pathogenic variant in EYS detected in our previous study along with structural variants (SVs) in EYS and another 88 RP-associated genes. Two large exon-overlapping deletions involving six exons were identified in EYS in two patients with unsolved RP. An analysis of an independent patient set (n=1189) suggested that these two deletions are not founder mutations. Our results suggest that searching for SVs by long-read sequencing in genetically unsolved cases benefits the molecular diagnosis of RP.


Assuntos
Proteínas do Olho , Retinose Pigmentar , Humanos , Genes Recessivos , Linhagem , Proteínas do Olho/genética , Mutação/genética , Retinose Pigmentar/diagnóstico , Retinose Pigmentar/genética , Retinose Pigmentar/patologia , Análise Mutacional de DNA
4.
Hum Genet ; 140(8): 1201-1216, 2021 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-33978893

RESUMO

Intermediate-sized insertions are one of the structural variants contributing to genome diversity. However, due to technical difficulties in identifying them, their importance in disease pathogenicity and gene expression regulation remains unclear. We used whole-genome sequencing data of 174 Japanese samples to characterize intermediate-sized insertions using a highly-accurate insertion calling method (IMSindel software and joint-call recovery) and obtained a catalogue of 4254 insertions. We constructed an imputation panel comprising of insertions and SNVs from all samples, and conducted imputation of intermediate-sized insertions for 82 publicly-available Japanese samples. Positive Predictive Value of imputation, evaluated using Nanopore long-read sequencing data, was 97%. Subsequent eQTL analysis predicted 128 (~ 3.0%) insertions as causative for gene expression level changes. Enrichment analysis of causal insertions for genome regulatory elements showed significant associations with CTCF-binding sites, super-enhancers, and promoters. Among 17 causal insertions found in the same causal set with GWAS hits, there were insertions associated with changes in expression of cancer-related genes such as BRCA1, ZNF222, and ABCB10. Analysis of insertions sequences revealed that 461 insertions were short tandem duplications frequently found in early-replicating regions of genome. Furthermore, comparison of functional importance of intermediate-sized insertions with that of intermediate-sized deletions detected in the same sample set in our previous study showed that insertions were more frequent in genic regions, and proportion of functional candidates was smaller in insertions. Here, we characterize a high-confidence set of intermediate-sized insertions and indicate their importance in gene expression regulation. Our results emphasize the importance of considering intermediate-sized insertions in trait association studies.


Assuntos
Regulação da Expressão Gênica , Genoma Humano , Mutagênese Insercional , Proteínas de Neoplasias/genética , Neoplasias/genética , Osteoartrite/genética , Transportadores de Cassetes de Ligação de ATP/genética , Transportadores de Cassetes de Ligação de ATP/metabolismo , Proteína BRCA1/genética , Proteína BRCA1/metabolismo , Sequência de Bases , Sítios de Ligação , Fator de Ligação a CCCTC/genética , Fator de Ligação a CCCTC/metabolismo , Mapeamento Cromossômico , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Perfilação da Expressão Gênica , Humanos , Japão , Anotação de Sequência Molecular , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Neoplasias/patologia , Osteoartrite/metabolismo , Osteoartrite/patologia , Regiões Promotoras Genéticas , Locos de Características Quantitativas , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Software , Sequenciamento Completo do Genoma
5.
Hum Mutat ; 39(10): 1384-1392, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-29969170

RESUMO

2,3,7,8-Tetrachlorodibenzo-p-dioxin (TCDD) or dioxin, is commonly considered the most toxic man-made substance. Dioxin exposure impacts human health and diseases, birth defects and teratogenesis were frequently observed in children of persons who have been exposed to dioxin. However, the impact of dioxin on human mutation rate in trios has not yet been elucidated at the whole genome level. To identify and characterize the genetic alterations in the individuals exposed to dioxin, we performed whole genome sequencing (WGS) of nine Vietnamese trios whose fathers were exposed to dioxin. In total, 846 de novo point mutations, 26 de novo insertions and deletions, 4 de novo structural variations, and 1 de novo copy number variation were identified. The number of point mutations and dioxin concentrations were positively correlated (P-value < 0.05). Considering the substitution pattern, the number of A > T/T > A mutation and the dioxin concentration was positively correlated (P-value < 0.05). Our analysis also identified one possible disease-related mutation in LAMA5 in one trio. These findings suggested that dioxin exposure might affect father genomes of trios leading to de novo mutations in their children. Further analysis with larger sample sizes would be required to better clarify mutation rates and substitution patterns in trios caused by dioxin.


Assuntos
Dioxinas/efeitos adversos , Estudo de Associação Genômica Ampla , Mutação , Exposição Paterna/efeitos adversos , Sequenciamento Completo do Genoma , Alelos , Criança , Dioxinas/sangue , Feminino , Células Germinativas/metabolismo , Mutação em Linhagem Germinativa , Humanos , Masculino , Espectrometria de Massas , Taxa de Mutação , Polimorfismo de Nucleotídeo Único , Veteranos
6.
Genome Med ; 13(1): 65, 2021 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-33910608

RESUMO

BACKGROUND: Identification of germline variation and somatic mutations is a major issue in human genetics. However, due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variation and somatic mutations is far from complete. METHODS: In the present study, we performed whole-genome sequencing using long-read sequencing technology (Oxford Nanopore) for 11 Japanese liver cancers and matched normal samples which were previously sequenced for the International Cancer Genome Consortium (ICGC). We constructed an analysis pipeline for the long-read data and identified germline and somatic structural variations (SVs). RESULTS: In polymorphic germline SVs, our analysis identified 8004 insertions, 6389 deletions, 27 inversions, and 32 intra-chromosomal translocations. By comparing to the chimpanzee genome, we correctly inferred events that caused insertions and deletions and found that most insertions were caused by transposons and Alu is the most predominant source, while other types of insertions, such as tandem duplications and processed pseudogenes, are rare. We inferred mechanisms of deletion generations and found that most non-allelic homolog recombination (NAHR) events were caused by recombination errors in SINEs. Analysis of somatic mutations in liver cancers showed that long reads could detect larger numbers of SVs than a previous short-read study and that mechanisms of cancer SV generation were different from that of germline deletions. CONCLUSIONS: Our analysis provides a comprehensive catalog of polymorphic and somatic SVs, as well as their possible causes. Our software are available at https://github.com/afujimoto/CAMPHOR and https://github.com/afujimoto/CAMPHORsomatic .


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Mutação/genética , Neoplasias/genética , Sequenciamento Completo do Genoma , Sequência de Bases , Metilação de DNA/genética , Mutação em Linhagem Germinativa/genética , Humanos , Mutação INDEL/genética , Regiões Promotoras Genéticas/genética , Telomerase/genética , Vírus/metabolismo
7.
Commun Biol ; 3(1): 724, 2020 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-33247206

RESUMO

(Pro)renin receptor [(P)RR] has a role in various diseases, such as cardiovascular and renal disorders and cancer. Aberrant (P)RR expression is prevalent in pancreatic ductal adenocarcinoma (PDAC) which is the most common pancreatic cancer. Here we show whether aberrant expression of (P)RR directly leads to genomic instability in human pancreatic ductal epithelial (HPDE) cells. (P)RR-expressing HPDE cells show obvious cellular atypia. Whole genome sequencing reveals that aberrant (P)RR expression induces large numbers of point mutations and structural variations at the genome level. A (P)RR-expressing cell population exhibits tumour-forming ability, showing both atypical nuclei characterised by distinctive nuclear bodies and chromosomal abnormalities. (P)RR overexpression upregulates SWItch/Sucrose Non-Fermentable (SWI/SNF)-related, matrix-associated, actin-dependent regulator of chromatin, subfamily a, member 5 (SMARCA5) through a direct molecular interaction, which results in the failure of several genomic stability pathways. These data reveal that aberrant (P)RR expression contributes to the early carcinogenesis of PDAC.


Assuntos
Adenosina Trifosfatases/metabolismo , Carcinoma Ductal Pancreático/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Neoplasias Pancreáticas/metabolismo , Receptores de Superfície Celular/metabolismo , ATPases Vacuolares Próton-Translocadoras/metabolismo , Adenosina Trifosfatases/genética , Animais , Carcinoma Ductal Pancreático/genética , Linhagem Celular , Transformação Celular Neoplásica , Montagem e Desmontagem da Cromatina , Proteínas Cromossômicas não Histona/genética , Regulação Neoplásica da Expressão Gênica , Instabilidade Genômica , Humanos , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Neoplasias Pancreáticas/genética , Regulação para Cima , Sequenciamento Completo do Genoma
8.
Genome Med ; 11(1): 44, 2019 07 24.
Artigo em Inglês | MEDLINE | ID: mdl-31340865

RESUMO

BACKGROUND: Next-generation sequencing has allowed for the identification of different genetic variations, which are known to contribute to diseases. Of these, insertions and deletions are the second most abundant type of variations in the genome, but their biological importance or disease association is not well-studied, especially for deletions of intermediate sizes. METHODS: We identified intermediate-sized deletions from whole-genome sequencing (WGS) data of Japanese samples (n = 174) with a novel deletion calling method which considered multiple samples. These deletions were used to construct a reference panel for use in imputation. Imputation was then conducted using the reference panel and data from 82 publically available Japanese samples with gene expression data. The accuracy of the deletion calling and imputation was examined with Nanopore long-read sequencing technology. We also conducted an expression quantitative trait loci (eQTL) association analysis using the deletions to infer their functional impacts on genes, before characterizing the deletions causal for gene expression level changes. RESULTS: We obtained a set of polymorphic 4378 high-confidence deletions and constructed a reference panel. The deletions were successfully imputed into the Japanese samples with high accuracy (97.3%). The eQTL analysis identified 181 deletions (4.1%) suggested as causal for gene expression level changes. The causal deletion candidates were significantly enriched in promoters, super-enhancers, and transcription elongation chromatin states. Generation of deletions in a cell line with the CRISPR-Cas9 system confirmed that they were indeed causative variants for gene expression change. Furthermore, one of the deletions was observed to affect the gene expression levels of a gene it was not located in. CONCLUSIONS: This paper reports an accurate deletion calling method for genotype imputation at the whole genome level and shows the importance of intermediate-sized deletions in the human population.


Assuntos
Epistasia Genética , Regulação da Expressão Gênica , Genética Populacional , Deleção de Sequência , Sistemas CRISPR-Cas , Biologia Computacional/métodos , Genoma Humano , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Anotação de Sequência Molecular , Filogenia , Locos de Características Quantitativas , Sequências Reguladoras de Ácido Nucleico , Reprodutibilidade dos Testes , Sequenciamento Completo do Genoma
9.
Sci Rep ; 9(1): 15017, 2019 10 22.
Artigo em Inglês | MEDLINE | ID: mdl-31641155

RESUMO

Plasma cell-free DNA (cfDNA) testing plays an increasingly important role in precision medicine for cancer. However, circulating cell-free tumor DNA (ctDNA) is highly diluted by cfDNA from non-cancer cells, complicating ctDNA detection and analysis. To identify low-frequency variants, we developed a program, eVIDENCE, which is a workflow for filtering candidate variants detected by using the ThruPLEX tag-seq (Takara Bio), a commercially-available molecular barcoding kit. We analyzed 27 cfDNA samples from hepatocellular carcinoma patients. Sequencing libraries were constructed and hybridized to our custom panel targeting about 80 genes. An initial variant calling identified 36,500 single nucleotide variants (SNVs) and 9,300 insertions and deletions (indels) across the 27 samples, but the number was much greater than expected when compared with previous cancer genome studies. eVIDENCE was applied to the candidate variants and finally 70 SNVs and 7 indels remained. Of the 77 variants, 49 (63.6%) showed VAF of < 1% (0.20-0.98%). Twenty-five variants were selected in an unbiased manner and all were successfully validated, suggesting that eVIDENCE can identify variants with VAF of ≥ 0.2%. Additionally, this study is the first to detect hepatitis B virus integration sites and genomic rearrangements in the TERT region from cfDNA of HCC patients. We consider that our method can be applied in the examination of cfDNA from other types of malignancies using specific custom gene panels and will contribute to comprehensive ctDNA analysis.


Assuntos
Biomarcadores Tumorais/genética , Carcinoma Hepatocelular/genética , Ácidos Nucleicos Livres/genética , Neoplasias Hepáticas/genética , Taxa de Mutação , Análise de Sequência de DNA/métodos , Software , Carcinoma Hepatocelular/diagnóstico , Humanos , Neoplasias Hepáticas/diagnóstico
10.
Artigo em Inglês | MEDLINE | ID: mdl-30477169

RESUMO

Although it has been a half-century since dioxin-contaminated herbicides were used to defoliate the landscape during the Vietnam War, dioxin contamination "hotspots" still remain in Vietnam. Environmental and health impacts of these hotspots need to be evaluated. Intellectual disability (ID) is one of the diseases found in the children of people exposed to the herbicides. This study aims to identify genetic alterations of a patient whose family lived in a dioxin hotspot. The patient's father had a highly elevated dioxin concentration. He was affected with undiagnosed moderate ID. To analyze de novo mutations and genetic variations, and to identify causal gene(s) for ID, we performed whole genome sequencing (WGS) of the proband and his parents. Two de novo missense mutations were detected, each one in ETS2 and ZNF408 genes, respectively. Compound heterozygosity was identified in CENPF and TTN genes. Existing knowledge on the genes and bioinformatics analyses suggest that EST2, ZNF408, and CENPF might be promising candidates for ID causative genes.


Assuntos
Dioxinas , Exposição Ambiental/efeitos adversos , Herbicidas , Deficiência Intelectual/genética , Biologia Computacional , Família , Feminino , Variação Genética , Humanos , Masculino , Mutação , Vietnã , Sequenciamento Completo do Genoma
11.
Hum Genome Var ; 3: 15067, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27081565

RESUMO

Tuberculosis (TB) is a complex disease, and both genetic and environmental factors contribute to disease progression. A previous genome-wide linkage study in Thailand determined that chromosome 20p13-12.3 may contain risk factors for young-onset disease. The present study aimed to identify novel susceptibility genes for young-onset TB within a 1-Mbp target region adjacent to the top-ranking risk marker in Chr.20p13-12.3. We performed next-generation sequencing (NGS) of the region in 13 young patients from multi-case families in Thailand. We then selected the functionally interesting single-nucleotide polymorphisms as candidates for subsequent analyses. The detected candidates rs13830 and rs1127354 in ITPA showed an association with young (<45 years old) TB patients. However, there was no association in old (⩾45 years old) patients. These findings confirm that stratifying patients based on age of TB onset can be important for identifying genetic risk factors for TB susceptibility. In addition, in silico expression quantitative trait loci analyses indicated that ITPA expression was associated with rs13830 genotype. This is the first study to use NGS resequencing to gain insight into host genetic factors associated with TB and to report a significant association for ITPA with host susceptibility in young-onset TB. The study also demonstrated the effectiveness of NGS in identifying susceptibility genes in common diseases.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA