Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Mol Cell Probes ; 66: 101873, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36379302

RESUMEN

Early detection is critical for minimizing mortality from cancer. Plasma cell-free DNA (cfDNA) contains the signatures of tumor DNA, allowing us to quantify the signature and diagnose early-stage tumors. Here, we report a novel tumor fragment quantification method, TOF (Tumor Originated Fragment) for the diagnosis of lung cancer by quantifying and analyzing both the plasma cfDNA methylation patterns and fragmentomic signatures. TOF utilizes the amount of ctDNA predicted from the methylation density information of each cfDNA read mapped on 6243 lung-tumor-specific CpG markers. The 6243 tumor-specific markers were derived from lung tumor tissues by comparing them with corresponding normal tissues and healthy blood from public methylation data. TOF also utilizes two cfDNA fragmentomic signatures: 1) the short fragment ratio, and 2) the 5' end-motif profile. We used 298 plasma samples to analyze cfDNA signatures using enzymatic methyl-sequencing data from 201 lung cancer patients and 97 healthy controls. The TOF score showed 0.98 of the area under the curve in correctly classifying lung cancer from normal samples. The TOF score resolution was high enough to clearly differentiate even the early-stage non-small cell lung cancer patients from the healthy controls. The same was true for small cell lung cancer patients.


Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas , Ácidos Nucleicos Libres de Células , Neoplasias Pulmonares , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patología , Carcinoma de Pulmón de Células no Pequeñas/genética , Epigenoma , Detección Precoz del Cáncer , ADN de Neoplasias/genética , Biomarcadores de Tumor/genética , Ácidos Nucleicos Libres de Células/genética , Metilación de ADN/genética
2.
Mol Cells ; 43(1): 86-95, 2020 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-31940721

RESUMEN

The red-crowned crane (Grus japonensis) is an endangered, large-bodied crane native to East Asia. It is a traditional symbol of longevity and its long lifespan has been confirmed both in captivity and in the wild. Lifespan in birds is known to be positively correlated with body size and negatively correlated with metabolic rate, though the genetic mechanisms for the red-crowned crane's long lifespan have not previously been investigated. Using whole genome sequencing and comparative evolutionary analyses against the grey-crowned crane and other avian genomes, including the long-lived common ostrich, we identified redcrowned crane candidate genes with known associations with longevity. Among these are positively selected genes in metabolism and immunity pathways (NDUFA5, NDUFA8, NUDT12, SOD3, CTH , RPA1, PHAX, HNMT , HS2ST1 , PPCDC , PSTK CD8B, GP9, IL-9R, and PTPRC). Our analyses provide genetic evidence for low metabolic rate and longevity, accompanied by possible convergent adaptation signatures among distantly related large and long-lived birds. Finally, we identified low genetic diversity in the red-crowned crane, consistent with its listing as an endangered species, and this genome should provide a useful genetic resource for future conservation studies of this rare and iconic species.


Asunto(s)
Proteínas Aviares/genética , Aves/fisiología , Animales , Especies en Peligro de Extinción , Inmunidad/genética , Longevidad/genética , Polimorfismo Genético , Especificidad de la Especie , Transcriptoma , Secuenciación Completa del Genoma
3.
Sci Rep ; 8(1): 5677, 2018 04 04.
Artículo en Inglés | MEDLINE | ID: mdl-29618732

RESUMEN

High-coverage whole-genome sequencing data of a single ethnicity can provide a useful catalogue of population-specific genetic variations, and provides a critical resource that can be used to more accurately identify pathogenic genetic variants. We report a comprehensive analysis of the Korean population, and present the Korean National Standard Reference Variome (KoVariome). As a part of the Korean Personal Genome Project (KPGP), we constructed the KoVariome database using 5.5 terabases of whole genome sequence data from 50 healthy Korean individuals in order to characterize the benign ethnicity-relevant genetic variation present in the Korean population. In total, KoVariome includes 12.7M single-nucleotide variants (SNVs), 1.7M short insertions and deletions (indels), 4K structural variations (SVs), and 3.6K copy number variations (CNVs). Among them, 2.4M (19%) SNVs and 0.4M (24%) indels were identified as novel. We also discovered selective enrichment of 3.8M SNVs and 0.5M indels in Korean individuals, which were used to filter out 1,271 coding-SNVs not originally removed from the 1,000 Genomes Project when prioritizing disease-causing variants. KoVariome health records were used to identify novel disease-causing variants in the Korean population, demonstrating the value of high-quality ethnic variation databases for the accurate interpretation of individual genomes and the precise characterization of genetic variations.


Asunto(s)
Variaciones en el Número de Copia de ADN , Enfermedad/genética , Genética de Población , Genoma Humano , Mutación INDEL , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma/métodos , Bases de Datos Genéticas , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Mutación , Estándares de Referencia , República de Corea , Análisis de Secuencia de ADN
5.
PLoS One ; 12(7): e0180418, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28678835

RESUMEN

Myotis rufoniger is a vesper bat in the genus Myotis. Here we report the whole genome sequence and analyses of the M. rufoniger. We generated 124 Gb of short-read DNA sequences with an estimated genome size of 1.88 Gb at a sequencing depth of 66× fold. The sequences were aligned to M. brandtii bat reference genome at a mapping rate of 96.50% covering 95.71% coding sequence region at 10× coverage. The divergence time of Myotis bat family is estimated to be 11.5 million years, and the divergence time between M. rufoniger and its closest species M. davidii is estimated to be 10.4 million years. We found 1,239 function-altering M. rufoniger specific amino acid sequences from 929 genes compared to other Myotis bat and mammalian genomes. The functional enrichment test of the 929 genes detected amino acid changes in melanin associated DCT, SLC45A2, TYRP1, and OCA2 genes possibly responsible for the M. rufoniger's red fur color and a general coloration in Myotis. N6AMT1 gene, associated with arsenic resistance, showed a high degree of function alteration in M. rufoniger. We further confirmed that the M. rufoniger also has bat-specific sequences within FSHB, GHR, IGF1R, TP53, MDM2, SLC45A2, RGS7BP, RHO, OPN1SW, and CNGB3 genes that have already been published to be related to bat's reproduction, lifespan, flight, low vision, and echolocation. Additionally, our demographic history analysis found that the effective population size of Myotis clade has been consistently decreasing since ~30k years ago. M. rufoniger's effective population size was the lowest in Myotis bats, confirming its relatively low genetic diversity.


Asunto(s)
Quirópteros/genética , Genoma , Sustitución de Aminoácidos , Animales , Quirópteros/clasificación , Variación Genética , Mutación , Filogenia
6.
Genome Biol ; 17(1): 211, 2016 10 11.
Artículo en Inglés | MEDLINE | ID: mdl-27802837

RESUMEN

BACKGROUND: There are three main dietary groups in mammals: carnivores, omnivores, and herbivores. Currently, there is limited comparative genomics insight into the evolution of dietary specializations in mammals. Due to recent advances in sequencing technologies, we were able to perform in-depth whole genome analyses of representatives of these three dietary groups. RESULTS: We investigated the evolution of carnivory by comparing 18 representative genomes from across Mammalia with carnivorous, omnivorous, and herbivorous dietary specializations, focusing on Felidae (domestic cat, tiger, lion, cheetah, and leopard), Hominidae, and Bovidae genomes. We generated a new high-quality leopard genome assembly, as well as two wild Amur leopard whole genomes. In addition to a clear contraction in gene families for starch and sucrose metabolism, the carnivore genomes showed evidence of shared evolutionary adaptations in genes associated with diet, muscle strength, agility, and other traits responsible for successful hunting and meat consumption. Additionally, an analysis of highly conserved regions at the family level revealed molecular signatures of dietary adaptation in each of Felidae, Hominidae, and Bovidae. However, unlike carnivores, omnivores and herbivores showed fewer shared adaptive signatures, indicating that carnivores are under strong selective pressure related to diet. Finally, felids showed recent reductions in genetic diversity associated with decreased population sizes, which may be due to the inflexible nature of their strict diet, highlighting their vulnerability and critical conservation status. CONCLUSIONS: Our study provides a large-scale family level comparative genomic analysis to address genomic changes associated with dietary specialization. Our genomic analyses also provide useful resources for diet-related genetic and health research.


Asunto(s)
Variación Genética , Genoma , Panthera/genética , Análisis de Secuencia de ADN , Adaptación Fisiológica/genética , Animales , Evolución Biológica , Gatos , Herbivoria/genética , Mamíferos/genética , Anotación de Secuencia Molecular , Filogenia
7.
Nat Commun ; 7: 13637, 2016 11 24.
Artículo en Inglés | MEDLINE | ID: mdl-27882922

RESUMEN

Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity.


Asunto(s)
Pueblo Asiatico/genética , Genoma Humano/genética , Mapeo Cromosómico , Consenso , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , República de Corea , Análisis de Secuencia de ADN
8.
Genome Biol ; 16: 215, 2015 Oct 21.
Artículo en Inglés | MEDLINE | ID: mdl-26486310

RESUMEN

BACKGROUND: The cinereous vulture, Aegypius monachus, is the largest bird of prey and plays a key role in the ecosystem by removing carcasses, thus preventing the spread of diseases. Its feeding habits force it to cope with constant exposure to pathogens, making this species an interesting target for discovering functionally selected genetic variants. Furthermore, the presence of two independently evolved vulture groups, Old World and New World vultures, provides a natural experiment in which to investigate convergent evolution due to obligate scavenging. RESULTS: We sequenced the genome of a cinereous vulture, and mapped it to the bald eagle reference genome, a close relative with a divergence time of 18 million years. By comparing the cinereous vulture to other avian genomes, we find positively selected genetic variations in this species associated with respiration, likely linked to their ability of immune defense responses and gastric acid secretion, consistent with their ability to digest carcasses. Comparisons between the Old World and New World vulture groups suggest convergent gene evolution. We assemble the cinereous vulture blood transcriptome from a second individual, and annotate genes. Finally, we infer the demographic history of the cinereous vulture which shows marked fluctuations in effective population size during the late Pleistocene. CONCLUSIONS: We present the first genome and transcriptome analyses of the cinereous vulture compared to other avian genomes and transcriptomes, revealing genetic signatures of dietary and environmental adaptations accompanied by possible convergent evolution between the Old World and New World vultures.


Asunto(s)
Evolución Molecular , Falconiformes/genética , Adaptación Biológica/genética , Animales , Fenómenos Fisiológicos del Sistema Digestivo/genética , Variación Genética , Genoma , Inmunidad/genética , Análisis de Secuencia de ADN , Transcriptoma
9.
BMC Genomics ; 15 Suppl 9: S4, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25521865

RESUMEN

BACKGROUND: The horse (Equus ferus caballus) is one of the earliest domesticated species and has played an important role in the development of human societies over the past 5,000 years. In this study, we characterized the genome of the Marwari horse, a rare breed with unique phenotypic characteristics, including inwardly turned ear tips. It is thought to have originated from the crossbreeding of local Indian ponies with Arabian horses beginning in the 12th century. RESULTS: We generated 101 Gb (~30 × coverage) of whole genome sequences from a Marwari horse using the Illumina HiSeq2000 sequencer. The sequences were mapped to the horse reference genome at a mapping rate of ~98% and with ~95% of the genome having at least 10 × coverage. A total of 5.9 million single nucleotide variations, 0.6 million small insertions or deletions, and 2,569 copy number variation blocks were identified. We confirmed a strong Arabian and Mongolian component in the Marwari genome. Novel variants from the Marwari sequences were annotated, and were found to be enriched in olfactory functions. Additionally, we suggest a potential functional genetic variant in the TSHZ1 gene (p.Ala344>Val) associated with the inward-turning ear tip shape of the Marwari horses. CONCLUSIONS: Here, we present an analysis of the Marwari horse genome. This is the first genomic data for an Asian breed, and is an invaluable resource for future studies of genetic variation associated with phenotypes and diseases in horses.


Asunto(s)
Genoma/genética , Genómica , Caballos/genética , Análisis de Secuencia de ADN , Secuencia de Aminoácidos , Animales , Evolución Molecular , Variación Genética , Genotipo , Humanos , Hibridación Genética , Masculino , Datos de Secuencia Molecular , Fenotipo , Selección Genética , Especificidad de la Especie
10.
BMC Genomics ; 15: 477, 2014 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-24929792

RESUMEN

BACKGROUND: In contrast with wild species, cultivated crop genomes consist of reshuffled recombination blocks, which occurred by crossing and selection processes. Accordingly, recombination block-based genomics analysis can be an effective approach for the screening of target loci for agricultural traits. RESULTS: We propose the variation block method, which is a three-step process for recombination block detection and comparison. The first step is to detect variations by comparing the short-read DNA sequences of the cultivar to the reference genome of the target crop. Next, sequence blocks with variation patterns are examined and defined. The boundaries between the variation-containing sequence blocks are regarded as recombination sites. All the assumed recombination sites in the cultivar set are used to split the genomes, and the resulting sequence regions are termed variation blocks. Finally, the genomes are compared using the variation blocks. The variation block method identified recurring recombination blocks accurately and successfully represented block-level diversities in the publicly available genomes of 31 soybean and 23 rice accessions. The practicality of this approach was demonstrated by the identification of a putative locus determining soybean hilum color. CONCLUSIONS: We suggest that the variation block method is an efficient genomics method for the recombination block-level comparison of crop genomes. We expect that this method will facilitate the development of crop genomics by bringing genomics technologies to the field of crop breeding.


Asunto(s)
Productos Agrícolas/genética , Genoma de Planta , Glycine max/genética , Secuencia de Bases , Mapeo Cromosómico , Proteínas de Plantas/genética , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Análisis de Secuencia de ADN
11.
Genome Biol ; 15(4): R55, 2014 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-24690483

RESUMEN

BACKGROUND: Stomach cancer is the third deadliest among all cancers worldwide. Although incidence of the intestinal-type gastric cancer has decreased, the incidence of diffuse-type is still increasing and its progression is notoriously aggressive. There is insufficient information on genome variations of diffuse-type gastric cancer because its cells are usually mixed with normal cells, and this low cellularity has made it difficult to analyze the genome. RESULTS: We analyze whole genomes and corresponding exomes of diffuse-type gastric cancer, using matched tumor and normal samples from 14 diffuse-type and five intestinal-type gastric cancer patients. Somatic variations found in the diffuse-type gastric cancer are compared to those of the intestinal-type and to previously reported variants. We determine the average exonic somatic mutation rate of the two types. We find associated candidate driver genes, and identify seven novel somatic mutations in CDH1, which is a well-known gastric cancer-associated gene. Three-dimensional structure analysis of the mutated E-cadherin protein suggests that these new somatic mutations could cause significant functional perturbations of critical calcium-binding sites in the EC1-2 junction. Chromosomal instability analysis shows that the MDM2 gene is amplified. After thorough structural analysis, a novel fusion gene TSC2-RNF216 is identified, which may simultaneously disrupt tumor-suppressive pathways and activate tumorigenesis. CONCLUSIONS: We report the genomic profile of diffuse-type gastric cancers including new somatic variations, a novel fusion gene, and amplification and deletion of certain chromosomal regions that contain oncogenes and tumor suppressors.


Asunto(s)
Genoma Humano , Neoplasias Gástricas/genética , Adulto , Secuencia de Aminoácidos , Antígenos CD , Cadherinas/química , Cadherinas/genética , Cadherinas/metabolismo , Estudios de Casos y Controles , Femenino , Amplificación de Genes , Fusión Génica , Humanos , Masculino , Datos de Secuencia Molecular , Mutación , Proteínas Proto-Oncogénicas c-mdm2/genética , Neoplasias Gástricas/diagnóstico , Proteína 2 del Complejo de la Esclerosis Tuberosa , Proteínas Supresoras de Tumor/química , Proteínas Supresoras de Tumor/genética , Ubiquitina-Proteína Ligasas/química , Ubiquitina-Proteína Ligasas/genética
12.
Nat Genet ; 46(1): 88-92, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24270359

RESUMEN

The shift from terrestrial to aquatic life by whales was a substantial evolutionary event. Here we report the whole-genome sequencing and de novo assembly of the minke whale genome, as well as the whole-genome sequences of three minke whales, a fin whale, a bottlenose dolphin and a finless porpoise. Our comparative genomic analysis identified an expansion in the whale lineage of gene families associated with stress-responsive proteins and anaerobic metabolism, whereas gene families related to body hair and sensory receptors were contracted. Our analysis also identified whale-specific mutations in genes encoding antioxidants and enzymes controlling blood pressure and salt concentration. Overall the whale-genome sequences exhibited distinct features that are associated with the physiological and morphological changes needed for life in an aquatic environment, marked by resistance to physiological stresses caused by a lack of oxygen, increased amounts of reactive oxygen species and high salt levels.


Asunto(s)
Adaptación Fisiológica/genética , Genoma , Ballena Minke/genética , Animales , Presión Sanguínea/genética , Glutatión/metabolismo , Haptoglobinas/genética , Masculino , Ballena Minke/metabolismo , Familia de Multigenes , Mutación , Océano Pacífico , Filogenia , Densidad de Población , Tolerancia a la Sal , Estrés Fisiológico
13.
Sci Rep ; 3: 2998, 2013 Oct 21.
Artículo en Inglés | MEDLINE | ID: mdl-24141358

RESUMEN

Cloning is a process that produces genetically identical organisms. However, the genomic degree of genetic resemblance in clones needs to be determined. In this report, the genomes of a cloned dog and its donor were compared. Compared with a human monozygotic twin, the genome of the cloned dog showed little difference from the genome of the nuclear donor dog in terms of single nucleotide variations, chromosomal instability, and telomere lengths. These findings suggest that cloning by somatic cell nuclear transfer produced an almost identical genome. The whole genome sequence data of donor and cloned dogs can provide a resource for further investigations on epigenetic contributions in phenotypic differences.


Asunto(s)
Clonación de Organismos/veterinaria , Genoma , Animales , Perros , Inestabilidad Genómica , Masculino , Mutación , Análisis de Secuencia de ADN , Homeostasis del Telómero , Gemelos Monocigóticos
14.
Nat Commun ; 4: 2433, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24045858

RESUMEN

Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.


Asunto(s)
Genoma/genética , Leones/genética , Panthera/genética , Tigres/genética , Adaptación Fisiológica/genética , Secuencia de Aminoácidos , Animales , Variación Genética , Datos de Secuencia Molecular , Mutación/genética , Densidad de Población , Sintenía/genética
15.
Genome Res ; 23(7): 1109-17, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23737375

RESUMEN

Microsatellite instability (MSI) is a critical mechanism that drives genetic aberrations in cancer. To identify the entire MS mutation, we performed the first comprehensive genome- and transcriptome-wide analyses of mutations associated with MSI in Korean gastric cancer cell lines and primary tissues. We identified 18,377 MS mutations of five or more repeat nucleotides in coding sequences and untranslated regions of genes, and discovered 139 individual genes whose expression was down-regulated in association with UTR MS mutation. In addition, we found that 90.5% of MS mutations with deletions in gene regions occurred in UTRs. This analysis emphasizes the genetic diversity of MSI-H gastric tumors and provides clues to the mechanistic basis of instability in microsatellite unstable gastric cancers.


Asunto(s)
Pueblo Asiatico/genética , Estudio de Asociación del Genoma Completo , Inestabilidad de Microsatélites , Mutación , Neoplasias Gástricas/genética , Transcriptoma , Línea Celular Tumoral , Mutación del Sistema de Lectura , Regulación Neoplásica de la Expresión Génica , Frecuencia de los Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Repeticiones de Microsatélite , Procesamiento Postranscripcional del ARN , Estabilidad del ARN , República de Corea , Eliminación de Secuencia , Regiones no Traducidas
16.
BMC Genomics ; 13: 473, 2012 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-22971240

RESUMEN

BACKGROUND: Thoroughbred horses are the most expensive domestic animals, and their running ability and knowledge about their muscle-related diseases are important in animal genetics. While the horse reference genome is available, there has been no large-scale functional annotation of the genome using expressed genes derived from transcriptomes. RESULTS: We present a large-scale analysis of whole transcriptome data. We sequenced the whole mRNA from the blood and muscle tissues of six thoroughbred horses before and after exercise. By comparing current genome annotations, we identified 32,361 unigene clusters spanning 51.83 Mb that contained 11,933 (36.87%) annotated genes. More than 60% (20,428) of the unigene clusters did not match any current equine gene model. We also identified 189,973 single nucleotide variations (SNVs) from the sequences aligned against the horse reference genome. Most SNVs (171,558 SNVs; 90.31%) were novel when compared with over 1.1 million equine SNPs from two SNP databases. Using differential expression analysis, we further identified a number of exercise-regulated genes: 62 up-regulated and 80 down-regulated genes in the blood, and 878 up-regulated and 285 down-regulated genes in the muscle. Six of 28 previously-known exercise-related genes were over-expressed in the muscle after exercise. Among the differentially expressed genes, there were 91 transcription factor-encoding genes, which included 56 functionally unknown transcription factor candidates that are probably associated with an early regulatory exercise mechanism. In addition, we found interesting RNA expression patterns where different alternative splicing forms of the same gene showed reversed expressions before and after exercising. CONCLUSION: The first sequencing-based horse transcriptome data, extensive analyses results, deferentially expressed genes before and after exercise, and candidate genes that are related to the exercise are provided in this study.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Caballos/genética , Caballos/fisiología , Condicionamiento Físico Animal/fisiología , ARN/genética , Animales
17.
BMC Genomics ; 10 Suppl 3: S32, 2009 Dec 03.
Artículo en Inglés | MEDLINE | ID: mdl-19958497

RESUMEN

BACKGROUND: Parkinson's disease (PD) is one of the most common neurodegenerative disorders, clinically characterized by impaired motor function. Since the etiology of PD is diverse and complex, many researchers have created PD-related research resources. However, resources for brain and PD studies are still lacking. Therefore, we have constructed a database of PD-related gene and genetic variations using the substantia nigra (SN) in PD and normal tissues. In addition, we integrated PD-related information from several resources. RESULTS: We collected the 6,130 SN expressed sequenced tags (ESTs) from brain SN normal tissues and PD patients SN tissues using full-cDNA library and normalized cDNA library construction methods from our previous study. The SN ESTs were clustered in 2,951 unigene clusters and assigned in 2,678 genes. We then found up-regulated 57 genes and down-regulated 48 genes by comparing normal and PD SN ESTs frequencies with over 0.9 cut-off probability of differential expression based on the Audic and Claverie method. In addition, we integrated disease-related information from public resources. To examine the characteristics of these PD-related genes, we analyzed alternative splicing events, single nucleotide polymorphism (SNP) markers located in the gene regions, repeat elements, gene regulation elements, and pathways and protein-protein interaction networks. CONCLUSION: We constructed the PDbase database to capture the PD-related gene, genetic variation, and functional elements. This database contains 2,698 PD-related genes through ESTs discovered from human normal and PD patients SN tissues, and through integrating several public resources. PDbase provides the mitochondrion proteins, microRNA gene regulation elements, single nucleotide polymorphisms (SNPs) markers within PD-related gene structures, repeat elements, and pathways and networks with protein-protein interaction information. The PDbase information can aid in understanding the causation of PD. It is available at http://bioportal.kobic.re.kr/PDbase/. Supplementary data is available at http://bioportal.kobic.re.kr/PDbase/suppl.jsp.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Etiquetas de Secuencia Expresada , Variación Genética , Enfermedad de Parkinson/genética , Sustancia Negra/química , ADN Complementario/química , ADN Complementario/genética , Regulación hacia Abajo , Biblioteca de Genes , Humanos , Internet , Regulación hacia Arriba
18.
BMC Genomics ; 10 Suppl 3: S35, 2009 Dec 03.
Artículo en Inglés | MEDLINE | ID: mdl-19958500

RESUMEN

BACKGROUND: A disease-causing mutation refers to a heritable genetic change that is associated with a specific phenotype (disease). The detection of a mutation from a patient's sample is critical for the diagnosis, treatment, and prognosis of the disease. There are numerous databases and applications with which to archive mutation data. However, none of them have been implemented with any automated bioinformatics tools for mutation detection and analysis starting from raw data materials from patients. We present a Locus Specific mutation DB (LSDB) construction system that supports both mutation detection and deposition in one package. RESULTS: COMUS (Clinician-Oriented locus specific MUtation detection and deposition System) is a mutation detection and deposition system for developing specific LSDBs. COMUS contains 1) a DNA sequence mutation analysis method for clinicians' mutation data identification and deposition and 2) a curation system for variation detection from clinicians' input data. To embody the COMUS system and to validate its clinical utility, we have chosen the disease hemophilia as a test database. A set of data files from bench experiments and clinical information from hemophilia patients were tested on the LSDB, KoHemGene http://www.kohemgene.org, which has proven to be a clinician-friendly interface for mutation detection and deposition. CONCLUSION: COMUS is a bioinformatics system for detecting and depositing new mutations from patient DNA with a clinician-friendly interface. LSDBs made using COMUS will promote the clinical utility of LSDBs. COMUS is available at http://www.comus.info.


Asunto(s)
Bases de Datos Genéticas , Sitios Genéticos , Hemofilia A/genética , Mutación , Diseño de Software , Secuencia de Bases , Biología Computacional , Humanos , Internet , Datos de Secuencia Molecular , Alineación de Secuencia
19.
Genome Res ; 19(9): 1622-9, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19470904

RESUMEN

We present the first Korean individual genome sequence (SJK) and analysis results. The diploid genome of a Korean male was sequenced to 28.95-fold redundancy using the Illumina paired-end sequencing method. SJK covered 99.9% of the NCBI human reference genome. We identified 420,083 novel single nucleotide polymorphisms (SNPs) that are not in the dbSNP database. Despite a close similarity, significant differences were observed between the Chinese genome (YH), the only other Asian genome available, and SJK: (1) 39.87% (1,371,239 out of 3,439,107) SNPs were SJK-specific (49.51% against Venter's, 46.94% against Watson's, and 44.17% against the Yoruba genomes); (2) 99.5% (22,495 out of 22,605) of short indels (< 4 bp) discovered on the same loci had the same size and type as YH; and (3) 11.3% (331 out of 2920) deletion structural variants were SJK-specific. Even after attempting to map unmapped reads of SJK to unanchored NCBI scaffolds, HGSV, and available personal genomes, there were still 5.77% SJK reads that could not be mapped. All these findings indicate that the overall genetic differences among individuals from closely related ethnic groups may be significant. Hence, constructing reference genomes for minor socio-ethnic groups will be useful for massive individual genome sequencing.


Asunto(s)
Pueblo Asiatico/genética , Genoma Humano/genética , Análisis de Secuencia de ADN/métodos , Biología Computacional/métodos , Bases de Datos Genéticas , Femenino , Genómica/métodos , Humanos , Mutación INDEL , Corea (Geográfico) , Masculino , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple/genética , Estándares de Referencia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...