Búsqueda | Portal de Búsqueda de la BVS Colombia

KGVDB: a population-based genomic map of CNVs tagged by SNPs in Koreans.

Moon, Sanghoon; Jung, Kwang Su; Kim, Young Jin; Hwang, Mi Yeong; Han, Kyungsook; Lee, Jong-Young; Park, Kiejung; Kim, Bong-Jo.

Bioinformatics ; 29(11): 1481-3, 2013 Jun 01.

Artículo en Inglés | MEDLINE | ID: mdl-23626002

RESUMEN

SUMMARY: Despite a growing interest in a correlation between copy number variations (CNVs) and flanking single nucleotide polymorphisms, few databases provide such information. In particular, most information on CNV available so far was obtained in Caucasian and Yoruba populations, and little is known about CNV in Asian populations. This article presents a database that provides CNV regions tagged by single nucleotide polymorphisms in about 4700 Koreans, which were detected under strict quality control, manually curated and experimentally validated. AVAILABILITY: KGVDB is freely available for non-commercial use at http://biomi.cdc.go.kr/KGVDB. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Pueblo Asiatico/genética , Variaciones en el Número de Copia de ADN , Bases de Datos de Ácidos Nucleicos , Polimorfismo de Nucleótido Simple , Mapeo Cromosómico , Genoma Humano , Estudio de Asociación del Genoma Completo , Genómica/métodos , Humanos , Corea (Geográfico)

Genovar: a detection and visualization tool for genomic variants.

Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung.

BMC Bioinformatics ; 13 Suppl 7: S12, 2012 May 08.

Artículo en Inglés | MEDLINE | ID: mdl-22594998

RESUMEN

BACKGROUND: Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. RESULTS: A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. CONCLUSIONS: Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. AVAILABILITY: http://genovar.sourceforge.net/.

Asunto(s)

Variaciones en el Número de Copia de ADN , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Programas Informáticos , Cromosomas Humanos , Hibridación Genómica Comparativa , Genómica , Humanos

QSE: A new 3-D solvent exposure measure for the analysis of protein structure.

Li, Peipei; Pok, Gouchol; Jung, Kwang Su; Shon, Ho Sun; Ryu, Keun Ho.

Proteomics ; 11(19): 3793-801, 2011 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-21761564

RESUMEN

Solvent exposure of amino acids measures how deep residues are buried in tertiary structure of proteins, and hence it provides important information for analyzing and predicting protein structure and functions. Existing methods of calculating solvent exposure such as accessible surface area, relative accessible surface area, residue depth, contact number, and half-sphere exposure still have some limitations. In this article, we propose a novel solvent exposure measure named quadrant-sphere exposure (QSE) based on eight quadrants derived from spherical neighborhood. The proposed measure forms a microenvironment around Cα atom as a sphere with a radius of 13 Å, and subdivides it into eight quadrants according to a rectangular coordinate system constructed based on geometric relationships of backbone atoms. The number of neighboring Cα atoms whose labels are the same is given as the QSE value of the center Cα atom at hand. As evidenced by histograms that show very different distributions for different structure configurations, the proposed measure captures local properties that are characteristic for a residue's eight-directional neighborhood within a sphere. Compared with other measures, QSE provides a different view of solvent exposure, and provides information that is specific for different tertiary structure. As the experimental results show, QSE measure can potentially be used in protein structure analysis and predictions.

Asunto(s)

Proteínas/química , Análisis de Secuencia de Proteína/métodos , Solventes/química , Secuencia de Aminoácidos , Aminoácidos/química , Bases de Datos de Proteínas , Datos de Secuencia Molecular , Conformación Proteica

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing.

Jung, Kwang Su; Hong, Kyung-Won; Jo, Hyun Youn; Choi, Jongpill; Ban, Hyo-Jeong; Cho, Seong Beom; Chung, Myungguen.

Database (Oxford) ; 20202020 01 01.

Artículo en Inglés | MEDLINE | ID: mdl-32133509

RESUMEN

Since 2012, the Center for Genome Science of the Korea National Institute of Health (KNIH) has been sequencing complete genomes of 1722 Korean individuals. As a result, more than 32 million variant sites have been identified, and a large proportion of the variant sites have been detected for the first time. In this article, we describe the Korean Reference Genome Database (KRGDB) and its genome browser. The current version of our database contains both single nucleotide and short insertion/deletion variants. The DNA samples were obtained from four different origins and sequenced in different sequencing depths (10× coverage of 63 individuals, 20× coverage of 194 individuals, combined 10× and 20× coverage of 135 individuals, 30× coverage of 230 individuals and 30× coverage of 1100 individuals). The major features of the KRGDB are that it contains information on the Korean genomic variant frequency, frequency difference between the Korean and other populations and the variant functional annotation (such as regulatory elements in ENCODE regions and coding variant functions) of the variant sites. Additionally, we performed the genome-wide association study (GWAS) between Korean genome variant sites for the 30×230 individuals and three major common diseases (diabetes, hypertension and metabolic syndrome). The association results are displayed on our browser. The KRGDB uses the MySQL database and Apache-Tomcat web server adopted with Java Server Page (JSP) and is freely available at http://coda.nih.go.kr/coda/KRGDB/index.jsp. Availability: http://coda.nih.go.kr/coda/KRGDB/index.jsp.

Asunto(s)

Biología Computacional/métodos , Bases de Datos Genéticas , Genoma Humano/genética , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Secuenciación Completa del Genoma/métodos , Pueblo Asiatico/genética , Minería de Datos/métodos , Femenino , Humanos , Mutación INDEL , Internet , Masculino , Polimorfismo de Nucleótido Simple , República de Corea

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing.

Jung, Kwang Su; Hong, Kyung-Won; Jo, Hyun Youn; Choi, Jongpill; Ban, Hyo-Jeong; Cho, Seong Beom; Chung, Myungguen.

Database (Oxford) ; 20202020 01 01.

Artículo en Inglés | MEDLINE | ID: mdl-32348452

A 9-state hidden Markov model using protein secondary structure information for protein fold recognition.

Lee, Sun Young; Lee, Jong Yun; Jung, Kwang Su; Ryu, Keun Ho.

Comput Biol Med ; 39(6): 527-34, 2009 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-19394594

RESUMEN

In protein fold recognition, the main disadvantage of hidden Markov models (HMMs) is the employment of large-scale model architectures which require large data sets and high computational resources for training. Also, HMMs must consider sequential information about secondary structures of proteins, to improve prediction performance and reduce model parameters. Therefore, we propose a novel method for protein fold recognition based on a hidden Markov model, called a 9-state HMM. The method can (i) reduce the number of states using secondary structure information about proteins for each fold and (ii) recognize protein folds more accurately than other HMMs.

Asunto(s)

Cadenas de Markov , Pliegue de Proteína , Estructura Secundaria de Proteína , Modelos Moleculares

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA