RESUMO
MOTIVATION: The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exact and approximate LOD calculations and maximum-likelihood haplotyping. The system draws its power from thousands of CPUs, performing data analysis tasks orders of magnitude faster than a single computer. By providing an intuitive interface to sophisticated state-of-the-art analysis tools coupled with high computing capacity, Superlink-Online SNP helps geneticists unleash the potential of SNP data for detecting disease genes. RESULTS: Computations performed by Superlink-Online SNP are automatically parallelized using novel paradigms, and executed on unlimited number of private or public CPUs. One novel service is large-scale approximate Markov Chain-Monte Carlo (MCMC) analysis. The accuracy of the results is reliably estimated by running the same computation on multiple CPUs and evaluating the Gelman-Rubin Score to set aside unreliable results. Another service within the workflow is a novel parallelized exact algorithm for inferring maximum-likelihood haplotyping. The reported system enables genetic analyses that were previously infeasible. We demonstrate the system capabilities through a study of a large complex pedigree affected with metabolic syndrome. AVAILABILITY: Superlink-Online SNP is freely available for researchers at http://cbl-hap.cs.technion.ac.il/superlink-snp. The system source code can also be downloaded from the system website. CONTACT: omerw@cs.technion.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Ligação Genética , Linhagem , Polimorfismo de Nucleotídeo Único , Software , Algoritmos , Análise por Conglomerados , Haplótipos , Humanos , Cadeias de Markov , Método de Monte CarloRESUMO
We have recently described a novel autosomal recessive disorder, lethal congenital contractural syndrome type 2 (LCCS2) (OMIM 607598), in a large Israeli Bedouin kindred. The phenotype, which is lethal in the neonatal period, is distinguished by the presence of a markedly distended urinary bladder. Association of LCCS2 to the known loci associated with arthogryposis was excluded. In the present study, we set out to determine the genetic locus harboring the gene defective in this disease. We performed genome-wide linkage analysis, demonstrating linkage to a approximately 6 cM (corresponding to approximately 7.2 Mb) homozygosity region on chromosome 12q13 between markers D12S1604 and D12S83. Based on recombination events, the interval harboring the disease-associated locus was further narrowed to a region spanning approximately 6 cM ( approximately 6.4 Mb) between D12S325 and D12S1072. Linkage of LCCS2 to that locus was established, with two significant maximum peaks at markers D12S1604 (Z(max) = 10.56 at theta = 0.01) and D12S1700 (Z(max) = 9.23 at theta = 0.00).