Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 40
Filter
1.
PLoS Genet ; 15(5): e1008124, 2019 05.
Article in English | MEDLINE | ID: mdl-31071088

ABSTRACT

The rapid digitization of genealogical and medical records enables the assembly of extremely large pedigree records spanning millions of individuals and trillions of pairs of relatives. Such pedigrees provide the opportunity to investigate the sociological and epidemiological history of human populations in scales much larger than previously possible. Linear mixed models (LMMs) are routinely used to analyze extremely large animal and plant pedigrees for the purposes of selective breeding. However, LMMs have not been previously applied to analyze population-scale human family trees. Here, we present Sparse Cholesky factorIzation LMM (Sci-LMM), a modeling framework for studying population-scale family trees that combines techniques from the animal and plant breeding literature and from human genetics literature. The proposed framework can construct a matrix of relationships between trillions of pairs of individuals and fit the corresponding LMM in several hours. We demonstrate the capabilities of Sci-LMM via simulation studies and by estimating the heritability of longevity and of reproductive fitness (quantified via number of children) in a large pedigree spanning millions of individuals and over five centuries of human history. Sci-LMM provides a unified framework for investigating the epidemiological history of human populations via genealogical records.


Subject(s)
Genealogy and Heraldry , Genetics, Population , Longevity/genetics , Models, Genetic , Pedigree , Animals , Computer Simulation , Female , Genetic Fitness , Humans , Linear Models , Male , Plants/genetics
2.
J Med Genet ; 56(7): 481-490, 2019 07.
Article in English | MEDLINE | ID: mdl-30894412

ABSTRACT

BACKGROUND: Mapping the genetic component of molecular mechanisms responsible for the reduced penetrance (RP) of rare disorders constitutes one of the most challenging problems in human genetics. Heritable pulmonary arterial hypertension (PAH) is one such disorder characterised by rare mutations mostly occurring in the bone morphogenetic protein receptor type 2 (BMPR2) gene and a wide heterogeneity of penetrance modifier mechanisms. Here, we analyse 32 genotyped individuals from a large Iberian family of 65 members, including 22 carriers of the pathogenic BMPR2 mutation c.1472G>A (p.Arg491Gln), 8 of them diagnosed with PAH by right-heart catheterisation, leading to an RP rate of 36.4%. METHODS: We performed a linkage analysis on the genotyping data to search for genetic modifiers of penetrance. Using functional genomics data, we characterised the candidate region identified by linkage analysis. We also predicted the haplotype segregation within the family. RESULTS: We identified a candidate chromosome region in 2q24.3, 38 Mb upstream from BMPR2, with significant linkage (LOD=4.09) under a PAH susceptibility model. This region contains common variants associated with vascular aetiology and shows functional evidence that the putative genetic modifier is located in the upstream distal promoter of the fidgetin (FIGN) gene. CONCLUSION: Our results suggest that the genetic modifier acts through FIGN transcriptional regulation, whose expression variability would contribute to modulating heritable PAH. This finding may help to advance our understanding of RP in PAH across families sharing the p.Arg491Gln pathogenic mutation in BMPR2.


Subject(s)
ATPases Associated with Diverse Cellular Activities/genetics , Familial Primary Pulmonary Hypertension/diagnosis , Familial Primary Pulmonary Hypertension/genetics , Genetic Linkage , Genetic Predisposition to Disease , Microtubule-Associated Proteins/genetics , Penetrance , Alleles , Amino Acid Substitution , Blood Pressure , Chromosomes, Human, Pair 2 , Family , Genetic Association Studies , Genome-Wide Association Study , Genotype , Hemodynamics , Heterozygote , Humans , Linkage Disequilibrium , Mutation , Pedigree , Phenotype , Polymorphism, Single Nucleotide
3.
Orphanet J Rare Dis ; 13(1): 41, 2018 03 15.
Article in English | MEDLINE | ID: mdl-29544503

ABSTRACT

BACKGROUND: Thoracic and abdominal aortic aneurysms and dissection often develop in hypertensive elderly patients. At higher risk are smokers and those who have a family history of aortic aneurysms. In most affected families, the aortic aneurysms and dissection is inherited in an autosomal dominant manner with decreased penetrance and variable expressivity. Mutations at two chromosomal loci, TAA1 at 11q23 and the TAA2 at 5q13-14, and eight genes, MYLK, MYH11, TGFBR2, TGFBR1, ACTA2, SMAD3, TGFB2, and MAT2A, have been identified as being responsible for the disease in 23% of affected families. RESULTS: Herein, we inform on the clinical, genetic and pathological characteristics of nine living and deceased members of a large consanguineous Arab family with thoracic aortic aneurysm and dissection who carry a missense mutation c.4471G > T (Ala1491Ser), in exon 27 of MYLK gene. We show a reduced kinase activity of the Ala1491Ser protein compared to wildtype protein. This mutation is expressed as aortic aneurysm and dissection in one of two distinct phenotypes. A severe fatal and early onset symptom in homozygous or mild late onset in heterozygous genotypes. CONCLUSIONS: We found that MYLK gene Ala1491Ser mutation affect the kinase activity and clinically, it presents with vascular aneurysms and dissection. We describe a distinct genotype phenotype correlation where; heterozygous patients have mild late onset and incomplete penetrance disease compared with the early onset severe and generally fatal outcome in homozygous patients.


Subject(s)
Aortic Aneurysm, Thoracic/diagnosis , Aortic Aneurysm, Thoracic/genetics , Calcium-Binding Proteins/genetics , Myosin-Light-Chain Kinase/genetics , Adult , Aged , Family , Female , Genotype , Humans , Male , Middle Aged , Mutation, Missense , Phenotype
4.
Science ; 360(6385): 171-175, 2018 04 13.
Article in English | MEDLINE | ID: mdl-29496957

ABSTRACT

Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from publicly available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of human longevity and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other data sets with our resource.


Subject(s)
Family , Genealogy and Heraldry , Models, Genetic , Pedigree , Datasets as Topic , Humans , Longevity , Population
6.
Brain ; 140(2): 370-386, 2017 02.
Article in English | MEDLINE | ID: mdl-28007986

ABSTRACT

Leukoencephalopathies are a group of white matter disorders related to abnormal formation, maintenance, and turnover of myelin in the central nervous system. These disorders of the brain are categorized according to neuroradiological and pathophysiological criteria. Herein, we have identified a unique form of leukoencephalopathy in seven patients presenting at ages 2 to 4 months with progressive microcephaly, spastic quadriparesis, and global developmental delay. Clinical, metabolic, and imaging characterization of seven patients followed by homozygosity mapping and linkage analysis were performed. Next generation sequencing, bioinformatics, and segregation analyses followed, to determine a loss of function sequence variation in the phospholipase A2-activating protein encoding gene (PLAA). Expression and functional studies of the encoded protein were performed and included measurement of prostaglandin E2 and cytosolic phospholipase A2 activity in membrane fractions of fibroblasts derived from patients and healthy controls. Plaa-null mice were generated and prostaglandin E2 levels were measured in different tissues. The novel phenotype of our patients segregated with a homozygous loss-of-function sequence variant, causing the substitution of leucine at position 752 to phenylalanine, in PLAA, which causes disruption of the protein's ability to induce prostaglandin E2 and cytosolic phospholipase A2 synthesis in patients' fibroblasts. Plaa-null mice were perinatal lethal with reduced brain levels of prostaglandin E2 The non-functional phospholipase A2-activating protein and the associated neurological phenotype, reported herein for the first time, join other complex phospholipid defects that cause leukoencephalopathies in humans, emphasizing the importance of this axis in white matter development and maintenance.


Subject(s)
Leukoencephalopathies/genetics , Leukoencephalopathies/metabolism , Leukoencephalopathies/physiopathology , Proteins/genetics , Proteins/metabolism , Adolescent , Animals , Brain/embryology , Brain/growth & development , Brain/metabolism , Brain/pathology , Child , Consanguinity , Dinoprostone/metabolism , Embryo, Mammalian , Family Health , Female , Fibroblasts/drug effects , Fibroblasts/metabolism , Fibroblasts/ultrastructure , Gene Expression Regulation/genetics , Humans , Leukoencephalopathies/diagnostic imaging , Lung/pathology , Male , Mice , Mice, Transgenic , Models, Molecular , NF-kappa B/metabolism , Phospholipases A2/metabolism , Skin/pathology
7.
Genome Res ; 26(7): 969-79, 2016 07.
Article in English | MEDLINE | ID: mdl-27302636

ABSTRACT

Linear mixed models (LMMs) and their extensions have recently become the method of choice in phenotype prediction for complex traits. However, LMM use to date has typically been limited by assuming simple genetic architectures. Here, we present multikernel linear mixed model (MKLMM), a predictive modeling framework that extends the standard LMM using multiple-kernel machine learning approaches. MKLMM can model genetic interactions and is particularly suitable for modeling complex local interactions between nearby variants. We additionally present MKLMM-Adapt, which automatically infers interaction types across multiple genomic regions. In an analysis of eight case-control data sets from the Wellcome Trust Case Control Consortium and more than a hundred mouse phenotypes, MKLMM-Adapt consistently outperforms competing methods in phenotype prediction. MKLMM is as computationally efficient as standard LMMs and does not require storage of genotypes, thus achieving state-of-the-art predictive power without compromising computational feasibility or genomic privacy.


Subject(s)
Models, Genetic , Algorithms , Animals , Case-Control Studies , Colitis, Ulcerative/genetics , Computer Simulation , Humans , Linear Models , Mice , Phenotype , Software
8.
Nat Methods ; 12(4): 332-4, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25664543

ABSTRACT

Linear mixed models (LMMs) have emerged as the method of choice for confounded genome-wide association studies. However, the performance of LMMs in nonrandomly ascertained case-control studies deteriorates with increasing sample size. We propose a framework called LEAP (liability estimator as a phenotype; https://github.com/omerwe/LEAP) that tests for association with estimated latent values corresponding to severity of phenotype, and we demonstrate that this can lead to a substantial power increase.


Subject(s)
Case-Control Studies , Genome-Wide Association Study/methods , Biostatistics , Humans , Models, Theoretical , Multiple Sclerosis/genetics , Sample Size
9.
Hum Mol Genet ; 23(14): 3629-40, 2014 Jul 15.
Article in English | MEDLINE | ID: mdl-24549038

ABSTRACT

Human telomeric regions are packaged as constitutive heterochromatin, characterized by extensive subtelomeric DNA methylation and specific histone modifications. ICF (immunodeficiency, centromeric instability, facial anomalies) type I patients carry mutations in DNA methyltransferase 3B (DNMT3B) that methylates de novo repetitive sequences during early embryonic development. ICF type I patient fibroblasts display hypomethylated subtelomeres, abnormally short telomeres and premature senescence. In order to study the molecular mechanism by which the failure to de novo methylate subtelomeres results in accelerated telomere shortening, we generated induced pluripotent stem cells (iPSCs) from 3 ICF type I patients. Telomeres were elongated in ICF-iPSCs during reprogramming, and the senescence phenotype was abolished despite sustained subtelomeric hypomethylation and high TERRA levels. Fibroblast-like cells (FLs) isolated from differentiated ICF-iPSCs maintained abnormally high TERRA levels, and telomeres in these cells shortened at an accelerated rate, leading to early senescence, thus recapitulating the telomeric phenotype of the parental fibroblasts. These findings demonstrate that the abnormal telomere phenotype associated with subtelomeric hypomethylation is overridden in cells expressing telomerase, therefore excluding telomerase inhibition by TERRA as a central mechanism responsible for telomere shortening in ICF syndrome. The data in the current study lend support to the use of ICF-iPSCs for modeling of phenotypic and molecular defects in ICF syndrome and for unraveling the mechanism whereby subtelomeric hypomethylation is linked to accelerated telomeric loss in this syndrome.


Subject(s)
DNA (Cytosine-5-)-Methyltransferases/metabolism , Face/abnormalities , Immunologic Deficiency Syndromes/genetics , Induced Pluripotent Stem Cells/metabolism , Models, Biological , Telomere Shortening , Telomere/metabolism , Cells, Cultured , Cellular Reprogramming , Cellular Senescence , Chromosome Aberrations , Chromosomes, Human , DNA Methylation , DNA-Binding Proteins/metabolism , Face/physiopathology , Female , Fibroblasts/physiology , Humans , Immunologic Deficiency Syndromes/physiopathology , Induced Pluripotent Stem Cells/cytology , Male , Primary Immunodeficiency Diseases , DNA Methyltransferase 3B
10.
Int J Cardiol ; 171(1): 24-30, 2014 Jan 15.
Article in English | MEDLINE | ID: mdl-24315344

ABSTRACT

BACKGROUND: Familial restrictive cardiomyopathy (RCM) caused by a single gene mutation is the least common of the inherited cardiomyopathies. Only a few RCM-causing mutations have been described. Most mutations causing RCM are located in sarcomere protein genes which also cause hypertrophic cardiomyopathy (HCM). Other genes associated with RCM include the desmin and familial amyloidosis genes. In the present study we describe familial RCM with severe heart failure triggered by a de novo mutation in TTN, encoding the huge muscle filament protein titin. METHODS AND RESULTS: Family members underwent physical examination, ECG and Doppler echocardiogram studies. The family comprised 6 affected individuals aged 12-35 years. Linkage to candidate loci was performed, followed by gene sequencing. Candidate loci/gene analysis excluded 18 candidate genes but showed segregation with a common haplotype surrounding the TTN locus. Sequence analysis identified a de novo mutation within exon 266 of the TTN gene, resulting in the replacement of tyrosine by cysteine. p.Y7621C affects a highly conserved region in the protein within a fibronectin-3 domain, belonging to the A/I junction region of titin. No other disease-causing mutation was identified in cardiomyopathy genes by whole exome sequencing. CONCLUSIONS: Our study shows, for the first time, that mutations in TTN can cause restrictive cardiomyopathy. The giant filament titin is considered to be a determinant of a resting tension of the sarcomere and this report provides genetic evidence of its crucial role in diastolic function.


Subject(s)
Cardiomyopathy, Restrictive/diagnosis , Cardiomyopathy, Restrictive/genetics , Connectin/genetics , Mutation/genetics , Adolescent , Adult , Amino Acid Sequence , Child , Connectin/chemistry , Female , Humans , Male , Molecular Sequence Data , Pedigree , Protein Structure, Secondary , Young Adult
11.
Bioinformatics ; 29(2): 197-205, 2013 Jan 15.
Article in English | MEDLINE | ID: mdl-23162081

ABSTRACT

MOTIVATION: The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exact and approximate LOD calculations and maximum-likelihood haplotyping. The system draws its power from thousands of CPUs, performing data analysis tasks orders of magnitude faster than a single computer. By providing an intuitive interface to sophisticated state-of-the-art analysis tools coupled with high computing capacity, Superlink-Online SNP helps geneticists unleash the potential of SNP data for detecting disease genes. RESULTS: Computations performed by Superlink-Online SNP are automatically parallelized using novel paradigms, and executed on unlimited number of private or public CPUs. One novel service is large-scale approximate Markov Chain-Monte Carlo (MCMC) analysis. The accuracy of the results is reliably estimated by running the same computation on multiple CPUs and evaluating the Gelman-Rubin Score to set aside unreliable results. Another service within the workflow is a novel parallelized exact algorithm for inferring maximum-likelihood haplotyping. The reported system enables genetic analyses that were previously infeasible. We demonstrate the system capabilities through a study of a large complex pedigree affected with metabolic syndrome. AVAILABILITY: Superlink-Online SNP is freely available for researchers at http://cbl-hap.cs.technion.ac.il/superlink-snp. The system source code can also be downloaded from the system website. CONTACT: omerw@cs.technion.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genetic Linkage , Pedigree , Polymorphism, Single Nucleotide , Software , Algorithms , Cluster Analysis , Haplotypes , Humans , Markov Chains , Monte Carlo Method
12.
J Med Genet ; 49(7): 462-72, 2012 Jul.
Article in English | MEDLINE | ID: mdl-22717650

ABSTRACT

BACKGROUND: Members of two seemingly unrelated kindreds of Arab Moslem origin presented with pronounced early onset spastic paraparesis of upper and lower limbs, mild intellectual disability, kyphosis, pectus carinatum and hypertrichosis. METHODS: The authors performed neurological and developmental examinations on the affected individuals. The authors conducted whole genome linkage and haplotype analyses, followed by sequencing of candidate genes; RNA and protein expression studies; and finally proof of principle investigations on knockdown morpholino oligonucleotide injected zebrafish. RESULTS: The authors characterise a novel form of autosomal recessive complex hereditary spastic paraparesis (CHSP). MRI studies of brain and spinal cord were normal. Within a single significantly linked locus the authors ultimately identified a homozygous missense mutation c.1146A>T (p.K382N) in the vacuolar protein sorting 37A (Vps37A) gene, fully penetrant and segregating with the disease in both families. Mobility was significantly reduced in Vps37A knockdown morpholino oligonucleotide injected zebrafish, supporting the causal relationship between mutations in this gene and the phenotype described in the patients of this study. CONCLUSIONS: The authors provide evidence for the involvement of Vps37A, a member of the endosomal sorting complex required for transport (ESCRT) system, in upper motor neuron disease. The ESCRT system has been shown to play a central role in intracellular trafficking, in the maturation of multivesicular bodies and the sorting of ubiquitinated membrane proteins into internal luminal vesicles. Further investigation of mechanisms by which dysfunction of this gene causes CHSP will contribute to the understanding of intracellular trafficking of vesicles by the ESCRT machinery and its relevance to CHSP.


Subject(s)
Endosomal Sorting Complexes Required for Transport/genetics , Genes, Recessive , Mutation, Missense , Spastic Paraplegia, Hereditary/genetics , Animals , Brain/metabolism , Child , Child, Preschool , Chromosome Mapping , Chromosomes, Human, Pair 8/genetics , Chromosomes, Human, Pair 8/metabolism , Endosomal Sorting Complexes Required for Transport/metabolism , Female , Founder Effect , Gene Knockdown Techniques , Genetic Linkage , Haplotypes , Homozygote , Humans , Magnetic Resonance Imaging , Male , Pedigree , Phenotype , Selection, Genetic , Spastic Paraplegia, Hereditary/physiopathology , Zebrafish
13.
J Invest Dermatol ; 132(7): 1798-805, 2012 Jul.
Article in English | MEDLINE | ID: mdl-22437316

ABSTRACT

Pemphigus vulgaris (PV) is a severe autoimmune blistering disease caused by anti-epithelial antibodies, leading to disruption of cell-cell adhesion. Although the disease is exceedingly rare worldwide, it is known to be relatively prevalent in Jewish populations. The low prevalence of the disease represents a significant obstacle to a genome-wide approach to the mapping of susceptibility genes. We reasoned that the study of a genetically homogeneous cohort characterized by a high prevalence of PV may help exposing associated signals while reducing spurious results due to population sub-structure. We performed a genome-wide association study using 300K single-nucleotide polymorphisms (SNPs) in a case-control study of 100 PV patients of Jewish descent and 397 matched control individuals, followed by replication of significantly associated SNPs in three additional cohorts of Jewish, Egyptian, and German origin. In addition to the major histocompatibility complex locus, a genomic segment on 8q11.23 that spans the ST18 gene was also found to be significantly associated with PV. This association was confirmed in the Jewish and Egyptian replication sets but not in the German sample, suggesting that ST18-associated variants may predispose to PV in a population-specific manner. ST18 regulates apoptosis and inflammation, two processes of direct relevance to the pathogenesis of PV. Further supporting the relevance of ST18 to PV, we found this gene to be overexpressed in the skin of PV patients as compared with healthy individuals.


Subject(s)
Pemphigus/genetics , Polymorphism, Single Nucleotide , Repressor Proteins/genetics , Aged , Female , Genetics, Population , Genome-Wide Association Study , Humans , Linkage Disequilibrium , Male , Middle Aged , Skin/metabolism
14.
Bioinformatics ; 27(20): 2880-7, 2011 Oct 15.
Article in English | MEDLINE | ID: mdl-21862568

ABSTRACT

MOTIVATION: High-throughput single nucleotide polymorphism (SNP) arrays have become the standard platform for linkage and association analyses. The high SNP density of these platforms allows high-resolution identification of ancestral recombination events even for distant relatives many generations apart. However, such inference is sensitive to marker mistyping and current error detection methods rely on the genotyping of additional close relatives. Genotyping algorithms provide a confidence score for each marker call that is currently not integrated in existing methods. There is a need for a model that incorporates this prior information within the standard identical by descent (IBD) and association analyses. RESULTS: We propose a novel model that incorporates marker confidence scores within IBD methods based on the Lander-Green Hidden Markov Model. The novel parameter of this model is the joint distribution of confidence scores and error status per array. We estimate this probability distribution by applying a modified expectation-maximization (EM) procedure on data from nuclear families genotyped with Affymetrix 250K SNP arrays. The converged tables from two different genotyping algorithms are shown for a wide range of error rates. We demonstrate the efficacy of our method in refining the detection of IBD signals using nuclear pedigrees and distant relatives. AVAILABILITY: Plinke, a new version of Plink with an extended pairwise IBD inference model allowing per marker error probabilities is freely available at: http://bioinfo.bgu.ac.il/bsu/software/plinke. CONTACT: obirk@bgu.ac.il; markusb@bgu.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genotyping Techniques , Polymorphism, Single Nucleotide , Algorithms , Genotype , Humans , Markov Chains , Models, Statistical , Pedigree
15.
J Comput Biol ; 18(3): 237-49, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21385031

ABSTRACT

Abstract Admixture mapping is a gene mapping approach used for the identification of genomic regions harboring disease susceptibility genes in the case of recently admixed populations such as African Americans. We present a novel method for admixture mapping, called admixture aberration analysis (AAA) that uses a DNA pool of affected admixed individuals. We demonstrate through simulations that AAA is a powerful and economical mapping method under a range of scenarios, capturing complex human diseases such as hypertension and end-stage kidney disease. The method has a low false-positive rate and is robust to deviation from model assumptions. Finally, we apply AAA on 600 prostate cancer-affected African Americans, replicating a known risk locus. Simulation results indicate that the method can yield over 96% reduction in genotyping. Our method is implemented as a Java program called AAAmap and is freely available at http://bioinfo.cs.technion.ac.il/AAAmap.


Subject(s)
Chromosome Mapping/methods , DNA/genetics , Genetic Predisposition to Disease , Prostatic Neoplasms/genetics , Black or African American/genetics , Chromosome Mapping/economics , Computer Simulation , Humans , Hypertension/genetics , Kidney Failure, Chronic/genetics , Linkage Disequilibrium , Male , Models, Genetic
16.
Stat Appl Genet Mol Biol ; 10(1)2011 Oct 04.
Article in English | MEDLINE | ID: mdl-23089820

ABSTRACT

Germline mosaicism is a genetic condition in which some germ cells of an individual contain a mutation. This condition violates the assumptions underlying classic genetic analysis and may lead to failure of such analysis. In this work we extend the statistical model used for genetic linkage analysis in order to incorporate germline mosaicism. We develop a likelihood ratio test for detecting whether a genetic trait has been introduced into a pedigree by germline mosaicism. We analyze the statistical properties of this test and evaluate its performance via computer simulations. We demonstrate that genetic linkage analysis has high power to identify linkage in the presence of germline mosaicism when our extended model is used. We further use this extended model to provide solid statistical evidence that the MDN syndrome studied by Genzer-Nir et al. has been introduced by germline mosaicism.


Subject(s)
Genetic Diseases, Inborn/diagnosis , Genetic Linkage , Genetic Predisposition to Disease , Germ-Line Mutation , Alleles , Computational Biology/methods , Computer Simulation , Genetic Diseases, Inborn/genetics , Genetic Loci , Humans , Models, Genetic , Pedigree , Polymorphism, Single Nucleotide , Reproducibility of Results , Sensitivity and Specificity
17.
BMC Med Genomics ; 3: 47, 2010 Oct 18.
Article in English | MEDLINE | ID: mdl-20955568

ABSTRACT

BACKGROUND: The question of a genetic contribution to the higher prevalence and incidence of end stage kidney disease (ESKD) among African Americans (AA) remained unresolved, until recent findings using admixture mapping pointed to the association of a genomic locus on chromosome 22 with this disease phenotype. In the current study we utilize this example to demonstrate the utility of applying a multi-step admixture mapping approach. METHODS: A multi-step case only admixture mapping study, consisted of the following steps was designed: 1) Assembly of the sample dataset (ESKD AA); 2) Design of the estimated mutual information ancestry informative markers (n = 2016) screening panel 3); Genotyping the sample set whose size was determined by a power analysis (n = 576) appropriate for the initial screening panel; 4) Inference of local ancestry for each individual and identification of regions with increased AA ancestry using two different ancestry inference statistical approaches; 5) Enrichment of the initial screening panel; 6) Power analysis of the enriched panel 7) Genotyping of additional samples. 8) Re-analysis of the genotyping results to identify a genetic risk locus. RESULTS: The initial screening phase yielded a significant peak using the ADMIXMAP ancestry inference program applying case only statistics. Subgroup analysis of 299 ESKD patients with no history of diabetes yielded peaks using both the ANCESTRYMAP and ADMIXMAP ancestry inference programs. The significant peak was found on chromosome 22. Genotyping of additional ancestry informative markers on chromosome 22 that took into account linkage disequilibrium in the ancestral populations, and the addition of samples increased the statistical significance of the finding. CONCLUSIONS: A multi-step admixture mapping analysis of AA ESKD patients replicated the finding of a candidate risk locus on chromosome 22, contributing to the heightened susceptibility of African Americans to develop non-diabetic ESKD, and underscores the importance of using mutual information and multiple ancestry inference approaches to achieve a robust analysis, using relatively small datasets of "affected" only individuals. The current study suggests solutions to some limitations of existing admixture mapping methodologies, such as considerations regarding the distribution of ancestry information along the genome and its effects on power calculations and sample size.


Subject(s)
Chromosome Mapping/methods , Genetic Predisposition to Disease , Kidney Failure, Chronic/genetics , Female , Genetic Loci/genetics , Genetic Markers/genetics , Humans , Kidney Failure, Chronic/epidemiology , Male , Middle Aged , Reproducibility of Results , Sample Size
18.
Blood ; 116(23): 4990-5001, 2010 Dec 02.
Article in English | MEDLINE | ID: mdl-20709904

ABSTRACT

Gray platelet syndrome (GPS) is an inherited bleeding disorder characterized by macrothrombocytopenia and absence of platelet α-granules resulting in typical gray platelets on peripheral smears. GPS is associated with a bleeding tendency, myelofibrosis, and splenomegaly. Reports on GPS are limited to case presentations. The causative gene and underlying pathophysiology are largely unknown. We present the results of molecular genetic analysis of 116 individuals including 25 GPS patients from 14 independent families as well as novel clinical data on the natural history of the disease. The mode of inheritance was autosomal recessive (AR) in 11 and indeterminate in 3 families. Using genome-wide linkage analysis, we mapped the AR-GPS gene to a 9.4-Mb interval on 3p21.1-3p22.1, containing 197 protein-coding genes. Sequencing of 1423 (69%) of the 2075 exons in the interval did not identify the GPS gene. Long-term follow-up data demonstrated the progressive nature of the thrombocytopenia and myelofibrosis of GPS resulting in fatal hemorrhages in some patients. We identified high serum vitamin B(12) as a consistent, novel finding in GPS. Chromosome 3p21.1-3p22.1 has not been previously linked to a platelet disorder; identification of the GPS gene will likely lead to the discovery of novel components of platelet organelle biogenesis. This study is registered at www.clinicaltrials.gov as NCT00069680 and NCT00369421.


Subject(s)
Chromosomes, Human, Pair 3/genetics , Gray Platelet Syndrome/genetics , Gray Platelet Syndrome/physiopathology , Adolescent , Adult , Blood Platelets/ultrastructure , Cell Separation , Child , Child, Preschool , DNA Mutational Analysis , Female , Flow Cytometry , Genetic Linkage , Genome-Wide Association Study , Gray Platelet Syndrome/blood , Humans , Male , Microsatellite Repeats , Microscopy, Electron, Transmission , Middle Aged , Neutrophils/ultrastructure , Pedigree , Polymerase Chain Reaction , Polymorphism, Single Nucleotide , Vitamin B 12/blood , Young Adult
19.
Bioinformatics ; 26(12): i175-82, 2010 Jun 15.
Article in English | MEDLINE | ID: mdl-20529903

ABSTRACT

MOTIVATION: Association analysis is the method of choice for studying complex multifactorial diseases. The premise of this method is that affected persons contain some common genomic regions with similar SNP alleles and such areas will be found in this analysis. An important disadvantage of GWA studies is that it does not distinguish between genomic areas that are inherited from a common ancestor [identical by descent (IBD)] and areas that are identical merely by state [identical by state (IBS)]. Clearly, areas that can be marked with higher probability as IBD and have the same correlation with the disease status of identical areas that are more probably only IBS, are better candidates to be causative, and yet this distinction is not encoded in standard association analysis. RESULTS: We develop a factorial hidden Markov model-based algorithm for computing genome-wide IBD sharing. The algorithm accepts as input SNP data of measured individuals and estimates the probability of IBD at each locus for every pair of individuals. For two g-degree relatives, when g > or = 8, the computation yields a precision of IBD tagging of over 50% higher than previous methods for 95% recall. Our algorithm uses a first-order Markovian model for the linkage disequilibrium process and employs a reduction of the state space of the inheritance vector from being exponential in g to quadratic. The higher accuracy along with the reduced time complexity marks our method as a feasible means for IBD mapping in practical scenarios. AVAILABILITY: A software implementation, called IBDMAP, is freely available at http://bioinfo.cs.technion.ac.il/IBDmap.


Subject(s)
Chromosome Mapping/methods , Genome , Genomics/methods , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Algorithms , Genome-Wide Association Study , Markov Chains , Models, Genetic
20.
Eur J Hum Genet ; 18(6): 662-7, 2010 Jun.
Article in English | MEDLINE | ID: mdl-20145678

ABSTRACT

Mammary-digital-nail syndrome is a novel phenotypic association consisting of anonychia onychodystrophy with hypoplasia or absence of distal phalanges in males and females, accompanied by juvenile hypertrophy of the breast in affected females. This newly described genetic trait presents an autosomal dominant inheritance pattern, with either reduced penetrance or germ-line mosaicism. Analysis of the pedigree, linkage studies followed by a genome-wide screen and by haplotype analysis defined the locus for the phenotype within a 12 cM (4.3 Mb) interval on chromosome 22q12.3-13.1. This chromosomal region has not been implicated before in genetic disorders of the mammary tissue or limbs. These data suggest a possibly novel signaling pathway affecting the organogenesis of limbs and mammary glands in humans.


Subject(s)
Bone Diseases/genetics , Breast Diseases/genetics , Chromosomes, Human, Pair 22 , Nails, Malformed/genetics , Abnormalities, Multiple/genetics , Bone Diseases/complications , Bone Diseases/congenital , Breast/pathology , Breast Diseases/complications , Breast Diseases/congenital , Chromosome Mapping , Chromosomes, Human, Pair 22/genetics , Female , Fingers/pathology , Genetic Association Studies , Humans , Hypertrophy/genetics , Male , Nails, Malformed/complications , Pedigree , Phenotype , Syndrome
SELECTION OF CITATIONS
SEARCH DETAIL
...