Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
1.
Am J Med Genet A ; 188(10): 3089-3095, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35946377

RESUMO

Alternative use of short distance tandem sites such as NAGNn AG are a common mechanism of alternative splicing; however, single nucleotide variants are rarely reported as likely to generate or to disrupt tandem splice sites. We identify a pathogenic intron 5 STK11 variant (NM_000455.4:c.[735-6A>G];[=]) segregating with the mucocutaneous features but not the hamartomatous polyps of Peutz-Jeghers syndrome in two individuals. By RNAseq analysis of peripheral blood mRNA, this variant was shown to generate a novel and preferentially used tandem proximal splice acceptor (AAGTGAAG). The variant transcript (NM_000455.4:c.734_734 + 1insTGAAG), which encodes a frameshift (p.[Tyr246Glufs*43]) constituted 36%-43% of STK11 transcripts suggesting partial escape from nonsense mediated mRNA decay and translation of a truncated protein. A review of the ClinVar database identified other similar variants. We suggest that nucleotide changes creating or disrupting tandem alternative splice sites are a pertinent disease mechanism and require contextualization for clinical reporting. Additionally, we hypothesize that some pathogenic STK11 variants cause an attenuated phenotype.


Assuntos
Síndrome de Peutz-Jeghers , Quinases Proteína-Quinases Ativadas por AMP , Processamento Alternativo , Códon sem Sentido , Humanos , Nucleotídeos , Síndrome de Peutz-Jeghers/genética , Síndrome de Peutz-Jeghers/patologia
2.
J Cell Mol Med ; 24(1): 418-430, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31730267

RESUMO

Numerous genetic and epigenetic alterations cause functional changes in cell biology underlying cancer. These hallmark functional changes constitute potentially tissue-independent anticancer therapeutic targets. We hypothesized that RNA-Seq identifies gene expression changes that underly those hallmarks, and thereby defines relevant therapeutic targets. To test this hypothesis, we analysed the publicly available TCGA-TARGET-GTEx gene expression data set from the University of California Santa CruzToil recompute project using WGCNA to delineate co-correlated 'modules' from tumour gene expression profiles and functional enrichment of these modules to hierarchically cluster tumours. This stratified tumours according to T cell activation, NK-cell activation, complement cascade, ATM, Rb, angiogenic, MAPK, ECM receptor and histone modification signalling. These correspond to the cancer hallmarks of avoiding immune destruction, tumour-promoting inflammation, evading growth suppressors, inducing angiogenesis, sustained proliferative signalling, activating invasion and metastasis, and genome instability and mutation. This approach did not detect pathways corresponding to the cancer enabling replicative immortality, resisting cell death or deregulating cellular energetics hallmarks. We conclude that RNA-Seq stratifies tumours along some, but not all, hallmarks of cancer and, therefore, could be used in conjunction with other analyses collectively to inform precision therapy.


Assuntos
Neoplasias/genética , RNA-Seq , Regulação Neoplásica da Expressão Gênica , Humanos , Especificidade de Órgãos/genética , Transdução de Sinais/genética
3.
PLoS Genet ; 10(3): e1004190, 2014 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24603370

RESUMO

Although a considerable proportion of serum lipids loci identified in European ancestry individuals (EA) replicate in African Americans (AA), interethnic differences in the distribution of serum lipids suggest that some genetic determinants differ by ethnicity. We conducted a comprehensive evaluation of five lipid candidate genes to identify variants with ethnicity-specific effects. We sequenced ABCA1, LCAT, LPL, PON1, and SERPINE1 in 48 AA individuals with extreme serum lipid concentrations (high HDLC/low TG or low HDLC/high TG). Identified variants were genotyped in the full population-based sample of AA (n = 1694) and tested for an association with serum lipids. rs328 (LPL) and correlated variants were associated with higher HDLC and lower TG. Interestingly, a stronger effect was observed on a "European" vs. "African" genetic background at this locus. To investigate this effect, we evaluated the region among West Africans (WA). For TG, the effect size among WA was the same in AA with only African local ancestry (2-3% lower TG), while the larger association among AA with local European ancestry matched previous reports in EA (10%). For HDLC, there was no association with rs328 in AA with only African local ancestry or in WA, while the association among AA with European local ancestry was much greater than what has been observed for EA (15 vs. ∼ 5 mg/dl), suggesting an interaction with an environmental or genetic factor that differs by ethnicity. Beyond this ancestry effect, the importance of African ancestry-focused, sequence-based work was also highlighted by serum lipid associations of variants that were in higher frequency (or present only) among those of African ancestry. By beginning our study with the sequence variation present in AA individuals, investigating local ancestry effects, and seeking replication in WA, we were able to comprehensively evaluate the role of a set of candidate genes in serum lipids in AA.


Assuntos
Negro ou Afro-Americano/genética , Etnicidade/genética , Estudo de Associação Genômica Ampla , Lipídeos/genética , Variação Genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Desequilíbrio de Ligação , Lipídeos/sangue , Polimorfismo de Nucleotídeo Único , População Branca/genética
4.
BMC Genomics ; 16: 998, 2015 Nov 25.
Artigo em Inglês | MEDLINE | ID: mdl-26602380

RESUMO

BACKGROUND: Whole-exome sequencing (WES) is rapidly evolving into a tool of choice for rapid, and inexpensive identification of molecular genetic lesions within targeted regions of the human genome. While biases in WES coverage of nucleotides in targeted regions are recognized, it is not well understood how repetition of WES improves the interpretation of sequencing results in a clinical diagnostic setting. METHOD: To address this, we compared independently generated exome-capture of six individuals from three-generations sequenced in triplicate. This generated between 48x-86x mean target depth of high-quality mapped bases (>Q20) for each technical replicate library. Cumulatively, we achieved 179 - 208x average target coverage for each individual in the pedigree. Using this experimental design, we evaluated stochastics in WES interpretation, genotyping sensitivity, and accuracy to detect de novo variants. RESULTS: In this study, we show that repetition of WES improved the interpretation of the capture target regions after aggregating the data (93.5 - 93.9 %). Compared to 81.2 - 89.6 % (50.2-55.4 Mb of 61.7 M) coverage of targeted bases at ≥20x in the individual technical replicates, the aggregated data covered 93.5 - 93.9 % of targeted bases (57.7 - 58.0 of 61.7 M) at ≥20x threshold, suggesting a 4.3 - 12.7 % improvement in coverage. Each individual's aggregate dataset recovered 3.4 - 6.4 million bases within variable targeted regions. We uncovered technical variability (2-5 %) inherent to WES technique. We also show improved interpretation in assessing clinically important regions that lack interpretation under current conditions, affecting 12-16 of the 56 genes recommended for secondary analysis by American College of Medical Genetics (ACMG). We demonstrate that comparing technical replicate WES datasets and their derived aggregate data can effectively address overall WES genotyping discrepancies. CONCLUSION: We describe a method to evaluate the reproducibility and stochastics in exome library preparation, and delineate the advantages of aggregating the data derived from technical replicates. The implications of this study are directly applicable to improved experimental design and provide an opportunity to rapidly, efficiently, and accurately arrive at reliable candidate nucleotide variants.


Assuntos
Exoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Biblioteca Gênica , Humanos , Linhagem
5.
Proc Natl Acad Sci U S A ; 109(33): 13319-24, 2012 Aug 14.
Artigo em Inglês | MEDLINE | ID: mdl-22864914

RESUMO

Long noncoding RNAs (lncRNAs) play a broad range of biological roles, including regulation of expression of genes and chromosomes. Here, we present evidence that lncRNAs are involved in vertebrate circadian biology. Differential night/day expression of 112 lncRNAs (0.3 to >50 kb) occurs in the rat pineal gland, which is the source of melatonin, the hormone of the night. Approximately one-half of these changes reflect nocturnal increases. Studies of eight lncRNAs with 2- to >100-fold daily rhythms indicate that, in most cases, the change results from neural stimulation from the central circadian oscillator in the suprachiasmatic nucleus (doubling time = 0.5-1.3 h). Light exposure at night rapidly reverses (halving time = 9-32 min) levels of some of these lncRNAs. Organ culture studies indicate that expression of these lncRNAs is regulated by norepinephrine acting through cAMP. These findings point to a dynamic role of lncRNAs in the circadian system.


Assuntos
Ritmo Circadiano/genética , Glândula Pineal/metabolismo , RNA não Traduzido/genética , Animais , Bucladesina/farmacologia , Ritmo Circadiano/efeitos dos fármacos , Biologia Computacional , Regulação da Expressão Gênica/efeitos dos fármacos , Sistema Nervoso/efeitos dos fármacos , Sistema Nervoso/metabolismo , Norepinefrina/farmacologia , Glândula Pineal/efeitos dos fármacos , RNA não Traduzido/metabolismo , Ratos , Ratos Sprague-Dawley , Sinapses/efeitos dos fármacos , Sinapses/metabolismo
6.
BMC Genomics ; 15: 198, 2014 Mar 14.
Artigo em Inglês | MEDLINE | ID: mdl-24628908

RESUMO

BACKGROUND: Massively-parallel cDNA sequencing (RNA-Seq) is a new technique that holds great promise for cardiovascular genomics. Here, we used RNA-Seq to study the transcriptomes of matched coronary artery disease cases and controls in the ClinSeq® study, using cell lines as tissue surrogates. RESULTS: Lymphoblastoid cell lines (LCLs) from 16 cases and controls representing phenotypic extremes for coronary calcification were cultured and analyzed using RNA-Seq. All cell lines were then independently re-cultured and along with another set of 16 independent cases and controls, were profiled with Affymetrix microarrays to perform a technical validation of the RNA-Seq results. Statistically significant changes (p < 0.05) were detected in 186 transcripts, many of which are expressed at extremely low levels (5-10 copies/cell), which we confirmed through a separate spike-in control RNA-Seq experiment. Next, by fitting a linear model to exon-level RNA-Seq read counts, we detected signals of alternative splicing in 18 transcripts. Finally, we used the RNA-Seq data to identify differential expression (p < 0.0001) in eight previously unannotated regions that may represent novel transcripts. Overall, differentially expressed genes showed strong enrichment (p = 0.0002) for prior association with cardiovascular disease. At the network level, we found evidence for perturbation in pathways involving both cardiovascular system development and function as well as lipid metabolism. CONCLUSIONS: We present a pilot study for transcriptome involvement in coronary artery calcification and demonstrate how RNA-Seq analyses using LCLs as a tissue surrogate may yield fruitful results in a clinical sequencing project. In addition to canonical gene expression, we present candidate variants from alternative splicing and novel transcript detection, which have been unexplored in the context of this disease.


Assuntos
Vasos Coronários/metabolismo , Vasos Coronários/patologia , Perfilação da Expressão Gênica , Transcriptoma , Calcificação Vascular/genética , Processamento Alternativo , Estudos de Casos e Controles , Linhagem Celular , Biologia Computacional/métodos , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Pessoa de Meia-Idade , Anotação de Sequência Molecular , Reprodutibilidade dos Testes
7.
Mol Biol Evol ; 30(4): 938-50, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23292442

RESUMO

The whey acidic protein (WAP) four-disulfide core domain (WFDC) locus located on human chromosome 20q13 spans 19 genes with WAP and/or Kunitz domains. These genes participate in antimicrobial, immune, and tissue homoeostasis activities. Neighboring SEMG genes encode seminal proteins Semenogelin 1 and 2 (SEMG1 and SEMG2). WFDC and SEMG genes have a strikingly high rate of amino acid replacement (dN/dS), indicative of responses to adaptive pressures during vertebrate evolution. To better understand the selection pressures acting on WFDC genes in human populations, we resequenced 18 genes and 54 noncoding segments in 71 European (CEU), African (YRI), and Asian (CHB + JPT) individuals. Overall, we identified 484 single-nucleotide polymorphisms (SNPs), including 65 coding variants (of which 49 are nonsynonymous differences). Using classic neutrality tests, we confirmed the signature of short-term balancing selection on WFDC8 in Europeans and a signature of positive selection spanning genes PI3, SEMG1, SEMG2, and SLPI. Associated with the latter signal, we identified an unusually homogeneous-derived 100-kb haplotype with a frequency of 88% in Asian populations. A putative candidate variant targeted by selection is Thr56Ser in SEMG1, which may alter the proteolytic profile of SEMG1 and antimicrobial activities of semen. All the well-characterized genes residing in the WDFC locus encode proteins that appear to have a role in immunity and/or fertility, two processes that are often associated with adaptive evolution. This study provides further evidence that the WFDC and SEMG loci have been under strong adaptive pressure within the short timescale of modern humans.


Assuntos
Cromossomos Humanos Par 20/genética , Fertilidade/genética , Imunidade/genética , Polimorfismo de Nucleotídeo Único , Seleção Genética , Adaptação Biológica/genética , Substituição de Aminoácidos , Evolução Molecular , Frequência do Gene , Haplótipos , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Análise de Componente Principal , Proteínas/genética , Proteínas Secretadas pela Vesícula Seminal/genética , Análise de Sequência de DNA
8.
PLoS Genet ; 7(10): e1002325, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22022284

RESUMO

We report an early onset spastic ataxia-neuropathy syndrome in two brothers of a consanguineous family characterized clinically by lower extremity spasticity, peripheral neuropathy, ptosis, oculomotor apraxia, dystonia, cerebellar atrophy, and progressive myoclonic epilepsy. Whole-exome sequencing identified a homozygous missense mutation (c.1847G>A; p.Y616C) in AFG3L2, encoding a subunit of an m-AAA protease. m-AAA proteases reside in the mitochondrial inner membrane and are responsible for removal of damaged or misfolded proteins and proteolytic activation of essential mitochondrial proteins. AFG3L2 forms either a homo-oligomeric isoenzyme or a hetero-oligomeric complex with paraplegin, a homologous protein mutated in hereditary spastic paraplegia type 7 (SPG7). Heterozygous loss-of-function mutations in AFG3L2 cause autosomal-dominant spinocerebellar ataxia type 28 (SCA28), a disorder whose phenotype is strikingly different from that of our patients. As defined in yeast complementation assays, the AFG3L2(Y616C) gene product is a hypomorphic variant that exhibited oligomerization defects in yeast as well as in patient fibroblasts. Specifically, the formation of AFG3L2(Y616C) complexes was impaired, both with itself and to a greater extent with paraplegin. This produced an early-onset clinical syndrome that combines the severe phenotypes of SPG7 and SCA28, in additional to other "mitochondrial" features such as oculomotor apraxia, extrapyramidal dysfunction, and myoclonic epilepsy. These findings expand the phenotype associated with AFG3L2 mutations and suggest that AFG3L2-related disease should be considered in the differential diagnosis of spastic ataxias.


Assuntos
Proteases Dependentes de ATP/genética , Encéfalo/anormalidades , Metaloendopeptidases/genética , Mitocôndrias/enzimologia , Paraplegia Espástica Hereditária/genética , Degenerações Espinocerebelares/genética , ATPases Associadas a Diversas Atividades Celulares , Adolescente , Sequência de Aminoácidos , Animais , Encéfalo/patologia , Criança , Diagnóstico Diferencial , Exoma/genética , Genótipo , Células HeLa , Homozigoto , Humanos , Masculino , Metaloendopeptidases/metabolismo , Camundongos , Dados de Sequência Molecular , Mutação de Sentido Incorreto , Paraplegia , Dobramento de Proteína , Irmãos , Paraplegia Espástica Hereditária/patologia , Ataxias Espinocerebelares/congênito , Degenerações Espinocerebelares/patologia , Leveduras/genética
9.
PLoS Genet ; 7(8): e1002245, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21901109

RESUMO

ATAD5, the human ortholog of yeast Elg1, plays a role in PCNA deubiquitination. Since PCNA modification is important to regulate DNA damage bypass, ATAD5 may be important for suppression of genomic instability in mammals in vivo. To test this hypothesis, we generated heterozygous (Atad5(+/m)) mice that were haploinsuffficient for Atad5. Atad5(+/m) mice displayed high levels of genomic instability in vivo, and Atad5(+/m) mouse embryonic fibroblasts (MEFs) exhibited molecular defects in PCNA deubiquitination in response to DNA damage, as well as DNA damage hypersensitivity and high levels of genomic instability, apoptosis, and aneuploidy. Importantly, 90% of haploinsufficient Atad5(+/m) mice developed tumors, including sarcomas, carcinomas, and adenocarcinomas, between 11 and 20 months of age. High levels of genomic alterations were evident in tumors that arose in the Atad5(+/m) mice. Consistent with a role for Atad5 in suppressing tumorigenesis, we also identified somatic mutations of ATAD5 in 4.6% of sporadic human endometrial tumors, including two nonsense mutations that resulted in loss of proper ATAD5 function. Taken together, our findings indicate that loss-of-function mutations in mammalian Atad5 are sufficient to cause genomic instability and tumorigenesis.


Assuntos
Adenosina Trifosfatases/genética , Transformação Celular Neoplásica/genética , Proteínas de Ligação a DNA/genética , Neoplasias/genética , ATPases Associadas a Diversas Atividades Celulares , Adenosina Trifosfatases/metabolismo , Aneuploidia , Animais , Linhagem Celular , Dano ao DNA/genética , Proteínas de Ligação a DNA/metabolismo , Neoplasias do Endométrio/genética , Feminino , Predisposição Genética para Doença , Instabilidade Genômica , Humanos , Masculino , Camundongos , Mutação/genética , Antígeno Nuclear de Célula em Proliferação/metabolismo , Ubiquitinação
10.
Am J Hum Genet ; 86(5): 743-8, 2010 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-20451169

RESUMO

Micrognathia, glossoptosis, and cleft palate comprise one of the most common malformation sequences, Robin sequence. It is a component of the TARP syndrome, talipes equinovarus, atrial septal defect, Robin sequence, and persistent left superior vena cava. This disorder is X-linked and severe, with apparently 100% pre- or postnatal lethality in affected males. Here we characterize a second family with TARP syndrome, confirm linkage to Xp11.23-q13.3, perform massively parallel sequencing of X chromosome exons, filter the results via a number of criteria including the linkage region, use a unique algorithm to characterize sequence changes, and show that TARP syndrome is caused by mutations in the RBM10 gene, which encodes RNA binding motif 10. We further show that this previously uncharacterized gene is expressed in midgestation mouse embryos in the branchial arches and limbs, consistent with the human phenotype. We conclude that massively parallel sequencing is useful to characterize large candidate linkage intervals and that it can be used successfully to allow identification of disease-causing gene mutations.


Assuntos
Cromossomos Humanos X , Fissura Palatina/genética , Éxons , Síndrome de Pierre Robin/genética , Proteínas de Ligação a RNA/genética , Animais , Sequência de Bases , Humanos , Masculino , Camundongos , Mutação , Fenótipo , Síndrome
11.
Am J Hum Genet ; 87(4): 560-6, 2010 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-20920668

RESUMO

Charcot-Marie-Tooth (CMT) disease comprises a genetically and clinically heterogeneous group of peripheral nerve disorders characterized by impaired distal motor and sensory function. Mutations in three genes encoding aminoacyl-tRNA synthetases (ARSs) have been implicated in CMT disease primarily associated with an axonal pathology. ARSs are ubiquitously expressed, essential enzymes responsible for charging tRNA molecules with their cognate amino acids. To further explore the role of ARSs in CMT disease, we performed a large-scale mutation screen of the 37 human ARS genes in a cohort of 355 patients with a phenotype consistent with CMT. Here we describe three variants (p.Leu133His, p.Tyr173SerfsX7, and p.Ile302Met) in the lysyl-tRNA synthetase (KARS) gene in two patients from this cohort. Functional analyses revealed that two of these mutations (p.Leu133His and p.Tyr173SerfsX7) severely affect enzyme activity. Interestingly, both functional variants were found in a single patient with CMT disease and additional neurological and non-neurological sequelae. Based on these data, KARS becomes the fourth ARS gene associated with CMT disease, indicating that this family of enzymes is specifically critical for axon function.


Assuntos
Doença de Charcot-Marie-Tooth/genética , Heterozigoto , Lisina-tRNA Ligase/genética , Modelos Moleculares , Doenças do Sistema Nervoso Periférico/genética , Conformação Proteica , Sequência de Aminoácidos , Sequência de Bases , Estudos de Coortes , Análise Mutacional de DNA , Humanos , Dados de Sequência Molecular , Mutação de Sentido Incorreto/genética
12.
Blood ; 118(17): e139-48, 2011 Oct 27.
Artigo em Inglês | MEDLINE | ID: mdl-21900194

RESUMO

Erythropoiesis is dependent on the activity of transcription factors, including the erythroid-specific erythroid Kruppel-like factor (EKLF). ChIP followed by massively parallel sequencing (ChIP-Seq) is a powerful, unbiased method to map trans-factor occupancy. We used ChIP-Seq to study the interactome of EKLF in mouse erythroid progenitor cells and more differentiated erythroblasts. We correlated these results with the nuclear distribution of EKLF, RNA-Seq analysis of the transcriptome, and the occupancy of other erythroid transcription factors. In progenitor cells, EKLF is found predominantly at the periphery of the nucleus, where EKLF primarily occupies the promoter regions of genes and acts as a transcriptional activator. In erythroblasts, EKLF is distributed throughout the nucleus, and erythroblast-specific EKLF occupancy is predominantly in intragenic regions. In progenitor cells, EKLF modulates general cell growth and cell cycle regulatory pathways, whereas in erythroblasts EKLF is associated with repression of these pathways. The EKLF interactome shows very little overlap with the interactomes of GATA1, GATA2, or TAL1, leading to a model in which EKLF directs programs that are independent of those regulated by the GATA factors or TAL1.


Assuntos
Imunoprecipitação da Cromatina , Mapeamento Cromossômico/métodos , Eritrócitos/fisiologia , Células Precursoras Eritroides/fisiologia , Fatores de Transcrição Kruppel-Like/fisiologia , Animais , Sítios de Ligação/genética , Células Cultivadas , Imunoprecipitação da Cromatina/métodos , Embrião de Mamíferos , Eritrócitos/metabolismo , Células Precursoras Eritroides/metabolismo , Eritropoese/genética , Eritropoese/fisiologia , Fatores de Transcrição Kruppel-Like/genética , Fatores de Transcrição Kruppel-Like/metabolismo , Camundongos , Camundongos Transgênicos , Ligação Proteica , Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo
13.
Hum Mutat ; 33(4): 614-26, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-22311686

RESUMO

In this study, we assess exome sequencing (ES) as a diagnostic alternative for genetically heterogeneous disorders. Because ES readily identified a previously reported homozygous mutation in the CAPN3 gene for an individual with an undiagnosed limb girdle muscular dystrophy, we evaluated ES as a generalizable clinical diagnostic tool by assessing the targeting efficiency and sequencing coverage of 88 genes associated with muscle disease (MD) and spastic paraplegia (SPG). We used three exome-capture kits on 125 individuals. Exons constituting each gene were defined using the UCSC and CCDS databases. The three exome-capture kits targeted 47-92% of bases within the UCSC-defined exons and 97-99% of bases within the CCDS-defined exons. An average of 61.2-99.5% and 19.1-99.5% of targeted bases per gene were sequenced to 20X coverage within the CCDS-defined MD and SPG coding exons, respectively. Greater than 95-99% of targeted known mutation positions were sequenced to ≥1X coverage and 55-87% to ≥20X coverage in every exome. We conclude, therefore, that ES is a rapid and efficient first-tier method to screen for mutations, particularly within the CCDS annotated exons, although its application requires disclosure of the extent of coverage for each targeted gene and supplementation with second-tier Sanger sequencing for full coverage.


Assuntos
Exoma , Doenças Musculares/genética , Paraplegia/genética , Análise de Sequência de DNA/métodos , Calpaína/genética , Feminino , Humanos , Proteínas Musculares/genética , Distrofia Muscular do Cíngulo dos Membros/genética , Mutação , Polimorfismo de Nucleotídeo Único , Adulto Jovem
14.
Genome Res ; 19(9): 1665-74, 2009 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-19602640

RESUMO

ClinSeq is a pilot project to investigate the use of whole-genome sequencing as a tool for clinical research. By piloting the acquisition of large amounts of DNA sequence data from individual human subjects, we are fostering the development of hypothesis-generating approaches for performing research in genomic medicine, including the exploration of issues related to the genetic architecture of disease, implementation of genomic technology, informed consent, disclosure of genetic information, and archiving, analyzing, and displaying sequence data. In the initial phase of ClinSeq, we are enrolling roughly 1000 participants; the evaluation of each includes obtaining a detailed family and medical history, as well as a clinical evaluation. The participants are being consented broadly for research on many traits and for whole-genome sequencing. Initially, Sanger-based sequencing of 300-400 genes thought to be relevant to atherosclerosis is being performed, with the resulting data analyzed for rare, high-penetrance variants associated with specific clinical traits. The participants are also being consented to allow the contact of family members for additional studies of sequence variants to explore their potential association with specific phenotypes. Here, we present the general considerations in designing ClinSeq, preliminary results based on the generation of an initial 826 Mb of sequence data, the findings for several genes that serve as positive controls for the project, and our views about the potential implications of ClinSeq. The early experiences with ClinSeq illustrate how large-scale medical sequencing can be a practical, productive, and critical component of research in genomic medicine.


Assuntos
Aterosclerose/genética , Pesquisa Biomédica , Doenças Cardiovasculares/genética , Genoma Humano , Genômica , Projetos Piloto , Análise de Sequência de DNA/métodos , Idoso , Estudos de Coortes , Feminino , Humanos , Masculino , Linhagem , Fenótipo
15.
Blood ; 116(23): 4990-5001, 2010 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-20709904

RESUMO

Gray platelet syndrome (GPS) is an inherited bleeding disorder characterized by macrothrombocytopenia and absence of platelet α-granules resulting in typical gray platelets on peripheral smears. GPS is associated with a bleeding tendency, myelofibrosis, and splenomegaly. Reports on GPS are limited to case presentations. The causative gene and underlying pathophysiology are largely unknown. We present the results of molecular genetic analysis of 116 individuals including 25 GPS patients from 14 independent families as well as novel clinical data on the natural history of the disease. The mode of inheritance was autosomal recessive (AR) in 11 and indeterminate in 3 families. Using genome-wide linkage analysis, we mapped the AR-GPS gene to a 9.4-Mb interval on 3p21.1-3p22.1, containing 197 protein-coding genes. Sequencing of 1423 (69%) of the 2075 exons in the interval did not identify the GPS gene. Long-term follow-up data demonstrated the progressive nature of the thrombocytopenia and myelofibrosis of GPS resulting in fatal hemorrhages in some patients. We identified high serum vitamin B(12) as a consistent, novel finding in GPS. Chromosome 3p21.1-3p22.1 has not been previously linked to a platelet disorder; identification of the GPS gene will likely lead to the discovery of novel components of platelet organelle biogenesis. This study is registered at www.clinicaltrials.gov as NCT00069680 and NCT00369421.


Assuntos
Cromossomos Humanos Par 3/genética , Síndrome da Plaqueta Cinza/genética , Síndrome da Plaqueta Cinza/fisiopatologia , Adolescente , Adulto , Plaquetas/ultraestrutura , Separação Celular , Criança , Pré-Escolar , Análise Mutacional de DNA , Feminino , Citometria de Fluxo , Ligação Genética , Estudo de Associação Genômica Ampla , Síndrome da Plaqueta Cinza/sangue , Humanos , Masculino , Repetições de Microssatélites , Microscopia Eletrônica de Transmissão , Pessoa de Meia-Idade , Neutrófilos/ultraestrutura , Linhagem , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único , Vitamina B 12/sangue , Adulto Jovem
16.
BMC Med Genomics ; 15(1): 56, 2022 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-35287663

RESUMO

BACKGROUND: Clinical use of genotype data requires high positive predictive value (PPV) and thorough understanding of the genotyping platform characteristics. BeadChip arrays, such as the Global Screening Array (GSA), potentially offer a high-throughput, low-cost clinical screen for known variants. We hypothesize that quality assessment and comparison to whole-genome sequence and benchmark data establish the analytical validity of GSA genotyping. METHODS: To test this hypothesis, we selected 263 samples from Coriell, generated GSA genotypes in triplicate, generated whole genome sequence (rWGS) genotypes, assessed the quality of each set of genotypes, and compared each set of genotypes to each other and to the 1000 Genomes Phase 3 (1KG) genotypes, a performance benchmark. For 59 genes (MAP59), we also performed theoretical and empirical evaluation of variants deemed medically actionable predispositions. RESULTS: Quality analyses detected sample contamination and increased assay failure along the chip margins. Comparison to benchmark data demonstrated that > 82% of the GSA assays had a PPV of 1. GSA assays targeting transitions, genomic regions of high complexity, and common variants performed better than those targeting transversions, regions of low complexity, and rare variants. Comparison of GSA data to rWGS and 1KG data showed > 99% performance across all measured parameters. Consistent with predictions from prior studies, the GSA detection of variation within the MAP59 genes was 3/261. CONCLUSION: We establish the analytical validity of GSA assays using quality analytics and comparison to benchmark and rWGS data. GSA assays meet the standards of a clinical screen although assays interrogating rare variants, transversions, and variants within low-complexity regions require careful evaluation.


Assuntos
Benchmarking , Sequenciamento de Nucleotídeos em Larga Escala , Genoma , Genótipo , Polimorfismo de Nucleotídeo Único
17.
Mol Genet Metab ; 104(1-2): 189-91, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21767969

RESUMO

While genomic sequencing methods are powerful tools in the discovery of the genetic underpinnings of human disease, incidentally-revealed novel genomic risk factors may be equally important, both scientifically, and as relates to direct patient care. We performed whole-exome sequencing on a child with VACTERL association who suffered severe post-surgical neonatal pulmonary hypertension, and identified a potential novel genetic risk factor for this complication: a heterozygous mutation in CPSI. Newborn screening results from this patient's monozygotic twin provided evidence that this mutation, in combination with an environmental trigger (in this case, surgery), may have resulted in pulmonary artery hypertension due to inadequate nitric oxide production. Identification of this genetic risk factor allows for targeted medical preventative measures in this patient as well as relatives with the same mutation, and illustrates the power of incidental medical information unearthed by whole-exome sequencing.


Assuntos
Exoma/genética , Genômica , Medicina de Precisão , Hipertensão Pulmonar Primária Familiar , Humanos , Hipertensão Pulmonar/genética , Lactente , Recém-Nascido , Masculino , Reprodutibilidade dos Testes , Análise de Sequência de DNA
18.
Proteins ; 67(4): 811-20, 2007 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-17357158

RESUMO

The knowledge of protein and domain interactions provide crucial insights into their function within a cell. Several computational methods have been proposed to detect interactions between proteins and their constitutive domains. In this work, we focus on approaches based on correlated evolution (coevolution) of sequences of interacting proteins. In this type of approach, often referred to as the mirrortree method, a high correlation of evolutionary histories of two proteins is used as an indicator to predict protein interactions. Recently, it has been observed that subtracting the underlying speciation process by separating coevolution due to common speciation divergence from that due to common function of interacting pairs greatly improves the predictive power of the mirrortree approach. In this article, we investigate possible improvements and limitations of this method. In particular, we demonstrate that the performance of the mirrortree method that can be further improved by restricting the coevolution analysis to the relatively conserved regions in the protein domain sequences (disregarding highly divergent regions). We provide a theoretical validation of our results leading to new insights into the interplay between coevolution and speciation of interacting proteins.


Assuntos
Estrutura Terciária de Proteína , Proteínas/química , Proteínas/metabolismo , Sequência de Aminoácidos , Biologia Computacional , Entropia , Evolução Molecular , Modelos Moleculares , Proteínas/genética
19.
J Mol Biol ; 362(4): 861-75, 2006 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-16949097

RESUMO

Recent advances in functional genomics have helped generate large-scale high-throughput protein interaction data. Such networks, though extremely valuable towards molecular level understanding of cells, do not provide any direct information about the regions (domains) in the proteins that mediate the interaction. Here, we performed co-evolutionary analysis of domains in interacting proteins in order to understand the degree of co-evolution of interacting and non-interacting domains. Using a combination of sequence and structural analysis, we analyzed protein-protein interactions in F1-ATPase, Sec23p/Sec24p, DNA-directed RNA polymerase and nuclear pore complexes, and found that interacting domain pair(s) for a given interaction exhibits higher level of co-evolution than the non-interacting domain pairs. Motivated by this finding, we developed a computational method to test the generality of the observed trend, and to predict large-scale domain-domain interactions. Given a protein-protein interaction, the proposed method predicts the domain pair(s) that is most likely to mediate the protein interaction. We applied this method on the yeast interactome to predict domain-domain interactions, and used known domain-domain interactions found in PDB crystal structures to validate our predictions. Our results show that the prediction accuracy of the proposed method is statistically significant. Comparison of our prediction results with those from two other methods reveals that only a fraction of predictions are shared by all the three methods, indicating that the proposed method can detect known interactions missed by other methods. We believe that the proposed method can be used with other methods to help identify previously unrecognized domain-domain interactions on a genome scale, and could potentially help reduce the search space for identifying interaction sites.


Assuntos
Evolução Molecular , Mapeamento de Interação de Proteínas , Sequência de Aminoácidos , Vesículas Revestidas pelo Complexo de Proteína do Envoltório , RNA Polimerases Dirigidas por DNA/química , RNA Polimerases Dirigidas por DNA/metabolismo , Dimerização , Proteínas Ativadoras de GTPase , Carioferinas/química , Proteínas de Membrana/química , Proteínas de Membrana/metabolismo , Mutação/genética , Proteínas Nucleares/química , Proteínas Nucleares/metabolismo , Proteínas de Transporte Nucleocitoplasmático , Ligação Proteica , Estrutura Terciária de Proteína , ATPases Translocadoras de Prótons/química , ATPases Translocadoras de Prótons/metabolismo , Reprodutibilidade dos Testes , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/metabolismo , Leveduras/química
20.
Nucleic Acids Res ; 33(Database issue): D192-6, 2005 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-15608175

RESUMO

The Conserved Domain Database (CDD) is the protein classification component of NCBI's Entrez query and retrieval system. CDD is linked to other Entrez databases such as Proteins, Taxonomy and PubMed, and can be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=cdd. CD-Search, which is available at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, is a fast, interactive tool to identify conserved domains in new protein sequences. CD-Search results for protein sequences in Entrez are pre-computed to provide links between proteins and domain models, and computational annotation visible upon request. Protein-protein queries submitted to NCBI's BLAST search service at http://www.ncbi.nlm.nih.gov/BLAST are scanned for the presence of conserved domains by default. While CDD started out as essentially a mirror of publicly available domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and in some cases replace these models with domain hierarchies curated at the NCBI. Here, we report on the progress of the curation effort and associated improvements in the functionality of the CDD information retrieval system.


Assuntos
Bases de Dados de Proteínas , Estrutura Terciária de Proteína , Proteínas/classificação , Sequência de Aminoácidos , Sequência Conservada , Filogenia , Alinhamento de Sequência , Análise de Sequência de Proteína , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa