Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
Mais filtros












Base de dados
Intervalo de ano de publicação
1.
Bone ; 94: 65-74, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27789416

RESUMO

BACKGROUND: Osteogenesis imperfecta (OI), the commonest inherited bone fragility disorder, affects 1 in 15,000 live births resulting in frequent fractures and reduced mobility, with significant impact on quality of life. Early diagnosis is important, as therapeutic advances can lead to improved clinical outcome and patient benefit. REPORT: Whole exome sequencing in patients with OI identified, in two patients with a multi-system phenotype, compound heterozygous variants in NBAS (neuroblastoma amplified sequence). Patient 1: NBAS c.5741G>A p.(Arg1914His); c.3010C>T p.(Arg1004*) in a 10-year old boy with significant short stature, bone fragility requiring treatment with bisphosphonates, developmental delay and immunodeficiency. Patient 2: NBAS c.5741G>A p.(Arg1914His); c.2032C>T p.(Gln678*) in a 5-year old boy with similar presenting features, bone fragility, mild developmental delay, abnormal liver function tests and immunodeficiency. DISCUSSION: Homozygous missense NBAS variants cause SOPH syndrome (short stature; optic atrophy; Pelger-Huet anomaly), the same missense variant was found in our patients on one allele and a nonsense variant in the other allele. Recent literature suggests a multi-system phenotype. In this study, patient fibroblasts have shown reduced collagen expression, compared to control cells and RNAseq studies, in bone cells show that NBAS is expressed in osteoblasts and osteocytes of rodents and primates. These findings provide proof-of-concept that NBAS mutations have mechanistic effects in bone, and that NBAS variants are a novel cause of bone fragility, which is distinguishable from 'Classical' OI. CONCLUSIONS: Here we report on variants in NBAS, as a cause of bone fragility in humans, and expand the phenotypic spectrum associated with NBAS. We explore the mechanism underlying NBAS and the striking skeletal phenotype in our patients.


Assuntos
Mutação/genética , Proteínas de Neoplasias/genética , Osteogênese Imperfeita/genética , Sequência de Bases , Células Cultivadas , Criança , Pré-Escolar , Fibroblastos/patologia , Heterozigoto , Humanos , Lactente , Recém-Nascido , Masculino , Proteínas de Neoplasias/química , Osteogênese Imperfeita/diagnóstico por imagem , Domínios Proteicos , Pele/patologia , Pele/ultraestrutura
2.
Oncogene ; 35(2): 158-72, 2016 Jan 14.
Artigo em Inglês | MEDLINE | ID: mdl-25798844

RESUMO

The miR-200 family promotes the epithelial state by suppressing the Zeb1/Zeb2 epithelial gene transcriptional repressors. To identify other miR-200-regulated genes, we isolated mRNAs bound to transfected biotinylated miR-200c in mouse breast cancer cells. In all, 520 mRNAs were significantly enriched in miR-200c binding at least twofold. Putative miR-200-regulated genes included Zeb2, enriched 3.5-fold in the pull down. However, Zeb2 knockdown does not fully recapitulate miR-200c overexpression, suggesting that regulating other miR-200 targets contributes to miR-200's enhancement of epithelial gene expression. Candidate genes were highly enriched for miR-200c seed pairing in their 3'UTR and coding sequence and for genes that were downregulated by miR-200c overexpression. Epidermal growth factor receptor and downstream MAPK signaling pathways were the most enriched pathways. Genes whose products mediate transforming growth factor (TGF)-ß signaling were also significantly overrepresented, and miR-200 counteracted the suppressive effects of TGF-ß and bone morphogenic protein 2 (BMP-2) on epithelial gene expression. miR-200c regulated the 3'UTRs of 12 of 14 putative miR-200c-binding mRNAs tested. The extent of mRNA binding to miR-200c strongly correlated with gene suppression. Twelve targets of miR-200c (Crtap, Fhod1, Smad2, Map3k1, Tob1, Ywhag/14-3-3γ, Ywhab/14-3-3ß, Smad5, Zfp36, Xbp1, Mapk12, Snail1) were experimentally validated by identifying their 3'UTR miR-200 recognition elements. Smad2 and Smad5 form a complex with Zeb2 and Ywhab/14-3-3ß and Ywhag/14-3-3γ form a complex with Snail1. These complexes that repress transcription assemble on epithelial gene promoters. miR-200 overexpression induced RNA polymerase II localization and reduced Zeb2 and Snail1 binding to epithelial gene promoters. Expression of miR-200-resistant Smad5 modestly, but significantly, reduced epithelial gene induction by miR-200. miR-200 expression and Zeb2 knockdown are known to inhibit cell invasion in in vitro assays. Knockdown of each of three novel miR-200 target genes identified here, Smad5, Ywhag and Crtap, also profoundly suppressed cell invasion. Thus, miR-200 suppresses TGF-ß/BMP signaling, promotes epithelial gene expression and suppresses cell invasion by regulating a network of genes.


Assuntos
Neoplasias da Mama/genética , Transição Epitelial-Mesenquimal/genética , Proteínas de Homeodomínio/genética , MicroRNAs/genética , Proteínas Repressoras/genética , Fatores de Transcrição/genética , Regiões 3' não Traduzidas , Animais , Proteína Morfogenética Óssea 2/metabolismo , Proteína Morfogenética Óssea 2/farmacologia , Neoplasias da Mama/patologia , Caderinas/genética , Caderinas/metabolismo , Linhagem Celular Tumoral , Transição Epitelial-Mesenquimal/efeitos dos fármacos , Feminino , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Proteínas de Homeodomínio/metabolismo , Camundongos , Células NIH 3T3 , Proteínas Repressoras/metabolismo , Fatores de Transcrição da Família Snail , Fatores de Transcrição/metabolismo , Fator de Crescimento Transformador beta/metabolismo , Fator de Crescimento Transformador beta/farmacologia , Homeobox 2 de Ligação a E-box com Dedos de Zinco
3.
Virology ; 396(2): 213-25, 2010 Jan 20.
Artigo em Inglês | MEDLINE | ID: mdl-19913270

RESUMO

It is unresolved whether recently transmitted human immunodeficiency viruses (HIV) have genetic features that specifically favour their transmissibility. To identify potential "transmission signatures", we compared 20 full-length HIV-1 subtype C genomes from primary infections, with 66 sampled from ethnically and geographically matched individuals with chronic infections. Controlling for recombination and phylogenetic relatedness, we identified 39 sites at which amino acid frequency spectra differed significantly between groups. These sites were predominantly located within Env, Pol and Gag (14/39, 9/39 and 6/39 respectively) and were significantly clustered (33/39) within known immunoreactive peptides. Within 6 months of infection, we detected reversion-to-consensus mutations at 14 sites and potential CTL escape mutations at seven. Here we provide evidence that frequent reversion mutations probably allows the virus to recover replicative fitness which, together with immune escape driven by the HLA alleles of the new hosts, differentiate sequences from chronic infections from those sampled shortly after transmission.


Assuntos
Infecções por HIV/virologia , HIV-1/genética , Antígenos HLA/imunologia , Evasão da Resposta Imune/genética , Substituição de Aminoácidos , Progressão da Doença , Feminino , Genoma Viral/genética , Infecções por HIV/imunologia , HIV-1/imunologia , Humanos , Evasão da Resposta Imune/imunologia , Dados de Sequência Molecular , Mutação/genética , Alinhamento de Sequência , Análise de Sequência de Proteína , Proteínas do Envelope Viral/genética , Proteínas do Envelope Viral/imunologia
4.
J Virol ; 83(8): 3556-67, 2009 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-19193811

RESUMO

Identifying the specific genetic characteristics of successfully transmitted variants may prove central to the development of effective vaccine and microbicide interventions. Although human immunodeficiency virus transmission is associated with a population bottleneck, the extent to which different factors influence the diversity of transmitted viruses is unclear. We estimate here the number of transmitted variants in 69 heterosexual men and women with primary subtype C infections. From 1,505 env sequences obtained using a single genome amplification approach we show that 78% of infections involved single variant transmission and 22% involved multiple variant transmissions (median of 3). We found evidence for mutations selected for cytotoxic-T-lymphocyte or antibody escape and a high prevalence of recombination in individuals infected with multiple variants representing another potential escape pathway in these individuals. In a combined analysis of 171 subtype B and C transmission events, we found that infection with more than one variant does not follow a Poisson distribution, indicating that transmission of individual virions cannot be seen as independent events, each occurring with low probability. While most transmissions resulted from a single infectious unit, multiple variant transmissions represent a significant fraction of transmission events, suggesting that there may be important mechanistic differences between these groups that are not yet understood.


Assuntos
Variação Genética , Infecções por HIV/transmissão , Infecções por HIV/virologia , HIV-1/fisiologia , Adulto , Análise por Conglomerados , Feminino , HIV-1/classificação , HIV-1/genética , Humanos , Masculino , Dados de Sequência Molecular , Filogenia , RNA Viral/genética , Análise de Sequência de DNA , Homologia de Sequência , Adulto Jovem
5.
Mol Biochem Parasitol ; 118(2): 201-10, 2001 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-11738710

RESUMO

We have undertaken the first comparative pilot gene discovery analysis of approximately 25,000 random genomic and expressed sequence tags (ESTs) from three species of Plasmodium, the infectious agent that causes malaria. A total of 5482 genome survey sequences (GSSs) and 5582 ESTs were generated from mung bean nuclease (MBN) and cDNA libraries, respectively, of the ANKA line of the rodent malaria parasite Plasmodium berghei, and 10,874 GSSs generated from MBN libraries of the Salvador I and Belem lines of Plasmodium vivax, the most geographically wide-spread human malaria pathogen. These tags, together with 2438 Plasmodium falciparum sequences present in GenBank, were used to perform first-pass assembly and transcript reconstruction, and non-redundant consensus sequence datasets created. The datasets were compared against public protein databases and more than 1000 putative new Plasmodium proteins identified based on sequence similarity. Homologs of previously characterized Plasmodium genes were also identified, increasing the number of P. vivax and P. berghei sequences in public databases at least 10-fold. Comparative studies with other species of Apicomplexa identified interesting homologs of possible therapeutic or diagnostic value. A gene prediction program, Phat, was used to predict probable open reading frames for proteins in all three datasets. Predicted and non-redundant BLAST-matched proteins were submitted to InterPro, an integrated database of protein domains, signatures and families, for functional classification. Thus a partial predicted proteome was created for each species. This first comparative analysis of Plasmodium protein coding sequences represents a valuable resource for further studies on the biology of this important pathogen.


Assuntos
Biologia Computacional/métodos , Genoma de Protozoário , Genômica , Malária/parasitologia , Plasmodium/genética , Proteínas de Protozoários/genética , Animais , Apicomplexa/classificação , Apicomplexa/genética , DNA Complementar/genética , Bases de Dados de Ácidos Nucleicos , Humanos , Dados de Sequência Molecular , Plasmodium/classificação , Plasmodium berghei/genética , Plasmodium falciparum/genética , Plasmodium vivax/genética , Proteoma , Proteínas de Protozoários/metabolismo , Análise de Sequência de DNA
6.
Genome Res ; 11(11): 1848-53, 2001 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-11691849

RESUMO

Completion of the human genome sequence provides evidence for a gene count with lower bound 30,000-40,000. Significant protein complexity may derive in part from multiple transcript isoforms. Recent EST based studies have revealed that alternate transcription, including alternative splicing, polyadenylation and transcription start sites, occurs within at least 30-40% of human genes. Transcript form surveys have yet to integrate the genomic context, expression, frequency, and contribution to protein diversity of isoform variation. We determine here the degree to which protein coding diversity may be influenced by alternate expression of transcripts by exhaustive manual confirmation of genome sequence annotation, and comparison to available transcript data to accurately associate skipped exon isoforms with genomic sequence. Relative expression levels of transcripts are estimated from EST database representation. The rigorous in silico method accurately identifies exon skipping using verified genome sequence. 545 genes have been studied in this first hand-curated assessment of exon skipping on chromosome 22. Combining manual assessment with software screening of exon boundaries provides a highly accurate and internally consistent indication of skipping frequency. 57 of 62 exon skipping events occur in the protein coding regions of 52 genes. A single gene, (FBXO7) expresses an exon repetition. 59% of highly represented multi-exon genes are likely to express exon-skipped isoforms in ratios that vary from 1:1 to 1:>100. The proportion of all transcripts corresponding to multi-exon genes that exhibit an exon skip is estimated to be 5%.


Assuntos
Processamento Alternativo , Cromossomos Humanos Par 22/genética , Códon/genética , Éxons/genética , Variação Genética/genética , Proteínas/genética , Humanos , Isoformas de Proteínas/genética , Sítios de Splice de RNA/genética
7.
Genome Biol ; 2(10): RESEARCH0044, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11597336

RESUMO

BACKGROUND: The genome of Mycobacterium tuberculosis H37Rv has five copies of a cluster of genes known as the ESAT-6 loci. These clusters contain members of the CFP-10 (lhp) and ESAT-6 (esat-6) gene families (encoding secreted T-cell antigens that lack detectable secretion signals) as well as genes encoding secreted, cell-wall-associated subtilisin-like serine proteases, putative ABC transporters, ATP-binding proteins and other membrane-associated proteins. These membrane-associated and energy-providing proteins may function to secrete members of the ESAT-6 and CFP-10 protein families, and the proteases may be involved in processing the secreted peptide. RESULTS: Finished and unfinished genome sequencing data of 98 publicly available microbial genomes has been analyzed for the presence of orthologs of the ESAT-6 loci. The multiple duplicates of the ESAT-6 gene cluster found in the genome of M. tuberculosis H37Rv are also conserved in the genomes of other mycobacteria, for example M. tuberculosis CDC1551, M. tuberculosis 210, M. bovis, M. leprae, M. avium, and the avirulent strain M. smegmatis. Phylogenetic analyses of the resulting sequences have established the duplication order of the gene clusters and demonstrated that the gene cluster known as region 4 (Rv3444c-3450c) is ancestral. Region 4 is also the only region for which an ortholog could be found in the genomes of Corynebacterium diphtheriae and Streptomyces coelicolor. CONCLUSIONS: Comparative genomic analysis revealed that the presence of the ESAT-6 gene cluster is a feature of some high-G+C Gram-positive bacteria. Multiple duplications of this cluster have occurred and are maintained only within the genomes of members of the genus Mycobacterium.


Assuntos
Antígenos de Bactérias/genética , Família Multigênica , Mycobacterium tuberculosis/genética , Mycobacterium/genética , Proteínas de Bactérias , Corynebacterium diphtheriae/genética , DNA Bacteriano/genética , Sequência Rica em GC , Genoma Bacteriano , Bactérias Gram-Positivas/genética , Mycobacterium avium/genética , Mycobacterium avium subsp. paratuberculosis/genética , Mycobacterium bovis/genética , Mycobacterium leprae/genética , Mycobacterium smegmatis/genética , Filogenia , Análise de Sequência de DNA , Streptomyces/genética
8.
Tuberculosis (Edinb) ; 81(4): 291-302, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11584597

RESUMO

SETTING: M. tuberculosis isolates were collected from patients attending health clinics in a high incidence urban community and in a low incidence rural setting in South Africa. OBJECTIVE: To reconstruct the evolutionary history of a group of closely related M. tuberculosis isolates using IS6110, DRr and MTB484(1) restriction fragment length polymorphism (RFLP) data. DESIGN: Mycobacterium tuberculosis isolates containing an average of ten IS6110 elements, with a similarity index of > or = 65% were genotypically classified by DNA fingerprinting using the IS6110 derived probes IS-3' and IS-5', as well as the DRr and MTB484(1) probes, in combination with PvuII or Hinfl endonuclease digestion. These RFLP data were subjected to phylogenetic analysis using both genetic distance and parsimony algorithms. RESULTS: Phylogenetic analysis predicted the existence of two independently evolving lineages, possibly evolving from a common ancestral strain. The topology of the phylogenetic tree was supported by comprehensive bootstrapping and the specific partitioning of DNA methylation phenotypes. The observed difference in the branch lengths of the two lineages may suggest differential evolutionary rates. Isolates collected from different geographical regions demonstrate independent evolution, suggesting that it is highly unlikely that strains have been recently transmitted between the two regions. The number of evolutionary events identified in this strain family differs significantly from that of previously characterized strain families, implying that evolutionary rate may be strain family dependent. CONCLUSION: Based on this analysis we propose that the algorithm used to calculate recent epidemiological events should be revised to incorporate the evolutionary characteristics of individual strain families, thereby enhancing the accuracy of molecular epidemiological calculations.


Assuntos
Evolução Molecular , Mycobacterium tuberculosis/genética , Filogenia , Algoritmos , Southern Blotting , Impressões Digitais de DNA/métodos , Análise Mutacional de DNA , DNA Bacteriano/genética , Humanos , Mycobacterium tuberculosis/classificação , Polimorfismo de Fragmento de Restrição
10.
Nucleic Acids Res ; 29(1): 234-8, 2001 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-11125101

RESUMO

STACK is a tool for detection and visualisation of expressed transcript variation in the context of developmental and pathological states. The datasystem organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index and is accessible via WWW at http://www.sanbi.ac.za/Dbases.html. STACK represents a broadly applicable resource, as it is the only reconstructed transcript database for which the tools for its generation are also broadly available (http://www.sanbi.ac.za/CODES).


Assuntos
Bases de Dados Factuais , Etiquetas de Sequências Expressas , Sequência de Bases , Sequência Consenso , Expressão Gênica , Humanos , Internet , Dados de Sequência Molecular , Alinhamento de Sequência
12.
AIDS Res Hum Retroviruses ; 17(17): 1681-7, 2001 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-11779357

RESUMO

We have characterized 43 nef sequences from subtype C HIV-1-infected South Africans and compared deduced amino acid sequences with other subtypes to identify areas of conservation. Our Nef amino acid sequences were aligned with a consensus subtype B, HXB2 reference strain and a consensus subtype C sequence. All were found to be highly homologous to subtype B in the central region of Nef, but more variable at the N and C termini of the molecule. Alignment of a consensus amino acid sequence generated from South African subtype C Nef with subtypes A, B, and D underscores cross-clade conservation in the central domain of the molecule. This domain is also rich in previously described cytotoxic T lymphocyte (CTL) epitopes that are restricted by commonly found HLA molecules in the South African population.


Assuntos
Sequência Conservada , Epitopos de Linfócito T/genética , Produtos do Gene nef/genética , HIV-1/classificação , Linfócitos T Citotóxicos/imunologia , Sequência de Aminoácidos , Epitopos de Linfócito T/imunologia , Produtos do Gene nef/química , Produtos do Gene nef/imunologia , Infecções por HIV/virologia , HIV-1/genética , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNA , África do Sul , Produtos do Gene nef do Vírus da Imunodeficiência Humana
13.
Genome Res ; 9(11): 1135-42, 1999 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-10568753

RESUMO

Several efforts are under way to condense single-read expressed sequence tags (ESTs) and full-length transcript data on a large scale by means of clustering or assembly. One goal of these projects is the construction of gene indices where transcripts are partitioned into index classes (or clusters) such that they are put into the same index class if and only if they represent the same gene. Accurate gene indexing facilitates gene expression studies and inexpensive and early partial gene sequence discovery through the assembly of ESTs that are derived from genes that have yet to be positionally cloned or obtained directly through genomic sequencing. We describe d2_cluster, an agglomerative algorithm for rapidly and accurately partitioning transcript databases into index classes by clustering sequences according to minimal linkage or "transitive closure" rules. We then evaluate the relative efficiency of d2_cluster with respect to other clustering tools. UniGene is chosen for comparison because of its high quality and wide acceptance. It is shown that although d2_cluster and UniGene produce results that are between 83% and 90% identical, the joining rate of d2_cluster is between 8% and 20% greater than UniGene. Finally, we present the first published rigorous evaluation of under and over clustering (in other words, of type I and type II errors) of a sequence clustering algorithm, although the existence of highly identical gene paralogs means that care must be taken in the interpretation of the type II error. Upper bounds for these d2_cluster error rates are estimated at 0.4% and 0.8%, respectively. In other words, the sensitivity and selectivity of d2_cluster are estimated to be >99.6% and 99.2%.


Assuntos
Algoritmos , DNA Complementar/genética , Etiquetas de Sequências Expressas , Análise de Sequência de DNA , Análise por Conglomerados , Bases de Dados Factuais , Humanos , Análise de Sequência de DNA/métodos
14.
Genome Res ; 9(11): 1143-55, 1999 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-10568754

RESUMO

The expressed human genome is being sequenced and analyzed by disparate groups producing disparate data. The majority of the identified coding portion is in the form of expressed sequence tags (ESTs). The need to discover exonic representation and expression forms of full-length cDNAs for each human gene is frustrated by the partial and variable quality nature of this data delivery. A highly redundant human EST data set has been processed into integrated and unified expressed transcript indices that consist of hierarchically organized human transcript consensi reflecting gene expression forms and genetic polymorphism within an index class. The expression index and its intermediate outputs include cleaned transcript sequence, expression, and alignment information and a higher fidelity subset, SANIGENE. The STACK_PACK clustering system has been applied to dbEST release 121598 (GenBank version 110). Sixty-four percent of 1,313, 103 Homo sapiens ESTs are condensed into 143,885 tissue level multiple sequence clusters; linking through clone-ID annotations produces 68,701 total assemblies, such that 81% of the original input set is captured in a STACK multiple sequence or linked cluster. Indexing of alignments by substituent EST accession allows browsing of the data structure and its cross-links to UniGene. STACK metaclusters consolidate a greater number of ESTs by a factor of 1. 86 with respect to the corresponding UniGene build. Fidelity comparison with genome reference sequence AC004106 demonstrates consensus expression clusters that reflect significantly lower spurious repeat sequence content and capture alternate splicing within a whole body index cluster and three STACK v.2.3 tissue-level clusters. Statistics of a staggered release whole body index build of STACK v.2.0 are presented.


Assuntos
Análise por Conglomerados , Sequência Consenso , Etiquetas de Sequências Expressas , Expressão Gênica , Algoritmos , Bases de Dados Factuais , Genoma Humano , Humanos , Polimorfismo de Nucleotídeo Único/genética , Alinhamento de Sequência
15.
Nat Genet ; 22(3): 255-9, 1999 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-10391212

RESUMO

Inherited retinal diseases are a common cause of visual impairment in children and young adults, often resulting in severe loss of vision in later life. The most frequent form of inherited retinopathy is retinitis pigmentosa (RP), with an approximate incidence of 1 in 3,500 individuals worldwide. RP is characterized by night blindness and progressive degeneration of the midperipheral retina, accompanied by bone spicule-like pigmentary deposits and a reduced or absent electroretinogram (ERG). The disease process culminates in severe reduction of visual fields or blindness. RP is genetically heterogeneous, with autosomal dominant, autosomal recessive and X-linked forms. Here we have identified two mutations in a novel retina-specific gene from chromosome 8q that cause the RP1 form of autosomal dominant RP in three unrelated families. The protein encoded by this gene is 2,156 amino acids and its function is currently unknown, although the amino terminus has similarity to that of the doublecortin protein, whose gene (DCX) has been implicated in lissencephaly in humans. Two families have a nonsense mutation in codon 677 of this gene (Arg677stop), whereas the third family has a nonsense mutation in codon 679 (Gln679stop). In one family, two individuals homozygous for the mutant gene have more severe retinal disease compared with heterozygotes.


Assuntos
Mutação , Retina/metabolismo , Retinose Pigmentar/genética , Adulto , Sequência de Aminoácidos , Substituição de Aminoácidos , Criança , Cromossomos Humanos Par 8/genética , Proteínas do Olho/genética , Feminino , Genes Dominantes , Heterozigoto , Homozigoto , Humanos , Masculino , Proteínas Associadas aos Microtúbulos , Dados de Sequência Molecular , Linhagem , Polimorfismo Genético , Homologia de Sequência de Aminoácidos
16.
Genome Res ; 8(3): 276-90, 1998 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-9521931

RESUMO

Several efforts are under way to partition single-read expressed sequence tag (EST), as well as full-length transcript data, into large-scale gene indices, where transcripts are in common index classes if and only if they share a common progenitor gene. Accurate gene indexing facilitates gene expression studies, as well as inexpensive and early gene sequence discovery through assembly of ESTs that are derived from genes that have not been sequenced by classical methods. We extend, correct, and enhance the information obtained from index groups by splitting index classes into subclasses based on sequence dissimilarity (diversity). Two applications of this are highlighted in this report. First it is shown that our method can ameliorate the damage that artifacts, such as chimerism, inflict on index integrity. Additionally, we demonstrate how the organization imposed by an effective subpartition can greatly increase the sensitivity of gene expression studies by accounting for the existence and tissue- or pathology-specific regulation of novel gene isoforms and polymorphisms. We apply our subpartitioning treatment to the UniGene gene indexing project to measure a marked increase in information quality and abundance (in terms of assembly length and insertion/deletion error) after treatment and demonstrate cases where new levels of information concerning differential expression of alternate gene forms, such as regulated alternative splicing, are discovered. [Tables 2 and 3 can be viewed in their entirety as Online Supplements at http://www.genome.org.]


Assuntos
Processamento Alternativo , Biologia Computacional/métodos , Armazenamento e Recuperação da Informação , Análise de Sequência de DNA/métodos , Animais , Artefatos , Clonagem Molecular , Drosophila , Expressão Gênica , Homologia de Genes , Humanos , Camundongos , Família Multigênica , Reprodutibilidade dos Testes , Análise de Sequência de DNA/estatística & dados numéricos , Software , Transcrição Gênica
18.
J Comput Biol ; 1(3): 199-215, 1994.
Artigo em Inglês | MEDLINE | ID: mdl-8790465

RESUMO

A number of algorithms exist for searching sequence databases for biologically significant similarities based on the primary sequence similarity of aligned sequences. We have determined the biological sensitivity and selectivity of d2, a high-performance comparison algorithm that rapidly determines the relative dissimilarity of large datasets of genetic sequences. d2 uses sequence-word multiplicity as a simple measure of dissimilarity. It is not constrained by the comparison of direct sequence alignments and so can use word contexts to yield new information on relationships. It is extremely efficient, comparing a query of length 884 bases (INS1ECLAC) with 19,540,603 bases of the bacterial division of GenBank (release 76.0) in 51.77 CPU seconds on a Cray Y/MP-48 supercomputer. It is unique in that subsequences (words) of biological interest can be weighted to improve the sensitivity and selectivity of a search over existing methods. We have determined the ability of d2 to detect biologically significant matches between a query and large datasets of DNA sequences while varying parameters such as word-length and window size. We have also determined the distribution of dissimilarity scores within eukaryotic and prokaryotic divisions of GenBank. We have optimized parameters of the d2 program using Cray hardware and present an analysis of the sensitivity and selectivity of the algorithm. A theoretical analysis of the expectation for scores is presented. This work demonstrates that d2 is a unique, sensitive, and selective method of rapid sequence comparison that can detect novel sequence relationships which remain undetected by alternate methodologies.


Assuntos
Algoritmos , Computação Matemática , Alinhamento de Sequência/métodos , Homologia de Sequência , Computadores de Grande Porte , Células Eucarióticas , Dados de Sequência Molecular , Células Procarióticas , Fatores de Tempo
19.
Mol Cell Biol ; 12(9): 3827-33, 1992 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-1508186

RESUMO

In contrast to many other cases in microbial development, Aspergillus nidulans conidiophore production initiates primarily as a programmed part of the life cycle rather than as a response to nutrient deprivation. Mutations in the acoD locus result in "fluffy" colonies that appear to grow faster than the wild type and proliferate as undifferentiated masses of vegetative cells. We show that unlike wild-type strains, acoD deletion mutants are unable to make conidiophores under optimal growth conditions but can be induced to conidiate when growth is nutritionally limited. The requirement for acoD in conidiophore development occurs prior to activation of brlA, a primary regulator of development. The acoD transcript is present both in vegetative hyphae prior to developmental induction and in developing cultures. However, the effects of acoD mutations are detectable only after developmental induction. We propose that acoD activity is primarily controlled at the posttranscriptional level and that it is required to direct developmentally specific changes that bring about growth inhibition and activation of brlA expression to result in conidiophore development.


Assuntos
Aspergillus nidulans/genética , Genes Fúngicos , Fatores de Transcrição , Aspergillus nidulans/crescimento & desenvolvimento , Proteínas Fúngicas/genética , Regulação Fúngica da Expressão Gênica , Cinética , Mutação , Fenótipo , Mapeamento por Restrição
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...