Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nat Genet ; 50(11): 1574-1583, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30275530

RESUMO

We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.


Assuntos
Mapeamento Cromossômico , Loci Gênicos , Genoma , Haplótipos , Camundongos Endogâmicos/genética , Animais , Animais de Laboratório , Mapeamento Cromossômico/veterinária , Haplótipos/genética , Camundongos , Camundongos Endogâmicos BALB C/genética , Camundongos Endogâmicos C3H/genética , Camundongos Endogâmicos C57BL/genética , Camundongos Endogâmicos CBA/genética , Camundongos Endogâmicos DBA/genética , Camundongos Endogâmicos NOD/genética , Camundongos Endogâmicos/classificação , Anotação de Sequência Molecular , Filogenia , Polimorfismo de Nucleotídeo Único , Especificidade da Espécie
2.
Genome Res ; 28(4): 448-459, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29563166

RESUMO

Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.


Assuntos
Evolução Molecular , Genoma/genética , Muridae/genética , Filogenia , Animais , Sítios de Ligação , Fator de Ligação a CCCTC/genética , Cromossomos/genética , Cariotipagem/métodos , Elementos Nucleotídeos Longos e Dispersos/genética , Camundongos , Retroelementos/genética , Especificidade da Espécie
3.
Nat Genet ; 48(8): 912-8, 2016 08.
Artigo em Inglês | MEDLINE | ID: mdl-27376238

RESUMO

Two bottlenecks impeding the genetic analysis of complex traits in rodents are access to mapping populations able to deliver gene-level mapping resolution and the need for population-specific genotyping arrays and haplotype reference panels. Here we combine low-coverage (0.15×) sequencing with a new method to impute the ancestral haplotype space in 1,887 commercially available outbred mice. We mapped 156 unique quantitative trait loci for 92 phenotypes at a 5% false discovery rate. Gene-level mapping resolution was achieved at about one-fifth of the loci, implicating Unc13c and Pgc1a at loci for the quality of sleep, Adarb2 for home cage activity, Rtkn2 for intensity of reaction to startle, Bmp2 for wound healing, Il15 and Id2 for several T cell measures and Prkca for bone mineral content. These findings have implications for diverse areas of mammalian biology and demonstrate how genome-wide association studies can be extended via low-coverage sequencing to species with highly recombinant outbred populations.


Assuntos
Animais não Endogâmicos/genética , Mapeamento Cromossômico , Marcadores Genéticos/genética , Estudo de Associação Genômica Ampla , Haplótipos/genética , Herança Multifatorial/genética , Locos de Características Quantitativas/genética , Animais , Genótipo , Camundongos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética
4.
PLoS Genet ; 9(6): e1003570, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23785304

RESUMO

Variation at regulatory elements, identified through hypersensitivity to digestion by DNase I, is believed to contribute to variation in complex traits, but the extent and consequences of this variation are poorly characterized. Analysis of terminally differentiated erythroblasts in eight inbred strains of mice identified reproducible variation at approximately 6% of DNase I hypersensitive sites (DHS). Only 30% of such variable DHS contain a sequence variant predictive of site variation. Nevertheless, sequence variants within variable DHS are more likely to be associated with complex traits than those in non-variant DHS, and variants associated with complex traits preferentially occur in variable DHS. Changes at a small proportion (less than 10%) of variable DHS are associated with changes in nearby transcriptional activity. Our results show that whilst DNA sequence variation is not the major determinant of variation in open chromatin, where such variants exist they are likely to be causal for complex traits.


Assuntos
Cromatina/genética , Desoxirribonuclease I/genética , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Desoxirribonuclease I/metabolismo , Camundongos , Fenótipo
5.
Nature ; 477(7364): 289-94, 2011 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-21921910

RESUMO

We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism.


Assuntos
Regulação da Expressão Gênica/genética , Variação Genética/genética , Genoma/genética , Camundongos Endogâmicos/genética , Camundongos/genética , Fenótipo , Alelos , Animais , Animais de Laboratório/genética , Genômica , Camundongos/classificação , Camundongos Endogâmicos C57BL/genética , Filogenia , Locos de Características Quantitativas/genética
6.
Nature ; 477(7364): 326-9, 2011 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-21921916

RESUMO

Structural variation is widespread in mammalian genomes and is an important cause of disease, but just how abundant and important structural variants (SVs) are in shaping phenotypic variation remains unclear. Without knowing how many SVs there are, and how they arise, it is difficult to discover what they do. Combining experimental with automated analyses, we identified 711,920 SVs at 281,243 sites in the genomes of thirteen classical and four wild-derived inbred mouse strains. The majority of SVs are less than 1 kilobase in size and 98% are deletions or insertions. The breakpoints of 160,000 SVs were mapped to base pair resolution, allowing us to infer that insertion of retrotransposons causes more than half of SVs. Yet, despite their prevalence, SVs are less likely than other sequence variants to cause gene expression or quantitative phenotypic variation. We identified 24 SVs that disrupt coding exons, acting as rare variants of large effect on gene function. One-third of the genes so affected have immunological functions.


Assuntos
Variação Genética/genética , Genoma/genética , Camundongos Endogâmicos/genética , Fenótipo , Animais , Pontos de Quebra do Cromossomo , Éxons/genética , Feminino , Expressão Gênica , Genômica , Genótipo , Masculino , Camundongos , Camundongos Endogâmicos/imunologia , Mutagênese Insercional/genética , Locos de Características Quantitativas/genética , Ratos , Retroelementos/genética , Deleção de Sequência/genética
7.
Genome Res ; 21(8): 1239-48, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21493779

RESUMO

The Collaborative Cross (CC) is a genetic reference panel of recombinant inbred lines of mice, designed for the dissection of complex traits and gene networks. Each line is independently descended from eight genetically diverse founder strains such that the genomes of the CC lines, once fully inbred, are fine-grained homozygous mosaics of the founder haplotypes. We present an analysis of 120 CC lines, from a cohort of the CC bred at Tel Aviv University in collaboration with the University of Oxford, which at the time of this study were between the sixth and 12th generations of inbreeding and substantially homozygous at 170,000 SNPs. We show how CC genomes decompose into mosaics, and we identify loci that carry a deficiency or excess of a founder, many being deficient for the wild-derived strains WSB/EiJ and PWK/PhJ. We phenotyped 371 mice from 66 CC lines for a susceptibility to Aspergillus fumigatus infection. The survival time after infection varied significantly between CC lines. Quantitative trait locus (QTL) mapping identified genome-wide significant QTLs on chromosomes 2, 3, 8, 10 (two QTLs), 15, and 18. Simulations show that QTL mapping resolution (the median distance between the QTL peak and true location) varied between 0.47 and 1.18 Mb. Most of the QTLs involved contrasts between wild-derived founder strains and therefore would not segregate between classical inbred strains. Use of variation data from the genomes of the CC founder strains refined these QTLs further and suggested several candidate genes. These results support the use of the CC for dissecting complex traits.


Assuntos
Aspergilose/genética , Aspergillus fumigatus/fisiologia , Cruzamentos Genéticos , Animais , Aspergilose/microbiologia , Mapeamento Cromossômico/métodos , Predisposição Genética para Doença , Haplótipos , Camundongos , Camundongos Endogâmicos , Fenótipo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas
8.
Bioinformatics ; 26(21): 2778-9, 2010 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-20847218

RESUMO

SUMMARY: Computational pipelines are common place in scientific research. However, most of the resources for constructing pipelines are heavyweight systems with graphical user interfaces. Ruffus is a library for the creation of computational pipelines. Its lightweight and unobtrusive design recommends it for use even for the most trivial of analyses. At the same time, it is powerful enough to have been used for complex workflows involving more than 50 interdependent stages. AVAILABILITY AND IMPLEMENTATION: Ruffus is written in python. Source code, a short tutorial, examples and a comprehensive user manual are freely available at http://www.ruffus.org.uk. The example program is available at http://www.ruffus.org.uk/examples/bioinformatics


Assuntos
Biologia Computacional/métodos , Software , Bases de Dados Factuais , Internet
9.
Genome Biol ; 11(7): R72, 2010.
Artigo em Inglês | MEDLINE | ID: mdl-20624288

RESUMO

BACKGROUND: Long considered to be the building block of life, it is now apparent that protein is only one of many functional products generated by the eukaryotic genome. Indeed, more of the human genome is transcribed into noncoding sequence than into protein-coding sequence. Nevertheless, whilst we have developed a deep understanding of the relationships between evolutionary constraint and function for protein-coding sequence, little is known about these relationships for non-coding transcribed sequence. This dearth of information is partially attributable to a lack of established non-protein-coding RNA (ncRNA) orthologs among birds and mammals within sequence and expression databases. RESULTS: Here, we performed a multi-disciplinary study of four highly conserved and brain-expressed transcripts selected from a list of mouse long intergenic noncoding RNA (lncRNA) loci that generally show pronounced evolutionary constraint within their putative promoter regions and across exon-intron boundaries. We identify some of the first lncRNA orthologs present in birds (chicken), marsupial (opossum), and eutherian mammals (mouse), and investigate whether they exhibit conservation of brain expression. In contrast to conventional protein-coding genes, the sequences, transcriptional start sites, exon structures, and lengths for these non-coding genes are all highly variable. CONCLUSIONS: The biological relevance of lncRNAs would be highly questionable if they were limited to closely related phyla. Instead, their preservation across diverse amniotes, their apparent conservation in exon structure, and similarities in their pattern of brain expression during embryonic and early postnatal stages together indicate that these are functional RNA molecules, of which some have roles in vertebrate brain development.


Assuntos
Encéfalo/metabolismo , Sequência Conservada/genética , RNA não Traduzido/genética , Vertebrados/genética , Animais , Encéfalo/embriologia , Galinhas/genética , Evolução Molecular , Regulação da Expressão Gênica no Desenvolvimento , Loci Gênicos/genética , Camundongos , Homologia de Sequência do Ácido Nucleico
10.
PLoS Genet ; 5(12): e1000753, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19997497

RESUMO

The onset of prezygotic and postzygotic barriers to gene flow between populations is a hallmark of speciation. One of the earliest postzygotic isolating barriers to arise between incipient species is the sterility of the heterogametic sex in interspecies' hybrids. Four genes that underlie hybrid sterility have been identified in animals: Odysseus, JYalpha, and Overdrive in Drosophila and Prdm9 (Meisetz) in mice. Mouse Prdm9 encodes a protein with a KRAB motif, a histone methyltransferase domain and several zinc fingers. The difference of a single zinc finger distinguishes Prdm9 alleles that cause hybrid sterility from those that do not. We find that concerted evolution and positive selection have rapidly altered the number and sequence of Prdm9 zinc fingers across 13 rodent genomes. The patterns of positive selection in Prdm9 zinc fingers imply that rapid evolution has acted on the interface between the Prdm9 protein and the DNA sequences to which it binds. Similar patterns are apparent for Prdm9 zinc fingers for diverse metazoans, including primates. Indeed, allelic variation at the DNA-binding positions of human PRDM9 zinc fingers show significant association with decreased risk of infertility. Prdm9 thus plays a role in determining male sterility both between species (mouse) and within species (human). The recurrent episodes of positive selection acting on Prdm9 suggest that the DNA sequences to which it binds must also be evolving rapidly. Our findings do not identify the nature of the underlying DNA sequences, but argue against the proposed role of Prdm9 as an essential transcription factor in mouse meiosis. We propose a hypothetical model in which incompatibilities between Prdm9-binding specificity and satellite DNAs provide the molecular basis for Prdm9-mediated hybrid sterility. We suggest that Prdm9 should be investigated as a candidate gene in other instances of hybrid sterility in metazoans.


Assuntos
Evolução Molecular , Especiação Genética , Histona-Lisina N-Metiltransferase/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , DNA Satélite/genética , Histona-Lisina N-Metiltransferase/química , Humanos , Modelos Biológicos , Dados de Sequência Molecular , Filogenia , Primatas/genética , Roedores/genética , Seleção Genética , Dedos de Zinco/genética
11.
Biochem Soc Trans ; 37(Pt 4): 734-9, 2009 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-19614585

RESUMO

To take full advantage of the mouse as a model organism, it is essential to distinguish lineage-specific biology from what is shared between human and mouse. Investigations into shared genetic elements common to both have been well served by the draft human and mouse genome sequences. More recently, the virtually complete euchromatic sequences of the two reference genomes have been finished. These reveal a high ( approximately 5%) level of sequence duplications that had previously been recalcitrant to sequencing and assembly. Within these duplications lie large numbers of rodent- or primate-specific genes. In the present paper, we review the sequence properties of the two genomes, dwelling most on the duplications, deletions and insertions that separate each of them from their most recent common ancestor, approx. 90 million years ago. We consider the differences in gene numbers and repertoires between the two species, and speculate on their contributions to lineage-specific biology. Loss of ancient single-copy genes are rare, as are gains of new functional genes through retrotransposition. Instead, most changes to the gene repertoire have occurred in large multicopy families. It has been proposed that numbers of such 'environmental genes' rise and fall, and their sequences change, as adaptive responses to infection and other environmental pressures, including conspecific competition. Nevertheless, many such genes may be under little or no selection.


Assuntos
Evolução Molecular , Genoma Humano/genética , Genoma/genética , Animais , Duplicação Gênica , Humanos , Camundongos , Análise de Sequência de DNA
12.
PLoS Biol ; 7(5): e1000112, 2009 May 05.
Artigo em Inglês | MEDLINE | ID: mdl-19468303

RESUMO

The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non-protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not.


Assuntos
Biologia Computacional/métodos , Genoma/genética , Animais , Bases de Dados Genéticas , Duplicação Gênica , Genoma/fisiologia , Humanos , Camundongos
13.
Nat Genet ; 40(11): 1285-7, 2008 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18836446

RESUMO

Using a positional cloning approach supported by comparative genomics, we have identified a previously unreported gene, EYS, at the RP25 locus on chromosome 6q12 commonly mutated in autosomal recessive retinitis pigmentosa. Spanning over 2 Mb, this is the largest eye-specific gene identified so far. EYS is independently disrupted in four other mammalian lineages, including that of rodents, but is well conserved from Drosophila to man and is likely to have a role in the modeling of retinal architecture.


Assuntos
Proteínas de Drosophila/química , Drosophila melanogaster/química , Proteínas do Olho/genética , Genes Recessivos , Mutação/genética , Retinose Pigmentar/genética , Homologia de Sequência de Aminoácidos , Animais , Linhagem Celular , Cromossomos Humanos Par 6/genética , Proteínas do Olho/química , Proteínas do Olho/metabolismo , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Estrutura Terciária de Proteína , Transporte Proteico
14.
Genome Res ; 17(7): 969-81, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17495010

RESUMO

The newly sequenced genome of Monodelphis domestica not only provides the out-group necessary to better understand our own eutherian lineage, but it enables insights into the innovative biology of metatherians. Here, we compare Monodelphis with Homo sequences from alignments of single nucleotides, genes, and whole chromosomes. Using PhyOP, we have established orthologs in Homo for 82% (15,250) of Monodelphis gene predictions. Those with single orthologs in each species exhibited a high median synonymous substitution rate (d(S) = 1.02), thereby explaining the relative paucity of aligned regions outside of coding sequences. Orthology assignments were used to construct a synteny map that illustrates the considerable fragmentation of Monodelphis and Homo karyotypes since their therian last common ancestor. Fifteen percent of Monodelphis genes are predicted, from their low divergence at synonymous sites, to have been duplicated in the metatherian lineage. The majority of Monodelphis-specific genes possess predicted roles in chemosensation, reproduction, adaptation to specific diets, and immunity. Using alignments of Monodelphis genes to sequences from either Homo or Trichosurus vulpecula (an Australian marsupial), we show that metatherian X chromosomes have elevated silent substitution rates and high G+C contents in comparison with both metatherian autosomes and eutherian chromosomes. Each of these elevations is also a feature of subtelomeric chromosomal regions. We attribute these observations to high rates of female-specific recombination near the chromosomal ends and within the X chromosome, which act to sustain or increase G+C levels by biased gene conversion. In particular, we propose that the higher G+C content of the Monodelphis X chromosome is a direct consequence of its small size relative to the giant autosomes.


Assuntos
Mapeamento Cromossômico , Evolução Molecular , Monodelphis/genética , Animais , Humanos , Engenharia de Proteínas , Pseudogenes , Homologia de Sequência do Ácido Nucleico
15.
Nature ; 447(7141): 167-77, 2007 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-17495919

RESUMO

We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.


Assuntos
Evolução Molecular , Genoma/genética , Genômica , Gambás/genética , Animais , Composição de Bases , Sequência Conservada/genética , Elementos de DNA Transponíveis/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Biossíntese de Proteínas , Sintenia/genética , Inativação do Cromossomo X/genética
16.
PLoS Comput Biol ; 2(9): e133, 2006 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-17009864

RESUMO

Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46%) of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a fast and robust approach to orthology prediction that will be applicable to whole genomes from multiple closely related species. PhyOP will be particularly useful in predicting orthology for mammalian genomes that have been incompletely sequenced, and for large families of rapidly duplicating genes.


Assuntos
Cães/genética , Filogenia , Sintenia/genética , Animais , Cromossomos de Mamíferos/genética , Biologia Computacional , Feminino , Genoma/genética , Humanos , Masculino , Linhagem , Transcrição Gênica/genética
17.
Int J Med Microbiol ; 296 Suppl 40: 177-84, 2006 May.
Artigo em Inglês | MEDLINE | ID: mdl-16530476

RESUMO

Borrelia burgdorferi, a spirochaete transmitted to human hosts during feeding of infected Ixodes ticks, is the causative agent of Lyme disease, the most frequent vector-borne disease in Eurasia and North America. Sporadically Lyme disease develops into a chronic, multisystemic disorder. Serum-resistant B. burgdorferi strains bind complement factor H (FH) and FH-like protein 1 (FHL-1) on the spirochaete surface. This binding is dependent on the expression of proteins termed complement-regulator acquiring surface proteins (CRASPs). The atomic structure of BbCRASP-1, the key FHL-1/FH-binding protein of B. burgdorferi, has recently been determined. Our analysis indicates that its protein topology apparently evolved to provide a high affinity interaction site for FH/FHL-1 and leads to an atomic-level hypothesis for the functioning of BbCRASP-1. This work demonstrates that pathogens interact with complement regulators in ways that are distinct from the mechanisms used by the host and are thus obvious targets for drug design.


Assuntos
Proteínas de Bactérias/química , Borrelia burgdorferi/química , Proteínas de Membrana/química , Sequência de Aminoácidos , Sítios de Ligação , Proteínas Inativadoras do Complemento C3b , Fator H do Complemento/metabolismo , Humanos , Modelos Moleculares , Dados de Sequência Molecular , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Alinhamento de Sequência , Relação Estrutura-Atividade
18.
Nature ; 438(7069): 803-19, 2005 Dec 08.
Artigo em Inglês | MEDLINE | ID: mdl-16341006

RESUMO

Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.


Assuntos
Cães/genética , Evolução Molecular , Genoma/genética , Genômica , Haplótipos/genética , Animais , Sequência Conservada/genética , Doenças do Cão/genética , Cães/classificação , Feminino , Humanos , Hibridização Genética , Masculino , Camundongos , Mutagênese/genética , Polimorfismo de Nucleotídeo Único/genética , Ratos , Elementos Nucleotídeos Curtos e Dispersos/genética , Sintenia/genética
19.
BMC Genomics ; 6: 120, 2005 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-16159394

RESUMO

BACKGROUND: The physiological and phenotypic differences between human and chimpanzee are largely specified by our genomic differences. We have been particularly interested in recent duplications in the human genome as examples of relatively large-scale changes to our genome. We performed an in-depth evolutionary analysis of a region of chromosome 1, which is copy number polymorphic among humans, and that contains at least 32 PRAME (Preferentially expressed antigen of melanoma) genes and pseudogenes. PRAME-like genes are expressed in the testis and in a large number of tumours, and are thought to possess roles in spermatogenesis and oogenesis. RESULTS: Using nucleotide substitution rate estimates for exons and introns, we show that two large segmental duplications, of six and seven human PRAME genes respectively, occurred in the last 3 million years. These duplicated genes are thus hominin-specific, having arisen in our genome since the divergence from chimpanzee. This cluster of PRAME genes appears to have arisen initially from a translocation approximately 95-85 million years ago. We identified multiple sites within human or mouse PRAME sequences which exhibit strong evidence of positive selection. These form a pronounced cluster on one face of the predicted PRAME protein structure. CONCLUSION: We predict that PRAME genes evolved adaptively due to strong competition between rapidly-dividing cells during spermatogenesis and oogenesis. We suggest that as PRAME gene copy number is polymorphic among individuals, positive selection of PRAME alleles may still prevail within the human population.


Assuntos
Cromossomos Humanos Par 1 , Regulação Neoplásica da Expressão Gênica , Alelos , Animais , Análise por Conglomerados , Evolução Molecular , Éxons , Duplicação Gênica , Genoma , Genoma Humano , Humanos , Íntrons , Masculino , Melanoma/metabolismo , Modelos Genéticos , Modelos Moleculares , Família Multigênica , Pan troglodytes , Fenótipo , Filogenia , Polimorfismo Genético , Pseudogenes , Seleção Genética , Testículo/metabolismo , Translocação Genética
20.
Trends Biochem Sci ; 29(6): 289-92, 2004 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-15276181

RESUMO

Vitamin K epoxide reductase (VKOR) recycles reduced vitamin K, which is used subsequently as a co-factor in the gamma-carboxylation of glutamic acid residues in blood coagulation enzymes. VKORC1, a subunit of the VKOR complex, has recently been shown to possess this activity. Here, we show that VKORC1 is a member of a large family of predicted enzymes that are present in vertebrates, Drosophila, plants, bacteria and archaea. Four cysteine residues and one residue, which is either serine or threonine, are identified as likely active-site residues. In some plant and bacterial homologues the VKORC1 homologous domain is fused with domains of the thioredoxin family of oxidoreductases. These might reduce disulfide bonds of VKORC1-like enzymes as a prerequisite for their catalytic activities.


Assuntos
Oxigenases de Função Mista/metabolismo , Sequência de Aminoácidos , Animais , Sítios de Ligação , Catálise , Humanos , Dados de Sequência Molecular , Oxirredutases/metabolismo , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Tiorredoxinas/química , Vitamina K/metabolismo , Vitamina K Epóxido Redutases
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...