RESUMO
The Ensembl project (http://www.ensembl.org) is a comprehensive genome information system featuring an integrated set of genome annotation, databases and other information for chordate and selected model organism and disease vector genomes. As of release 47 (October 2007), Ensembl fully supports 35 species, with preliminary support for six additional species. New species in the past year include platypus and horse. Major additions and improvements to Ensembl since our previous report include extensive support for functional genomics data in the form of a specialized functional genomics database, genome-wide maps of protein-DNA interactions and the Ensembl regulatory build; support for customization of the Ensembl web interface through the addition of user accounts and user groups; and increased support for genome resequencing. We have also introduced new comparative genomics-based data mining options and report on the continued development of our software infrastructure.
Assuntos
Bases de Dados Genéticas , Genômica , Animais , Gráficos por Computador , Humanos , Internet , Camundongos , Elementos Reguladores de Transcrição , Software , Interface Usuário-ComputadorRESUMO
SUMMARY: BioJava is a mature open-source project that provides a framework for processing of biological data. BioJava contains powerful analysis and statistical routines, tools for parsing common file formats and packages for manipulating sequences and 3D structures. It enables rapid bioinformatics application development in the Java programming language. AVAILABILITY: BioJava is an open-source project distributed under the Lesser GPL (LGPL). BioJava can be downloaded from the BioJava website (http://www.biojava.org). BioJava requires Java 1.5 or higher. All queries should be directed to the BioJava mailing lists. Details are available at http://biojava.org/wiki/BioJava:MailingLists.
Assuntos
Biologia Computacional/métodos , Linguagens de Programação , Conformação de Ácido Nucleico , Conformação Proteica , Análise de SequênciaRESUMO
The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of chordate genome sequences. Over the past year the number of genomes available from Ensembl has increased from 15 to 33, with the addition of sites for the mammalian genomes of elephant, rabbit, armadillo, tenrec, platypus, pig, cat, bush baby, common shrew, microbat and european hedgehog; the fish genomes of stickleback and medaka and the second example of the genomes of the sea squirt (Ciona savignyi) and the mosquito (Aedes aegypti). Some of the major features added during the year include the first complete gene sets for genomes with low-sequence coverage, the introduction of new strain variation data and the introduction of new orthology/paralog annotations based on gene trees.
Assuntos
Bases de Dados de Ácidos Nucleicos , Genômica , Animais , Sequência de Bases , Bases de Dados de Ácidos Nucleicos/normas , Variação Genética , Genoma Humano , Humanos , Internet , Camundongos , Proteínas/genética , Padrões de Referência , Alinhamento de Sequência , Integração de Sistemas , Interface Usuário-ComputadorRESUMO
Human placentation displays many similarities with tumourigenesis, including rapid cell division, migration and invasion, overlapping gene expression profiles and escape from immune detection. Recent data have identified promoter methylation in the Ras association factor and adenomatous polyposis coli tumour suppressor genes as part of this process. However, the extent of tumour-associated methylation in the placenta remains unclear. Using whole genome methylation data as a starting point, we have examined this phenomenon in placental tissue. We found no evidence for methylation of the majority of common tumour suppressor genes in term placentas, but identified methylation in several genes previously described in some human tumours. Notably, promoter methylation of four independent negative regulators of Wnt signalling has now been identified in human placental tissue and purified trophoblasts. Methylation is present in baboon, but not in mouse placentas. This supports a role for elevated Wnt signalling in primate trophoblast invasiveness and placentation. Examination of invasive choriocarcinoma cell lines revealed altered methylation patterns consistent with a role of methylation change in gestational trophoblastic disease. This distinct pattern of tumour-associated methylation implicates a coordinated series of epigenetic silencing events, similar to those associated with some tumours, in the distinct features of normal human placental invasion and function.
Assuntos
Metilação de DNA , Placenta/metabolismo , Trofoblastos/metabolismo , Animais , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Proteínas de Transporte/genética , Linhagem Celular Tumoral , Células Cultivadas , Proteínas de Ligação a DNA , Fatores de Transcrição de Resposta de Crescimento Precoce/genética , Feminino , Humanos , Receptores de Hialuronatos/genética , Técnicas In Vitro , Proteínas de Membrana/genética , Camundongos , Neoplasias/genética , Neoplasias/patologia , Papio , Gravidez , Primeiro Trimestre da Gravidez , Proteínas Repressoras/genética , Trofoblastos/citologia , Proteínas Supressoras de Tumor/genéticaRESUMO
The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of large genome sequences. Over the last year the number of genomes available from the Ensembl site has increased from 4 to 19, with the addition of the mammalian genomes of Rhesus macaque and Opossum, the chordate genome of Ciona intestinalis and the import and integration of the yeast genome. The year has also seen extensive improvements to both data analysis and presentation, with the introduction of a redesigned website, the addition of RNA gene and regulatory annotation and substantial improvements to the integration of human genome variation data.
Assuntos
Bases de Dados de Ácidos Nucleicos , Genômica , Animais , Sequência de Bases , Variação Genética , Genoma Humano , Humanos , Internet , Camundongos , Proteínas/genética , RNA/genética , Ratos , Sequências Reguladoras de Ácido Nucleico , Alinhamento de Sequência , Interface Usuário-ComputadorRESUMO
The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of large genome sequences. Over the last year the number of genomes available from the Ensembl site has increased by 7 to 16, with the addition of the six vertebrate genomes of chimpanzee, dog, cow, chicken, tetraodon and frog and the insect genome of honeybee. The majority have been annotated automatically using the Ensembl gene build system, showing its flexibility to reliably annotate a wide variety of genomes. With the increased number of vertebrate genomes, the comparative analysis provided to users has been greatly improved, with new website interfaces allowing annotation of different genomes to be directly compared. The Ensembl software system is being increasingly widely reused in different projects showing the benefits of a completely open approach to software development and distribution.
Assuntos
Bases de Dados de Ácidos Nucleicos , Genômica , Animais , Sequência de Bases , Bovinos , Cães , Humanos , Internet , Camundongos , Ratos , Alinhamento de Sequência , Software , Interface Usuário-ComputadorRESUMO
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.
Assuntos
Bases de Dados Genéticas , Genoma Humano , Biologia Computacional , Sistemas de Gerenciamento de Base de Dados , Humanos , Armazenamento e Recuperação da Informação , Internet , Análise de Sequência de DNA , Integração de SistemasRESUMO
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organize biology around the sequences of large genomes. It is a comprehensive and integrated source of annotation of large genome sequences, available via interactive website, web services or flat files. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. The facilities of the system range from sequence analysis to data storage and visualization and installations exist around the world both in companies and at academic sites. With a total of nine genome sequences available from Ensembl and more genomes to follow, recent developments have focused mainly on closer integration between genomes and external data.
Assuntos
Biologia Computacional , Bases de Dados Genéticas , Genoma , Genômica , Animais , Humanos , Armazenamento e Recuperação da Informação , Internet , SoftwareRESUMO
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of human, mouse and other genome sequences, available as either an interactive web site or as flat files. Ensembl also integrates manually annotated gene structures from external sources where available. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. These range from sequence analysis to data storage and visualisation and installations exist around the world in both companies and at academic sites. With both human and mouse genome sequences available and more vertebrate sequences to follow, many of the recent developments in Ensembl have focusing on developing automatic comparative genome analysis and visualisation.
Assuntos
Bases de Dados Genéticas , Genômica , Animais , Biologia Computacional , Genoma Humano , Humanos , Internet , Camundongos , Software , SinteniaRESUMO
An understanding of the relative roles of historical and contemporary factors in structuring genetic variation is a fundamental, but understudied aspect of geographic variation. We examined geographic variation in microsatellite DNA allele frequencies in bull trout (Salvelinus confluentus, Salmonidae) to test hypotheses concerning the relative roles of postglacial dispersal (historical) and current landscape features (contemporary) in structuring genetic variability and population differentiation. Bull trout exhibit relatively low intrapopulation microsatellite variation (average of 1.9 alleles per locus, average He = 0.24), but high levels of interpopulation divergence (F(ST) = 0.39). We found evidence of historical influences on microsatellite variation in the form of a decrease in the number of alleles and heterozygosities in populations on the periphery of the range relative to populations closer to putative glacial refugia. In addition, one region of British Columbia that was colonized later during deglaciation and by more indirect watershed connections showed less developed and more variable patterns of isolation by distance than a similar region colonized earlier and more directly from refugia. Current spatial and drainage interconnectedness among sites and the presence of migration barriers (falls and cascades) within individual streams were found to be important contemporary factors influencing historical patterns of genetic variability and interpopulation divergence. Our work illustrates the limited utility of equilibrium models to delineate population structure and patterns of genetic diversity in recently founded populations or those inhabiting highly heterogeneous environments, and it highlights the need for approaches incorporating a landscape context for population divergence. Substantial microsatellite DNA divergence among bull trout populations may also signal divergence in traits important to population persistence in specific environments.
Assuntos
Evolução Molecular , Variação Genética , Repetições de Microssatélites , Truta/genética , Alberta , Alelos , Animais , Colúmbia Britânica , Meio Ambiente , Água Doce , Geografia , Heterozigoto , FilogeniaRESUMO
During an eight-year period, twenty-four Southwick osteotomies were performed in twenty-one patients with severe chronic slipped capital femoral epiphysis. After follow-ups ranging from two to ten years (average, four and one-half years), correction of the deformity had been maintained in all but one patient, who has loss of fixation of the osteotomy. One patient had joint-space narrowing, avascular necrosis of the femoral head, and significant restriction of motion which was still present three years after operation. We concluded that this osteotomy gives good correction of the deformity but is a major operative procedure that should be recommended only for severe, chronic slips. Preoperative loss of motion and joint-space narrowing are probably contraindications for the procedure, particularly in black patients.
Assuntos
Epifise Deslocada/cirurgia , Cabeça do Fêmur/cirurgia , Osteotomia/métodos , Adolescente , Placas Ósseas , Parafusos Ósseos , Criança , Doença Crônica , Epifise Deslocada/complicações , Feminino , Necrose da Cabeça do Fêmur/etiologia , Seguimentos , Humanos , Masculino , Complicações Pós-Operatórias , CicatrizaçãoRESUMO
This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.