Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 52
Filtrar
1.
Gigascience ; 8(12)2019 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31782791

RESUMO

BACKGROUND: Sugarcane cultivars are polyploid interspecific hybrids of giant genomes, typically with 10-13 sets of chromosomes from 2 Saccharum species. The ploidy, hybridity, and size of the genome, estimated to have >10 Gb, pose a challenge for sequencing. RESULTS: Here we present a gene space assembly of SP80-3280, including 373,869 putative genes and their potential regulatory regions. The alignment of single-copy genes in diploid grasses to the putative genes indicates that we could resolve 2-6 (up to 15) putative homo(eo)logs that are 99.1% identical within their coding sequences. Dissimilarities increase in their regulatory regions, and gene promoter analysis shows differences in regulatory elements within gene families that are expressed in a species-specific manner. We exemplify these differences for sucrose synthase (SuSy) and phenylalanine ammonia-lyase (PAL), 2 gene families central to carbon partitioning. SP80-3280 has particular regulatory elements involved in sucrose synthesis not found in the ancestor Saccharum spontaneum. PAL regulatory elements are found in co-expressed genes related to fiber synthesis within gene networks defined during plant growth and maturation. Comparison with sorghum reveals predominantly bi-allelic variations in sugarcane, consistent with the formation of 2 "subgenomes" after their divergence ∼3.8-4.6 million years ago and reveals single-nucleotide variants that may underlie their differences. CONCLUSIONS: This assembly represents a large step towards a whole-genome assembly of a commercial sugarcane cultivar. It includes a rich diversity of genes and homo(eo)logous resolution for a representative fraction of the gene space, relevant to improve biomass and food production.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Glucosiltransferases/genética , Fenilalanina Amônia-Liase/genética , Saccharum/crescimento & desenvolvimento , Biomassa , Produtos Agrícolas/genética , Produtos Agrícolas/crescimento & desenvolvimento , Variação Genética , Tamanho do Genoma , Genoma de Planta , Família Multigênica , Proteínas de Plantas/genética , Poliploidia , Regiões Promotoras Genéticas , Saccharum/genética
2.
Gene ; 691: 96-105, 2019 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-30630096

RESUMO

Vriesea carinata is an endemic bromeliad from the Brazilian Atlantic Forest. It has trichome and tank system in their leaves which allows to absorb water and nutrients. It belongs to Bromeliaceae family, which includes several species highly enriched of cysteine-proteases (CysPs). These proteolytic enzymes regulate processes as senescence, cell differentiation, pathogen-linked programmed cell death and mobilization of proteins. Although, their biological importance, there are not genomic resources in V. carinata that can help to identify and understand their molecular mechanisms involved in different biological processes. Thus high-throughput transcriptome sequencing of V. carinata is necessary to generate sequences for the purpose of gene discovery and functional genomic studies. In the present study, we sequenced and assembled the V. carinata transcriptome to the identification of CysPs. A total of 43,232 contigs were assembled for the leaf tissue. BLAST analysis indicated that 23,803 contigs exhibited similarity to non-redundant Viridiplantae proteins. 28.24% of the contigs were classified into the COG database, and gene ontology categorized them into 61 functional groups. A metabolic pathway analysis with KEGG revealed 9679 contigs assigned to 31 metabolic pathways. Among 16 full-length CysPs identified, 11 were evaluated in respect to their expression patterns in the leaf apex, base and inflorescence tissues. The results showed differential expression levels of legumain, metacaspase, pyroglutamyl and papain-like CysPs depending of the leaf region. These results provide a global overview of V. carinata gene functions and expression activities of CysPs in those tissues.


Assuntos
Bromeliaceae/genética , Mapeamento de Sequências Contíguas/métodos , Cisteína Proteases/genética , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas , Sequenciamento de Nucleotídeos em Larga Escala , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Família Multigênica , Folhas de Planta/genética , Proteínas de Plantas/genética , Análise de Sequência de RNA
3.
Genome Res ; 27(5): 885-896, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28420692

RESUMO

Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma de Planta , Anotação de Sequência Molecular/métodos , Proteínas de Plantas/genética , Translocação Genética , Triticum/genética , Algoritmos , Mapeamento de Sequências Contíguas/normas , Anotação de Sequência Molecular/normas , Polimorfismo Genético , Poliploidia
4.
Genome Res ; 27(5): 793-800, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28104618

RESUMO

Achieving complete, accurate, and cost-effective assembly of human genomes is of great importance for realizing the promise of precision medicine. The abundance of repeats and genetic variations in human genomes and the limitations of existing sequencing technologies call for the development of novel assembly methods that can leverage the complementary strengths of multiple technologies. We propose a Hybrid Structural variant Assembly (HySA) approach that integrates sequencing reads from next-generation sequencing and single-molecule sequencing technologies to accurately assemble and detect structural variants (SVs) in human genomes. By identifying homologous SV-containing reads from different technologies through a bipartite-graph-based clustering algorithm, our approach turns a whole genome assembly problem into a set of independent SV assembly problems, each of which can be effectively solved to enhance the assembly of structurally altered regions in human genomes. We used data generated from a haploid hydatidiform mole genome (CHM1) and a diploid human genome (NA12878) to test our approach. The result showed that, compared with existing methods, our approach had a low false discovery rate and substantially improved the detection of many types of SVs, particularly novel large insertions, small indels (10-50 bp), and short tandem repeat expansions and contractions. Our work highlights the strengths and limitations of current approaches and provides an effective solution for extending the power of existing sequencing technologies for SV discovery.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Variação Estrutural do Genoma , Genômica/métodos , Análise de Sequência de DNA/métodos , Software , Animais , Mapeamento de Sequências Contíguas/normas , Diploide , Genômica/normas , Haploidia , Humanos , Camundongos , Análise de Sequência de DNA/normas , Sequências de Repetição em Tandem
5.
J Comput Biol ; 23(3): 137-49, 2016 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-26953874

RESUMO

The large amount of short read data that has to be assembled in future applications, such as in metagenomics or cancer genomics, strongly motivates the investigation of disk-based approaches to index next-generation sequencing (NGS) data. Positive results in this direction stimulate the investigation of efficient external memory algorithms for de novo assembly from NGS data. Our article is also motivated by the open problem of designing a space-efficient algorithm to compute a string graph using an indexing procedure based on the Burrows-Wheeler transform (BWT). We have developed a disk-based algorithm for computing string graphs in external memory: the light string graph (LSG). LSG relies on a new representation of the FM-index that is exploited to use an amount of main memory requirement that is independent from the size of the data set. Moreover, we have developed a pipeline for genome assembly from NGS data that integrates LSG with the assembly step of SGA (Simpson and Durbin, 2012 ), a state-of-the-art string graph-based assembler, and uses BEETL for indexing the input data. LSG is open source software and is available online. We have analyzed our implementation on a 875-million read whole-genome dataset, on which LSG has built the string graph using only 1GB of main memory (reducing the memory occupation by a factor of 50 with respect to SGA), while requiring slightly more than twice the time than SGA. The analysis of the entire pipeline shows an important decrease in memory usage, while managing to have only a moderate increase in the running time.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Genoma Humano , Humanos
6.
Methods Mol Biol ; 1227: 55-67, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25239741

RESUMO

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.


Assuntos
Cromossomos Artificiais Bacterianos/química , Mapeamento de Sequências Contíguas/métodos , DNA Bacteriano/genética , Genoma Bacteriano , Análise de Sequência de DNA/métodos , Software , Cromossomos Artificiais Bacterianos/metabolismo , Impressões Digitais de DNA , Enzimas de Restrição do DNA/metabolismo , DNA Bacteriano/metabolismo , Escherichia coli/efeitos dos fármacos , Escherichia coli/genética , Escherichia coli/metabolismo , Galactosídeos/metabolismo , Biblioteca Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Indóis/metabolismo , Isopropiltiogalactosídeo/farmacologia
7.
Nat Biotechnol ; 31(12): 1119-25, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24185095

RESUMO

Genomes assembled de novo from short reads are highly fragmented relative to the finished chromosomes of Homo sapiens and key model organisms generated by the Human Genome Project. To address this problem, we need scalable, cost-effective methods to obtain assemblies with chromosome-scale contiguity. Here we show that genome-wide chromatin interaction data sets, such as those generated by Hi-C, are a rich source of long-range information for assigning, ordering and orienting genomic sequences to chromosomes, including across centromeres. To exploit this finding, we developed an algorithm that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. We demonstrate the approach by combining shotgun fragment and short jump mate-pair sequences with Hi-C data to generate chromosome-scale de novo assemblies of the human, mouse and Drosophila genomes, achieving--for the human genome--98% accuracy in assigning scaffolds to chromosome groups and 99% accuracy in ordering and orienting scaffolds within chromosome groups. Hi-C data can also be used to validate chromosomal translocations in cancer genomes.


Assuntos
Algoritmos , Cromatina/genética , Mapeamento Cromossômico/métodos , Mapeamento de Sequências Contíguas/métodos , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Drosophila , Humanos , Camundongos , Dados de Sequência Molecular
8.
BMC Bioinformatics ; 13: 205, 2012 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-22897923

RESUMO

BACKGROUND: Variations in DNA copy number carry information on the modalities of genome evolution and mis-regulation of DNA replication in cancer cells. Their study can help localize tumor suppressor genes, distinguish different populations of cancerous cells, and identify genomic variations responsible for disease phenotypes. A number of different high throughput technologies can be used to identify copy number variable sites, and the literature documents multiple effective algorithms. We focus here on the specific problem of detecting regions where variation in copy number is relatively common in the sample at hand. This problem encompasses the cases of copy number polymorphisms, related samples, technical replicates, and cancerous sub-populations from the same individual. RESULTS: We present a segmentation method named generalized fused lasso (GFL) to reconstruct copy number variant regions. GFL is based on penalized estimation and is capable of processing multiple signals jointly. Our approach is computationally very attractive and leads to sensitivity and specificity levels comparable to those of state-of-the-art specialized methodologies. We illustrate its applicability with simulated and real data sets. CONCLUSIONS: The flexibility of our framework makes it applicable to data obtained with a wide range of technology. Its versatility and speed make GFL particularly useful in the initial screening stages of large data sets.


Assuntos
Algoritmos , Mapeamento de Sequências Contíguas/métodos , DNA/genética , Dosagem de Genes , Variação Genética , Análise de Sequência de DNA/métodos , Transtorno Bipolar/genética , Duplicação Gênica , Genes Supressores de Tumor , Genoma , Genômica , Humanos , Neoplasias/genética , Linhagem , Análise de Regressão , Sensibilidade e Especificidade
9.
PLoS One ; 7(3): e33644, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22438974

RESUMO

Polyploids account for approximately 70% of flowering plants, including many field, horticulture and forage crops. Cottons are a world-leading fiber and important oilseed crop, and a model species for study of plant polyploidization, cellulose biosynthesis and cell wall biogenesis. This study has addressed the concerns of physical mapping of polyploids with BACs and/or BIBACs by constructing a physical map of the tetraploid cotton, Gossypium hirsutum L. The physical map consists of 3,450 BIBAC contigs with an N50 contig size of 863 kb, collectively spanning 2,244 Mb. We sorted the map contigs according to their origin of subgenome, showing that we assembled physical maps for the A- and D-subgenomes of the tetraploid cotton, separately. We also identified the BIBACs in the map minimal tilling path, which consists of 15,277 clones. Moreover, we have marked the physical map with nearly 10,000 BIBAC ends (BESs), making one BES in approximately 250 kb. This physical map provides a line of evidence and a strategy for physical mapping of polyploids, and a platform for advanced research of the tetraploid cotton genome, particularly fine mapping and cloning the cotton agronomic genes and QTLs, and sequencing and assembling the cotton genome using the modern next-generation sequencing technology.


Assuntos
Genoma de Planta , Gossypium/genética , Tetraploidia , Cromossomos Artificiais Bacterianos/genética , Mapeamento de Sequências Contíguas/métodos
10.
BMC Bioinformatics ; 13 Suppl 19: S9, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23282028

RESUMO

Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. However, nearly all DNA sequencing technologies sequence DNA from multiple cells, thus resulting in measurement of mutations from a mixture of genomes. Genome rearrangements are a major class of somatic mutations in many tumors, and the novel adjacencies (i.e. breakpoints) resulting from these rearrangements are readily detected from DNA sequencing reads. However, the assignment of each rearrangement, or adjacency, to an individual cancer genome in the mixture is not known. Moreover, the quantity of DNA sequence reads may be insufficient to measure all rearrangements in all genomes in the tumor. Motivated by this application, we formulate the k-minimum completion problem (k-MCP). In this problem, we aim to reconstruct k genomes derived from a single reference genome, given partial information about the adjacencies present in the mixture of these genomes. We show that the 1-MCP is solvable in linear time in the cases where: (i) the measured, incomplete genome has a single circular or linear chromosome; (ii) there are no restrictions on the chromosomal content of the measured, incomplete genome. We also show that the k-MCP problem, for k ≥ 3 in general, and the 2-MCP problem with the double-cut-and-join (DCJ) distance are NP-complete, when there are no restriction on the chromosomal structure of the measured, incomplete genome. These results lay the foundation for future algorithmic studies of the k-MCP and the application of these algorithms to real cancer sequencing data.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Rearranjo Gênico , Genoma Humano/genética , Neoplasias/genética , Algoritmos , Sequência de Bases , Humanos , Mutação , Análise de Sequência de DNA
11.
Int J Cancer ; 120(3): 533-8, 2007 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-17096344

RESUMO

The recurrent loss of 3p segments in neuroblastoma suggests the implication of 1 or more tumor suppressor genes but thus far few efforts have been made to pinpoint their detailed chromosomal position. To achieve this goal, array-based comparative genomic hybridization was performed on a panel of 23 neuroblastoma cell lines and 75 primary tumors using a tiling-path bacterial artificial chromosome array for chromosome 3p. A total of 45 chromosome 3 losses were detected, including whole chromosome losses, large terminal deletions and interstitial deletions. The latter, observed in cell lines as well as a number of distal deletions detected in primary tumors, allowed us to demarcate 3 minimal regions of loss of 3.6 Mb [3p21.31-p21.2, shortest regions of overlap (SRO)1], 1.4 Mb (3p22.3-3p22.2, SRO2) and 3.8 Mb (3p25.3-p25.1, SRO3) in size. The present data significantly extend previous findings and now firmly establish critical regions on 3p implicated in neuroblastoma. Interestingly, the 2 proximal regions coincide with previously defined SROs on 3p21.3 in more frequent tumors including lung and breast cancer. As such, similar tumor suppressor genes may play a critical role in development or progression of a variety of neoplasms, including neuroblastoma.


Assuntos
Deleção Cromossômica , Cromossomos Humanos Par 3/genética , Neoplasias/genética , Neuroblastoma/genética , Hibridização de Ácido Nucleico/métodos , Proteínas Supressoras de Tumor/genética , Linhagem Celular Tumoral , Cromossomos Artificiais Bacterianos/genética , Mapeamento de Sequências Contíguas/métodos , Progressão da Doença , Deleção de Genes , Genes Supressores de Tumor/fisiologia , Genoma Humano/genética , Humanos , Neoplasias/patologia , Neoplasias/fisiopatologia , Neuroblastoma/patologia , Neuroblastoma/fisiopatologia
12.
Plant Mol Biol ; 59(1): 63-74, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16217602

RESUMO

Polyploidy (genome duplication) is thought to have contributed to the evolution of the eukaryotic genome, but complex genome structures and massive gene loss during evolution has complicated detection of these ancestral duplication events. The major factors determining the fate of duplicated genes are currently unclear, as are the processes by which duplicated genes evolve after polyploidy. Fine-scale analysis between homologous regions may allow us to better understand post-polyploidy evolution. Here, using gene-by-gene and gene-by-genome strategies, we identified the S5 region and four homologous regions within the japonica genome. Additional phylogenomic analyses of the comparable duplicated blocks indicate that four successive duplication events gave rise to these five regions, allowing us to propose a model for this local chromosomal evolution. According to this model, gene loss may play a major role in post-duplication genetic evolution at the segmental level. Moreover, we found molecular evidence that one of the sister duplicated blocks experienced more gene loss and a more rapid evolution subsequent to two recent duplication events. Given that these two recent duplication events were likely involved in polyploidy, this asymmetric evolution (gene loss and gene divergence) may be one possible mechanism accounting for the diploidization at the segmental level.


Assuntos
Grão Comestível/genética , Evolução Molecular , Poliploidia , Cromossomos Artificiais Bacterianos/genética , Cromossomos de Plantas/genética , Biologia Computacional/métodos , Mapeamento de Sequências Contíguas/métodos , DNA de Plantas/genética , Duplicação Gênica , Variação Genética , Genoma de Planta , Oryza/genética , Filogenia
13.
Cytogenet Genome Res ; 108(1-3): 98-105, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-15545721

RESUMO

Intrachromosomal duplications play a significant role in human genome pathology and evolution. To better understand the molecular basis of evolutionary chromosome rearrangements, we performed molecular cytogenetic and sequence analyses of the breakpoint region that distinguishes human chromosome 3p12.3 and orangutan chromosome 2. FISH with region-specific BAC clones demonstrated that the breakpoint-flanking sequences are duplicated intrachromosomally on orangutan 2 and human 3q21 as well as at many pericentromeric and subtelomeric sites throughout the genomes. Breakage and rearrangement of the human 3p12.3-homologous region in the orangutan lineage were associated with a partial loss of duplicated sequences in the breakpoint region. Consistent with our FISH mapping results, computational analysis of the human chromosome 3 genomic sequence revealed three 3p12.3-paralogous sequence blocks on human chromosome 3q21 and smaller blocks on the short arm end 3p26-->p25. This is consistent with the view that sequences from an ancestral site at 3q21 were duplicated at 3p12.3 in a common ancestor of orangutan and humans. Our results show that evolutionary chromosome rearrangements are associated with microduplications and microdeletions, contributing to the DNA differences between closely related species.


Assuntos
Quebra Cromossômica/genética , Inversão Cromossômica/genética , Cromossomos Humanos Par 3/genética , Evolução Molecular , Pongo pygmaeus/genética , Animais , Linhagem Celular Transformada , Cercopithecidae/genética , Cromossomos de Mamíferos/genética , Mapeamento de Sequências Contíguas/métodos , Herpesvirus Humano 4/genética , Humanos , Células Híbridas/química , Células Híbridas/metabolismo , Hibridização in Situ Fluorescente/métodos , Linfócitos/metabolismo , Linfócitos/virologia , Pan troglodytes/genética , Deleção de Sequência/genética
14.
BMC Genomics ; 5(1): 60, 2004 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-15333141

RESUMO

BACKGROUND: Capillary DNA sequencing machines allow the generation of vast amounts of data with little hands-on time. With this expansion of data generation, there is a growing need for automated data processing. Most available software solutions, however, still require user intervention or provide modules that need advanced informatics skills to allow implementation in pipelines. RESULTS: Here we present POSA, a pair of new perl objects that describe DNA sequence traces and Phrap contig assemblies in detail. Methods included in POSA include basecalling with quality scores (by Phred), contig assembly (by Phrap), generation of primer3 input and automated SNP annotation (by PolyPhred). Although easily implemented by users with only limited programming experience, these objects considerabily reduce hands-on analysis time compared to using the Staden package for extracting sequence information from raw sequencing files and for SNP discovery. CONCLUSIONS: The POSA objects allow a flexible and easy design, implementation and usage of perl-based pipelines to handle and analyze DNA sequencing data, while requiring only minor programming skills.


Assuntos
Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/tendências , Software , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/tendências
15.
BMC Genet ; 5: 15, 2004 Jun 09.
Artigo em Inglês | MEDLINE | ID: mdl-15189568

RESUMO

BACKGROUND: Hormones frequently guide animal development via the induction of cascades of gene activities, whose products further amplify an initial hormonal stimulus. In Drosophila the transformation of the larva into the pupa and the subsequent metamorphosis to the adult stage is triggered by changes in the titer of the steroid hormone 20-hydroxyecdysone. singed wings (swi) is the only gene known in Drosophila melanogaster for which mutations specifically interrupt the transmission of the regulatory signal from early to late ecdysone inducible genes. RESULTS: We have characterized singed wings locus, showing it to correspond to EG:171E4.2 (CG3095). swi encodes a predicted 68.5-kDa protein that contains N-terminal histidine-rich and threonine-rich domains, a cysteine-rich C-terminal region and two leucine-rich repeats. The SWI protein has a close homolog in D. melanogaster, defining a new family of SWI-like proteins, and is conserved in D. pseudoobscura. A lethal mutation, swit476, shows a severe disruption of the ecdysone pathway and is a C>Y substitution in one of the two conserved CysXCys motifs that are common to SWI and the Drosophila Toll-4 protein. CONCLUSIONS: It is not entirely clear from the present molecular analysis how the SWI protein may function in the ecdysone induced cascade. Currently all predictions agree in that SWI is very unlikely to be a nuclear protein. Thus it probably exercises its control of "late" ecdysone genes indirectly. Apparently the genetic regulation of ecdysone signaling is much more complex then was previously anticipated.


Assuntos
Proteínas de Drosophila/química , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Marcadores Genéticos/genética , Sequência de Aminoácidos/genética , Animais , Sequência de Bases/genética , Clonagem Molecular/métodos , Mapeamento de Sequências Contíguas/métodos , Feminino , Masculino , Dados de Sequência Molecular , Cromossomo X/genética
16.
Genes Chromosomes Cancer ; 40(3): 266-70, 2004 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-15139005

RESUMO

Deletion of chromosome arm 1p is one of the most frequent genetic alterations in neuroblastoma. However, using conventional comparative genomic hybridization, we have observed amplifications on 1p in 2 neuroblastoma tumors at bands 1p34.2 and 1p36.3, respectively. Using a medium-resolution genomic array containing 178 PACs/BACs from 1p and then 2 high-resolution arrays containing contigs of overlapping PACs/BACs from the amplified regions, we could precisely map and delineate both amplicons. The 1p34.2 amplicon appeared as a homogeneous amplification unit, whereas the 1p36.3 amplicon had a more complex structure, with 2 noncontiguous, highly amplified regions and several moderate amplification units. In this case, fluorescence in situ hybridization analysis confirmed the amplification of several clones and indicated that the 2 highest amplification units corresponded to 2 populations of double minute chromosomes, one of which also contained the MYCN locus. This is the first report of 1p amplifications in primary neuroblastomas.


Assuntos
Mapeamento Cromossômico/métodos , Cromossomos Humanos Par 1/genética , Amplificação de Genes/genética , Neuroblastoma/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Cromossomos Artificiais Bacterianos/genética , Cromossomos Artificiais de Bacteriófago P1/genética , Mapeamento de Sequências Contíguas/métodos , Humanos , Proteína Proto-Oncogênica N-Myc , Proteínas Nucleares/genética , Hibridização de Ácido Nucleico/métodos , Proteínas Oncogênicas/genética
17.
Genes Chromosomes Cancer ; 40(1): 60-5, 2004 May.
Artigo em Inglês | MEDLINE | ID: mdl-15034870

RESUMO

Regional deletions of 6q are frequent karyotypic alterations in malignant lymphoma and are associated with an adverse clinical outcome. One such region of recurrent deletion is 6q16-q21; however, the specific genes affected have not been identified. Our objective in this study was to identify cases with deletion of 6q16-q21 in follicular lymphoma and to define a minimal region of deletion. A physical map of 6q16.2-q21 was constructed using map information from both sequence-based and bacterial artificial chromosome (BAC) fingerprint-based maps. Forty-three BAC clones spanning a 6-Mb region of 6q16.2-q21 were identified and obtained from the RP-11 library. Selected BACs were fluorescence-labeled and hybridized to a series of 34 follicular lymphomas with a regional 6q deletion detected by G banding. Twenty-four cases with deletion of the 6q16.3 region were detected. A minimal deletion of 2.3 Mb was defined. Our study has identified a limited region of deletion of 6q16.3 that may implicate four known genes in follicular lymphoma and possibly in other cancers. A BAC contig spanning a 6-Mb region has been anchored to the 6q16.2-q21 region. This map represents a useful resource for gene identification in this region, not only in lymphoma but also in other neoplasms with 6q alterations.


Assuntos
Deleção Cromossômica , Mapeamento Cromossômico/métodos , Cromossomos Artificiais Bacterianos/genética , Cromossomos Humanos Par 6/genética , Clonagem Molecular/métodos , Mapeamento de Sequências Contíguas/métodos , Linfoma Folicular/genética , Feminino , Humanos , Isocromossomos/genética , Cariotipagem , Masculino
18.
Genes Chromosomes Cancer ; 40(1): 66-71, 2004 May.
Artigo em Inglês | MEDLINE | ID: mdl-15034871

RESUMO

Comparative genomic hybridization (CGH) has proved to be a powerful tool for the detection of genome copy number changes in human cancers and in other diseases caused by segmental aneusomies. Array versions of CGH allow the definition of these aberrations, with resolution determined by the size and distribution of the array elements. Resolution approaching 100 kb can be achieved by use of arrays comprising bacterial artificial chromosomes (BACs) distributed contiguously across regions of interest. We describe here a computer program that automatically assembles contigs of minimally overlapping BAC clones, using information about BAC end-sequences and the normal genome DNA sequence. We demonstrate the characteristics of contigs assembled and annotated by use of this approach for regions of recurrent abnormality in human ovarian and breast cancers at chromosome bands 3q25-q27 and 8q24 and chromosome arm 20q. We also show illustrative analyses of regions of amplification in these regions in breast and ovarian tumor cell lines by use of array CGH with arrays comprising contiguous BACs.


Assuntos
Cromossomos Artificiais Bacterianos/genética , Biologia Computacional/tendências , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/tendências , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Cromossomos Humanos Par 20/genética , Cromossomos Humanos Par 3/genética , Biologia Computacional/métodos , DNA de Neoplasias/genética , Feminino , Genes Neoplásicos/genética , Genoma Humano , Humanos , Hibridização de Ácido Nucleico , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia , Software
19.
Genet. mol. res. (Online) ; 3(1): 53-63, Mar. 2004.
Artigo em Inglês | LILACS | ID: lil-417585

RESUMO

In the finishing phase of the Chromobacterium violaceum genome project, the shotgun sequences were assembled into 57 contigs that were then organized into 19 scaffolds, using the information from shotgun and cosmid clones. Among the 38 ends resulting from the 19 scaffolds, 10 ended with sequences corresponding to rRNA genes (seven ended with the 5S rRNA gene and three ended with the 16S rRNA gene). The 28 non-ribosomal ends were extended using the PCR-assisted contig extension (PACE) methodology, which immediately closed 15 real gaps. We then applied PACE to the 16S rRNA gene containing ends, resulting in eight different sequences that were correctly assembled within the C. violaceum genome by combinatory PCR strategy, with primers derived from the non-repetitive genomic region flanking the 16S and 5S rRNA gene. An oriented combinatory PCR was used to correctly position the two versions (copy A and copy B, which differ by the presence or absence of a 100-bp insert); it revealed six copies corresponding to copy A, and two to copy B. We estimate that the use of PACE, followed by combinatory PCR, accelerated the finishing phase of the C. violaceum genome project by at least 40 per cent


Assuntos
Chromobacterium/genética , Genoma Bacteriano , RNA Ribossômico/genética , Reação em Cadeia da Polimerase/métodos , Mapeamento de Sequências Contíguas/métodos
20.
BMC Biotechnol ; 4: 1, 2004 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-14709179

RESUMO

BACKGROUND: BAC clones containing entire mammalian genes including all the transcribed region and long range controlling elements are very useful for functional analysis. Sequenced BACs are available for most of the human and mouse genomes and in many cases these contain intact genes. However, large genes often span more than one BAC, and single BACs covering the entire region of interest are not available. Here we describe a system for linking two or more overlapping BACs into a single clone by homologous recombination. RESULTS: The method was used to link a 61-kb insert carrying the final 5 exons of the human CFTR gene onto a 160-kb BAC carrying the first 22 exons. Two rounds of homologous recombination were carried out in the EL350 strain of bacteria which can be induced for the Red genes. In the first round, the inserts of the two overlapping BACs were subcloned into modified BAC vectors using homologous recombination. In the second round, the BAC to be added was linearised with the very rare-cutting enzyme I-PpoI and electroporated into recombination efficient EL350 bacteria carrying the other BAC. Recombined BACs were identified by antibiotic selection and PCR screening and 10% of clones contained the correctly recombined 220-kb BAC. CONCLUSION: The system can be used to link the inserts from any overlapping BAC or PAC clones. The original orientation of the inserts is not important and desired regions of the inserts can be selected. The size limit for the fragments recombined may be larger than the 61 kb used here and multiple BACs in a contig could be combined by alternating use of the two pBACLink vectors. This system should be of use to many investigators wishing to carry out functional analysis on large mammalian genes which are not available in single BAC clones.


Assuntos
Cromossomos Artificiais Bacterianos/genética , Mapeamento de Sequências Contíguas/métodos , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Vetores Genéticos/genética , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA