Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38233091

RESUMO

Structural variations (SVs) are commonly found in cancer genomes. They can cause gene amplification, deletion and fusion, among other functional consequences. With an average read length of hundreds of kilobases, nano-channel-based optical DNA mapping is powerful in detecting large SVs. However, existing SV calling methods are not tailored for cancer samples, which have special properties such as mixed cell types and sub-clones. Here we propose the Cancer Optical Mapping for detecting Structural Variations (COMSV) method that is specifically designed for cancer samples. It shows high sensitivity and specificity in benchmark comparisons. Applying to cancer cell lines and patient samples, COMSV identifies hundreds of novel SVs per sample.


Assuntos
Genoma Humano , Neoplasias , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética
2.
Hum Mol Genet ; 31(R1): R97-R104, 2022 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-36018818

RESUMO

Enhancers are pivotal for regulating gene transcription that occurs at promoters. Identification of the interacting enhancer-promoter pairs and understanding the mechanisms behind how they interact and how enhancers modulate transcription can provide fundamental insight into gene regulatory networks. Recently, advances in high-throughput methods in three major areas-chromosome conformation capture assay, such as Hi-C to study basic chromatin architecture, ectopic reporter experiments such as self-transcribing active regulatory region sequencing (STARR-seq) to quantify promoter and enhancer activity, and endogenous perturbations such as clustered regularly interspaced short palindromic repeat interference (CRISPRi) to identify enhancer-promoter compatibility-have further our knowledge about transcription. In this review, we will discuss the major method developments and key findings from these assays.


Assuntos
Elementos Facilitadores Genéticos , Genômica , Elementos Facilitadores Genéticos/genética , Regiões Promotoras Genéticas/genética , Cromatina/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas
3.
Nat Biotechnol ; 40(7): 1056-1065, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35177836

RESUMO

Mounting evidence supports the idea that transcriptional patterns serve as more specific identifiers of active enhancers than histone marks; however, the optimal strategy to identify active enhancers both experimentally and computationally has not been determined. Here, we compared 13 genome-wide RNA sequencing (RNA-seq) assays in K562 cells and show that nuclear run-on followed by cap-selection assay (GRO/PRO-cap) has advantages in enhancer RNA detection and active enhancer identification. We also introduce a tool, peak identifier for nascent transcript starts (PINTS), to identify active promoters and enhancers genome wide and pinpoint the precise location of 5' transcription start sites. Finally, we compiled a comprehensive enhancer candidate compendium based on the detected enhancer RNA (eRNA) transcription start sites (TSSs) available in 120 cell and tissue types, which can be accessed at https://pints.yulab.org . With knowledge of the best available assays and pipelines, this large-scale annotation of candidate enhancers will pave the way for selection and characterization of their functions in a time- and labor-efficient manner.


Assuntos
Elementos Facilitadores Genéticos , RNA , Elementos Facilitadores Genéticos/genética , Regiões Promotoras Genéticas/genética , RNA/genética , Análise de Sequência de RNA/métodos , Sítio de Iniciação de Transcrição
4.
Nat Genet ; 52(10): 1067-1075, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32958950

RESUMO

Distal enhancers play pivotal roles in development and disease yet remain one of the least understood regulatory elements. We used massively parallel reporter assays to perform functional comparisons of two leading enhancer models and find that gene-distal transcription start sites are robust predictors of active enhancers with higher resolution than histone modifications. We show that active enhancer units are precisely delineated by active transcription start sites, validate that these boundaries are sufficient for capturing enhancer function, and confirm that core promoter sequences are necessary for this activity. We assay adjacent enhancers and find that their joint activity is often driven by the stronger unit within the cluster. Finally, we validate these results through functional dissection of a distal enhancer cluster using CRISPR-Cas9 deletions. In summary, definition of high-resolution enhancer boundaries enables deconvolution of complex regulatory loci into modular units.


Assuntos
Elementos Facilitadores Genéticos/genética , Código das Histonas/genética , Sítio de Iniciação de Transcrição , Transcrição Gênica , Linhagem Celular , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Humanos , Regiões Promotoras Genéticas/genética , Processamento de Proteína Pós-Traducional/genética , Iniciação da Transcrição Genética
5.
Mol Cell Proteomics ; 19(3): 554-568, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31839598

RESUMO

Protein-protein interactions play a vital role in nearly all cellular functions. Hence, understanding their interaction patterns and three-dimensional structural conformations can provide crucial insights about various biological processes and underlying molecular mechanisms for many disease phenotypes. Cross-linking mass spectrometry (XL-MS) has the unique capability to detect protein-protein interactions at a large scale along with spatial constraints between interaction partners. The inception of MS-cleavable cross-linkers enabled the MS2-MS3 XL-MS acquisition strategy that provides cross-link information from both MS2 and MS3 level. However, the current cross-link search algorithm available for MS2-MS3 strategy follows a "MS2-centric" approach and suffers from a high rate of mis-identified cross-links. We demonstrate the problem using two new quality assessment metrics ["fraction of mis-identifications" (FMI) and "fraction of interprotein cross-links from known interactions" (FKI)]. We then address this problem, by designing a novel "MS3-centric" approach for cross-link identification and implementing it as a search engine named MaXLinker. MaXLinker outperforms the currently popular search engine with a lower mis-identification rate, and higher sensitivity and specificity. Moreover, we performed human proteome-wide cross-linking mass spectrometry using K562 cells. Employing MaXLinker, we identified a comprehensive set of 9319 unique cross-links at 1% false discovery rate, comprising 8051 intraprotein and 1268 interprotein cross-links. Finally, we experimentally validated the quality of a large number of novel interactions identified in our study, providing a conclusive evidence for MaXLinker's robust performance.


Assuntos
Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Humanos , Células K562 , Espectrometria de Massas , Peptídeos/metabolismo , Proteoma , Sensibilidade e Especificidade
6.
Gigascience ; 8(7)2019 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-31289833

RESUMO

BACKGROUND: Optical mapping is an emerging technology that complements sequencing-based methods in genome analysis. It is widely used in improving genome assemblies and detecting structural variations by providing information over much longer (up to 1 Mb) reads. Current standards in optical mapping analysis involve assembling optical maps into contigs and aligning them to a reference, which is limited to pairwise comparison and becomes bias-prone when analyzing multiple samples. FINDINGS: We present a new method, OMMA, that extends optical mapping to the study of complex genomic features by simultaneously interrogating optical maps across many samples in a reference-independent manner. OMMA captures and characterizes complex genomic features, e.g., multiple haplotypes, copy number variations, and subtelomeric structures when applied to 154 human samples across the 26 populations sequenced in the 1000 Genomes Project. For small genomes such as pathogenic bacteria, OMMA accurately reconstructs the phylogenomic relationships and identifies functional elements across 21 Acinetobacter baumannii strains. CONCLUSIONS: With the increasing data throughput of optical mapping system, the use of this technology in comparative genome analysis across many samples will become feasible. OMMA is a timely solution that can address such computational need. The OMMA software is available at https://github.com/TF-Chan-Lab/OMTools.


Assuntos
Genômica/métodos , Mapeamento por Restrição Óptica/métodos , Polimorfismo Genético , População/genética , Análise de Sequência de DNA/métodos , Software , Genoma Humano , Humanos , Filogenia
7.
Nat Commun ; 10(1): 1216, 2019 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-30872580

RESUMO

Efficient crop improvement depends on the application of accurate genetic information contained in diverse germplasm resources. Here we report a reference-grade genome of wild soybean accession W05, with a final assembled genome size of 1013.2 Mb and a contig N50 of 3.3 Mb. The analytical power of the W05 genome is demonstrated by several examples. First, we identify an inversion at the locus determining seed coat color during domestication. Second, a translocation event between chromosomes 11 and 13 of some genotypes is shown to interfere with the assignment of QTLs. Third, we find a region containing copy number variations of the Kunitz trypsin inhibitor (KTI) genes. Such findings illustrate the power of this assembly in the analysis of large structural variations in soybean germplasm collections. The wild soybean genome assembly has wide applications in comparative genomic and evolutionary studies, as well as in crop breeding and improvement programs.


Assuntos
Genoma de Planta/genética , Glycine max/genética , Melhoramento Vegetal/métodos , Locos de Características Quantitativas/genética , Evolução Biológica , Variações do Número de Cópias de DNA , Domesticação , Genômica/métodos , Genótipo , Anotação de Sequência Molecular , Peptídeos/genética , Proteínas de Plantas/genética , Translocação Genética/genética
8.
Genome Biol ; 18(1): 230, 2017 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-29195502

RESUMO

We present a new method, OMSV, for accurately and comprehensively identifying structural variations (SVs) from optical maps. OMSV detects both homozygous and heterozygous SVs, SVs of various types and sizes, and SVs with or without creating or destroying restriction sites. We show that OMSV has high sensitivity and specificity, with clear performance gains over the latest method. Applying OMSV to a human cell line, we identified hundreds of SVs >2 kbp, with 68 % of them missed by sequencing-based callers. Independent experimental validation confirmed the high accuracy of these SVs. The OMSV software is available at http://yiplab.cse.cuhk.edu.hk/omsv/ .


Assuntos
Variação Estrutural do Genoma , Genômica/métodos , Software , Biologia Computacional/métodos , Simulação por Computador , Genoma Humano , Humanos
9.
Bioinformatics ; 33(18): 2933-2935, 2017 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-28505226

RESUMO

SUMMARY: Optical mapping is a molecular technique capturing specific patterns of fluorescent labels along DNA molecules. It has been widely applied in assisted-scaffolding in sequence assemblies, microbial strain typing and detection of structural variations. Various computational methods have been developed to analyze optical mapping data. However, existing tools for processing and visualizing optical map data still have many shortcomings. Here, we present OMTools, an efficient and intuitive data processing and visualization suite to handle and explore large-scale optical mapping profiles. OMTools includes modules for visualization (OMView), data processing and simulation. These modules together form an accessible and convenient pipeline for optical mapping analyses. AVAILABILITY AND IMPLEMENTATION: OMTools is implemented in Java 1.8 and released under a GPL license. OMTools can be downloaded from https://github.com/aldenleung/OMTools and run on any standard desktop computer equipped with a Java virtual machine. CONTACT: kevinyip@cse.cuhk.edu.hk or tf.chan@cuhk.edu.hk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genômica/métodos , Análise de Sequência de DNA/métodos , Software , Humanos
10.
Bioinformatics ; 33(3): 311-319, 2017 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-28172448

RESUMO

Background: Optical mapping is a technique for capturing fluorescent signal patterns of long DNA molecules (in the range of 0.1­1 Mbp). Recently, it has been complementing the widely used short-read sequencing technology by assisting with scaffolding and detecting large and complex structural variations (SVs). Here, we introduce a fast, robust and accurate tool called OMBlast for aligning optical maps, the set of signal locations on the molecules generated from optical mapping. Our method is based on the seed-and-extend approach from sequence alignment, with modifications specific to optical mapping. Results: Experiments with both synthetic and our real data demonstrate that OMBlast has higher accuracy and faster mapping speed than existing alignment methods. Our tool also shows significant improvement when aligning data with SVs. Availability and Implementation: OMBlast is implemented for Java 1.7 and is released under a GPL license. OMBlast can be downloaded from https://github.com/aldenleung/OMBlast and run directly on machines equipped with a Java virtual machine. Contact: kevinyip@cse.cuhk.edu.hk and tf.chan@cuhk.edu.hk Supplementary Information: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma , Mapeamento por Restrição Óptica/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Software , Animais , Caenorhabditis elegans/genética , Escherichia coli/genética , Genômica/métodos , Humanos , Saccharomyces cerevisiae/genética
11.
Genome Announc ; 3(3)2015 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-25977429

RESUMO

We report the draft genome sequence of an extensively drug-resistant strain of Acinetobacter baumannii, CUAB1, isolated from a patient in a local Hong Kong hospital. MIC testing was performed, and genes previously associated with drug resistance were located.

12.
PLoS One ; 8(9): e74286, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24073205

RESUMO

RNA-sequencing is a powerful tool in studying RNomics. However, the highly abundance of ribosomal RNAs (rRNA) and transfer RNA (tRNA) have predominated in the sequencing reads, thereby hindering the study of lowly expressed genes. Therefore, rRNA depletion prior to sequencing is often performed in order to preserve the subtle alteration in gene expression especially those at relatively low expression levels. One of the commercially available methods is to use DNA or RNA probes to hybridize to the target RNAs. However, there is always a concern with the non-specific binding and unintended removal of messenger RNA (mRNA) when the same set of probes is applied to different organisms. The degree of such unintended mRNA removal varies among organisms due to organism-specific genomic variation. We developed a computer-based method to design probes to deplete rRNA in an organism-specific manner. Based on the computation results, biotinylated-RNA-probes were produced by in vitro transcription and were used to perform rRNA depletion with subtractive hybridization. We demonstrated that the designed probes of 16S rRNAs and 23S rRNAs can efficiently remove rRNAs from Mycobacterium smegmatis. In comparison with a commercial subtractive hybridization-based rRNA removal kit, using organism-specific probes is better in preserving the RNA integrity and abundance. We believe the computer-based design approach can be used as a generic method in preparing RNA of any organisms for next-generation sequencing, particularly for the transcriptome analysis of microbes.


Assuntos
Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Mycobacterium smegmatis/genética , RNA Bacteriano/genética , RNA Mensageiro/genética , RNA Ribossômico/genética , Sondas RNA/genética , RNA Ribossômico/isolamento & purificação , Reação em Cadeia da Polimerase em Tempo Real , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Especificidade da Espécie
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA