Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Mol Genet Genomic Med ; 10(12): e2072, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36251442

RESUMO

BACKGROUND: Some clinically important genetic variants are not easily evaluated with next-generation sequencing (NGS) methods due to technical challenges arising from high- similarity copies (e.g., PMS2, SMN1/SMN2, GBA1, HBA1/HBA2, CYP21A2), repetitive short sequences (e.g., ARX polyalanine repeats, FMR1 AGG interruptions in CGG repeats, CFTR poly-T/TG repeats), and other complexities (e.g., MSH2 Boland inversions). METHODS: We customized our NGS processes to detect the technically challenging variants mentioned above with adaptations including target enrichment and bioinformatic masking of similar sequences. Adaptations were validated with samples of known genotypes. RESULTS: Our adaptations provided high-sensitivity and high-specificity detection for most of the variants and provided a high-sensitivity primary assay to be followed with orthogonal disambiguation for the others. The sensitivity of the NGS adaptations was 100% for all of the technically challenging variants. Specificity was 100% for those in PMS2, GBA1, SMN1/SMN2, and HBA1/HBA2, and for the MSH2 Boland inversion; 97.8%-100% for CYP21A2 variants; and 85.7% for ARX polyalanine repeats. CONCLUSIONS: NGS assays can detect technically challenging variants when chemistries and bioinformatics are jointly refined. The adaptations described support a scalable, cost-effective path to identifying all clinically relevant variants within a single sample.


Assuntos
Proteína do X Frágil da Deficiência Intelectual , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Endonuclease PMS2 de Reparo de Erro de Pareamento , Hemoglobinas Glicadas , Proteína 2 Homóloga a MutS , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genótipo , Esteroide 21-Hidroxilase
2.
RNA ; 19(7): 958-70, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23697550

RESUMO

Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.


Assuntos
DNA Complementar/biossíntese , Íntrons , DNA Polimerase Dirigida por RNA/metabolismo , Proteínas Recombinantes de Fusão/metabolismo , Análise de Sequência de RNA/métodos , Sequência de Bases , Clonagem Molecular , Sequência Conservada , DNA Complementar/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Perfilação da Expressão Gênica , Biblioteca Gênica , Geobacillus stearothermophilus/genética , Geobacillus stearothermophilus/metabolismo , Células HeLa , Humanos , Células MCF-7 , MicroRNAs/genética , MicroRNAs/metabolismo , Dados de Sequência Molecular , Fases de Leitura Aberta , Proteínas Periplásmicas de Ligação/genética , Proteínas Periplásmicas de Ligação/metabolismo , Plasmídeos/genética , Plasmídeos/metabolismo , Estabilidade Proteica , DNA Polimerase Dirigida por RNA/genética , Proteínas Recombinantes de Fusão/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Temperatura
3.
Proc Natl Acad Sci U S A ; 110(14): 5552-7, 2013 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-23509297

RESUMO

Standard whole-genome genotyping technologies are unable to determine haplotypes. Here we describe a method for rapid and cost-effective long-range haplotyping. Genomic DNA is diluted and distributed into multiple aliquots such that each aliquot receives a fraction of a haploid copy. The DNA template in each aliquot is amplified by multiple displacement amplification, converted into barcoded sequencing libraries using Nextera technology, and sequenced in multiplexed pools. To assess the performance of our method, we combined two male genomic DNA samples at equal ratios, resulting in a sample with diploid X chromosomes with known haplotypes. Pools of the multiplexed sequencing libraries were subjected to targeted pull-down of a 1-Mb contiguous region of the X-chromosome Duchenne muscular dystrophy gene. We were able to phase the Duchenne muscular dystrophy region into two contiguous haplotype blocks with a mean length of 494 kb. The haplotypes showed 99% agreement with the consensus base calls made by sequencing the individual DNAs. We subsequently used the strategy to haplotype two human genomes. Standard genomic sequencing to identify all heterozygous SNPs in the sample was combined with dilution-amplification-based sequencing data to resolve the phase of identified heterozygous SNPs. Using this procedure, we were able to phase >95% of the heterozygous SNPs from the diploid sequence data. The N50 for a Yoruba male DNA was 702 kb whereas the N50 for a European female DNA was 358 kb. Therefore, the strategy described here is suitable for haplotyping of a set of targeted regions as well as of the entire genome.


Assuntos
Técnicas Genéticas , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Técnicas de Amplificação de Ácido Nucleico/métodos , Código de Barras de DNA Taxonômico/métodos , Distrofina/genética , Feminino , Biblioteca Gênica , Genótipo , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética
4.
Bioinformatics ; 28(14): 1811-7, 2012 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-22581179

RESUMO

MOTIVATION: Whole genome and exome sequencing of matched tumor-normal sample pairs is becoming routine in cancer research. The consequent increased demand for somatic variant analysis of paired samples requires methods specialized to model this problem so as to sensitively call variants at any practical level of tumor impurity. RESULTS: We describe Strelka, a method for somatic SNV and small indel detection from sequencing data of matched tumor-normal samples. The method uses a novel Bayesian approach which represents continuous allele frequencies for both tumor and normal samples, while leveraging the expected genotype structure of the normal. This is achieved by representing the normal sample as a mixture of germline variation with noise, and representing the tumor sample as a mixture of the normal sample with somatic variation. A natural consequence of the model structure is that sensitivity can be maintained at high tumor impurity without requiring purity estimates. We demonstrate that the method has superior accuracy and sensitivity on impure samples compared with approaches based on either diploid genotype likelihoods or general allele-frequency tests. AVAILABILITY: The Strelka workflow source code is available at ftp://strelka@ftp.illumina.com/. CONTACT: csaunders@illumina.com


Assuntos
Teorema de Bayes , Biologia Computacional/métodos , Neoplasias/genética , Exoma , Frequência do Gene , Variação Genética , Genoma , Humanos , Mutação INDEL , Modelos Genéticos , Alinhamento de Sequência
5.
Nature ; 463(7283): 893-8, 2010 Feb 18.
Artigo em Inglês | MEDLINE | ID: mdl-20164919

RESUMO

The cancer genome is moulded by the dual processes of somatic mutation and selection. Homozygous deletions in cancer genomes occur over recessive cancer genes, where they can confer selective growth advantage, and over fragile sites, where they are thought to reflect an increased local rate of DNA breakage. However, most homozygous deletions in cancer genomes are unexplained. Here we identified 2,428 somatic homozygous deletions in 746 cancer cell lines. These overlie 11% of protein-coding genes that, therefore, are not mandatory for survival of human cells. We derived structural signatures that distinguish between homozygous deletions over recessive cancer genes and fragile sites. Application to clusters of unexplained homozygous deletions suggests that many are in regions of inherent fragility, whereas a small subset overlies recessive cancer genes. The results illustrate how structural signatures can be used to distinguish between the influences of mutation and selection in cancer genomes. The extensive copy number, genotyping, sequence and expression data available for this large series of publicly available cancer cell lines renders them informative reagents for future studies of cancer biology and drug discovery.


Assuntos
Sítios Frágeis do Cromossomo/genética , Deleção de Genes , Genes Neoplásicos/genética , Genes Recessivos/genética , Genoma Humano/genética , Homozigoto , Neoplasias/genética , Seleção Genética/genética , Linhagem Celular Tumoral , Cromossomos Humanos/genética , Variações do Número de Cópias de DNA/genética , Análise Mutacional de DNA , Dosagem de Genes/genética , Humanos , Modelos Genéticos , Análise de Sequência com Séries de Oligonucleotídeos , Mapeamento Físico do Cromossomo , Reprodutibilidade dos Testes
6.
Biostatistics ; 11(1): 164-75, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19837654

RESUMO

High-throughput oligonucleotide microarrays are commonly employed to investigate genetic disease, including cancer. The algorithms employed to extract genotypes and copy number variation function optimally for diploid genomes usually associated with inherited disease. However, cancer genomes are aneuploid in nature leading to systematic errors when using these techniques. We introduce a preprocessing transformation and hidden Markov model algorithm bespoke to cancer. This produces genotype classification, specification of regions of loss of heterozygosity, and absolute allelic copy number segmentation. Accurate prediction is demonstrated with a combination of independent experimental techniques. These methods are exemplified with affymetrix genome-wide SNP6.0 data from 755 cancer cell lines, enabling inference upon a number of features of biological interest. These data and the coded algorithm are freely available for download.


Assuntos
Algoritmos , Alelos , Variações do Número de Cópias de DNA/genética , Testes Genéticos , Modelos Estatísticos , Neoplasias/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Aneuploidia , Teorema de Bayes , Viés , Linhagem Celular Tumoral , Genes Supressores de Tumor , Genótipo , Humanos , Internet , Perda de Heterozigosidade/genética , Cadeias de Markov , Neoplasias/diagnóstico , Polimorfismo de Nucleotídeo Único/genética , Poliploidia , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Software
7.
Sci Signal ; 2(68): ra19, 2009 Apr 28.
Artigo em Inglês | MEDLINE | ID: mdl-19401593

RESUMO

The mammalian postsynaptic density (PSD) comprises a complex collection of approximately 1100 proteins. Despite extensive knowledge of individual proteins, the overall organization of the PSD is poorly understood. Here, we define maps of molecular circuitry within the PSD based on phosphorylation of postsynaptic proteins. Activation of a single neurotransmitter receptor, the N-methyl-D-aspartate receptor (NMDAR), changed the phosphorylation status of 127 proteins. Stimulation of ionotropic and metabotropic glutamate receptors and dopamine receptors activated overlapping networks with distinct combinatorial phosphorylation signatures. Using peptide array technology, we identified specific phosphorylation motifs and switching mechanisms responsible for the integration of neurotransmitter receptor pathways and their coordination of multiple substrates in these networks. These combinatorial networks confer high information-processing capacity and functional diversity on synapses, and their elucidation may provide new insights into disease mechanisms and new opportunities for drug discovery.


Assuntos
Hipocampo/metabolismo , Proteoma/metabolismo , Receptores de Neurotransmissores/metabolismo , Sinapses/metabolismo , Sequência de Aminoácidos , Animais , Sítios de Ligação , Cromatografia Líquida , Agonistas de Aminoácidos Excitatórios/farmacologia , Hipocampo/efeitos dos fármacos , Immunoblotting , Técnicas In Vitro , Espectrometria de Massas , Camundongos , Camundongos Endogâmicos , Dados de Sequência Molecular , N-Metilaspartato/farmacologia , Fosfoproteínas/metabolismo , Fosforilação/efeitos dos fármacos , Análise Serial de Proteínas , Ligação Proteica , Proteômica/métodos , Transdução de Sinais/efeitos dos fármacos , Transmissão Sináptica/efeitos dos fármacos
8.
Mol Cell Proteomics ; 7(5): 962-70, 2008 May.
Artigo em Inglês | MEDLINE | ID: mdl-18216375

RESUMO

It is a major challenge to develop effective sequence database search algorithms to translate molecular weight and fragment mass information obtained from tandem mass spectrometry into high quality peptide and protein assignments. We investigated the peptide identification performance of Mascot and X!Tandem for mass tolerance settings common for low and high accuracy mass spectrometry. We demonstrated that sensitivity and specificity of peptide identification can vary substantially for different mass tolerance settings, but this effect was more significant for Mascot. We present an adjusted Mascot threshold, which allows the user to freely select the best trade-off between sensitivity and specificity. The adjusted Mascot threshold was compared with the default Mascot and X!Tandem scoring thresholds and shown to be more sensitive at the same false discovery rates for both low and high accuracy mass spectrometry data.


Assuntos
Peptídeos/análise , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Animais , Células Cultivadas , Camundongos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...