Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
1.
Cell Chem Biol ; 30(11): 1453-1467.e8, 2023 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-37607550

RESUMO

Orphan cytotoxins are small molecules for which the mechanism of action (MoA) is either unknown or ambiguous. Unveiling the mechanism of these compounds may lead to useful tools for biological investigation and new therapeutic leads. In selected cases, the DNA mismatch repair-deficient colorectal cancer cell line, HCT116, has been used as a tool in forward genetic screens to identify compound-resistant mutations, which have ultimately led to target identification. To expand the utility of this approach, we engineered cancer cell lines with inducible mismatch repair deficits, thus providing temporal control over mutagenesis. By screening for compound resistance phenotypes in cells with low or high rates of mutagenesis, we increased both the specificity and sensitivity of identifying resistance mutations. Using this inducible mutagenesis system, we implicate targets for multiple orphan cytotoxins, including a natural product and compounds emerging from a high-throughput screen, thus providing a robust tool for future MoA studies.


Assuntos
Antineoplásicos , Neoplasias do Colo , Humanos , Reparo de Erro de Pareamento de DNA , Antineoplásicos/farmacologia , Mutagênese , Citotoxinas
2.
Nucleic Acids Res ; 34(20): 5730-9, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-17041233

RESUMO

Given a set of known binding sites for a specific transcription factor, it is possible to build a model of the transcription factor binding site, usually called a motif model, and use this model to search for other sites that bind the same transcription factor. Typically, this search is performed using a position-specific scoring matrix (PSSM), also known as a position weight matrix. In this paper we analyze a set of eukaryotic transcription factor binding sites and show that there is extensive clustering of similar k-mers in eukaryotic motifs, owing to both functional and evolutionary constraints. The apparent limitations of probabilistic models in representing complex nucleotide dependencies lead us to a graph-based representation of motifs. When deciding whether a candidate k-mer is part of a motif or not, we base our decision not on how well the k-mer conforms to a model of the motif as a whole, but how similar it is to specific, known k-mers in the motif. We elucidate the reasons why we expect graph-based methods to perform well on motif data. Our MotifScan algorithm shows greatly improved performance over the prevalent PSSM-based method for the detection of eukaryotic motifs.


Assuntos
Algoritmos , Modelos Genéticos , Elementos Reguladores de Transcrição , Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo , Sítios de Ligação , Expressão Gênica , Humanos , Modelos Estatísticos , Nucleotídeos/análise , Saccharomyces cerevisiae/genética
3.
Bioinformatics ; 22(14): e150-7, 2006 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-16873465

RESUMO

MOTIVATION: DNA motif finding is one of the core problems in computational biology, for which several probabilistic and discrete approaches have been developed. Most existing methods formulate motif finding as an intractable optimization problem and rely either on expectation maximization (EM) or on local heuristic searches. Another challenge is the choice of motif model: simpler models such as the position-specific scoring matrix (PSSM) impose biologically unrealistic assumptions such as independence of the motif positions, while more involved models are harder to parametrize and learn. RESULTS: We present MotifCut, a graph-theoretic approach to motif finding leading to a convex optimization problem with a polynomial time solution. We build a graph where the vertices represent all k-mers in the input sequences, and edges represent pairwise k-mer similarity. In this graph, we search for a motif as the maximum density subgraph, which is a set of k-mers that exhibit a large number of pairwise similarities. Our formulation does not make strong assumptions regarding the structure of the motif and in practice both motifs that fit well the PSSM model, and those that exhibit strong dependencies between position pairs are found as dense subgraphs. We benchmark MotifCut on both synthetic and real yeast motifs, and find that it compares favorably to existing popular methods. The ability of MotifCut to detect motifs appears to scale well with increasing input size. Moreover, the motifs we discover are different from those discovered by the other methods. AVAILABILITY: MotifCut server and other materials can be found at motifcut.stanford.edu.


Assuntos
Algoritmos , DNA/genética , Modelos Genéticos , Elementos Reguladores de Transcrição/genética , Análise de Sequência de DNA/métodos , Software , Fatores de Transcrição/genética , Sequência de Bases , Sítios de Ligação , Mapeamento Cromossômico/métodos , Simulação por Computador , Modelos Estatísticos , Dados de Sequência Molecular , Ligação Proteica , Alinhamento de Sequência/métodos
4.
PLoS One ; 6(8): e23473, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21858135

RESUMO

While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.


Assuntos
Estudos de Associação Genética/métodos , Genoma Humano/genética , Estudo de Associação Genômica Ampla/métodos , Inquéritos e Questionários , Adulto , Idoso , Estudos de Coortes , Feminino , Genótipo , Humanos , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Razão de Chances , Polimorfismo de Nucleotídeo Único/genética , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA