Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
Bioinformatics ; 34(17): 3047-3049, 2018 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-29659720

RESUMO

Summary: Exhaustive detection of multi-loci markers from genome-wide association study datasets is a computationally challenging problem. This paper presents a massively parallel algorithm for finding all significant combinations of alleles and introduces a software tool termed MP-LAMP that can be easily deployed in a cloud platform, such as Amazon Web Service, as well as in an in-house computer cluster. Multi-loci marker detection is an unbalanced tree search problem that cannot be parallelized by simple tree-splitting using generic parallel programming frameworks, such as Map-Reduce. We employ work stealing and periodic reduce-broadcast to decrease the running time almost linearly to the number of cores. Availability and implementation: MP-LAMP is available at https://github.com/tsudalab/mp-lamp. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Computação em Nuvem , Algoritmos , Humanos , Software
2.
Bioinformatics ; 32(22): 3513-3515, 2016 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-27412093

RESUMO

One of the major issues in genome-wide association studies is to solve the missing heritability problem. While considering epistatic interactions among multiple SNPs may contribute to solving this problem, existing software cannot detect statistically significant high-order interactions. We propose software named LAMPLINK, which employs a cutting-edge method to enumerate statistically significant SNP combinations from genome-wide case-control data. LAMPLINK is implemented as a set of additional functions to PLINK, and hence existing procedures with PLINK can be applicable. Applied to the 1000 Genomes Project data, LAMPLINK detected a combination of five SNPs that are statistically significantly accumulated in the Japanese population. AVAILABILITY AND IMPLEMENTATION: LAMPLINK is available at http://a-terada.github.io/lamplink/ CONTACT: terada@cbms.k.u-tokyo.ac.jp or sese.jun@aist.go.jpSupplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Software , Animais , Genoma , Humanos
3.
Mol Ecol ; 26(1): 193-207, 2017 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-27352992

RESUMO

The habitats of polyploid species are generally distinct from their parental species. Stebbins described polyploids as 'general purpose genotypes', which can tolerate a wide range of environmental conditions. However, little is known about its molecular basis because of the complexity of polyploid genomes. We hypothesized that allopolyploid species might utilize the expression patterns of both parents depending on environments (polyploid plasticity hypothesis). We focused on hydrological niche segregation along fine-scale soil moisture and waterlogging gradients. Two diploid species, Cardamine amara and Cardamine hirsuta, grew best in submerged and unsubmerged conditions, respectively, consistent with their natural habitats. Interestingly, the allotetraploid Cardamine flexuosa derived from them grew similarly in fluctuating as well as submerged and unsubmerged conditions, consistent with its wide environmental tolerance. A similar pattern was found in another species trio: allotetraploid Cardamine scutata and its parents. Using the close relatedness of Cardamine and Arabidopsis, we quantified genomewide expression patterns following dry and wet treatments using an Arabidopsis microarray. Hierarchical clustering analysis revealed that the expression pattern of C. flexuosa clustered with C. hirsuta in the dry condition and with C. amara in the wet condition, supporting our hypothesis. Furthermore, the induction levels of most genes in the allopolyploid were lower than in a specialist diploid species. This reflects a disadvantage of being allopolyploid arising from fixed heterozygosity. We propose that recurrent allopolyploid speciation along soil moisture and waterlogging gradients confers niche differentiation and reproductive isolation simultaneously and serves as a model for studying the molecular basis of ecological speciation and adaptive radiation.


Assuntos
Adaptação Fisiológica/genética , Cardamine/classificação , Ecossistema , Poliploidia , Transcriptoma , Diploide , Isolamento Reprodutivo , Água
4.
Proc Natl Acad Sci U S A ; 110(32): 12996-3001, 2013 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-23882073

RESUMO

More than three transcription factors often work together to enable cells to respond to various signals. The detection of combinatorial regulation by multiple transcription factors, however, is not only computationally nontrivial but also extremely unlikely because of multiple testing correction. The exponential growth in the number of tests forces us to set a strict limit on the maximum arity. Here, we propose an efficient branch-and-bound algorithm called the "limitless arity multiple-testing procedure" (LAMP) to count the exact number of testable combinations and calibrate the Bonferroni factor to the smallest possible value. LAMP lists significant combinations without any limit, whereas the family-wise error rate is rigorously controlled under the threshold. In the human breast cancer transcriptome, LAMP discovered statistically significant combinations of as many as eight binding motifs. This method may contribute to uncover pathways regulated in a coordinated fashion and find hidden associations in heterogeneous data.


Assuntos
Algoritmos , Biologia Computacional/métodos , Transdução de Sinais/fisiologia , Fatores de Transcrição/metabolismo , Neoplasias da Mama/genética , Neoplasias da Mama/fisiopatologia , Feminino , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes/genética , Redes Reguladoras de Genes/fisiologia , Humanos , Modelos Genéticos , Reprodutibilidade dos Testes , Transdução de Sinais/genética , Fatores de Transcrição/genética
5.
Methods Mol Biol ; 1807: 83-94, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30030805

RESUMO

Detecting combinatorial effects is important to various research areas, including biology, genomics, and medical sciences. However, this task was not only computationally nontrivial but also extremely difficult to achieve because of the necessity of a multiple testing procedure; hence few methods can comprehensively analyze high-order combinations. Recently, Limitless Arity Multiple-testing Procedure (LAMP) was introduced, allowing us to enumerate statistically significant combinations from a given dataset. This chapter provides instructions for LAMP using simple examples of combinatorial transcription factor regulation discovery and visualization of the results. This chapter also introduces LAMPLINK, which is extended software of LAMP. LAMPLINK can handle genetic dataset to detect statistically significant interactions among multiple SNPs from a genome-wide association study (GWAS) dataset.


Assuntos
Biologia Computacional/métodos , Algoritmos , Humanos , Polimorfismo de Nucleotídeo Único/genética , Fluxo de Trabalho
6.
BMC Med Genomics ; 11(Suppl 2): 31, 2018 04 20.
Artigo em Inglês | MEDLINE | ID: mdl-29697363

RESUMO

BACKGROUND: Survival analysis methods have been widely applied in different areas of health and medicine, spanning over varying events of interest and target diseases. They can be utilized to provide relationships between the survival time of individuals and factors of interest, rendering them useful in searching for biomarkers in diseases such as cancer. However, some disease progression can be very unpredictable because the conventional approaches have failed to consider multiple-marker interactions. An exponential increase in the number of candidate markers requires large correction factor in the multiple-testing correction and hide the significance. METHODS: We address the issue of testing marker combinations that affect survival by adapting the recently developed Limitless Arity Multiple-testing Procedure (LAMP), a p-value correction technique for statistical tests for combination of markers. LAMP cannot handle survival data statistics, and hence we extended LAMP for the log-rank test, making it more appropriate for clinical data, with newly introduced theoretical lower bound of the p-value. RESULTS: We applied the proposed method to gene combination detection for cancer and obtained gene interactions with statistically significant log-rank p-values. Gene combinations with orders of up to 32 genes were detected by our algorithm, and effects of some genes in these combinations are also supported by existing literature. CONCLUSION: The novel approach for detecting prognostic markers presented here can identify statistically significant markers with no limitations on the order of interaction. Furthermore, it can be applied to different types of genomic data, provided that binarization is possible.


Assuntos
Biomarcadores Tumorais/genética , Biologia Computacional/métodos , Estatística como Assunto , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Cistadenocarcinoma Seroso/genética , Cistadenocarcinoma Seroso/patologia , Feminino , Humanos , Invasividade Neoplásica , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia , Análise de Sobrevida
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa