Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 38
Filtrar
1.
Sante Publique ; 36(1): 81-85, 2024 04 05.
Artigo em Francês | MEDLINE | ID: mdl-38580470

RESUMO

In a context of saturation of private dental practices and medical demography issues, responses to requests for emergency dental care are a poorly documented problem. In partnership with the Observatoire Regional de la Santé, the URPS Chirurgiens-Dentistes Nouvelle-Aquitaine, a union, conducted a survey of private dentists in May and June 2022. The objective was to estimate the volume of requests for unscheduled dental care and to describe the responses provided by professionals. More than eight out of ten professionals said they were often called upon for unscheduled care and more than four out of ten set aside specific time slots to provide it. More than a quarter of them said they provided care in 90 percent of cases, in response to requests of this type, and 40 percent provided care in at least half of the cases. For most professionals, the average waiting time for patients requesting unscheduled care was less than 24 hours. Respondents cited patient education as a general avenue for improvement, in addition to the creation of a specific pricing structure for unscheduled care. This survey provides a better understanding of the difficulties faced by professionals on a subject not yet investigated by the dental profession. It documents the acceptability of possible responses in terms of improving professional practices and institutional organizations.


Dans un contexte de saturation des cabinets dentaires libéraux et de démographie médicale tendue, l'apport de réponses aux demandes de soins dentaires non programmés constitue une réelle problématique assez peu documentée. En partenariat avec l'Observatoire régional de la santé, l'URPS Chirurgiens-dentistes Nouvelle-Aquitaine a mené en mai-juin 2022 une enquête auprès de chirurgiens-dentistes libéraux. L'objectif était d'estimer le volume des demandes de soins non programmés en soins dentaires et de décrire les réponses apportées par les professionnels. Plus de huit professionnels sur dix ont déclaré être souvent sollicités pour des soins non programmés, et plus de quatre sur dix prévoyaient des créneaux spécifiques pour les assurer. Plus d'un quart d'entre eux ont déclaré répondre à 90 % des sollicitations pour ce type de soins et 40 % répondre à moins de la moitié des demandes. Les soins non programmés étaient pris en charge dans les 24 heures en moyenne pour la majorité des professionnels. L'éducation des patients a été citée comme une piste d'amélioration générale ou institutionnelle, devant la création d'une cotation spécifique pour les soins non programmés. Cette enquête permet de mieux connaître les difficultés des professionnels sur un sujet non encore investigué auprès de la profession dentaire. Elle documente l'acceptabilité de pistes de réponses pouvant être apportées pour améliorer les pratiques professionnelles et les organisations institutionnelles.


Assuntos
Serviços Médicos de Emergência , Humanos , Inquéritos e Questionários , Atitude do Pessoal de Saúde , Prática Profissional , Assistência Odontológica , Odontólogos
2.
Mol Ecol ; 32(22): 5944-5958, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37815414

RESUMO

Next-generation biomonitoring proposes to combine machine-learning algorithms with environmental DNA data to automate the monitoring of the Earth's major ecosystems. In the present study, we searched for molecular biomarkers of tree water status to develop next-generation biomonitoring of forest ecosystems. Because phyllosphere microbial communities respond to both tree physiology and climate change, we investigated whether environmental DNA data from tree phyllosphere could be used as molecular biomarkers of tree water status in forest ecosystems. Using an amplicon sequencing approach, we analysed phyllosphere microbial communities of four tree species (Quercus ilex, Quercus robur, Pinus pinaster and Betula pendula) in a forest experiment composed of irrigated and non-irrigated plots. We used these microbial community data to train a machine-learning algorithm (Random Forest) to classify irrigated and non-irrigated trees. The Random Forest algorithm detected tree water status from phyllosphere microbial community composition with more than 90% accuracy for oak species, and more than 75% for pine and birch. Phyllosphere fungal communities were more informative than phyllosphere bacterial communities in all tree species. Seven fungal amplicon sequence variants were identified as candidates for the development of molecular biomarkers of water status in oak trees. Altogether, our results show that microbial community data from tree phyllosphere provides information on tree water status in forest ecosystems and could be included in next-generation biomonitoring programmes that would use in situ, real-time sequencing of environmental DNA to help monitor the health of European temperate forest ecosystems.


Assuntos
DNA Ambiental , Microbiota , Pinus , Monitoramento Biológico , Betula , Microbiota/genética
3.
Cell ; 135(3): 475-85, 2008 Oct 31.
Artigo em Inglês | MEDLINE | ID: mdl-18984159

RESUMO

The organization of the Escherichia coli chromosome into insulated macrodomains influences the segregation of sister chromatids and the mobility of chromosomal DNA. Here, we report that organization of the Terminus region (Ter) into a macrodomain relies on the presence of a 13 bp motif called matS repeated 23 times in the 800-kb-long domain. matS sites are the main targets in the E. coli chromosome of a newly identified protein designated MatP. MatP accumulates in the cell as a discrete focus that colocalizes with the Ter macrodomain. The effects of MatP inactivation reveal its role as main organizer of the Ter macrodomain: in the absence of MatP, DNA is less compacted, the mobility of markers is increased, and segregation of Ter macrodomain occurs early in the cell cycle. Our results indicate that a specific organizational system is required in the Terminus region for bacterial chromosome management during the cell cycle.


Assuntos
Proteínas Cromossômicas não Histona/metabolismo , Cromossomos Bacterianos/química , Cromossomos Bacterianos/metabolismo , Proteínas de Escherichia coli/metabolismo , Escherichia coli/citologia , Escherichia coli/metabolismo , Divisão Celular , Proteínas Cromossômicas não Histona/genética , DNA Bacteriano/química , DNA Bacteriano/metabolismo , Proteínas de Escherichia coli/genética
4.
Bioinformatics ; 38(1): 141-148, 2021 12 22.
Artigo em Inglês | MEDLINE | ID: mdl-34478490

RESUMO

MOTIVATION: Combining the results of different experiments to exhibit complex patterns or to improve statistical power is a typical aim of data integration. The starting point of the statistical analysis often comes as a set of P-values resulting from previous analyses, that need to be combined flexibly to explore complex hypotheses, while guaranteeing a low proportion of false discoveries. RESULTS: We introduce the generic concept of composed hypothesis, which corresponds to an arbitrary complex combination of simple hypotheses. We rephrase the problem of testing a composed hypothesis as a classification task and show that finding items for which the composed null hypothesis is rejected boils down to fitting a mixture model and classifying the items according to their posterior probabilities. We show that inference can be efficiently performed and provide a thorough classification rule to control for type I error. The performance and the usefulness of the approach are illustrated in simulations and on two different applications. The method is scalable, does not require any parameter tuning, and provided valuable biological insight on the considered application cases. AVAILABILITY AND IMPLEMENTATION: The QCH methodology is available in the qch package hosted on CRAN. Additionally, R codes to reproduce the Einkorn example are available on the personal webpage of the first author: https://www6.inrae.fr/mia-paris/Equipes/Membres/Tristan-Mary-Huard. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Projetos de Pesquisa , Estatística como Assunto , Probabilidade
5.
Rev Infirm ; 71(281): 41-43, 2022 May.
Artigo em Francês | MEDLINE | ID: mdl-35843644

RESUMO

The epidemic of Covid-19 was characterized, from its beginning, by "emergency". A state of emergency enacted by the state authorities to fight, on one hand, against the pandemic as such and, on the other hand, to manage the influx of patients admitted in intensive care. In this unprecedented context, the suffering of the people goes beyond the emergency situation and persists in forms ranging from a pseudo-banality to the complexity of an insidious evolution.


Assuntos
COVID-19 , Ansiedade , COVID-19/epidemiologia , Humanos , Pandemias , Estresse Psicológico
6.
Ecol Lett ; 24(9): 1905-1916, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-34231296

RESUMO

The relative importance of ecological factors and species interactions for shaping species distributions is still debated. The realised niches of eight sympatric tephritid fruit flies were inferred from field abundance data using joint species distribution modelling and network inference, on the whole community and separately on three host plant groups. These estimates were then confronted the fundamental niches of seven fly species estimated through laboratory-measured fitnesses on host plants. Species abundances depended on host plants, followed by climatic factors, with a dose of competition between species sharing host plants. The relative importance of these factors mildly changed among the three host plant groups. Despite overlapping fundamental niches, specialists and generalists had almost distinct realised niches, with possible competitive exclusion of generalists by specialists on Cucurbitaceae. They had different assembly rules: Specialists were mainly influenced by their adaptation to host plants, while generalist abundances varied regardless of their fundamental host use.


Assuntos
Drosophila , Plantas , Animais
7.
Bioinformatics ; 36(4): 1275-1276, 2020 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-31504187

RESUMO

MOTIVATION: De novo comparative metagenomics is one of the most straightforward ways to analyze large sets of metagenomic data. Latest methods use the fraction of shared k-mers to estimate genomic similarity between read sets. However, those methods, while extremely efficient, are still limited by computational needs for practical usage outside of large computing facilities. RESULTS: We present SimkaMin, a quick comparative metagenomics tool with low disk and memory footprints, thanks to an efficient data subsampling scheme used to estimate Bray-Curtis and Jaccard dissimilarities. One billion metagenomic reads can be analyzed in <3 min, with tiny memory (1.09 GB) and disk (≈0.3 GB) requirements and without altering the quality of the downstream comparative analyses, making of SimkaMin a tool perfectly tailored for very large-scale metagenomic projects. AVAILABILITY AND IMPLEMENTATION: https://github.com/GATB/simka. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Metagenômica , Software , Algoritmos , Genômica , Metagenoma , Análise de Sequência de DNA
8.
Syst Biol ; 67(4): 662-680, 2018 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-29385556

RESUMO

To study the evolution of several quantitative traits, the classical phylogenetic comparative framework consists of a multivariate random process running along the branches of a phylogenetic tree. The Ornstein-Uhlenbeck (OU) process is sometimes preferred to the simple Brownian motion (BM) as it models stabilizing selection toward an optimum. The optimum for each trait is likely to be changing over the long periods of time spanned by large modern phylogenies. Our goal is to automatically detect the position of these shifts on a phylogenetic tree, while accounting for correlations between traits, which might exist because of structural or evolutionary constraints. We show that, in the presence of shifts, phylogenetic Principal Component Analysis fails to decorrelate traits efficiently, so that any method aiming at finding shifts needs to deal with correlation simultaneously. We introduce here a simplification of the full multivariate OU model, named scalar OU, which allows for noncausal correlations and is still computationally tractable. We extend the equivalence between the OU and a BM on a rescaled tree to our multivariate framework. We describe an Expectation-Maximization (EM) algorithm that allows for a maximum likelihood estimation of the shift positions, associated with a new model selection criterion, accounting for the identifiability issues for the shift localization on the tree. The method, freely available as an R-package (PhylogeneticEM) is fast, and can deal with missing values. We demonstrate its efficiency and accuracy compared to another state-of-the-art method ($\ell$1ou) on a wide range of simulated scenarios and use this new framework to reanalyze recently gathered data sets on New World Monkeys and Anolis lizards.


Assuntos
Adaptação Biológica , Evolução Biológica , Lagartos , Fenótipo , Platirrinos , Algoritmos , Animais , Filogenia
9.
Stat Appl Genet Mol Biol ; 17(5)2018 09 08.
Artigo em Inglês | MEDLINE | ID: mdl-30205662

RESUMO

Omic data are characterized by the presence of strong dependence structures that result either from data acquisition or from some underlying biological processes. Applying statistical procedures that do not adjust the variable selection step to the dependence pattern may result in a loss of power and the selection of spurious variables. The goal of this paper is to propose a variable selection procedure within the multivariate linear model framework that accounts for the dependence between the multiple responses. We shall focus on a specific type of dependence which consists in assuming that the responses of a given individual can be modelled as a time series. We propose a novel Lasso-based approach within the framework of the multivariate linear model taking into account the dependence structure by using different types of stationary processes covariance structures for the random error matrix. Our numerical experiments show that including the estimation of the covariance matrix of the random error matrix in the Lasso criterion dramatically improves the variable selection performance. Our approach is successfully applied to an untargeted LC-MS (Liquid Chromatography-Mass Spectrometry) data set made of African copals samples. Our methodology is implemented in the R package MultiVarSel which is available from the Comprehensive R Archive Network (CRAN).


Assuntos
Biomarcadores/metabolismo , Cromatografia Líquida/métodos , Interpretação Estatística de Dados , Metabolômica/métodos , Espectrometria de Massas em Tandem/métodos , Humanos , Modelos Lineares , Metabolômica/estatística & dados numéricos
10.
BMC Bioinformatics ; 18(1): 333, 2017 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-28697800

RESUMO

BACKGROUND: Detecting local correlations in expression between neighboring genes along the genome has proved to be an effective strategy to identify possible causes of transcriptional deregulation in cancer. It has been successfully used to illustrate the role of mechanisms such as copy number variation (CNV) or epigenetic alterations as factors that may significantly alter expression in large chromosomal regions (gene silencing or gene activation). RESULTS: The identification of correlated regions requires segmenting the gene expression correlation matrix into regions of homogeneously correlated genes and assessing whether the observed local correlation is significantly higher than the background chromosomal correlation. A unified statistical framework is proposed to achieve these two tasks, where optimal segmentation is efficiently performed using dynamic programming algorithm, and detection of highly correlated regions is then achieved using an exact test procedure. We also propose a simple and efficient procedure to correct the expression signal for mechanisms already known to impact expression correlation. The performance and robustness of the proposed procedure, called SegCorr, are evaluated on simulated data. The procedure is illustrated on cancer data, where the signal is corrected for correlations caused by copy number variation. It permitted the detection of regions with high correlations linked to epigenetic marks like DNA methylation. CONCLUSIONS: SegCorr is a novel method that performs correlation matrix segmentation and applies a test procedure in order to detect highly correlated regions in gene expression.


Assuntos
Regulação Neoplásica da Expressão Gênica , Genômica/métodos , Modelos Estatísticos , Algoritmos , Variações do Número de Cópias de DNA , Metilação de DNA , Epigênese Genética , Expressão Gênica , Humanos , Neoplasias/genética
11.
Biometrics ; 72(3): 804-14, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-26683201

RESUMO

Next-generation sequencing technologies now constitute a method of choice to measure gene expression. Data to analyze are read counts, commonly modeled using negative binomial distributions. A relevant issue associated with this probabilistic framework is the reliable estimation of the overdispersion parameter, reinforced by the limited number of replicates generally observable for each gene. Many strategies have been proposed to estimate this parameter, but when differential analysis is the purpose, they often result in procedures based on plug-in estimates, and we show here that this discrepancy between the estimation framework and the testing framework can lead to uncontrolled type-I errors. Instead, we propose a mixture model that allows each gene to share information with other genes that exhibit similar variability. Three consistent statistical tests are developed for differential expression analysis. We show through a wide simulation study that the proposed method improves the sensitivity of detecting differentially expressed genes with respect to the common procedures, since it reaches the nominal value for the type-I error, while keeping elevate discriminative power between differentially and not differentially expressed genes. The method is finally illustrated on prostate cancer RNA-Seq data.


Assuntos
Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Modelos Estatísticos , Humanos , Masculino , Neoplasias da Próstata/genética , Análise de Sequência de RNA
12.
EMBO J ; 30(10): 1928-38, 2011 May 18.
Artigo em Inglês | MEDLINE | ID: mdl-21487388

RESUMO

Post-translational modification of histones and DNA methylation are important components of chromatin-level control of genome activity in eukaryotes. However, principles governing the combinatorial association of chromatin marks along the genome remain poorly understood. Here, we have generated epigenomic maps for eight histone modifications (H3K4me2 and 3, H3K27me1 and 2, H3K36me3, H3K56ac, H4K20me1 and H2Bub) in the model plant Arabidopsis and we have combined these maps with others, produced under identical conditions, for H3K9me2, H3K9me3, H3K27me3 and DNA methylation. Integrative analysis indicates that these 12 chromatin marks, which collectively cover ∼90% of the genome, are present at any given position in a very limited number of combinations. Moreover, we show that the distribution of the 12 marks along the genomic sequence defines four main chromatin states, which preferentially index active genes, repressed genes, silent repeat elements and intergenic regions. Given the compact nature of the Arabidopsis genome, these four indexing states typically translate into short chromatin domains interspersed with each other. This first combinatorial view of the Arabidopsis epigenome points to simple principles of organization as in metazoans and provides a framework for further studies of chromatin-based regulatory mechanisms in plants.


Assuntos
Arabidopsis/fisiologia , Cromatina/metabolismo , Epigênese Genética , Regulação da Expressão Gênica de Plantas , Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Cromossomos/metabolismo , Metilação de DNA , Histonas/metabolismo , Processamento de Proteína Pós-Traducional
13.
J Theor Biol ; 365: 365-76, 2015 Jan 21.
Artigo em Inglês | MEDLINE | ID: mdl-25451529

RESUMO

Dynamic extinction colonisation models (also called contact processes) are widely studied in epidemiology and in metapopulation theory. Contacts are usually assumed to be possible only through a network of connected patches. This network accounts for a spatial landscape or a social organization of interactions. Thanks to social network literature, heterogeneous networks of contacts can be considered. A major issue is to assess the influence of the network in the dynamic model. Most work with this common purpose uses deterministic models or an approximation of a stochastic Extinction-Colonisation model (sEC) which are relevant only for large networks. When working with a limited size network, the induced stochasticity is essential and has to be taken into account in the conclusions. Here, a rigorous framework is proposed for limited size networks and the limitations of the deterministic approximation are exhibited. This framework allows exact computations when the number of patches is small. Otherwise, simulations are used and enhanced by adapted simulation techniques when necessary. A sensitivity analysis was conducted to compare four main topologies of networks in contrasting settings to determine the role of the network. A challenging case was studied in this context: seed exchange of crop species in the Réseau Semences Paysannes (RSP), an emergent French farmers׳ organisation. A stochastic Extinction-Colonisation model was used to characterize the consequences of substantial changes in terms of RSP׳s social organization on the ability of the system to maintain crop varieties.


Assuntos
Modelos Biológicos , Sementes/fisiologia , Evolução Biológica , Difusão , França , Dinâmica Populacional , Probabilidade , Triticum/fisiologia
15.
Stat Appl Genet Mol Biol ; 11(5)2012 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-23152425

RESUMO

The aim of this paper is to propose a test procedure for the detection of differential alternative splicing across conditions for tiling array or exon chip data. While developed in a mixed model framework, the test procedure is exact (avoiding computational burden) and applicable to a large variety of contrasts, including several previously published ones. A simulation study is presented to evaluate the robustness and performance of the method. It is found to have a good detection power of genes under differential alternative splicing, even for five biological replicates and four probes per exon. The methodology also enables the comparison of various experimental designs through exact power curves. This is illustrated with the comparison of paired and unpaired experiments. The test procedure was applied to two publicly available cancer data sets based on exon arrays, and showed promising results.


Assuntos
Algoritmos , Processamento Alternativo , Interpretação Estatística de Dados , Análise de Sequência com Séries de Oligonucleotídeos , Adenocarcinoma/genética , Adenocarcinoma/metabolismo , Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/metabolismo , Simulação por Computador , Éxons , Feminino , Humanos , Análise dos Mínimos Quadrados , Modelos Lineares , Modelos Genéticos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Neoplasias do Colo do Útero/genética , Neoplasias do Colo do Útero/metabolismo
16.
Biostatistics ; 12(3): 413-28, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21209153

RESUMO

The statistical analysis of array comparative genomic hybridization (CGH) data has now shifted to the joint assessment of copy number variations at the cohort level. Considering multiple profiles gives the opportunity to correct for systematic biases observed on single profiles, such as probe GC content or the so-called "wave effect." In this article, we extend the segmentation model developed in the univariate case to the joint analysis of multiple CGH profiles. Our contribution is multiple: we propose an integrated model to perform joint segmentation, normalization, and calling for multiple array CGH profiles. This model shows great flexibility, especially in the modeling of the wave effect that gives a likelihood framework to approaches proposed by others. We propose a new dynamic programming algorithm for break point positioning, as well as a model selection criterion based on a modified bayesian information criterion proposed in the univariate case. The performance of our method is assessed using simulated and real data sets. Our method is implemented in the R package cghseg.


Assuntos
Teorema de Bayes , Hibridização Genômica Comparativa/métodos , Interpretação Estatística de Dados , Modelos Genéticos , Modelos Estatísticos , Algoritmos , Simulação por Computador , Haplótipos , Humanos
17.
Stat Appl Genet Mol Biol ; 10(1)2011 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-23089815

RESUMO

Tiling arrays make possible a large-scale exploration of the genome thanks to probes which cover the whole genome with very high density, up to 2,000,000 probes. Biological questions usually addressed are either the expression difference between two conditions or the detection of transcribed regions. In this work, we propose to consider both questions simultaneously as an unsupervised classification problem by modeling the joint distribution of the two conditions. In contrast to previous methods, we account for all available information on the probes as well as biological knowledge such as annotation and spatial dependence between probes. Since probes are not biologically relevant units, we propose a classification rule for non-connected regions covered by several probes. Applications to transcriptomic and ChIP-chip data of Arabidopsis thaliana obtained with a NimbleGen tiling array highlight the importance of a precise modeling and of the region classification. The "TAHMMAnnot" package is implemented in R and C and is freely available from CRAN.


Assuntos
Imunoprecipitação da Cromatina/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Transcriptoma , Arabidopsis/genética , Cromossomos de Plantas/genética , Hibridização Genômica Comparativa , Biologia Computacional/métodos , Simulação por Computador , Sondas de DNA/genética , Éxons , Perfilação da Expressão Gênica/métodos , Genes de Plantas , Histonas/análise , Histonas/genética , Cadeias de Markov , Modelos Genéticos , Anotação de Sequência Molecular , RNA de Plantas/genética
18.
Trends Ecol Evol ; 37(6): 497-506, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35246322

RESUMO

A lot of what we know about past speciation and extinction dynamics is based on statistically fitting birth-death processes to phylogenies of extant species. Despite their wide use, the reliability of these tools is regularly questioned. It was recently demonstrated that vast 'congruent' sets of alternative diversification histories cannot be distinguished (i.e., are not identifiable) using extant phylogenies alone, reanimating the debate about the limits of phylogenetic diversification analysis. Here, we summarize what we know about the identifiability of the birth-death process and how identifiability issues can be addressed. We conclude that extant phylogenies, when combined with appropriate prior hypotheses and regularization techniques, can still tell us a lot about past diversification dynamics.


Assuntos
Extinção Biológica , Especiação Genética , Filogenia , Reprodutibilidade dos Testes
19.
Bioinformatics ; 25(18): 2341-7, 2009 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-19561016

RESUMO

MOTIVATION: High-density oligonucleotide tiling array technology holds the promise of a better description of the complexity and the dynamics of transcriptional landscapes. In organisms such as bacteria and yeasts, transcription can be measured on a genome-wide scale with a resolution >25 bp. The statistical models currently used to handle these data remain however very simple, the most popular being the piecewise constant Gaussian model with a fixed number of breakpoints. RESULTS: This article describes a new methodology based on a hidden Markov model that embeds the segmentation of a continuous-valued signal in a probabilistic setting. For a computationally affordable cost, this framework (i) alleviates the difficulty of choosing a fixed number of breakpoints, and (ii) permits retrieving more information than a unique segmentation by giving access to the whole probability distribution of the transcription profile. Importantly, the model is also enriched and accounts for subtle effects such as signal 'drift' and covariates. Relevance of this framework is demonstrated on a Bacillus subtilis dataset. AVAILABILITY: A software is distributed under the GPL.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Bacillus subtilis/genética , Genoma , Análise de Sequência de DNA/métodos , Transcrição Gênica
20.
PLoS Genet ; 3(9): 1614-21, 2007 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-17941709

RESUMO

Bacterial biodiversity at the species level, in terms of gene acquisition or loss, is so immense that it raises the question of how essential chromosomal regions are spared from uncontrolled rearrangements. Protection of the genome likely depends on specific DNA motifs that impose limits on the regions that undergo recombination. Although most such motifs remain unidentified, they are theoretically predictable based on their genomic distribution properties. We examined the distribution of the "crossover hotspot instigator," or Chi, in Escherichia coli, and found that its exceptional distribution is restricted to the core genome common to three strains. We then formulated a set of criteria that were incorporated in a statistical model to search core genomes for motifs potentially involved in genome stability in other species. Our strategy led us to identify and biologically validate two distinct heptamers that possess Chi properties, one in Staphylococcus aureus, and the other in several streptococci. This strategy paves the way for wide-scale discovery of other important functional noncoding motifs that distinguish core genomes from the strain-variable regions.


Assuntos
DNA Bacteriano/genética , Genoma Bacteriano , Modelos Genéticos , Bactérias/genética , Sequência de Bases , Troca Genética , Recombinação Genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA