Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Nature ; 606(7913): 382-388, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35614220

RESUMO

Mitochondria are epicentres of eukaryotic metabolism and bioenergetics. Pioneering efforts in recent decades have established the core protein componentry of these organelles1 and have linked their dysfunction to more than 150 distinct disorders2,3. Still, hundreds of mitochondrial proteins lack clear functions4, and the underlying genetic basis for approximately 40% of mitochondrial disorders remains unresolved5. Here, to establish a more complete functional compendium of human mitochondrial proteins, we profiled more than 200 CRISPR-mediated HAP1 cell knockout lines using mass spectrometry-based multiomics analyses. This effort generated approximately 8.3 million distinct biomolecule measurements, providing a deep survey of the cellular responses to mitochondrial perturbations and laying a foundation for mechanistic investigations into protein function. Guided by these data, we discovered that PIGY upstream open reading frame (PYURF) is an S-adenosylmethionine-dependent methyltransferase chaperone that supports both complex I assembly and coenzyme Q biosynthesis and is disrupted in a previously unresolved multisystemic mitochondrial disorder. We further linked the putative zinc transporter SLC30A9 to mitochondrial ribosomes and OxPhos integrity and established RAB5IF as the second gene harbouring pathogenic variants that cause cerebrofaciothoracic dysplasia. Our data, which can be explored through the interactive online MITOMICS.app resource, suggest biological roles for many other orphan mitochondrial proteins that still lack robust functional characterization and define a rich cell signature of mitochondrial dysfunction that can support the genetic diagnosis of mitochondrial diseases.


Assuntos
Mitocôndrias , Proteínas Mitocondriais , Proteínas de Transporte de Cátions , Proteínas de Ciclo Celular , Metabolismo Energético , Humanos , Espectrometria de Massas , Mitocôndrias/genética , Mitocôndrias/metabolismo , Doenças Mitocondriais/genética , Doenças Mitocondriais/metabolismo , Proteínas Mitocondriais/genética , Proteínas Mitocondriais/metabolismo , Fatores de Transcrição , Proteínas rab5 de Ligação ao GTP
2.
J Surg Res ; 246: 160-169, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-31586890

RESUMO

BACKGROUND: A major roadblock to reducing the mortality of colorectal cancer (CRC) is prompt detection and treatment, and a simple blood test is likely to have higher compliance than all of the current methods. The purpose of this report is to examine the utility of a mass spectrometry-based blood serum protein biomarker test for detection of CRC. MATERIALS AND METHODS: Blood was drawn from individuals (n = 213) before colonoscopy or from patients with nonmetastatic CRC (n = 50) before surgery. Proteins were isolated from the serum of patients using targeted liquid chromatography-tandem mass spectrometry. We designed a machine-learning statistical model to assess these proteins. RESULTS: When considered individually, over 70% of the selected biomarkers showed significance by Mann-Whitney testing for distinguishing cancer-bearing cases from cancer-free cases. Using machine-learning methods, peptides derived from epidermal growth factor receptor and leucine-rich alpha-2-glycoprotein 1 were consistently identified as highly predictive for detecting CRC from cancer-free cases. A five-marker panel consisting of leucine-rich alpha-2-glycoprotein 1, epidermal growth factor receptor, inter-alpha-trypsin inhibitor heavy-chain family member 4, hemopexin, and superoxide dismutase 3 performed the best with 70% specificity at over 89% sensitivity (area under the curve = 0.86) in the validation set. For distinguishing regional from localized cancers, cross-validation within the training set showed that a panel of four proteins consisting of CD44 molecule, GC-vitamin D-binding protein, C-reactive protein, and inter-alpha-trypsin inhibitor heavy-chain family member 3 yielded the highest performance (area under the curve = 0.75). CONCLUSIONS: The minimally invasive blood biomarker panels identified here could serve as screening/detection alternatives for CRC in a human population and potentially useful for staging of existing cancer.


Assuntos
Biomarcadores Tumorais/sangue , Neoplasias Colorretais/diagnóstico , Detecção Precoce de Câncer/métodos , Metástase Linfática/diagnóstico , Programas de Rastreamento/métodos , Adulto , Idoso , Idoso de 80 Anos ou mais , Colectomia , Colonoscopia , Neoplasias Colorretais/sangue , Neoplasias Colorretais/patologia , Neoplasias Colorretais/cirurgia , Estudos Transversais , Estudos de Viabilidade , Feminino , Humanos , Metástase Linfática/patologia , Masculino , Espectrometria de Massas/métodos , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Projetos Piloto , Estudos Prospectivos , Curva ROC
3.
Bull Math Biol ; 81(8): 2849-2872, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-29644519

RESUMO

We study the dynamics of flagellar growth in eukaryotes where intraflagellar transporters (IFT) play a crucial role. First we investigate a stochastic version of the original balance point model where a constant number of IFT particles move up and down the flagellum. The detailed model is a discrete event vector-valued Markov process occurring in continuous time. First the detailed stochastic model is compared and contrasted with a simple scalar ordinary differential equation (ODE) model of flagellar growth. Numerical simulations reveal that the steady-state mean value of the stochastic model is well approximated by the ODE model. Then we derive a scalar stochastic differential equation (SDE) as a first approximation and obtain a "small noise" approximation showing flagellar length to be Gaussian with mean and variance governed by simple ODEs. The accuracy of the small noise model is compared favorably with the numerical simulation results of the detailed model. Secondly, we derive a revised SDE for flagellar length following the revised balance point model proposed in 2009 in which IFT particles move in trains instead of in isolation. Small noise approximation of the revised SDE yields the same approximate Gaussian distribution for the flagellar length as the SDE corresponding to the original balance point model.


Assuntos
Células Eucarióticas/fisiologia , Células Eucarióticas/ultraestrutura , Flagelos/fisiologia , Flagelos/ultraestrutura , Modelos Biológicos , Algoritmos , Transporte Biológico Ativo/fisiologia , Proteínas de Transporte/fisiologia , Simulação por Computador , Cadeias de Markov , Conceitos Matemáticos , Distribuição Normal , Processos Estocásticos
4.
PLoS Comput Biol ; 13(6): e1005466, 2017 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-28570593

RESUMO

Various types of biological knowledge describe networks of interactions among elementary entities. For example, transcriptional regulatory networks consist of interactions among proteins and genes. Current knowledge about the exact structure of such networks is highly incomplete, and laboratory experiments that manipulate the entities involved are conducted to test hypotheses about these networks. In recent years, various automated approaches to experiment selection have been proposed. Many of these approaches can be characterized as active machine learning algorithms. Active learning is an iterative process in which a model is learned from data, hypotheses are generated from the model to propose informative experiments, and the experiments yield new data that is used to update the model. This review describes the various models, experiment selection strategies, validation techniques, and successful applications described in the literature; highlights common themes and notable distinctions among methods; and identifies likely directions of future research and open problems in the area.


Assuntos
Biologia Computacional/métodos , Aprendizado de Máquina Supervisionado , Algoritmos , Redes Reguladoras de Genes , Redes e Vias Metabólicas , Projetos de Pesquisa
5.
J Virol ; 89(14): 7214-23, 2015 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-25926637

RESUMO

UNLABELLED: Herpes simplex virus 1 (HSV-1) causes recurrent mucocutaneous ulcers and is the leading cause of infectious blindness and sporadic encephalitis in the United States. HSV-1 has been shown to be highly recombinogenic; however, to date, there has been no genome-wide analysis of recombination. To address this, we generated 40 HSV-1 recombinants derived from two parental strains, OD4 and CJ994. The 40 OD4-CJ994 HSV-1 recombinants were sequenced using the Illumina sequencing system, and recombination breakpoints were determined for each of the recombinants using the Bootscan program. Breakpoints occurring in the terminal inverted repeats were excluded from analysis to prevent double counting, resulting in a total of 272 breakpoints in the data set. By placing windows around the 272 breakpoints followed by Monte Carlo analysis comparing actual data to simulated data, we identified a recombination bias toward both high GC content and intergenic regions. A Monte Carlo analysis also suggested that recombination did not appear to be responsible for the generation of the spontaneous nucleotide mutations detected following sequencing. Additionally, kernel density estimation analysis across the genome found that the large, inverted repeats comprise a recombination hot spot. IMPORTANCE: Herpes simplex virus 1 (HSV-1) virus is the leading cause of sporadic encephalitis and blinding keratitis in developed countries. HSV-1 has been shown to be highly recombinogenic, and recombination itself appears to be a significant component of genome replication. To date, there has been no genome-wide analysis of recombination. Here we present the findings of the first genome-wide study of recombination performed by generating and sequencing 40 HSV-1 recombinants derived from the OD4 and CJ994 parental strains, followed by bioinformatics analysis. Recombination breakpoints were determined, yielding 272 breakpoints in the full data set. Kernel density analysis determined that the large inverted repeats constitute a recombination hot spot. Additionally, Monte Carlo analyses found biases toward high GC content and intergenic and repetitive regions.


Assuntos
DNA Viral/genética , Herpesvirus Humano 1/genética , Recombinação Genética , Animais , Composição de Bases , Chlorocebus aethiops , DNA Viral/química , Análise de Sequência de DNA , Células Vero
6.
BMC Med Inform Decis Mak ; 12: 22, 2012 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-22443103

RESUMO

BACKGROUND: The task of spatial cluster detection involves finding spatial regions where some property deviates from the norm or the expected value. In a probabilistic setting this task can be expressed as finding a region where some event is significantly more likely than usual. Spatial cluster detection is of interest in fields such as biosurveillance, mining of astronomical data, military surveillance, and analysis of fMRI images. In almost all such applications we are interested both in the question of whether a cluster exists in the data, and if it exists, we are interested in finding the most accurate characterization of the cluster. METHODS: We present a general dynamic programming algorithm for grid-based spatial cluster detection. The algorithm can be used for both Bayesian maximum a-posteriori (MAP) estimation of the most likely spatial distribution of clusters and Bayesian model averaging over a large space of spatial cluster distributions to compute the posterior probability of an unusual spatial clustering. The algorithm is explained and evaluated in the context of a biosurveillance application, specifically the detection and identification of Influenza outbreaks based on emergency department visits. A relatively simple underlying model is constructed for the purpose of evaluating the algorithm, and the algorithm is evaluated using the model and semi-synthetic test data. RESULTS: When compared to baseline methods, tests indicate that the new algorithm can improve MAP estimates under certain conditions: the greedy algorithm we compared our method to was found to be more sensitive to smaller outbreaks, while as the size of the outbreaks increases, in terms of area affected and proportion of individuals affected, our method overtakes the greedy algorithm in spatial precision and recall. The new algorithm performs on-par with baseline methods in the task of Bayesian model averaging. CONCLUSIONS: We conclude that the dynamic programming algorithm performs on-par with other available methods for spatial cluster detection and point to its low computational cost and extendability as advantages in favor of further research and use of the algorithm.


Assuntos
Algoritmos , Teorema de Bayes , Biovigilância/métodos , Análise por Conglomerados , Demografia , Técnicas de Apoio para a Decisão , Surtos de Doenças/prevenção & controle , Surtos de Doenças/estatística & dados numéricos , Humanos , Influenza Humana/epidemiologia , Influenza Humana/prevenção & controle , Funções Verossimilhança , Vigilância da População/métodos , Controle de Qualidade , Curva ROC , Software
7.
J Comput Biol ; 27(3): 403-417, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-32053004

RESUMO

Advances in systems biology have made clear the importance of network models for capturing knowledge about complex relationships in gene regulation, metabolism, and cellular signaling. A common approach to uncovering biological networks involves performing perturbations on elements of the network, such as gene knockdown experiments, and measuring how the perturbation affects some reporter of the process under study. In this article, we develop context-specific nested effects models (CSNEMs), an approach to inferring such networks that generalizes nested effects models (NEMs). The main contribution of this work is that CSNEMs explicitly model the participation of a gene in multiple contexts, meaning that a gene can appear in multiple places in the network. Biologically, the representation of regulators in multiple contexts may indicate that these regulators have distinct roles in different cellular compartments or cell cycle phases. We present an evaluation of the method on simulated data as well as on data from a study of the sodium chloride stress response in Saccharomyces cerevisiae.


Assuntos
Saccharomyces cerevisiae/efeitos dos fármacos , Cloreto de Sódio/farmacologia , Biologia de Sistemas/métodos , Algoritmos , Regulação Fúngica da Expressão Gênica/efeitos dos fármacos , Redes Reguladoras de Genes/efeitos dos fármacos , Modelos Genéticos , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética
8.
AMIA Jt Summits Transl Sci Proc ; 2020: 98-107, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32477628

RESUMO

Asthma is a prevalent chronic respiratory condition, and acute exacerbations represent a significant fraction of the economic and health-related costs associated with asthma. We present results from a novel study that is focused on modeling asthma exacerbations from data contained in patients' electronic health records. This work makes the following contributions: (i) we develop an algorithm for phenotyping asthma exacerbations from EHRs, (ii) we determine that models learned via supervised learning approaches can predict asthma exacerbations in the near future (AUC ≈ 0.77), and (iii) we develop an approach, based on mixtures of semi-Markov models, that is able to identify subpopula-tions of asthma patients sharing distinct temporal and seasonal patterns in their exacerbation susceptibility.

9.
Res Comput Mol Biol ; 10812: 194-210, 2018 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-30680375

RESUMO

Advances in systems biology have made clear the importance of network models for capturing knowledge about complex relationships in gene regulation, metabolism, and cellular signaling. A common approach to uncovering biological networks involves performing perturbations on elements of the network, such as gene knockdown experiments, and measuring how the perturbation affects some reporter of the process under study. In this paper, we develop context-specific nested effects models (CSNEMs), an approach to inferring such networks that generalizes nested effect models (NEMs). The main contribution of this work is that CSNEMs explicitly model the participation of a gene in multiple contexts, meaning that a gene can appear in multiple places in the network. Biologically, the representation of regulators in multiple contexts may indicate that these regulators have distinct roles in different cellular compartments or cell cycle phases. We present an evaluation of the method on simulated data as well as on data from a study of the sodium chloride stress response in Saccharomyces cerevisiae.

10.
Adv Intell Data Anal ; 9385: 275-285, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-27069983

RESUMO

We present a novel approach to the problem of detecting multivariate statistical differences across groups of data. The need to compare data in a multivariate manner arises naturally in observational studies, randomized trials, comparative effectiveness research, abnormality and anomaly detection scenarios, and other application areas. In such comparisons, it is of interest to identify statistical differences across the groups being compared. The approach we present in this paper addresses this issue by constructing statistical models that describe the groups being compared and using a decomposable Bayesian Dirichlet score of the models to identify variables that behave statistically differently between the groups. In our evaluation, the new method performed significantly better than logistic lasso regression in indentifying differences in a variety of datasets under a variety of conditions.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA