Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Nat Methods ; 20(6): 815-823, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37156842

RESUMO

Evolutionarily conserved signaling pathways are essential for early embryogenesis, and reducing or abolishing their activity leads to characteristic developmental defects. Classification of phenotypic defects can identify the underlying signaling mechanisms, but this requires expert knowledge and the classification schemes have not been standardized. Here we use a machine learning approach for automated phenotyping to train a deep convolutional neural network, EmbryoNet, to accurately identify zebrafish signaling mutants in an unbiased manner. Combined with a model of time-dependent developmental trajectories, this approach identifies and classifies with high precision phenotypic defects caused by loss of function of the seven major signaling pathways relevant for vertebrate development. Our classification algorithms have wide applications in developmental biology and robustly identify signaling defects in evolutionarily distant species. Furthermore, using automated phenotyping in high-throughput drug screens, we show that EmbryoNet can resolve the mechanism of action of pharmaceutical substances. As part of this work, we freely provide more than 2 million images that were used to train and test EmbryoNet.


Assuntos
Aprendizado Profundo , Animais , Peixe-Zebra/genética , Redes Neurais de Computação , Algoritmos , Fenótipo , Transdução de Sinais
2.
J Environ Manage ; 336: 117642, 2023 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-36907065

RESUMO

Fecal pollution is one of the most prevalent forms of pollution affecting waterbodies worldwide, threatening public health and negatively impacting aquatic environments. Microbial source tracking (MST) applies polymerase chain reaction (PCR) technology to help identify the source of fecal pollution. In this study, we combine spatial data for two watersheds with general and host-associated MST markers to target human (HF183/BacR287), bovine (CowM2), and general ruminant (Rum2Bac) sources. Concentrations of MST markers in samples were determined with droplet digital PCR (ddPCR). The three MST markers were detected at all sites (n = 25), but bovine and general ruminant markers were significantly associated with watershed characteristics. MST results, combined with watershed characteristics, suggest that streams draining areas with low-infiltration soil groups and high agricultural land use are at an increased risk for fecal contamination. Microbial source tracking has been applied in numerous studies to aid in identifying the sources of fecal contamination, but these studies usually lack information on the involvement of watershed characteristics. Our study combined watershed characteristics with MST results to provide more comprehensive insight into the factors that influence fecal contamination in order to implement the most effective best management practices.


Assuntos
Monitoramento Ambiental , Poluição da Água , Animais , Bovinos , Humanos , Poluição da Água/análise , Monitoramento Ambiental/métodos , Reação em Cadeia da Polimerase , Fezes , Microbiologia da Água , Ruminantes
3.
RNA Biol ; 19(1): 1228-1243, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-36457147

RESUMO

Endothelial cells (ECs) comprise the lumenal lining of all blood vessels and are critical for the functioning of the cardiovascular system. Their phenotypes can be modulated by alternative splicing of RNA to produce distinct protein isoforms. To characterize the RNA and protein isoform landscape within ECs, we applied a long read proteogenomics approach to analyse human umbilical vein endothelial cells (HUVECs). Transcripts delineated from PacBio sequencing serve as the basis for a sample-specific protein database used for downstream mass-spectrometry (MS) analysis to infer protein isoform expression. We detected 53,863 transcript isoforms from 10,426 genes, with 22,195 of those transcripts being novel. Furthermore, the predominant isoform in HUVECs does not correspond with the accepted "reference isoform" 25% of the time, with vascular pathway-related genes among this group. We found 2,597 protein isoforms supported through unique peptides, with an additional 2,280 isoforms nominated upon incorporation of long-read transcript evidence. We characterized a novel alternative acceptor for endothelial-related gene CDH5, suggesting potential changes in its associated signalling pathways. Finally, we identified novel protein isoforms arising from a diversity of RNA splicing mechanisms supported by uniquely mapped novel peptides. Our results represent a high-resolution atlas of known and novel isoforms of potential relevance to endothelial phenotypes and function.[Figure: see text].


Assuntos
Proteogenômica , Humanos , Células Endoteliais da Veia Umbilical Humana , Isoformas de Proteínas/genética , Processamento Alternativo , RNA
4.
Environ Manage ; 62(2): 190-209, 2018 08.
Artigo em Inglês | MEDLINE | ID: mdl-29796704

RESUMO

Climate change has far-reaching effects on human and ecological systems, requiring collaboration across sectors and disciplines to determine effective responses. To inform regional responses to climate change, decision-makers need credible and relevant information representing a wide swath of knowledge and perspectives. The southeastern U. S. State of Georgia is a valuable focal area for study because it contains multiple ecological zones that vary greatly in land use and economic activities, and it is vulnerable to diverse climate change impacts. We identified 40 important research questions that, if answered, could lay the groundwork for effective, science-based climate action in Georgia. Top research priorities were identified through a broad solicitation of candidate research questions (180 were received). A group of experts across sectors and disciplines gathered for a workshop to categorize, prioritize, and filter the candidate questions, identify missing topics, and rewrite questions. Participants then collectively chose the 40 most important questions. This cross-sectoral effort ensured the inclusion of a diversity of topics and questions (e.g., coastal hazards, agricultural production, ecosystem functioning, urban infrastructure, and human health) likely to be important to Georgia policy-makers, practitioners, and scientists. Several cross-cutting themes emerged, including the need for long-term data collection and consideration of at-risk Georgia citizens and communities. Workshop participants defined effective responses as those that take economic cost, environmental impacts, and social justice into consideration. Our research highlights the importance of collaborators across disciplines and sectors, and discussing challenges and opportunities that will require transdisciplinary solutions.


Assuntos
Pessoal Administrativo , Mudança Climática , Conservação dos Recursos Naturais/métodos , Política Ambiental , Pesquisa/organização & administração , Tomada de Decisões , Ecossistema , Georgia , Humanos
5.
bioRxiv ; 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38915658

RESUMO

Studying protein isoforms is an essential step in biomedical research; at present, the main approach for analyzing proteins is via bottom-up mass spectrometry proteomics, which return peptide identifications, that are indirectly used to infer the presence of protein isoforms. However, the detection and quantification processes are noisy; in particular, peptides may be erroneously detected, and most peptides, known as shared peptides, are associated to multiple protein isoforms. As a consequence, studying individual protein isoforms is challenging, and inferred protein results are often abstracted to the gene-level or to groups of protein isoforms. Here, we introduce IsoBayes , a novel statistical method to perform inference at the isoform level. Our method enhances the information available, by integrating mass spectrometry proteomics and transcriptomics data in a Bayesian probabilistic framework. To account for the uncertainty in the measurement process, we propose a two-layer latent variable approach: first, we sample if a peptide has been correctly detected (or, alternatively filter peptides); second, we allocate the abundance of such selected peptides across the protein(s) they are compatible with. This enables us, starting from peptide-level data, to recover protein-level data; in particular, we: i) infer the presence/absence of each protein isoform (via a posterior probability), ii) estimate its abundance (and credible interval), and iii) target isoforms where transcript and protein relative abundances significantly differ. We benchmarked our approach in simulations, and in two multi-protease real datasets: our method displays good sensitivity and specificity when detecting protein isoforms, its estimated abundances highly correlate with the ground truth, and can detect changes between protein and transcript relative abundances. IsoBayes is freely distributed as a Bioconductor R package, and is accompanied by an example usage vignette.

6.
bioRxiv ; 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38617311

RESUMO

Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of pre-defined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (LR RNAseq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNAseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This LR RNA seq-informed Tomahto targeted approach, called LRP-IS-PRM, is a new modality for generating protein-level evidence of alternative isoforms - a critical first step in designing functional studies and eventually clinical assays.

7.
bioRxiv ; 2024 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-38559226

RESUMO

Long-read RNA sequencing has shed light on transcriptomic complexity, but questions remain about the functionality of downstream protein products. We introduce Biosurfer, a computational approach for comparing protein isoforms, while systematically tracking the transcriptional, splicing, and translational variations that underlie differences in the sequences of the protein products. Using Biosurfer, we analyzed the differences in 32,799 pairs of GENCODE annotated protein isoforms, finding a majority (70%) of variable N-termini are due to the alternative transcription start sites, while only 9% arise from 5' UTR alternative splicing. Biosurfer's detailed tracking of nucleotide-to-residue relationships helped reveal an uncommonly tracked source of single amino acid residue changes arising from the codon splits at junctions. For 17% of internal sequence changes, such split codon patterns lead to single residue differences, termed "ragged codons". Of variable C-termini, 72% involve splice- or intron retention-induced reading frameshifts. We found an unusual pattern of reading frame changes, in which the first frameshift is closely followed by a distinct second frameshift that restores the original frame, which we term a "snapback" frameshift. We analyzed long read RNA-seq-predicted proteome of a human cell line and found similar trends as compared to our GENCODE analysis, with the exception of a higher proportion of isoforms predicted to undergo nonsense-mediated decay. Biosurfer's comprehensive characterization of long-read RNA-seq datasets should accelerate insights of the functional role of protein isoforms, providing mechanistic explanation of the origins of the proteomic diversity driven by the alternative splicing. Biosurfer is available as a Python package at https://github.com/sheynkman-lab/biosurfer.

8.
Nat Commun ; 15(1): 5053, 2024 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-38871684

RESUMO

Childhood radioactive iodine exposure from the Chornobyl accident increased papillary thyroid carcinoma (PTC) risk. While cervical lymph node metastases (cLNM) are well-recognized in pediatric PTC, the PTC metastatic process and potential radiation association are poorly understood. Here, we analyze cLNM occurrence among 428 PTC with genomic landscape analyses and known drivers (131I-exposed = 349, unexposed = 79; mean age = 27.9 years). We show that cLNM are more frequent in PTC with fusion (55%) versus mutation (30%) drivers, although the proportion varies by specific driver gene (RET-fusion = 71%, BRAF-mutation = 38%, RAS-mutation = 5%). cLNM frequency is not associated with other characteristics, including radiation dose. cLNM molecular profiling (N = 47) demonstrates 100% driver concordance with matched primary PTCs and highly concordant mutational spectra. Transcriptome analysis reveals 17 differentially expressed genes, particularly in the HOXC cluster and BRINP3; the strongest differentially expressed microRNA also is near HOXC10. Our findings underscore the critical role of driver alterations and provide promising candidates for elucidating the biological underpinnings of PTC cLNM.


Assuntos
Acidente Nuclear de Chernobyl , Radioisótopos do Iodo , Metástase Linfática , Mutação , Câncer Papilífero da Tireoide , Neoplasias da Glândula Tireoide , Humanos , Câncer Papilífero da Tireoide/genética , Câncer Papilífero da Tireoide/patologia , Metástase Linfática/genética , Masculino , Adulto , Feminino , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia , Adolescente , Proteínas Proto-Oncogênicas B-raf/genética , Adulto Jovem , Linfonodos/patologia , Proteínas Proto-Oncogênicas c-ret/genética , Criança , Genômica , Pessoa de Meia-Idade , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Perfilação da Expressão Gênica , MicroRNAs/genética , MicroRNAs/metabolismo , Neoplasias Induzidas por Radiação/genética , Neoplasias Induzidas por Radiação/patologia , Pescoço/patologia , Regulação Neoplásica da Expressão Gênica
9.
Genome Biol ; 24(1): 91, 2023 04 24.
Artigo em Inglês | MEDLINE | ID: mdl-37095564

RESUMO

Long-read RNA sequencing (lrRNA-seq) produces detailed information about full-length transcripts, including novel and sample-specific isoforms. Furthermore, there is an opportunity to call variants directly from lrRNA-seq data. However, most state-of-the-art variant callers have been developed for genomic DNA. Here, there are two objectives: first, we perform a mini-benchmark on GATK, DeepVariant, Clair3, and NanoCaller primarily on PacBio Iso-Seq, data, but also on Nanopore and Illumina RNA-seq data; second, we propose a pipeline to process spliced-alignment files, making them suitable for variant calling with DNA-based callers. With such manipulations, high calling performance can be achieved using DeepVariant on Iso-seq data.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , RNA , Análise de Sequência de RNA , RNA-Seq , Sequenciamento do Exoma
10.
Nat Commun ; 13(1): 4283, 2022 07 25.
Artigo em Inglês | MEDLINE | ID: mdl-35879309

RESUMO

Kinase inhibitors as targeted therapies have played an important role in improving cancer outcomes. However, there are still considerable challenges, such as resistance, non-response, patient stratification, polypharmacology, and identifying combination therapy where understanding a tumor kinase activity profile could be transformative. Here, we develop a graph- and statistics-based algorithm, called KSTAR, to convert phosphoproteomic measurements of cells and tissues into a kinase activity score that is generalizable and useful for clinical pipelines, requiring no quantification of the phosphorylation sites. In this work, we demonstrate that KSTAR reliably captures expected kinase activity differences across different tissues and stimulation contexts, allows for the direct comparison of samples from independent experiments, and is robust across a wide range of dataset sizes. Finally, we apply KSTAR to clinical breast cancer phosphoproteomic data and find that there is potential for kinase activity inference from KSTAR to complement the current clinical diagnosis of HER2 status in breast cancer patients.


Assuntos
Neoplasias da Mama , Proteômica , Algoritmos , Neoplasias da Mama/patologia , Feminino , Humanos , Fosfoproteínas/metabolismo , Fosforilação , Fosfotransferases , Inibidores de Proteínas Quinases/farmacologia , Inibidores de Proteínas Quinases/uso terapêutico
11.
Genome Biol ; 23(1): 69, 2022 03 03.
Artigo em Inglês | MEDLINE | ID: mdl-35241129

RESUMO

BACKGROUND: The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms. RESULTS: We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis. CONCLUSIONS: Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.


Assuntos
Proteogenômica , Processamento Alternativo , Humanos , Isoformas de Proteínas/genética , Proteômica , Análise de Sequência de RNA/métodos , Transcriptoma
12.
Cell Rep ; 37(7): 110022, 2021 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-34788620

RESUMO

Alternative splicing is a post-transcriptional regulatory mechanism producing distinct mRNA molecules from a single pre-mRNA with a prominent role in the development and function of the central nervous system. We used long-read isoform sequencing to generate full-length transcript sequences in the human and mouse cortex. We identify novel transcripts not present in existing genome annotations, including transcripts mapping to putative novel (unannotated) genes and fusion transcripts incorporating exons from multiple genes. Global patterns of transcript diversity are similar between human and mouse cortex, although certain genes are characterized by striking differences between species. We also identify developmental changes in alternative splicing, with differential transcript usage between human fetal and adult cortex. Our data confirm the importance of alternative splicing in the cortex, dramatically increasing transcriptional diversity and representing an important mechanism underpinning gene regulation in the brain. We provide transcript-level data for human and mouse cortex as a resource to the scientific community.


Assuntos
Córtex Cerebral/metabolismo , Isoformas de Proteínas/genética , Transcriptoma/genética , Processamento Alternativo/genética , Animais , Encéfalo/metabolismo , Córtex Cerebral/fisiologia , Éxons/genética , Expressão Gênica/genética , Perfilação da Expressão Gênica/métodos , Genoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Camundongos , Isoformas de Proteínas/metabolismo , Precursores de RNA/genética , Sítios de Splice de RNA/genética , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos
13.
Pain Physician ; 22(5): E395-E406, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31561649

RESUMO

BACKGROUND: Total knee arthroplasty (TKA) can lead to chronic pain and prolonged postoperative opioid use. There are few evidence-based interventions to prevent these outcomes. Recently, beta-blockers have emerged as possible novel analgesics. OBJECTIVES: The objective of this study was to determine whether perioperative beta-blocker use is associated with reduced prolonged postoperative opioid use after TKA. STUDY DESIGN: This study used a retrospective cohort design. SETTING: The research took place within Department of Veterans Affairs hospitals in the United States between April 2012 and April 2016. METHODS: Patients: IRB approval was obtained to examine the records of Veterans Affairs (VA) patients undergoing TKA. Patients using opioids 60 days before surgery were excluded. INTERVENTION: The intervention being investigated was perioperative beta-blocker use, overall and by class. MEASUREMENT: Oral morphine equivalent usage through postoperative day 1 and prescription opioid refills through 30, 90, and 365 days after TKA were recorded. Adjusted models were created controlling for relevant demographic and comorbidity covariates. A secondary analysis examined the same outcomes separated by beta-blocker class. RESULTS: The cohort was 93.8% male with a mean age of 66 years. Among the 11,614 TKAs that comprised the cohort, 2,604 (22.4%) were performed on patients using beta-blockers. After adjustment, beta-blocker use was associated with reduced opioid use through 30 days after surgery (odds ratio [OR] 0.89 [95% confidence interval (CI), 0.80-0.99], P = .026). Selective beta-blockers were associated with reduced opioid use at 30 days (OR 0.88 [95% CI, 0.78-0.98], P = .021), and nonselective beta-blockers were associated with reduced oral morphine equivalent usage through postoperative day 1 (beta = -17.9 [95% CI, -29.9 to -5.8], P = .004). LIMITATIONS: Generalizability of these findings is uncertain, because this study was performed on a cohort of predominantly white, male VA patients. This study also measured opioid use, but opioid use is not a perfect surrogate for pain. Nevertheless, opioid use offers value as an objective measure of pain persistence in a national cohort for which patient-reported outcomes are otherwise unavailable. CONCLUSIONS: Perioperative beta-blocker use was associated with reduced prescription opioid use at 30 days after surgery. Both selective and nonselective beta-blockers were associated with reduced opioid use when analyzed individually. KEY WORDS: Analgesics, opioid, arthroplasty, replacement, knee, adrenergic beta-antagonists, pain management.


Assuntos
Antagonistas Adrenérgicos beta/uso terapêutico , Analgésicos Opioides/uso terapêutico , Dor Pós-Operatória/tratamento farmacológico , Idoso , Artroplastia do Joelho , Estudos de Coortes , Comorbidade , Feminino , Humanos , Articulação do Joelho , Masculino , Pessoa de Meia-Idade , Morfina/uso terapêutico , Razão de Chances , Transtornos Relacionados ao Uso de Opioides/tratamento farmacológico , Manejo da Dor , Dor Pós-Operatória/epidemiologia , Período Pós-Operatório , Estudos Retrospectivos , Estados Unidos
14.
Nat Commun ; 9(1): 1582, 2018 04 20.
Artigo em Inglês | MEDLINE | ID: mdl-29679054

RESUMO

Fluorescence Recovery After Photobleaching (FRAP) and inverse FRAP (iFRAP) assays can be used to assess the mobility of fluorescent molecules. These assays measure diffusion by monitoring the return of fluorescence in bleached regions (FRAP), or the dissipation of fluorescence from photoconverted regions (iFRAP). However, current FRAP/iFRAP analysis methods suffer from simplified assumptions about sample geometry, bleaching/photoconversion inhomogeneities, and the underlying reaction-diffusion kinetics. To address these shortcomings, we developed the software PyFRAP, which fits numerical simulations of three-dimensional models to FRAP/iFRAP data and accounts for bleaching/photoconversion inhomogeneities. Using PyFRAP we determined the diffusivities of fluorescent molecules spanning two orders of magnitude in molecular weight. We measured the tortuous effects that cell-like obstacles exert on effective diffusivity and show that reaction kinetics can be accounted for by model selection. These applications demonstrate the utility of PyFRAP, which can be widely adapted as a new extensible standard for FRAP analysis.

15.
Science ; 336(6082): 721-4, 2012 May 11.
Artigo em Inglês | MEDLINE | ID: mdl-22499809

RESUMO

Biological systems involving short-range activators and long-range inhibitors can generate complex patterns. Reaction-diffusion models postulate that differences in signaling range are caused by differential diffusivity of inhibitor and activator. Other models suggest that differential clearance underlies different signaling ranges. To test these models, we measured the biophysical properties of the Nodal/Lefty activator/inhibitor system during zebrafish embryogenesis. Analysis of Nodal and Lefty gradients revealed that Nodals have a shorter range than Lefty proteins. Pulse-labeling analysis indicated that Nodals and Leftys have similar clearance kinetics, whereas fluorescence recovery assays revealed that Leftys have a higher effective diffusion coefficient than Nodals. These results indicate that differential diffusivity is the major determinant of the differences in Nodal/Lefty range and provide biophysical support for reaction-diffusion models of activator/inhibitor-mediated patterning.


Assuntos
Blástula/metabolismo , Padronização Corporal , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Fatores de Determinação Direita-Esquerda/metabolismo , Ligantes da Sinalização Nodal/metabolismo , Proteínas de Peixe-Zebra/metabolismo , Peixe-Zebra/embriologia , Animais , Difusão , Desenvolvimento Embrionário , Recuperação de Fluorescência Após Fotodegradação , Meia-Vida , Peptídeos e Proteínas de Sinalização Intracelular/genética , Cinética , Fatores de Determinação Direita-Esquerda/genética , Modelos Biológicos , Ligantes da Sinalização Nodal/genética , Proteínas Recombinantes de Fusão/metabolismo , Peixe-Zebra/metabolismo , Proteínas de Peixe-Zebra/genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa