Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 83
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 142(6): 930-42, 2010 Sep 17.
Artículo en Inglés | MEDLINE | ID: mdl-20850014

RESUMEN

Although genome-wide hypomethylation is a hallmark of many cancers, roles for active DNA demethylation during tumorigenesis are unknown. Here, loss of the APC tumor suppressor gene causes upregulation of a DNA demethylase system and the concomitant hypomethylation of key intestinal cell fating genes. Notably, this hypomethylation maintained zebrafish intestinal cells in an undifferentiated state that was released upon knockdown of demethylase components. Mechanistically, the demethylase genes are directly activated by Pou5f1 and Cebpß and are indirectly repressed by retinoic acid, which antagonizes Pou5f1 and Cebpß. Apc mutants lack retinoic acid as a result of the transcriptional repression of retinol dehydrogenase l1 via a complex that includes Lef1, Groucho2, Ctbp1, Lsd1, and Corest. Our findings imply a model wherein APC controls intestinal cell fating through a switch in DNA methylation dynamics. Wild-type APC and retinoic acid downregulate demethylase components, thereby promoting DNA methylation of key genes and helping progenitors commit to differentiation.


Asunto(s)
Proteína de la Poliposis Adenomatosa del Colon/metabolismo , Poliposis Adenomatosa del Colon/metabolismo , Metilación de ADN , Intestinos/embriología , Pez Cebra/embriología , Poliposis Adenomatosa del Colon/patología , Oxidorreductasas de Alcohol/metabolismo , Animales , Encéfalo/citología , Proteína beta Potenciadora de Unión a CCAAT/metabolismo , Línea Celular Tumoral , Proliferación Celular , Proteínas Co-Represoras/metabolismo , Neoplasias del Colon/metabolismo , Humanos , Mucosa Intestinal/metabolismo , Intestinos/citología , Factor 3 de Transcripción de Unión a Octámeros/metabolismo , Factores de Transcripción/metabolismo , Transcripción Genética , Tretinoina/metabolismo
2.
Biostatistics ; 24(3): 635-652, 2023 Jul 14.
Artículo en Inglés | MEDLINE | ID: mdl-34893807

RESUMEN

Nonignorable technical variation is commonly observed across data from multiple experimental runs, platforms, or studies. These so-called batch effects can lead to difficulty in merging data from multiple sources, as they can severely bias the outcome of the analysis. Many groups have developed approaches for removing batch effects from data, usually by accommodating batch variables into the analysis (one-step correction) or by preprocessing the data prior to the formal or final analysis (two-step correction). One-step correction is often desirable due it its simplicity, but its flexibility is limited and it can be difficult to include batch variables uniformly when an analysis has multiple stages. Two-step correction allows for richer models of batch mean and variance. However, prior investigation has indicated that two-step correction can lead to incorrect statistical inference in downstream analysis. Generally speaking, two-step approaches introduce a correlation structure in the corrected data, which, if ignored, may lead to either exaggerated or diminished significance in downstream applications such as differential expression analysis. Here, we provide more intuitive and more formal evaluations of the impacts of two-step batch correction compared to existing literature. We demonstrate that the undesired impacts of two-step correction (exaggerated or diminished significance) depend on both the nature of the study design and the batch effects. We also provide strategies for overcoming these negative impacts in downstream analyses using the estimated correlation matrix of the corrected data. We compare the results of our proposed workflow with the results from other published one-step and two-step methods and show that our methods lead to more consistent false discovery controls and power of detection across a variety of batch effect scenarios. Software for our method is available through GitHub (https://github.com/jtleek/sva-devel) and will be available in future versions of the $\texttt{sva}$ R package in the Bioconductor project (https://bioconductor.org/packages/release/bioc/html/sva.html).


Asunto(s)
Expresión Génica , Humanos , Filogenia , Proyectos de Investigación
3.
Breast Cancer Res Treat ; 204(2): 327-340, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38127176

RESUMEN

PURPOSE: Prior studies indicate that the physiologic response to stress can affect gene expression. We evaluated differential gene expression in breast cancers collected from Black women with high versus low exposure to psychosocial stressors. METHODS: We analyzed tumor RNA sequencing data from 417 Black Women's Health Study breast cancer cases with data on early life trauma and neighborhood disadvantage. We conducted age-adjusted differential gene expression analyses and pathway analyses. We also evaluated Conserved Transcriptional Response to Adversity (CTRA) contrast scores, relative fractions of immune cell types, T cell exhaustion, and adrenergic signaling. Analyses were run separately for estrogen receptor positive (ER+; n = 299) and ER- (n = 118) cases. RESULTS: Among ER+ cases, the top differentially expressed pathways by stress exposure were related to RNA and protein metabolism. Among ER- cases, they were related to developmental biology, signal transduction, metabolism, and the immune system. Targeted analyses indicated greater immune pathway enrichment with stress exposure for ER- cases, and possible relevance of adrenergic signaling for ER+ cases. CTRA contrast scores did not differ by stress exposure, but in analyses of the CTRA components, ER- breast cancer cases with high neighborhood disadvantage had higher pro-inflammatory gene expression (p = 0.039) and higher antibody gene expression (p = 0.006) compared to those with low neighborhood disadvantage. CONCLUSION: There are multiple pathways through which psychosocial stress exposure may influence breast tumor biology. Given the present findings on inflammation and immune response in ER- tumors, further research to identify stress-induced changes in the etiology and progression of ER- breast cancer is warranted.


Asunto(s)
Neoplasias de la Mama , Femenino , Humanos , Neoplasias de la Mama/epidemiología , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Receptores de Estrógenos/metabolismo , Salud de la Mujer , Adrenérgicos , Expresión Génica
4.
BMC Infect Dis ; 24(1): 610, 2024 Jun 20.
Artículo en Inglés | MEDLINE | ID: mdl-38902649

RESUMEN

BACKGROUND: Blood-based transcriptional gene signatures for tuberculosis (TB) have been developed with potential use to diagnose disease. However, an unresolved issue is whether gene set enrichment analysis of the signature transcripts alone is sufficient for prediction and differentiation or whether it is necessary to use the original model created when the signature was derived. Intra-method comparison is complicated by the unavailability of original training data and missing details about the original trained model. To facilitate the utilization of these signatures in TB research, comparisons between gene set scoring methods cross-data validation of original model implementations are needed. METHODS: We compared the performance of 19 TB gene signatures across 24 transcriptomic datasets using both rrebuilt original models and gene set scoring methods. Existing gene set scoring methods, including ssGSEA, GSVA, PLAGE, Singscore, and Zscore, were used as alternative approaches to obtain the profile scores. The area under the ROC curve (AUC) value was computed to measure performance. Correlation analysis and Wilcoxon paired tests were used to compare the performance of enrichment methods with the original models. RESULTS: For many signatures, the predictions from gene set scoring methods were highly correlated and statistically equivalent to the results given by the original models. In some cases, PLAGE outperformed the original models when considering signatures' weighted mean AUC values and the AUC results within individual studies. CONCLUSION: Gene set enrichment scoring of existing gene sets can distinguish patients with active TB disease from other clinical conditions with equivalent or improved accuracy compared to the original methods and models. These data justify using gene set scoring methods of published TB gene signatures for predicting TB risk and treatment outcomes, especially when original models are difficult to apply or implement.


Asunto(s)
Perfilación de la Expresión Génica , Tuberculosis , Humanos , Tuberculosis/diagnóstico , Tuberculosis/genética , Tuberculosis/microbiología , Perfilación de la Expresión Génica/métodos , Mycobacterium tuberculosis/genética , Transcriptoma , Curva ROC , Reproducibilidad de los Resultados
5.
Regul Toxicol Pharmacol ; 151: 105653, 2024 May 31.
Artículo en Inglés | MEDLINE | ID: mdl-38825064

RESUMEN

Despite two decades of research on silver nanoparticle (AgNP) toxicity, a safe threshold for exposure has not yet been established, albeit being critically needed for risk assessment and regulatory decision-making. Traditionally, a point-of-departure (PoD) value is derived from dose response of apical endpoints in animal studies using either the no-observed-adverse-effect level (NOAEL) approach, or benchmark dose (BMD) modeling. To develop new approach methodologies (NAMs) to inform human risk assessment of AgNPs, we conducted a concentration response modeling of the transcriptomic changes in hepatocytes derived from human induced pluripotent stem cells (iPSCs) after being exposed to a wide range concentration (0.01-25 µg/ml) of AgNPs for 24 h. A plausible transcriptomic PoD of 0.21 µg/ml was derived for a pathway related to the mode-of-action (MOA) of AgNPs, and a more conservative PoD of 0.10 µg/ml for a gene ontology (GO) term not apparently associated with the MOA of AgNPs. A reference dose (RfD) could be calculated from either of the PoDs as a safe threshold for AgNP exposure. The current study illustrates the usefulness of in vitro transcriptomic concentration response study using human cells as a NAM for toxicity study of chemicals that lack adequate toxicity data to inform human risk assessment.

6.
PLoS Pathog ; 17(5): e1009589, 2021 05.
Artículo en Inglés | MEDLINE | ID: mdl-34003848

RESUMEN

Respiratory syncytial virus (RSV) is a major cause of respiratory disease in infants and the elderly. RSV is a non-segmented negative strand RNA virus. The viral M2-1 protein plays a key role in viral transcription, serving as an elongation factor to enable synthesis of full-length mRNAs. M2-1 contains an unusual CCCH zinc-finger motif that is conserved in the related human metapneumovirus M2-1 protein and filovirus VP30 proteins. Previous biochemical studies have suggested that RSV M2-1 might bind to specific virus RNA sequences, such as the transcription gene end signals or poly A tails, but there was no clear consensus on what RSV sequences it binds. To determine if M2-1 binds to specific RSV RNA sequences during infection, we mapped points of M2-1:RNA interactions in RSV-infected cells at 8 and 18 hours post infection using crosslinking immunoprecipitation with RNA sequencing (CLIP-Seq). This analysis revealed that M2-1 interacts specifically with positive sense RSV RNA, but not negative sense genome RNA. It also showed that M2-1 makes contacts along the length of each viral mRNA, indicating that M2-1 functions as a component of the transcriptase complex, transiently associating with nascent mRNA being extruded from the polymerase. In addition, we found that M2-1 binds specific cellular mRNAs. In contrast to the situation with RSV mRNA, M2-1 binds discrete sites within cellular mRNAs, with a preference for A/U rich sequences. These results suggest that in addition to its previously described role in transcription elongation, M2-1 might have an additional role involving cellular RNA interactions.


Asunto(s)
ARN Mensajero/metabolismo , Infecciones por Virus Sincitial Respiratorio/virología , Virus Sincitial Respiratorio Humano/genética , Proteínas Virales/metabolismo , ARN Polimerasas Dirigidas por ADN/genética , ARN Polimerasas Dirigidas por ADN/metabolismo , Humanos , ARN Mensajero/genética , Proteínas de Unión al ARN , Factores de Elongación Transcripcional/genética , Factores de Elongación Transcripcional/metabolismo , Proteínas Virales/genética , Replicación Viral
7.
J Public Health (Oxf) ; 45(2): e184-e195, 2023 Jun 14.
Artículo en Inglés | MEDLINE | ID: mdl-36038507

RESUMEN

BACKGROUND: Development of a prediction model using baseline characteristics of tuberculosis (TB) patients at the time of diagnosis will aid us in early identification of the high-risk groups and devise pertinent strategies accordingly. Hence, we did this study to develop a prognostic-scoring model for predicting the death among newly diagnosed drug sensitive pulmonary TB patients in South India. METHODS: We undertook a longitudinal analysis of cohort data under the Regional Prospective Observational Research for Tuberculosis India consortium. Multivariable cox regression using the stepwise backward elimination procedure was used to select variables for the model building and the nomogram-scoring system was developed with the final selected model. RESULTS: In total, 54 (4.6%) out of the 1181 patients had died during the 1-year follow-up period. The TB mortality rate was 0.20 per 1000 person-days. Eight variables (age, gender, functional limitation, anemia, leukopenia, thrombocytopenia, diabetes, neutrophil-lymphocyte ratio) were selected and a nomogram was built using these variables. The discriminatory power was 0.81 (95% confidence interval: 0.75-0.86) and this model was well-calibrated. Decision curve analysis showed that the model is beneficial at a threshold probability ~15-65%. CONCLUSIONS: This scoring system could help the clinicians and policy makers to devise targeted interventions and in turn reduce the TB mortality in India.


Asunto(s)
Tuberculosis Pulmonar , Tuberculosis , Humanos , Pronóstico , Nomogramas , Probabilidad , India/epidemiología , Estudios Retrospectivos
8.
Clin Infect Dis ; 75(6): 1022-1030, 2022 09 29.
Artículo en Inglés | MEDLINE | ID: mdl-35015839

RESUMEN

BACKGROUND: Blood-based biomarkers for diagnosing active tuberculosis (TB), monitoring treatment response, and predicting risk of progression to TB disease have been reported. However, validation of the biomarkers across multiple independent cohorts is scarce. A robust platform to validate TB biomarkers in different populations with clinical end points is essential to the development of a point-of-care clinical test. NanoString nCounter technology is an amplification-free digital detection platform that directly measures mRNA transcripts with high specificity. Here, we determined whether NanoString could serve as a platform for extensive validation of candidate TB biomarkers. METHODS: The NanoString platform was used for performance evaluation of existing TB gene signatures in a cohort in which signatures were previously evaluated on an RNA-seq dataset. A NanoString codeset that probes 107 genes comprising 12 TB signatures and 6 housekeeping genes (NS-TB107) was developed and applied to total RNA derived from whole blood samples of TB patients and individuals with latent TB infection (LTBI) from South India. The TBSignatureProfiler tool was used to score samples for each signature. An ensemble of machine learning algorithms was used to derive a parsimonious biomarker. RESULTS: Gene signatures present in NS-TB107 had statistically significant discriminative power for segregating TB from LTBI. Further analysis of the data yielded a NanoString 6-gene set (NANO6) that when tested on 10 published datasets was highly diagnostic for active TB. CONCLUSIONS: The NanoString nCounter system provides a robust platform for validating existing TB biomarkers and deriving a parsimonious gene signature with enhanced diagnostic performance.


Asunto(s)
Tuberculosis Latente , Mycobacterium tuberculosis , Tuberculosis , Biomarcadores , Humanos , Tuberculosis Latente/diagnóstico , Mycobacterium tuberculosis/genética , ARN Mensajero/genética , Tuberculosis/diagnóstico , Tuberculosis/genética
9.
Br J Cancer ; 126(2): 287-296, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34718358

RESUMEN

BACKGROUND: African Americans have the highest pancreatic cancer incidence of any racial/ethnic group in the United States. The oral microbiome was associated with pancreatic cancer risk in a recent study, but no such studies have been conducted in African Americans. Poor oral health, which can be a cause or effect of microbial populations, was associated with an increased risk of pancreatic cancer in a single study of African Americans. METHODS: We prospectively investigated the oral microbiome in relation to pancreatic cancer risk among 122 African-American pancreatic cancer cases and 354 controls. DNA was extracted from oral wash samples for metagenomic shotgun sequencing. Alpha and beta diversity of the microbial profiles were calculated. Multivariable conditional logistic regression was used to estimate odds ratios (ORs) and 95% confidence intervals (CIs) for associations between microbes and pancreatic cancer risk. RESULTS: No associations were observed with alpha or beta diversity, and no individual microbial taxa were differentially abundant between cases and control, after accounting for multiple comparisons. Among never smokers, there were elevated ORs for known oral pathogens: Porphyromonas gingivalis (OR = 1.69, 95% CI: 0.80-3.56), Prevotella intermedia (OR = 1.40, 95% CI: 0.69-2.85), and Tannerella forsythia (OR = 1.36, 95% CI: 0.66-2.77). CONCLUSIONS: Previously reported associations between oral taxa and pancreatic cancer were not present in this African-American population overall.


Asunto(s)
Población Negra/genética , Microbiota , Boca/microbiología , Neoplasias Pancreáticas/patología , Estudios de Casos y Controles , Femenino , Humanos , Masculino , Persona de Mediana Edad , Neoplasias Pancreáticas/epidemiología , Neoplasias Pancreáticas/microbiología , Estudios Prospectivos , Factores de Riesgo , Estados Unidos/epidemiología
10.
Bioinformatics ; 37(11): 1521-1527, 2021 07 12.
Artículo en Inglés | MEDLINE | ID: mdl-33245114

RESUMEN

MOTIVATION: Genomic data are often produced in batches due to practical restrictions, which may lead to unwanted variation in data caused by discrepancies across batches. Such 'batch effects' often have negative impact on downstream biological analysis and need careful consideration. In practice, batch effects are usually addressed by specifically designed software, which merge the data from different batches, then estimate batch effects and remove them from the data. Here, we focus on classification and prediction problems, and propose a different strategy based on ensemble learning. We first develop prediction models within each batch, then integrate them through ensemble weighting methods. RESULTS: We provide a systematic comparison between these two strategies using studies targeting diverse populations infected with tuberculosis. In one study, we simulated increasing levels of heterogeneity across random subsets of the study, which we treat as simulated batches. We then use the two methods to develop a genomic classifier for the binary indicator of disease status. We evaluate the accuracy of prediction in another independent study targeting a different population cohort. We observed that in independent validation, while merging followed by batch adjustment provides better discrimination at low level of heterogeneity, our ensemble learning strategy achieves more robust performance, especially at high severity of batch effects. These observations provide practical guidelines for handling batch effects in the development and evaluation of genomic classifiers. AVAILABILITY AND IMPLEMENTATION: The data underlying this article are available in the article and in its online supplementary material. Processed data is available in the Github repository with implementation code, at https://github.com/zhangyuqing/bea_ensemble. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma , Genómica , Humanos , Aprendizaje Automático
11.
Mol Psychiatry ; 26(6): 1808-1831, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-32071385

RESUMEN

Maternal immune activation (MIA) disrupts the central innate immune system during a critical neurodevelopmental period. Microglia are primary innate immune cells in the brain although their direct influence on the MIA phenotype is largely unknown. Here we show that MIA alters microglial gene expression with upregulation of cellular protrusion/neuritogenic pathways, concurrently causing repetitive behavior, social deficits, and synaptic dysfunction to layer V intrinsically bursting pyramidal neurons in the prefrontal cortex of mice. MIA increases plastic dendritic spines of the intrinsically bursting neurons and their interaction with hyper-ramified microglia. Treating MIA offspring by colony stimulating factor 1 receptor inhibitors induces depletion and repopulation of microglia, and corrects protein expression of the newly identified MIA-associated neuritogenic molecules in microglia, which coalesces with correction of MIA-associated synaptic, neurophysiological, and behavioral abnormalities. Our study demonstrates that maternal immune insults perturb microglial phenotypes and influence neuronal functions throughout adulthood, and reveals a potent effect of colony stimulating factor 1 receptor inhibitors on the correction of MIA-associated microglial, synaptic, and neurobehavioral dysfunctions.


Asunto(s)
Microglía , Efectos Tardíos de la Exposición Prenatal , Animales , Conducta Animal , Encéfalo , Modelos Animales de Enfermedad , Femenino , Inflamación , Factor Estimulante de Colonias de Macrófagos , Ratones , Neuronas , Embarazo , Receptores de Factor Estimulante de Colonias de Granulocitos y Macrófagos
12.
BMC Infect Dis ; 21(1): 1058, 2021 Oct 12.
Artículo en Inglés | MEDLINE | ID: mdl-34641820

RESUMEN

BACKGROUND: Comorbidities such as undernutrition and parasitic infections are widespread in India and other tuberculosis (TB)-endemic countries. This study examines how these conditions as well as food supplementation and parasite treatment might alter immune responses to Mycobacterium tuberculosis (Mtb) infection and risk of progression to TB disease. METHODS: This is a 5-year prospective clinical trial at Jawaharlal Institute of Post Graduate Medical Education and Research in Puducherry, Tamil Nadu, India. We aim to enroll 760 household contacts (HHC) of adults with active TB in order to identify 120 who are followed prospectively for 2 years: Thirty QuantiFERON-TB Gold Plus (QFT-Plus) positive HHCs ≥ 18 years of age in four proposed groups: (1) undernourished (body mass index [BMI] < 18.5 kg/m2); (2) participants with a BMI ≥ 18.5 kg/m2 who have a parasitic infection (3) undernourished participants with a parasitic infection and (4) controls-participants with BMI ≥ 18.5 kg/m2 and without parasitic infection. We assess immune response at baseline and after food supplementation (for participants with BMI < 18.5 kg/m2) and parasite treatment (for participants with parasites). Detailed nutritional assessments, anthropometry, and parasite testing through polymerase chain reaction (PCR) and microscopy are performed. In addition, at serial time points, these samples will be further analyzed using flow cytometry and whole blood transcriptomics to elucidate the immune mechanisms involved in disease progression. CONCLUSIONS: This study will help determine whether undernutrition and parasite infection are associated with gene signatures that predict risk of TB and whether providing nutritional supplementation and/or treating parasitic infections improves immune response towards this infection. This study transcends individual level care and presents the opportunity to benefit the population at large by analyzing factors that affect disease progression potentially reducing the overall burden of people who progress to TB disease. Trial registration ClinicalTrials.gov; NCT03598842; Registered on July 26, 2018; https://clinicaltrials.gov/ct2/show/NCT03598842.


Asunto(s)
Mycobacterium tuberculosis , Tuberculosis , Adulto , Humanos , India/epidemiología , Estado Nutricional , Estudios Prospectivos , Tuberculosis/prevención & control
13.
BMC Infect Dis ; 21(1): 106, 2021 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-33482742

RESUMEN

BACKGROUND: Gene expression signatures have been used as biomarkers of tuberculosis (TB) risk and outcomes. Platforms are needed to simplify access to these signatures and determine their validity in the setting of comorbidities. We developed a computational profiling platform of TB signature gene sets and characterized the diagnostic ability of existing signature gene sets to differentiate active TB from LTBI in the setting of malnutrition. METHODS: We curated 45 existing TB-related signature gene sets and developed our TBSignatureProfiler software toolkit that estimates gene set activity using multiple enrichment methods and allows visualization of single- and multi-pathway results. The TBSignatureProfiler software is available through Bioconductor and on GitHub. For evaluation in malnutrition, we used whole blood gene expression profiling from 23 severely malnourished Indian individuals with TB and 15 severely malnourished household contacts with latent TB infection (LTBI). Severe malnutrition was defined as body mass index (BMI) < 16 kg/m2 in adults and based on weight-for-height Z scores in children < 18 years. Gene expression was measured using RNA-sequencing. RESULTS: The comparison and visualization functions from the TBSignatureProfiler showed that TB gene sets performed well in malnourished individuals; 40 gene sets had statistically significant discriminative power for differentiating TB from LTBI, with area under the curve ranging from 0.662-0.989. Three gene sets were not significantly predictive. CONCLUSION: Our TBSignatureProfiler is a highly effective and user-friendly platform for applying and comparing published TB signature gene sets. Using this platform, we found that existing gene sets for TB function effectively in the setting of malnutrition, although differences in gene set applicability exist. RNA-sequencing gene sets should consider comorbidities and potential effects on diagnostic performance.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Desnutrición/genética , Programas Informáticos , Tuberculosis/genética , Adolescente , Adulto , Anciano , Área Bajo la Curva , Biomarcadores/sangre , Niño , Comorbilidad , Femenino , Humanos , Tuberculosis Latente/diagnóstico , Tuberculosis Latente/epidemiología , Tuberculosis Latente/genética , Masculino , Desnutrición/diagnóstico , Desnutrición/epidemiología , Persona de Mediana Edad , Mycobacterium tuberculosis , Transcriptoma , Tuberculosis/diagnóstico , Tuberculosis/epidemiología , Adulto Joven
14.
Clin Infect Dis ; 71(10): 2645-2654, 2020 12 17.
Artículo en Inglés | MEDLINE | ID: mdl-31761933

RESUMEN

BACKGROUND: People with advanced human immunodeficiency virus (HIV) (CD4 < 50) remain at high risk of tuberculosis (TB) or death despite the initiation of antiretroviral therapy (ART). We aimed to identify immunological profiles that were most predictive of incident TB disease and death. METHODS: The REMEMBER randomized clinical trial enrolled 850 participants with HIV (CD4 < 50 cells/µL) at ART initiation to receive either empiric TB treatment or isoniazid preventive therapy (IPT). A case-cohort study (n = 257) stratified by country and treatment arm was performed. Cases were defined as incident TB or all-cause death within 48 weeks after ART initiation. Using multiplexed immunoassay panels and ELISA, 26 biomarkers were assessed in plasma. RESULTS: In total, 52 (6.1%) of 850 participants developed TB; 47 (5.5%) died (13 of whom had antecedent TB). Biomarkers associated with incident TB overlapped with those associated with death (interleukin [IL]-1ß, IL-6). Biomarker levels declined over time in individuals with incident TB while remaining persistently elevated in those who died. Dividing the cohort into development and validation sets, the final model of 6 biomarkers (CXCL10, IL-1ß, IL-10, sCD14, tumor necrosis factor [TNF]-α, and TNF-ß) achieved a sensitivity of 0.90 (95% confidence interval [CI]: .87-.94) and a specificity of 0.71(95% CI: .68-.75) with an area under the curve (AUC) of 0.81 (95% CI: .78-.83) for incident TB. CONCLUSION: Among people with advanced HIV, a parsimonious inflammatory biomarker signature predicted those at highest risk for developing TB despite initiation of ART and TB preventive therapies. The signature may be a promising stratification tool to select patients who may benefit from increased monitoring and novel interventions. CLINICAL TRIALS REGISTRATION: NCT01380080.


Asunto(s)
Infecciones por VIH , Tuberculosis , Antituberculosos/uso terapéutico , Biomarcadores , Recuento de Linfocito CD4 , Estudios de Cohortes , VIH , Infecciones por VIH/complicaciones , Infecciones por VIH/tratamiento farmacológico , Humanos , Tuberculosis/diagnóstico , Tuberculosis/tratamiento farmacológico , Tuberculosis/epidemiología
15.
BMC Bioinformatics ; 20(1): 222, 2019 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-31046658

RESUMEN

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) enables the high-throughput quantification of transcriptional profiles in single cells. In contrast to bulk RNA-seq, additional preprocessing steps such as cell barcode identification or unique molecular identifier (UMI) deconvolution are necessary for preprocessing of data from single cell protocols. R packages that can easily preprocess data and rapidly visualize quality metrics and read alignments for individual cells across multiple samples or runs are still lacking. RESULTS: Here we present scruff, an R/Bioconductor package that preprocesses data generated from the CEL-Seq or CEL-Seq2 protocols and reports comprehensive data quality metrics and visualizations. scruff rapidly demultiplexes, aligns, and counts the reads mapped to genome features with deduplication of unique molecular identifier (UMI) tags. scruff also provides novel and extensive functions to visualize both pre- and post-alignment data quality metrics for cells from multiple experiments. Detailed read alignments with corresponding UMI information can be visualized at specific genome coordinates to display differences in isoform usage. The package also supports the visualization of quality metrics for sequence alignment files for multiple experiments generated by Cell Ranger from 10X Genomics. scruff is available as a free and open-source R/Bioconductor package. CONCLUSIONS: scruff streamlines the preprocessing of scRNA-seq data in a few simple R commands. It performs data demultiplexing, alignment, counting, quality report and visualization systematically and comprehensively, ensuring reproducible and reliable analysis of scRNA-seq data.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Genómica/métodos , Alineación de Secuencia , Análisis de la Célula Individual
16.
BMC Cancer ; 19(1): 881, 2019 Sep 05.
Artículo en Inglés | MEDLINE | ID: mdl-31488082

RESUMEN

BACKGROUND: Gene expression profiling of rare cancers has proven challenging due to limited access to patient materials and requirement of intact, non-degraded RNA for next-generation sequencing. We customized a gene expression panel compatible with degraded RNA from formalin-fixed, paraffin-embedded (FFPE) patient cancer samples and investigated its utility in pathway activity profiling in patients with metaplastic breast cancer (MpBC). METHODS: Activity of various biological pathways was profiled in samples from nineteen patients with MpBC and 8 patients with invasive ductal carcinoma with triple negative breast cancer (TNBC) phenotype using a custom gene expression-based assay of 345 genes. RESULTS: MpBC samples of mesenchymal (chondroid and/or osteoid) histology demonstrated increased SNAI1 and BCL2L11 pathway activity compared to samples with non-mesenchymal histology. Additionally, late cornified envelope and keratinization genes were downregulated in MpBC compared to TNBC, and epithelial-to-mesenchymal transition (EMT) and collagen genes were upregulated in MpBC. Patients with high activity of an invasiveness gene expression signature, as well as high expression of the mesenchymal marker and extracellular matrix glycoprotein gene SPARC, experienced worse outcomes than those with low invasiveness activity and low SPARC expression. CONCLUSIONS: This study demonstrates the utility of gene expression profiling of metaplastic breast cancer FFPE samples with a custom counts-based assay. Gene expression patterns identified by this assay suggest that, although often histologically triple negative, patients with MpBC have distinct pathway activation compared to patients with invasive ductal TNBC. Incorporation of targeted therapies may lead to improved outcome for MpBC patients, especially in those patients expressing increased activity of invasiveness pathways.


Asunto(s)
Carcinoma Ductal de Mama/genética , Receptores de Factores de Crecimiento/metabolismo , Transducción de Señal/genética , Transcriptoma/genética , Neoplasias de la Mama Triple Negativas/genética , Adulto , Anciano , Anciano de 80 o más Años , Proteína 11 Similar a Bcl2/metabolismo , Carcinoma Ductal de Mama/patología , Estudios de Cohortes , Transición Epitelial-Mesenquimal/genética , Femenino , Regulación Neoplásica de la Expresión Génica , Humanos , Persona de Mediana Edad , Osteonectina/genética , Fenotipo , Pronóstico , RNA-Seq/métodos , Factores de Transcripción de la Familia Snail/metabolismo , Neoplasias de la Mama Triple Negativas/patología
17.
BMC Bioinformatics ; 19(1): 262, 2018 07 13.
Artículo en Inglés | MEDLINE | ID: mdl-30001694

RESUMEN

BACKGROUND: Combining genomic data sets from multiple studies is advantageous to increase statistical power in studies where logistical considerations restrict sample size or require the sequential generation of data. However, significant technical heterogeneity is commonly observed across multiple batches of data that are generated from different processing or reagent batches, experimenters, protocols, or profiling platforms. These so-called batch effects often confound true biological relationships in the data, reducing the power benefits of combining multiple batches, and may even lead to spurious results in some combined studies. Therefore there is significant need for effective methods and software tools that account for batch effects in high-throughput genomic studies. RESULTS: Here we contribute multiple methods and software tools for improved combination and analysis of data from multiple batches. In particular, we provide batch effect solutions for cases where the severity of the batch effects is not extreme, and for cases where one high-quality batch can serve as a reference, such as the training set in a biomarker study. We illustrate our approaches and software in both simulated and real data scenarios. CONCLUSIONS: We demonstrate the value of these new contributions compared to currently established approaches in the specified batch correction situations.


Asunto(s)
Genómica/métodos , Teorema de Bayes , Humanos , Proyectos de Investigación
18.
PLoS Genet ; 11(12): e1005713, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26658939

RESUMEN

Psychostimulant addiction is a heritable substance use disorder; however its genetic basis is almost entirely unknown. Quantitative trait locus (QTL) mapping in mice offers a complementary approach to human genome-wide association studies and can facilitate environment control, statistical power, novel gene discovery, and neurobiological mechanisms. We used interval-specific congenic mouse lines carrying various segments of chromosome 11 from the DBA/2J strain on an isogenic C57BL/6J background to positionally clone a 206 kb QTL (50,185,512-50,391,845 bp) that was causally associated with a reduction in the locomotor stimulant response to methamphetamine (2 mg/kg, i.p.; DBA/2J < C57BL/6J)-a non-contingent, drug-induced behavior that is associated with stimulation of the dopaminergic reward circuitry. This chromosomal region contained only two protein coding genes-heterogeneous nuclear ribonucleoprotein, H1 (Hnrnph1) and RUN and FYVE domain-containing 1 (Rufy1). Transcriptome analysis via mRNA sequencing in the striatum implicated a neurobiological mechanism involving a reduction in mesolimbic innervation and striatal neurotransmission. For instance, Nr4a2 (nuclear receptor subfamily 4, group A, member 2), a transcription factor crucial for midbrain dopaminergic neuron development, exhibited a 2.1-fold decrease in expression (DBA/2J < C57BL/6J; p 4.2 x 10-15). Transcription activator-like effector nucleases (TALENs)-mediated introduction of frameshift deletions in the first coding exon of Hnrnph1, but not Rufy1, recapitulated the reduced methamphetamine behavioral response, thus identifying Hnrnph1 as a quantitative trait gene for methamphetamine sensitivity. These results define a novel contribution of Hnrnph1 to neurobehavioral dysfunction associated with dopaminergic neurotransmission. These findings could have implications for understanding the genetic basis of methamphetamine addiction in humans and the development of novel therapeutics for prevention and treatment of substance abuse and possibly other psychiatric disorders.


Asunto(s)
Conducta Animal/efectos de los fármacos , Neuronas Dopaminérgicas/metabolismo , Ribonucleoproteínas Nucleares Heterogéneas/genética , Actividad Motora/genética , Sitios de Carácter Cuantitativo/genética , Proteínas Adaptadoras Transductoras de Señales/genética , Animales , Estimulantes del Sistema Nervioso Central/administración & dosificación , Mapeo Cromosómico , Neuronas Dopaminérgicas/efectos de los fármacos , Estudio de Asociación del Genoma Completo , Ribonucleoproteínas Nucleares Heterogéneas/metabolismo , Humanos , Masculino , Metanfetamina/administración & dosificación , Ratones , Actividad Motora/efectos de los fármacos , Miembro 2 del Grupo A de la Subfamilia 4 de Receptores Nucleares/genética , ARN Mensajero/genética , Transmisión Sináptica/efectos de los fármacos , Transmisión Sináptica/genética
19.
RNA ; 21(5): 786-800, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25805852

RESUMEN

Recent studies hint that endogenous dsRNA plays an unexpected role in cellular signaling. However, a complete understanding of endogenous dsRNA signaling is hindered by an incomplete annotation of dsRNA-producing genes. To identify dsRNAs expressed in Caenorhabditis elegans, we developed a bioinformatics pipeline that identifies dsRNA by detecting clustered RNA editing sites, which are strictly limited to long dsRNA substrates of Adenosine Deaminases that act on RNA (ADAR). We compared two alignment algorithms for mapping both unique and repetitive reads and detected as many as 664 editing-enriched regions (EERs) indicative of dsRNA loci. EERs are visually enriched on the distal arms of autosomes and are predicted to possess strong internal secondary structures as well as sequence complementarity with other EERs, indicative of both intramolecular and intermolecular duplexes. Most EERs were associated with protein-coding genes, with ∼1.7% of all C. elegans mRNAs containing an EER, located primarily in very long introns and in annotated, as well as unannotated, 3' UTRs. In addition to numerous EERs associated with coding genes, we identified a population of prospective noncoding EERs that were distant from protein-coding genes and that had little or no coding potential. Finally, subsets of EERs are differentially expressed during development as well as during starvation and infection with bacterial or fungal pathogens. By combining RNA-seq with freely available bioinformatics tools, our workflow provides an easily accessible approach for the identification of dsRNAs, and more importantly, a catalog of the C. elegans dsRNAome.


Asunto(s)
Caenorhabditis elegans/genética , Perfilación de la Expresión Génica , Genoma de los Helmintos , ARN Bicatenario/genética , Transcriptoma , Regiones no Traducidas 3' , Adenosina Desaminasa/metabolismo , Animales , Secuencia de Bases , Perfilación de la Expresión Génica/métodos , Intrones , Datos de Secuencia Molecular , Edición de ARN
20.
RNA ; 21(2): 164-71, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25519487

RESUMEN

Small RNA sequencing can be used to gain an unprecedented amount of detail into the microRNA transcriptome. The relatively high cost and low throughput of sequencing bases technologies can potentially be offset by the use of multiplexing. However, multiplexing involves a trade-off between increased number of sequenced samples and reduced number of reads per sample (i.e., lower depth of coverage). To assess the effect of different sequencing depths owing to multiplexing on microRNA differential expression and detection, we sequenced the small RNA of lung tissue samples collected in a clinical setting by multiplexing one, three, six, nine, or 12 samples per lane using the Illumina HiSeq 2000. As expected, the numbers of reads obtained per sample decreased as the number of samples in a multiplex increased. Furthermore, after normalization, replicate samples included in distinct multiplexes were highly correlated (R > 0.97). When detecting differential microRNA expression between groups of samples, microRNAs with average expression >1 reads per million (RPM) had reproducible fold change estimates (signal to noise) independent of the degree of multiplexing. The number of microRNAs detected was strongly correlated with the log2 number of reads aligning to microRNA loci (R = 0.96). However, most additional microRNAs detected in samples with greater sequencing depth were in the range of expression which had lower fold change reproducibility. These findings elucidate the trade-off between increasing the number of samples in a multiplex with decreasing sequencing depth and will aid in the design of large-scale clinical studies exploring microRNA expression and its role in disease.


Asunto(s)
MicroARNs/metabolismo , Perfilación de la Expresión Génica , Humanos , Pulmón/metabolismo , MicroARNs/genética , Análisis de Secuencia de ARN , Transcriptoma
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA