Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nat Methods ; 21(6): 954-966, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38689099

RESUMEN

Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Metagenoma , Metagenómica , Microbiota , Metagenómica/métodos , Metagenoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Microbiota/genética , Humanos , Análisis de Secuencia de ADN/métodos , Biología Computacional/métodos
2.
Nature ; 586(7831): 741-748, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33116287

RESUMEN

The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals-comprising 50 ethnolinguistic groups, including previously unsampled populations-to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon-but in other genes, variants denoted as 'likely pathogenic' in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.


Asunto(s)
Variación Genética , Genoma Humano/genética , Genómica , Salud , Migración Humana , África/etnología , Reparación del ADN/genética , Conjuntos de Datos como Asunto , Femenino , Flujo Génico , Genética Médica , Genética de Población , Salud/historia , Historia Antigua , Migración Humana/historia , Humanos , Inmunidad/genética , Lenguaje , Masculino , Metabolismo/genética , Selección Genética , Secuenciación Completa del Genoma
3.
Hum Mol Genet ; 32(6): 1048-1060, 2023 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-36444934

RESUMEN

Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.


Asunto(s)
Diabetes Mellitus , Nefropatías Diabéticas , Insuficiencia Renal Crónica , Humanos , Aminopeptidasas , Nefropatías Diabéticas/genética , Secuenciación del Exoma , Riñón , Insuficiencia Renal Crónica/genética
4.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35504290

RESUMEN

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
5.
Am J Hum Genet ; 109(5): 857-870, 2022 05 05.
Artículo en Inglés | MEDLINE | ID: mdl-35385699

RESUMEN

While polygenic risk scores (PRSs) enable early identification of genetic risk for chronic obstructive pulmonary disease (COPD), predictive performance is limited when the discovery and target populations are not well matched. Hypothesizing that the biological mechanisms of disease are shared across ancestry groups, we introduce a PrediXcan-derived polygenic transcriptome risk score (PTRS) to improve cross-ethnic portability of risk prediction. We constructed the PTRS using summary statistics from application of PrediXcan on large-scale GWASs of lung function (forced expiratory volume in 1 s [FEV1] and its ratio to forced vital capacity [FEV1/FVC]) in the UK Biobank. We examined prediction performance and cross-ethnic portability of PTRS through smoking-stratified analyses both on 29,381 multi-ethnic participants from TOPMed population/family-based cohorts and on 11,771 multi-ethnic participants from TOPMed COPD-enriched studies. Analyses were carried out for two dichotomous COPD traits (moderate-to-severe and severe COPD) and two quantitative lung function traits (FEV1 and FEV1/FVC). While the proposed PTRS showed weaker associations with disease than PRS for European ancestry, the PTRS showed stronger association with COPD than PRS for African Americans (e.g., odds ratio [OR] = 1.24 [95% confidence interval [CI]: 1.08-1.43] for PTRS versus 1.10 [0.96-1.26] for PRS among heavy smokers with ≥ 40 pack-years of smoking) for moderate-to-severe COPD. Cross-ethnic portability of the PTRS was significantly higher than the PRS (paired t test p < 2.2 × 10-16 with portability gains ranging from 5% to 28%) for both dichotomous COPD traits and across all smoking strata. Our study demonstrates the value of PTRS for improved cross-ethnic portability compared to PRS in predicting COPD risk.


Asunto(s)
Enfermedad Pulmonar Obstructiva Crónica , Transcriptoma , Humanos , Pulmón , National Heart, Lung, and Blood Institute (U.S.) , Enfermedad Pulmonar Obstructiva Crónica/genética , Factores de Riesgo , Estados Unidos/epidemiología
6.
Hum Mol Genet ; 31(18): 3120-3132, 2022 09 10.
Artículo en Inglés | MEDLINE | ID: mdl-35552711

RESUMEN

Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.


Asunto(s)
Factor VIII , Hemostáticos , Factor VII/genética , Factor VIII/genética , Fibrinógeno/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación del Exoma , Factor de von Willebrand/análisis , Factor de von Willebrand/genética
7.
Nature ; 562(7728): 583-588, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-30356187

RESUMEN

The development of the microbiome from infancy to childhood is dependent on a range of factors, with microbial-immune crosstalk during this time thought to be involved in the pathobiology of later life diseases1-9 such as persistent islet autoimmunity and type 1 diabetes10-12. However, to our knowledge, no studies have performed extensive characterization of the microbiome in early life in a large, multi-centre population. Here we analyse longitudinal stool samples from 903 children between 3 and 46 months of age by 16S rRNA gene sequencing (n = 12,005) and metagenomic sequencing (n = 10,867), as part of the The Environmental Determinants of Diabetes in the Young (TEDDY) study. We show that the developing gut microbiome undergoes three distinct phases of microbiome progression: a developmental phase (months 3-14), a transitional phase (months 15-30), and a stable phase (months 31-46). Receipt of breast milk, either exclusive or partial, was the most significant factor associated with the microbiome structure. Breastfeeding was associated with higher levels of Bifidobacterium species (B. breve and B. bifidum), and the cessation of breast milk resulted in faster maturation of the gut microbiome, as marked by the phylum Firmicutes. Birth mode was also significantly associated with the microbiome during the developmental phase, driven by higher levels of Bacteroides species (particularly B. fragilis) in infants delivered vaginally. Bacteroides was also associated with increased gut diversity and faster maturation, regardless of the birth mode. Environmental factors including geographical location and household exposures (such as siblings and furry pets) also represented important covariates. A nested case-control analysis revealed subtle associations between microbial taxonomy and the development of islet autoimmunity or type 1 diabetes. These data determine the structural and functional assembly of the microbiome in early life and provide a foundation for targeted mechanistic investigation into the consequences of microbial-immune crosstalk for long-term health.


Asunto(s)
Microbioma Gastrointestinal/inmunología , Microbioma Gastrointestinal/fisiología , Encuestas y Cuestionarios , Adolescente , Animales , Bifidobacterium/clasificación , Bifidobacterium/genética , Bifidobacterium/aislamiento & purificación , Lactancia Materna/estadística & datos numéricos , Estudios de Casos y Controles , Niño , Preescolar , Análisis por Conglomerados , Conjuntos de Datos como Asunto , Diabetes Mellitus Tipo 1/inmunología , Diabetes Mellitus Tipo 1/microbiología , Femenino , Firmicutes/clasificación , Firmicutes/genética , Firmicutes/aislamiento & purificación , Microbioma Gastrointestinal/genética , Humanos , Lactante , Masculino , Leche Humana/inmunología , Leche Humana/microbiología , Mascotas , ARN Ribosómico 16S/genética , Hermanos , Factores de Tiempo
8.
Am J Hum Genet ; 106(1): 112-120, 2020 01 02.
Artículo en Inglés | MEDLINE | ID: mdl-31883642

RESUMEN

Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (∼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program. We found evidence for eight distinct associations at the CRP locus, including two variants that have not been identified previously (rs11265259 and rs181704186), both of which are non-coding and more common in individuals of African ancestry (∼10% and ∼1% minor allele frequency, respectively, and rare or monomorphic in 1000 Genomes populations of East Asian, South Asian, and European ancestry). We show that the minor (G) allele of rs181704186 is associated with lower CRP levels and decreased transcriptional activity and protein binding in vitro, providing a plausible molecular mechanism for this African ancestry-specific signal. The individuals homozygous for rs181704186-G have a mean CRP level of 0.23 mg/L, in contrast to individuals heterozygous for rs181704186 with mean CRP of 2.97 mg/L and major allele homozygotes with mean CRP of 4.11 mg/L. This study demonstrates the utility of WGS in multi-ethnic populations to drive discovery of complex trait associations of large effect and to identify functional alleles in noncoding regulatory regions.


Asunto(s)
Pueblo Asiatico/genética , Población Negra/genética , Proteína C-Reactiva/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Secuenciación Completa del Genoma/métodos , Estudios de Cohortes , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento
9.
Bioinformatics ; 2021 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-33760063

RESUMEN

MOTIVATION: There are high demands for joint genotyping of structural variations with short-read sequencing, but efficient and accurate genotyping in population scale is a challenging task. RESULTS: We developed muCNV that aggregates per-sample summary pileups for joint genotyping of > 100,000 samples. Pilot results show very low Mendelian inconsistencies. Applications to large-scale projects in cloud show the computational efficiencies of muCNV genotyping pipeline. AVAILABILITY: muCNV is publicly available for download at: https://github.com/gjun/muCNV. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

11.
BMC Med ; 19(1): 255, 2021 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-34593004

RESUMEN

BACKGROUND: This study aims to identify the causative strain of SARS-CoV-2 in a cluster of vaccine breakthroughs. Vaccine breakthrough by a highly transmissible SARS-CoV-2 strain is a risk to global public health. METHODS: Nasopharyngeal swabs from suspected vaccine breakthrough cases were tested for SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) by qPCR (quantitative polymerase chain reaction) for Wuhan-Hu1 and alpha variant. Positive samples were then sequenced by Swift Normalase Amplicon Panels to determine the causal variant. GATK (genome analysis toolkit) variants were filtered with allele fraction ≥80 and min read depth 30x. RESULTS: Viral sequencing revealed an infection cluster of 6 vaccinated patients infected with the delta (B.1.617.2) SARS-CoV-2 variant. With no history of vaccine breakthrough, this suggests the delta variant may possess immune evasion in patients that received the Pfizer BNT162b2, Moderna mRNA-1273, and Covaxin BBV152. CONCLUSIONS: Delta variant may pose the highest risk out of any currently circulating SARS-CoV-2 variants, with previously described increased transmissibility over alpha variant and now, possible vaccine breakthrough. FUNDING: Parts of this work was supported by the National Institute of Allergy and Infectious Diseases (1U19AI144297) and Baylor College of Medicine internal funding.


Asunto(s)
COVID-19 , SARS-CoV-2 , Vacuna BNT162 , Vacunas contra la COVID-19 , Humanos , Evasión Inmune
12.
Genet Med ; 23(12): 2404-2414, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34363016

RESUMEN

PURPOSE: Cardiovascular disease (CVD) is the leading cause of death in adults in the United States, yet the benefits of genetic testing are not universally accepted. METHODS: We developed the "HeartCare" panel of genes associated with CVD, evaluating high-penetrance Mendelian conditions, coronary artery disease (CAD) polygenic risk, LPA gene polymorphisms, and specific pharmacogenetic (PGx) variants. We enrolled 709 individuals from cardiology clinics at Baylor College of Medicine, and samples were analyzed in a CAP/CLIA-certified laboratory. Results were returned to the ordering physician and uploaded to the electronic medical record. RESULTS: Notably, 32% of patients had a genetic finding with clinical management implications, even after excluding PGx results, including 9% who were molecularly diagnosed with a Mendelian condition. Among surveyed physicians, 84% reported medical management changes based on these results, including specialist referrals, cardiac tests, and medication changes. LPA polymorphisms and high polygenic risk of CAD were found in 20% and 9% of patients, respectively, leading to diet, lifestyle, and other changes. Warfarin and simvastatin pharmacogenetic variants were present in roughly half of the cohort. CONCLUSION: Our results support the use of genetic information in routine cardiovascular health management and provide a roadmap for accompanying research.


Asunto(s)
Cardiología , Enfermedades Cardiovasculares , Adulto , Enfermedades Cardiovasculares/diagnóstico , Enfermedades Cardiovasculares/genética , Enfermedades Cardiovasculares/terapia , Pruebas Genéticas , Humanos , Farmacogenética/métodos , Pruebas de Farmacogenómica , Estados Unidos
13.
Am J Hum Genet ; 100(2): 205-215, 2017 02 02.
Artículo en Inglés | MEDLINE | ID: mdl-28089252

RESUMEN

Whole-genome sequencing (WGS) allows for a comprehensive view of the sequence of the human genome. We present and apply integrated methodologic steps for interrogating WGS data to characterize the genetic architecture of 10 heart- and blood-related traits in a sample of 1,860 African Americans. In order to evaluate the contribution of regulatory and non-protein coding regions of the genome, we conducted aggregate tests of rare variation across the entire genomic landscape using a sliding window, complemented by an annotation-based assessment of the genome using predefined regulatory elements and within the first intron of all genes. These tests were performed treating all variants equally as well as with individual variants weighted by a measure of predicted functional consequence. Significant findings were assessed in 1,705 individuals of European ancestry. After these steps, we identified and replicated components of the genomic landscape significantly associated with heart- and blood-related traits. For two traits, lipoprotein(a) levels and neutrophil count, aggregate tests of low-frequency and rare variation were significantly associated across multiple motifs. For a third trait, cardiac troponin T, investigation of regulatory domains identified a locus on chromosome 9. These practical approaches for WGS analysis led to the identification of informative genomic regions and also showed that defined non-coding regions, such as first introns of genes and regulatory domains, are associated with important risk factor phenotypes. This study illustrates the tractable nature of WGS data and outlines an approach for characterizing the genetic architecture of complex traits.


Asunto(s)
Negro o Afroamericano/genética , Estudio de Asociación del Genoma Completo , Lipoproteína(a)/genética , Troponina T/genética , Proteína C-Reactiva/metabolismo , HDL-Colesterol/sangre , LDL-Colesterol/sangre , Cromosomas Humanos Par 9/genética , Frecuencia de los Genes , Genoma Humano , Genómica , Hemoglobinas/metabolismo , Humanos , Intrones , Recuento de Leucocitos , Lipoproteína(a)/sangre , Magnesio/sangre , Péptido Natriurético Encefálico/sangre , Péptido Natriurético Encefálico/genética , Neutrófilos/citología , Fragmentos de Péptidos/sangre , Fragmentos de Péptidos/genética , Fósforo/sangre , Recuento de Plaquetas , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Troponina T/sangre , Población Blanca/genética
14.
Hum Mol Genet ; 26(17): 3442-3450, 2017 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-28854705

RESUMEN

Oligopeptides are important markers of protein metabolism, as they are cleaved from larger polypeptides and proteins. Genetic association studies may help elucidate their origin and function. In 1,552 European Americans and 1,872 African Americans of the Atherosclerosis Risk in Communities study, we performed whole-genome and whole-exome sequencing and measured serum levels of 25 peptides. Common variants (minor allele frequency > 5%) were analysed individually. We grouped low-frequency variants (minor allele frequency ≤ 5%) by a genome-wide sliding window using region-based aggregate tests. Furthermore, low-frequency regulatory variants were grouped by gene, as were functional coding variants. All analyses were performed separately in each ancestry group and then meta-analysed. We identified 22 common variant associations with peptide levels (P-value < 4.2 × 10-10), including 16 novel gene-peptide pairs. Notably, variants in kinin-kallikrein genes KNG1, F12, KLKB1, and ACE were associated with several different peptides. Variants in KLKB1 and ACE were associated with a fragment of complement component 3f. Both common variants and low-frequency coding variants in CPN1 were associated with a fibrinogen cleavage peptide. Four sliding windows were significantly associated with peptide levels (P-value < 4.2 × 10-10). Our results highlight the importance of the kinin-kallikrein system in the regulation of serum peptide levels, strengthen the evidence for a broad link between the kinin-kallikrein and complement systems, and suggest a role of CPN1 in the conversion of fibrinogen to fibrin.


Asunto(s)
Aterosclerosis/genética , Aterosclerosis/metabolismo , Negro o Afroamericano/genética , Alelos , Aterosclerosis/sangre , Exoma/genética , Femenino , Frecuencia de los Genes , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Calicreínas/sangre , Calicreínas/genética , Masculino , Persona de Mediana Edad , Péptidos/sangre , Péptidos/genética , Polimorfismo de Nucleótido Simple/genética , Proteínas/genética , Factores de Riesgo , Población Blanca/genética , Secuenciación Completa del Genoma
15.
Am J Hum Genet ; 99(2): 481-8, 2016 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-27486782

RESUMEN

Circulating blood cell counts and indices are important indicators of hematopoietic function and a number of clinical parameters, such as blood oxygen-carrying capacity, inflammation, and hemostasis. By performing whole-exome sequence association analyses of hematologic quantitative traits in 15,459 community-dwelling individuals, followed by in silico replication in up to 52,024 independent samples, we identified two previously undescribed coding variants associated with lower platelet count: a common missense variant in CPS1 (rs1047891, MAF = 0.33, discovery + replication p = 6.38 × 10(-10)) and a rare synonymous variant in GFI1B (rs150813342, MAF = 0.009, discovery + replication p = 1.79 × 10(-27)). By performing CRISPR/Cas9 genome editing in hematopoietic cell lines and follow-up targeted knockdown experiments in primary human hematopoietic stem and progenitor cells, we demonstrate an alternative splicing mechanism by which the GFI1B rs150813342 variant suppresses formation of a GFI1B isoform that preferentially promotes megakaryocyte differentiation and platelet production. These results demonstrate how unbiased studies of natural variation in blood cell traits can provide insight into the regulation of human hematopoiesis.


Asunto(s)
Empalme Alternativo/genética , Análisis Mutacional de ADN , Exoma/genética , Sitios Genéticos/genética , Hematopoyesis/genética , Proteínas Proto-Oncogénicas/genética , Proteínas Represoras/genética , Plaquetas/citología , Sistemas CRISPR-Cas , Edición Génica , Células Madre Hematopoyéticas/citología , Humanos , Megacariocitos/citología , Recuento de Plaquetas
17.
Nature ; 455(7216): 1069-75, 2008 Oct 23.
Artículo en Inglés | MEDLINE | ID: mdl-18948947

RESUMEN

Determining the genetic basis of cancer requires comprehensive analyses of large collections of histopathologically well-classified primary tumours. Here we report the results of a collaborative study to discover somatic mutations in 188 human lung adenocarcinomas. DNA sequencing of 623 genes with known or potential relationships to cancer revealed more than 1,000 somatic mutations across the samples. Our analysis identified 26 genes that are mutated at significantly high frequencies and thus are probably involved in carcinogenesis. The frequently mutated genes include tyrosine kinases, among them the EGFR homologue ERBB4; multiple ephrin receptor genes, notably EPHA3; vascular endothelial growth factor receptor KDR; and NTRK genes. These data provide evidence of somatic mutations in primary lung adenocarcinoma for several tumour suppressor genes involved in other cancers--including NF1, APC, RB1 and ATM--and for sequence changes in PTPRD as well as the frequently deleted gene LRP1B. The observed mutational profiles correlate with clinical features, smoking status and DNA repair defects. These results are reinforced by data integration including single nucleotide polymorphism array and gene expression array. Our findings shed further light on several important signalling pathways involved in lung adenocarcinoma, and suggest new molecular targets for treatment.


Asunto(s)
Adenocarcinoma Bronquioloalveolar/genética , Neoplasias Pulmonares/genética , Mutación/genética , Femenino , Dosificación de Gen , Regulación Neoplásica de la Expresión Génica , Genes Supresores de Tumor , Humanos , Masculino , Proto-Oncogenes/genética
18.
bioRxiv ; 2024 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-39282457

RESUMEN

Every viral infection entails an evolving population of viral genomes. High-throughput sequencing technologies can be used to characterize such populations, but to date there are few published examples of such work. In addition, mixed sequencing data are sometimes used to infer properties of infecting genomes without discriminating between genome-derived reads and reads from the much more abundant, in the case of a typical active viral infection, transcripts. Here we apply capture probe-based short read high-throughput sequencing to nasal wash samples taken from a previously described group of adult hematopoietic cell transplant (HCT) recipients naturally infected with respiratory syncytial virus (RSV). We separately analyzed reads from genomes and transcripts for the levels and distribution of genetic variation by calculating per position Shannon entropies. Our analysis reveals a low level of genetic variation within the RSV infections analyzed here, but with interesting differences between genomes and transcripts in 1) average per sample Shannon entropies; 2) the genomic distribution of variation 'hotspots'; and 3) the genomic distribution of hotspots encoding alternative amino acids. In all, our results suggest the importance of separately analyzing reads from genomes and transcripts when interpreting high-throughput sequencing data for insight into intra-host viral genome replication, expression, and evolution.

19.
bioRxiv ; 2024 Sep 05.
Artículo en Inglés | MEDLINE | ID: mdl-39282326

RESUMEN

Background: Human noroviruses are a leading cause of acute and sporadic gastroenteritis worldwide. The evolution of human noroviruses in immunocompromised persons has been evaluated in many studies. Much less is known about the evolutionary dynamics of human norovirus in healthy adults. Methods: We used sequential samples collected from a controlled human infection study with GI.1/Norwalk/US/68 virus to evaluate intra- and inter-host evolution of a human norovirus in healthy adults. Up to 12 samples from day 1 to day 56 post-challenge were sequenced using a norovirus-specific capture probe method. Results: Complete genomes were assembled, even in samples that were below the limit of detection of standard RT-qPCR assays, up to 28 days post-challenge. Analysis of 123 complete genomes showed changes in the GI.1 genome in all persons, but there were no conserved changes across all persons. Single nucleotide variants resulting in non-synonymous amino acid changes were observed in all proteins, with the capsid VP1 and nonstructural protein NS3 having the largest numbers of changes. Conclusions: These data highlight the potential of a new capture-based sequencing approach to assemble human norovirus genomes with high sensitivity and demonstrate limited conserved immune pressure-driven evolution of GI.1 virus in healthy adults.

20.
Virus Evol ; 10(1): vead086, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38361816

RESUMEN

Respiratory syncytial virus (RSV) infection in immunocompromised individuals often leads to prolonged illness, progression to severe lower respiratory tract infection, and even death. How the host immune environment of the hematopoietic stem cell transplant (HCT) adults can affect viral genetic variation during an acute infection is not understood well. In the present study, we performed whole genome sequencing of RSV/A or RSV/B from samples collected longitudinally from HCT adults with normal (<14 days) and delayed (≥14 days) RSV clearance who were enrolled in a ribavirin trial. We determined the inter-host and intra-host genetic variation of RSV and the effect of mutations on putative glycosylation sites. The inter-host variation of RSV is centered in the attachment (G) and fusion (F) glycoprotein genes followed by polymerase (L) and matrix (M) genes. Interestingly, the overall genetic variation was constant between normal and delayed clearance groups for both RSV/A and RSV/B. Intra-host variation primarily occurred in the G gene followed by non-structural protein (NS1) and L genes; however, gain or loss of stop codons and frameshift mutations appeared only in the G gene and only in the delayed viral clearance group. Potential gain or loss of O-linked glycosylation sites in the G gene occurred both in RSV/A and RSV/B isolates. For RSV F gene, loss of N-linked glycosylation site occurred in three RSV/B isolates within an antigenic epitope. Both oral and aerosolized ribavirin did not cause any mutations in the L gene. In summary, prolonged viral shedding and immune deficiency resulted in RSV variation, especially in structural mutations in the G gene, possibly associated with immune evasion. Therefore, sequencing and monitoring of RSV isolates from immunocompromised patients are crucial as they can create escape mutants that can impact the effectiveness of upcoming vaccines and treatments.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA