Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
Nat Med ; 30(5): 1384-1394, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38740997

RESUMO

How human genetic variation contributes to vaccine effectiveness in infants is unclear, and data are limited on these relationships in populations with African ancestries. We undertook genetic analyses of vaccine antibody responses in infants from Uganda (n = 1391), Burkina Faso (n = 353) and South Africa (n = 755), identifying associations between human leukocyte antigen (HLA) and antibody response for five of eight tested antigens spanning pertussis, diphtheria and hepatitis B vaccines. In addition, through HLA typing 1,702 individuals from 11 populations of African ancestry derived predominantly from the 1000 Genomes Project, we constructed an imputation resource, fine-mapping class II HLA-DR and DQ associations explaining up to 10% of antibody response variance in our infant cohorts. We observed differences in the genetic architecture of pertussis antibody response between the cohorts with African ancestries and an independent cohort with European ancestry, but found no in silico evidence of differences in HLA peptide binding affinity or breadth. Using immune cell expression quantitative trait loci datasets derived from African-ancestry samples from the 1000 Genomes Project, we found evidence of differential HLA-DRB1 expression correlating with inferred protection from pertussis following vaccination. This work suggests that HLA-DRB1 expression may play a role in vaccine response and should be considered alongside peptide selection to improve vaccine design.


Assuntos
Cadeias HLA-DRB1 , Humanos , Cadeias HLA-DRB1/genética , Cadeias HLA-DRB1/imunologia , Lactente , População Negra/genética , Vacinas contra Hepatite B/imunologia , Locos de Características Quantitativas , Masculino , Feminino , Uganda , Formação de Anticorpos/genética , Formação de Anticorpos/imunologia , Vacina contra Coqueluche/imunologia , Vacina contra Coqueluche/genética , Vacinação , Coqueluche/prevenção & controle , Coqueluche/imunologia , Coqueluche/genética
2.
Cell Syst ; 14(12): 1122-1130.e3, 2023 12 20.
Artigo em Inglês | MEDLINE | ID: mdl-38128484

RESUMO

The efficacy of epitope vaccines depends on the included epitopes as well as the probability that the selected epitopes are presented by the major histocompatibility complex (MHC) proteins of a vaccinated individual. Designing vaccines that effectively immunize a high proportion of the population is challenging because of high MHC polymorphism, diverging MHC-peptide binding affinities, and physical constraints on epitope vaccine constructs. Here, we present HOGVAX, a combinatorial optimization approach for epitope vaccine design. To optimize population coverage within the constraint of limited vaccine construct space, HOGVAX employs a hierarchical overlap graph (HOG) to identify and exploit overlaps between selected peptides and explicitly models the structure of linkage disequilibrium in the MHC. In a SARS-CoV-2 case study, we demonstrate that HOGVAX-designed vaccines contain substantially more epitopes than vaccines built from concatenated peptides and predict vaccine efficacy in over 98% of the population with high numbers of presented peptides in vaccinated individuals.


Assuntos
COVID-19 , Vacinas , Humanos , SARS-CoV-2 , COVID-19/prevenção & controle , Epitopos de Linfócito T , Peptídeos
3.
HLA ; 102(1): 28-43, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-36932816

RESUMO

Accurate and comprehensive immunogenetic reference panels are key to the successful implementation of population-scale immunogenomics. The 5Mbp Major Histocompatibility Complex (MHC) is the most polymorphic region of the human genome and associated with multiple immune-mediated diseases, transplant matching and therapy responses. Analysis of MHC genetic variation is severely complicated by complex patterns of sequence variation, linkage disequilibrium and a lack of fully resolved MHC reference haplotypes, increasing the risk of spurious findings on analyzing this medically important region. Integrating Illumina, ultra-long Nanopore, and PacBio HiFi sequencing as well as bespoke bioinformatics, we completed five of the alternative MHC reference haplotypes of the current (GRCh38/hg38) build of the human reference genome and added one other. The six assembled MHC haplotypes encompass the DR1 and DR4 haplotype structures in addition to the previously completed DR2 and DR3, as well as six distinct classes of the structurally variable C4 region. Analysis of the assembled haplotypes showed that MHC class II sequence structures, including repeat element positions, are generally conserved within the DR haplotype supergroups, and that sequence diversity peaks in three regions around HLA-A, HLA-B+C, and the HLA class II genes. Demonstrating the potential for improved short-read analysis, the number of proper read pairs recruited to the MHC was found to be increased by 0.06%-0.49% in a 1000 Genomes Project read remapping experiment with seven diverse samples. Furthermore, the assembled haplotypes can serve as references for the community and provide the basis of a structurally accurate genotyping graph of the complete MHC region.


Assuntos
Antígenos de Histocompatibilidade Classe II , Complexo Principal de Histocompatibilidade , Humanos , Haplótipos , Alelos , Antígenos de Histocompatibilidade Classe II/genética , Complexo Principal de Histocompatibilidade/genética , Antígenos HLA/genética , Antígenos HLA-C/genética
4.
Clin Infect Dis ; 76(3): 408-415, 2023 02 08.
Artigo em Inglês | MEDLINE | ID: mdl-36189631

RESUMO

BACKGROUND: Monoclonal antibodies (mAbs) that target severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are predominantly less effective against Omicron variants. Immunocompromised patients often experience prolonged viral shedding, resulting in an increased risk of viral escape. METHODS: In an observational, prospective cohort, 57 patients infected with Omicron variants who received sotrovimab alone or in combination with remdesivir were followed. The study end points were a decrease in SARS-CoV-2 RNA <106 copies/mL in nasopharyngeal swabs at day 21 and the emergence of escape mutations at days 7, 14, and 21 after sotrovimab administration. All SARS-CoV-2 samples were analyzed using whole-genome sequencing. Individual variants within the quasispecies were subsequently quantified and further characterized using a pseudovirus neutralization assay. RESULTS: The majority of patients (43 of 57, 75.4%) were immunodeficient, predominantly due to immunosuppression after organ transplantation or hematologic malignancies. Infections by Omicron/BA.1 comprised 82.5%, while 17.5% were infected by Omicron/BA.2. Twenty-one days after sotrovimab administration, 12 of 43 (27.9%) immunodeficient patients had prolonged viral shedding compared with 1 of 14 (7.1%) immunocompetent patients (P = .011). Viral spike protein mutations, some specific for Omicron (e.g., P337S and/or E340D/V), emerged in 14 of 43 (32.6%) immunodeficient patients, substantially reducing sensitivity to sotrovimab in a pseudovirus neutralization assay. Combination therapy with remdesivir significantly reduced emergence of escape variants. CONCLUSIONS: Immunocompromised patients face a considerable risk of prolonged viral shedding and emergence of escape mutations after early therapy with sotrovimab. These findings underscore the importance of careful monitoring and the need for dedicated clinical trials in this patient population.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Anticorpos Neutralizantes , Anticorpos Antivirais , Hospedeiro Imunocomprometido , Estudos Prospectivos , RNA Viral , SARS-CoV-2/genética
5.
Euro Surveill ; 27(43)2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36305336

RESUMO

BackgroundTracking person-to-person SARS-CoV-2 transmission in the population is important to understand the epidemiology of community transmission and may contribute to the containment of SARS-CoV-2. Neither contact tracing nor genomic surveillance alone, however, are typically sufficient to achieve this objective.AimWe demonstrate the successful application of the integrated genomic surveillance (IGS) system of the German city of Düsseldorf for tracing SARS-CoV-2 transmission chains in the population as well as detecting and investigating travel-associated SARS-CoV-2 infection clusters.MethodsGenomic surveillance, phylogenetic analysis, and structured case interviews were integrated to elucidate two genetically defined clusters of SARS-CoV-2 isolates detected by IGS in Düsseldorf in July 2021.ResultsCluster 1 (n = 67 Düsseldorf cases) and Cluster 2 (n = 36) were detected in a surveillance dataset of 518 high-quality SARS-CoV-2 genomes from Düsseldorf (53% of total cases, sampled mid-June to July 2021). Cluster 1 could be traced back to a complex pattern of transmission in nightlife venues following a putative importation by a SARS-CoV-2-infected return traveller (IP) in late June; 28 SARS-CoV-2 cases could be epidemiologically directly linked to IP. Supported by viral genome data from Spain, Cluster 2 was shown to represent multiple independent introduction events of a viral strain circulating in Catalonia and other European countries, followed by diffuse community transmission in Düsseldorf.ConclusionIGS enabled high-resolution tracing of SARS-CoV-2 transmission in an internationally connected city during community transmission and provided infection chain-level evidence of the downstream propagation of travel-imported SARS-CoV-2 cases.


Assuntos
COVID-19 , Doenças Transmissíveis Importadas , Humanos , SARS-CoV-2/genética , Viagem , Doenças Transmissíveis Importadas/epidemiologia , COVID-19/epidemiologia , Filogenia , Busca de Comunicante , Alemanha/epidemiologia , Genômica
6.
Nat Genet ; 54(4): 518-525, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35410384

RESUMO

Typical genotyping workflows map reads to a reference genome before identifying genetic variants. Generating such alignments introduces reference biases and comes with substantial computational burden. Furthermore, short-read lengths limit the ability to characterize repetitive genomic regions, which are particularly challenging for fast k-mer-based genotypers. In the present study, we propose a new algorithm, PanGenie, that leverages a haplotype-resolved pangenome reference together with k-mer counts from short-read sequencing data to genotype a wide spectrum of genetic variation-a process we refer to as genome inference. Compared with mapping-based approaches, PanGenie is more than 4 times faster at 30-fold coverage and achieves better genotype concordances for almost all variant types and coverages tested. Improvements are especially pronounced for large insertions (≥50 bp) and variants in repetitive regions, enabling the inclusion of these classes of variants in genome-wide association studies. PanGenie efficiently leverages the increasing amount of haplotype-resolved assemblies to unravel the functional impact of previously inaccessible variants while being faster compared with alignment-based workflows.


Assuntos
Variação Genética , Genoma Humano , Genômica , Algoritmos , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Genômica/métodos , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de DNA
7.
Clin Infect Dis ; 74(6): 1039-1046, 2022 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-34181711

RESUMO

BACKGROUND: Tracing of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission chains is still a major challenge for public health authorities, when incidental contacts are not recalled or are not perceived as potential risk contacts. Viral sequencing can address key questions about SARS-CoV-2 evolution and may support reconstruction of viral transmission networks by integration of molecular epidemiology into classical contact tracing. METHODS: In collaboration with local public health authorities, we set up an integrated system of genomic surveillance in an urban setting, combining a) viral surveillance sequencing, b) genetically based identification of infection clusters in the population, c) integration of public health authority contact tracing data, and d) a user-friendly dashboard application as a central data analysis platform. RESULTS: Application of the integrated system from August to December 2020 enabled a characterization of viral population structure, analysis of 4 outbreaks at a maximum care hospital, and genetically based identification of 5 putative population infection clusters, all of which were confirmed by contact tracing. The system contributed to the development of improved hospital infection control and prevention measures and enabled the identification of previously unrecognized transmission chains, involving a martial arts gym and establishing a link between the hospital to the local population. CONCLUSIONS: Integrated systems of genomic surveillance could contribute to the monitoring and, potentially, improved management of SARS-CoV-2 transmission in the population.


Assuntos
COVID-19 , SARS-CoV-2 , COVID-19/epidemiologia , Busca de Comunicante , Surtos de Doenças/prevenção & controle , Genômica , Humanos , SARS-CoV-2/genética
8.
Comput Struct Biotechnol J ; 19: 3069-3076, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34136105

RESUMO

Codon degeneracy of amino acid sequences permits an additional "mRNP code" layer underlying the genetic code that is related to RNA processing. In pre-mRNA splicing, splice site usage is determined by both intrinsic strength and sequence context providing RNA binding sites for splicing regulatory proteins. In this study, we systematically examined modification of splicing regulatory properties in the neighborhood of a GT site, i.e. potential splice site, without altering the encoded amino acids. We quantified the splicing regulatory properties of the neighborhood around a potential splice site by its Splice Site HEXplorer Weight (SSHW) based on the HEXplorer score algorithm. To systematically modify GT site neighborhoods, either minimizing or maximizing their SSHW, we designed the novel stochastic optimization algorithm ModCon that applies a genetic algorithm with stochastic crossover, insertion and random mutation elements supplemented by a heuristic sliding window approach. To assess the achievable range in SSHW in human splice donors without altering the encoded amino acids, we applied ModCon to a set of 1000 randomly selected Ensembl annotated human splice donor sites, achieving substantial and accurate changes in SSHW. Using ModCon optimization, we successfully switched splice donor usage in a splice site competition reporter containing coding sequences from FANCA, FANCB or BRCA2, while retaining their amino acid coding information. The ModCon algorithm and its R package implementation can assist in reporter design by either introducing novel splice sites, silencing accidental, undesired splice sites, and by generally modifying the entire mRNP code while maintaining the genetic code.

9.
Int J Biochem Cell Biol ; 131: 105882, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33189874

RESUMO

The Major Histocompatibility Complex (MHC) on the short arm of chromosome 6 is associated with more diseases than any other region of the genome; it encodes the antigen-presenting Human Leukocyte Antigen (HLA) proteins and is one of the key immunogenetic regions of the genome. Accurate genome inference and interpretation of MHC association signals have traditionally been hampered by the region's uniquely complex features, such as high levels of polymorphism; inter-gene sequence homologies; structural variation; and long-range haplotype structures. Recent algorithmic and technological advances have, however, significantly increased the accessibility of genetic variation in the MHC; these developments include (i) accurate SNP-based HLA type imputation; (ii) genome graph approaches for variation-aware genome inference from next-generation sequencing data; (iii) long-read-based diploid de novo assembly of the MHC; (iv) cost-effective targeted MHC sequencing methods. Applied to hundreds of thousands of samples over the last years, these technologies have already enabled significant biological discoveries, for example in the field of autoimmune disease genetics. Remaining challenges concern the development of integrated methods that leverage haplotype-resolved de novo assembly of the MHC for the development of improved MHC genotyping methods for short reads and the construction of improved reference panels for SNP-based imputation. Improved genome inference in the MHC can crucially contribute to an improved genetic and functional understanding of many immune-related phenotypes and diseases.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Genoma Humano/imunologia , Antígenos HLA/genética , Teste de Histocompatibilidade/métodos , Complexo Principal de Histocompatibilidade/genética , Alelos , Sequência de Bases , Biologia Computacional/métodos , Heterogeneidade Genética , Antígenos HLA/classificação , Antígenos HLA/imunologia , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único
10.
Mob DNA ; 11(1): 30, 2020 Nov 07.
Artigo em Inglês | MEDLINE | ID: mdl-33292499

RESUMO

BACKGROUND: Mobile genetic elements are found in genomes throughout the microbial world, mediating genome plasticity and important prokaryotic phenotypes. Even the cell wall-less mycoplasmas, which are known to harbour a minimal set of genes, seem to accumulate mobile genetic elements. In Mycoplasma hominis, a facultative pathogen of the human urogenital tract and an inherently very heterogeneous species, four different MGE-classes had been detected until now: insertion sequence ISMhom-1, prophage MHoV-1, a tetracycline resistance mediating transposon, and ICEHo, a species-specific variant of a mycoplasma integrative and conjugative element encoding a T4SS secretion system (termed MICE). RESULTS: To characterize the prevalence of these MGEs, genomes of 23 M. hominis isolates were assembled using whole genome sequencing and bioinformatically analysed for the presence of mobile genetic elements. In addition to the previously described MGEs, a new ICEHo variant was found, which we designate ICEHo-II. Of 15 ICEHo-II genes, five are common MICE genes; eight are unique to ICEHo-II; and two represent a duplication of a gene also present in ICEHo-I. In 150 M. hominis isolates and based on a screening PCR, prevalence of ICEHo-I was 40.7%; of ICEHo-II, 28.7%; and of both elements, 15.3%. Activity of ICEHo-I and -II was demonstrated by detection of circularized extrachromosomal forms of the elements through PCR and subsequent Sanger sequencing. CONCLUSIONS: Nanopore sequencing enabled the identification of mobile genetic elements and of ICEHo-II, a novel MICE element of M. hominis, whose phenotypic impact and potential impact on pathogenicity can now be elucidated.

11.
Nat Commun ; 11(1): 4794, 2020 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-32963235

RESUMO

Most human genomes are characterized by aligning individual reads to the reference genome, but accurate long reads and linked reads now enable us to construct accurate, phased de novo assemblies. We focus on a medically important, highly variable, 5 million base-pair (bp) region where diploid assembly is particularly useful - the Major Histocompatibility Complex (MHC). Here, we develop a human genome benchmark derived from a diploid assembly for the openly-consented Genome in a Bottle sample HG002. We assemble a single contig for each haplotype, align them to the reference, call phased small and structural variants, and define a small variant benchmark for the MHC, covering 94% of the MHC and 22368 variants smaller than 50 bp, 49% more variants than a mapping-based benchmark. This benchmark reliably identifies errors in mapping-based callsets, and enables performance assessment in regions with much denser, complex variation than regions covered by previous benchmarks.


Assuntos
Diploide , Complexo Principal de Histocompatibilidade/genética , Benchmarking , Linhagem Celular , Variação Genética , Genoma Humano , Haplótipos , Humanos
12.
Nat Commun ; 11(1): 3849, 2020 07 31.
Artigo em Inglês | MEDLINE | ID: mdl-32737300

RESUMO

Kaposi's sarcoma-associated herpesvirus (KSHV) and Epstein-Barr Virus (EBV) establish life-long infections and are associated with malignancies. Striking geographic variation in incidence and the fact that virus alone is insufficient to cause disease, suggests other co-factors are involved. Here we present epidemiological analysis and genome-wide association study (GWAS) in 4365 individuals from an African population cohort, to assess the influence of host genetic and non-genetic factors on virus antibody responses. EBV/KSHV co-infection (OR = 5.71(1.58-7.12)), HIV positivity (OR = 2.22(1.32-3.73)) and living in a more rural area (OR = 1.38(1.01-1.89)) are strongly associated with immunogenicity. GWAS reveals associations with KSHV antibody response in the HLA-B/C region (p = 6.64 × 10-09). For EBV, associations are identified for VCA (rs71542439, p = 1.15 × 10-12). Human leucocyte antigen (HLA) and trans-ancestry fine-mapping substantiate that distinct variants in HLA-DQA1 (p = 5.24 × 10-44) are driving associations for EBNA-1 in Africa. This study highlights complex interactions between KSHV and EBV, in addition to distinct genetic architectures resulting in important differences in pathogenesis and transmission.


Assuntos
Anticorpos Antivirais/biossíntese , Resistência à Doença/genética , Infecções por Vírus Epstein-Barr/genética , Infecções por Henipavirus/genética , Interações Hospedeiro-Patógeno/genética , Sarcoma de Kaposi/genética , Adolescente , Adulto , Antígenos Virais/genética , Antígenos Virais/imunologia , Proteínas do Capsídeo/genética , Proteínas do Capsídeo/imunologia , Coinfecção , Infecções por Vírus Epstein-Barr/epidemiologia , Infecções por Vírus Epstein-Barr/imunologia , Infecções por Vírus Epstein-Barr/virologia , Antígenos Nucleares do Vírus Epstein-Barr/genética , Antígenos Nucleares do Vírus Epstein-Barr/imunologia , Feminino , Expressão Gênica , Estudo de Associação Genômica Ampla , HIV/genética , HIV/imunologia , HIV/patogenicidade , Cadeias alfa de HLA-DQ/genética , Cadeias alfa de HLA-DQ/imunologia , Infecções por Henipavirus/epidemiologia , Infecções por Henipavirus/imunologia , Infecções por Henipavirus/virologia , Herpesvirus Humano 4/genética , Herpesvirus Humano 4/imunologia , Herpesvirus Humano 4/patogenicidade , Herpesvirus Humano 8/genética , Herpesvirus Humano 8/imunologia , Herpesvirus Humano 8/patogenicidade , Interações Hospedeiro-Patógeno/imunologia , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , População Rural , Sarcoma de Kaposi/epidemiologia , Sarcoma de Kaposi/imunologia , Sarcoma de Kaposi/virologia , Uganda/epidemiologia , População Urbana
13.
Blood Adv ; 4(15): 3495-3506, 2020 08 11.
Artigo em Inglês | MEDLINE | ID: mdl-32750130

RESUMO

Each year, blood transfusions save millions of lives. However, under current blood-matching practices, sensitization to non-self-antigens is an unavoidable adverse side effect of transfusion. We describe a universal donor typing platform that could be adopted by blood services worldwide to facilitate a universal extended blood-matching policy and reduce sensitization rates. This DNA-based test is capable of simultaneously typing most clinically relevant red blood cell (RBC), human platelet (HPA), and human leukocyte (HLA) antigens. Validation was performed, using samples from 7927 European, 27 South Asian, 21 East Asian, and 9 African blood donors enrolled in 2 national biobanks. We illustrated the usefulness of the platform by analyzing antibody data from patients sensitized with multiple RBC alloantibodies. Genotyping results demonstrated concordance of 99.91%, 99.97%, and 99.03% with RBC, HPA, and HLA clinically validated typing results in 89 371, 3016, and 9289 comparisons, respectively. Genotyping increased the total number of antigen typing results available from 110 980 to >1 200 000. Dense donor typing allowed identification of 2 to 6 times more compatible donors to serve 3146 patients with multiple RBC alloantibodies, providing at least 1 match for 176 individuals for whom previously no blood could be found among the same donors. This genotyping technology is already being used to type thousands of donors taking part in national genotyping studies. Extraction of dense antigen-typing data from these cohorts provides blood supply organizations with the opportunity to implement a policy of genomics-based precision matching of blood.


Assuntos
Doadores de Sangue , Transfusão de Sangue , Genótipo , Humanos , Isoanticorpos , Estudos Prospectivos
14.
Euro Surveill ; 25(22)2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32524946

RESUMO

We whole-genome sequenced 55 SARS-CoV-2 isolates from Germany to investigate SARS-CoV-2 outbreaks in 2020 in the Heinsberg district and Düsseldorf. While the genetic structure of the Heinsberg outbreak indicates a clonal origin, reflecting superspreading dynamics from mid-February during the carnival season, distinct viral strains were circulating in Düsseldorf in March, reflecting the city's international links. Limited detection of Heinsberg strains in the Düsseldorf area despite geographical proximity may reflect efficient containment and contact-tracing efforts.


Assuntos
Betacoronavirus/genética , Técnicas de Laboratório Clínico/métodos , Infecções por Coronavirus/diagnóstico , Genoma Viral/genética , Pandemias , Pneumonia Viral/diagnóstico , Sequenciamento Completo do Genoma/métodos , Betacoronavirus/isolamento & purificação , Betacoronavirus/patogenicidade , COVID-19 , Teste para COVID-19 , Infecções por Coronavirus/epidemiologia , Surtos de Doenças , Alemanha/epidemiologia , Humanos , Pneumonia Viral/epidemiologia , DNA Polimerase Dirigida por RNA , Reação em Cadeia da Polimerase Via Transcriptase Reversa , SARS-CoV-2
15.
Genome Biol ; 21(1): 68, 2020 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-32171299

RESUMO

Hybrid genome assembly has emerged as an important technique in bacterial genomics, but cost and labor requirements limit large-scale application. We present Ultraplexing, a method to improve per-sample sequencing cost and hands-on time of Nanopore sequencing for hybrid assembly by at least 50% compared to molecular barcoding while maintaining high assembly quality. Ultraplexing requires the availability of Illumina data and uses inter-sample genetic variability to assign reads to isolates, which obviates the need for molecular barcoding. Thus, Ultraplexing can enable significant sequencing and labor cost reductions in large-scale bacterial genome projects.


Assuntos
Genoma Bacteriano , Sequenciamento por Nanoporos/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Plasmídeos/genética , Staphylococcus aureus/genética , Staphylococcus aureus/isolamento & purificação
16.
Proc Natl Acad Sci U S A ; 116(34): 16955-16960, 2019 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-31375628

RESUMO

Multiple sclerosis (MS) is a chronic inflammatory, likely autoimmune disease of the central nervous system with a combination of genetic and environmental risk factors, among which Epstein-Barr virus (EBV) infection is a strong suspect. We have previously identified increased autoantibody levels toward the chloride-channel protein Anoctamin 2 (ANO2) in MS. Here, IgG antibody reactivity toward ANO2 and EBV nuclear antigen 1 (EBNA1) was measured using bead-based multiplex serology in plasma samples from 8,746 MS cases and 7,228 controls. We detected increased anti-ANO2 antibody levels in MS (P = 3.5 × 10-36) with 14.6% of cases and 7.8% of controls being ANO2 seropositive (odds ratio [OR] = 1.6; 95% confidence intervals [95%CI]: 1.5 to 1.8). The MS risk increase in ANO2-seropositive individuals was dramatic when also exposed to 3 known risk factors for MS: HLA-DRB1*15:01 carriage, absence of HLA-A*02:01, and high anti-EBNA1 antibody levels (OR = 24.9; 95%CI: 17.9 to 34.8). Reciprocal blocking experiments with ANO2 and EBNA1 peptides demonstrated antibody cross-reactivity, mapping to ANO2 [aa 140 to 149] and EBNA1 [aa 431 to 440]. HLA gene region was associated with anti-ANO2 antibody levels and HLA-DRB1*04:01 haplotype was negatively associated with ANO2 seropositivity (OR = 0.6; 95%CI: 0.5 to 0.7). Anti-ANO2 antibody levels were not increased in patients from 3 other inflammatory disease cohorts. The HLA influence and the fact that specific IgG production usually needs T cell help provides indirect evidence for a T cell ANO2 autoreactivity in MS. We propose a hypothesis where immune reactivity toward EBNA1 through molecular mimicry with ANO2 contributes to the etiopathogenesis of MS.


Assuntos
Anoctaminas , Antígenos Nucleares do Vírus Epstein-Barr , Herpesvirus Humano 4 , Modelos Imunológicos , Mimetismo Molecular , Esclerose Múltipla , Anoctaminas/genética , Anoctaminas/imunologia , Autoanticorpos/imunologia , Reações Cruzadas/genética , Antígenos Nucleares do Vírus Epstein-Barr/genética , Antígenos Nucleares do Vírus Epstein-Barr/imunologia , Feminino , Antígeno HLA-A2/imunologia , Cadeias HLA-DRB1/genética , Cadeias HLA-DRB1/imunologia , Haplótipos , Herpesvirus Humano 4/genética , Herpesvirus Humano 4/imunologia , Humanos , Imunoglobulina G/imunologia , Masculino , Esclerose Múltipla/genética , Esclerose Múltipla/imunologia , Esclerose Múltipla/patologia , Fatores de Risco
17.
Nat Commun ; 10(1): 3066, 2019 07 11.
Artigo em Inglês | MEDLINE | ID: mdl-31296857

RESUMO

Metagenomic sequence classification should be fast, accurate and information-rich. Emerging long-read sequencing technologies promise to improve the balance between these factors but most existing methods were designed for short reads. MetaMaps is a new method, specifically developed for long reads, capable of mapping a long-read metagenome to a comprehensive RefSeq database with >12,000 genomes in <16 GB or RAM on a laptop computer. Integrating approximate mapping with probabilistic scoring and EM-based estimation of sample composition, MetaMaps achieves >94% accuracy for species-level read assignment and r2 > 0.97 for the estimation of sample composition on both simulated and real data when the sample genomes or close relatives are present in the classification database. To address novel species and genera, which are comparatively harder to predict, MetaMaps outputs mapping locations and qualities for all classified reads, enabling functional studies (e.g. gene presence/absence) and detection of incongruities between sample and reference genomes.


Assuntos
Biologia Computacional/métodos , Análise de Dados , Metagenômica/métodos , Algoritmos , Conjuntos de Dados como Assunto , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Metagenoma/genética , Microbiota/genética , Análise de Sequência de DNA/métodos , Software
18.
Bioinformatics ; 35(21): 4394-4396, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-30942877

RESUMO

SUMMARY: HLA*LA implements a new graph alignment model for human leukocyte antigen (HLA) type inference, based on the projection of linear alignments onto a variation graph. It enables accurate HLA type inference from whole-genome (99% accuracy) and whole-exome (93% accuracy) Illumina data; from long-read Oxford Nanopore and Pacific Biosciences data (98% accuracy for whole-genome and targeted data) and from genome assemblies. Computational requirements for a typical sample vary between 0.7 and 14 CPU hours per sample. AVAILABILITY AND IMPLEMENTATION: HLA*LA is implemented in C++ and Perl and freely available as a bioconda package or from https://github.com/DiltheyLab/HLA-LA (GPL v3). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Genoma , Teste de Histocompatibilidade , Humanos , Análise de Sequência de DNA
19.
Nat Biotechnol ; 2018 Oct 22.
Artigo em Inglês | MEDLINE | ID: mdl-30346939

RESUMO

Complex allelic variation hampers the assembly of haplotype-resolved sequences from diploid genomes. We developed trio binning, an approach that simplifies haplotype assembly by resolving allelic variation before assembly. In contrast with prior approaches, the effectiveness of our method improved with increasing heterozygosity. Trio binning uses short reads from two parental genomes to first partition long reads from an offspring into haplotype-specific sets. Each haplotype is then assembled independently, resulting in a complete diploid reconstruction. We used trio binning to recover both haplotypes of a diploid human genome and identified complex structural variants missed by alternative approaches. We sequenced an F1 cross between the cattle subspecies Bos taurus taurus and Bos taurus indicus and completely assembled both parental haplotypes with NG50 haplotig sizes of >20 Mb and 99.998% accuracy, surpassing the quality of current cattle reference genomes. We suggest that trio binning improves diploid genome assembly and will facilitate new studies of haplotype variation and inheritance.

20.
Hum Mol Genet ; 27(21): 3813-3824, 2018 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-30085094

RESUMO

Genetic variation within the major histocompatibility complex (MHC) contributes substantial risk for systemic lupus erythematosus, but high gene density, extreme polymorphism and extensive linkage disequilibrium (LD) have made fine mapping challenging. To address the problem, we compared two association techniques in two ancestrally diverse populations, African Americans (AAs) and Europeans (EURs). We observed a greater number of Human Leucocyte Antigen (HLA) alleles in AA consistent with the elevated level of recombination in this population. In EUR we observed 50 different A-C-B-DRB1-DQA-DQB multilocus haplotype sequences per hundred individuals; in the AA sample, these multilocus haplotypes were twice as common compared to Europeans. We also observed a strong narrow class II signal in AA as opposed to the long-range LD observed in EUR that includes class I alleles. We performed a Bayesian model choice of the classical HLA alleles and a frequentist analysis that combined both single nucleotide polymorphisms (SNPs) and classical HLA alleles. Both analyses converged on a similar subset of risk HLA alleles: in EUR HLA- B*08:01 + B*18:01 + (DRB1*15:01 frequentist only) + DQA*01:02 + DQB*02:01 + DRB3*02 and in AA HLA-C*17:01 + B*08:01 + DRB1*15:03 + (DQA*01:02 frequentist only) + DQA*02:01 + DQA*05:01+ DQA*05:05 + DQB*03:19 + DQB*02:02. We observed two additional independent SNP associations in both populations: EUR rs146903072 and rs501480; AA rs389883 and rs114118665. The DR2 serotype was best explained by DRB1*15:03 + DQA*01:02 in AA and by DRB1*15:01 + DQA*01:02 in EUR. The DR3 serotype was best explained by DQA*05:01 in AA and by DQB*02:01 in EUR. Despite some differences in underlying HLA allele risk models in EUR and AA, SNP signals across the extended MHC showed remarkable similarity and significant concordance in direction of effect for risk-associated variants.


Assuntos
Predisposição Genética para Doença , Lúpus Eritematoso Sistêmico/genética , Complexo Principal de Histocompatibilidade/genética , Polimorfismo de Nucleotídeo Único , Negro ou Afro-Americano/genética , Feminino , Estudos de Associação Genética , Haplótipos , Humanos , Masculino , Modelos Genéticos , População Branca/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA