RESUMO
Fully understanding autism spectrum disorder (ASD) genetics requires whole-genome sequencing (WGS). We present the latest release of the Autism Speaks MSSNG resource, which includes WGS data from 5,100 individuals with ASD and 6,212 non-ASD parents and siblings (total n = 11,312). Examining a wide variety of genetic variants in MSSNG and the Simons Simplex Collection (SSC; n = 9,205), we identified ASD-associated rare variants in 718/5,100 individuals with ASD from MSSNG (14.1%) and 350/2,419 from SSC (14.5%). Considering genomic architecture, 52% were nuclear sequence-level variants, 46% were nuclear structural variants (including copy-number variants, inversions, large insertions, uniparental isodisomies, and tandem repeat expansions), and 2% were mitochondrial variants. Our study provides a guidebook for exploring genotype-phenotype correlations in families who carry ASD-associated rare variants and serves as an entry point to the expanded studies required to dissect the etiology in the â¼85% of the ASD population that remain idiopathic.
Assuntos
Transtorno do Espectro Autista , Transtorno Autístico , Humanos , Transtorno do Espectro Autista/genética , Predisposição Genética para Doença , Variações do Número de Cópias de DNA/genética , GenômicaRESUMO
A remaining hurdle to whole-genome sequencing (WGS) becoming a first-tier genetic test has been accurate detection of copy-number variations (CNVs). Here, we used several datasets to empirically develop a detailed workflow for identifying germline CNVs >1 kb from short-read WGS data using read depth-based algorithms. Our workflow is comprehensive in that it addresses all stages of the CNV-detection process, including DNA library preparation, sequencing, quality control, reference mapping, and computational CNV identification. We used our workflow to detect rare, genic CNVs in individuals with autism spectrum disorder (ASD), and 120/120 such CNVs tested using orthogonal methods were successfully confirmed. We also identified 71 putative genic de novo CNVs in this cohort, which had a confirmation rate of 70%; the remainder were incorrectly identified as de novo due to false positives in the proband (7%) or parental false negatives (23%). In individuals with an ASD diagnosis in which both microarray and WGS experiments were performed, our workflow detected all clinically relevant CNVs identified by microarrays, as well as additional potentially pathogenic CNVs < 20 kb. Thus, CNVs of clinical relevance can be discovered from WGS with a detection rate exceeding microarrays, positioning WGS as a single assay for genetic variation detection.
Assuntos
Variações do Número de Cópias de DNA/genética , Sequenciamento Completo do Genoma , Fluxo de Trabalho , Algoritmos , Criança , Feminino , Haplótipos/genética , Humanos , Masculino , Reprodutibilidade dos Testes , Análise de Sequência de DNARESUMO
The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (p-value < 1.1020 × 10-57). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1-100 kb size range (p-value < 9.080 × 10-67) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (p-value < 9.02 × 10-9), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome.
Assuntos
DNA Intergênico/genética , Genoma Humano , Variação Estrutural do Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Feminino , Humanos , Masculino , Sequenciamento por Nanoporos , Linhagem , Gêmeos Monozigóticos/genéticaRESUMO
BACKGROUND: The severity and frequency of drought has increased around the globe, creating challenges in ensuring food security for a growing world population. As a consequence, improving water use efficiency by crops has become an important objective for crop improvement. Some wild crop relatives have adapted to extreme osmotic stresses and can provide valuable insights into traits and genetic signatures that can guide efforts to improve crop tolerance to water deficits. Eutrema salsugineum, a close relative of many cruciferous crops, is a halophytic plant and extremophyte model for abiotic stress research. RESULTS: Using comparative transcriptomics, we show that two E. salsugineum ecotypes display significantly different transcriptional responses towards a two-stage drought treatment. Even before visibly wilting, water deficit led to the differential expression of almost 1,100 genes for an ecotype from the semi-arid, sub-arctic Yukon, Canada, but only 63 genes for an ecotype from the semi-tropical, monsoonal, Shandong, China. After recovery and a second drought treatment, about 5,000 differentially expressed genes were detected in Shandong plants versus 1,900 genes in Yukon plants. Only 13 genes displayed similar drought-responsive patterns for both ecotypes. We detected 1,007 long non-protein coding RNAs (lncRNAs), 8% were only expressed in stress-treated plants, a surprising outcome given the documented association between lncRNA expression and stress. Co-expression network analysis of the transcriptomes identified eight gene clusters where at least half of the genes in each cluster were differentially expressed. While many gene clusters were correlated to drought treatments, only a single cluster significantly correlated to drought exposure in both ecotypes. CONCLUSION: Extensive, ecotype-specific transcriptional reprogramming with drought was unexpected given that both ecotypes are adapted to saline habitats providing persistent exposure to osmotic stress. This ecotype-specific response would have escaped notice had we used a single exposure to water deficit. Finally, the apparent capacity to improve tolerance and growth after a drought episode represents an important adaptive trait for a plant that thrives under semi-arid Yukon conditions, and may be similarly advantageous for crop species experiencing stresses attributed to climate change.
Assuntos
Brassicaceae/crescimento & desenvolvimento , Perfilação da Expressão Gênica/métodos , RNA Longo não Codificante/genética , RNA Mensageiro/genética , Brassicaceae/genética , Canadá , Desidratação , Ecótipo , Regulação da Expressão Gênica de Plantas , Redes Reguladoras de Genes , Folhas de Planta/genética , Folhas de Planta/crescimento & desenvolvimento , RNA de Plantas/genética , Plantas Tolerantes a Sal/genética , Plantas Tolerantes a Sal/crescimento & desenvolvimento , Análise de Sequência de RNA , Estresse FisiológicoRESUMO
BACKGROUND: Whole blood is currently the most common DNA source for whole-genome sequencing (WGS), but for studies requiring non-invasive collection, self-collection, greater sample stability or additional tissue references, saliva or buccal samples may be preferred. However, the relative quality of sequencing data and accuracy of genetic variant detection from blood-derived, saliva-derived and buccal-derived DNA need to be thoroughly investigated. METHODS: Matched blood, saliva and buccal samples from four unrelated individuals were used to compare sequencing metrics and variant-detection accuracy among these DNA sources. RESULTS: We observed significant differences among DNA sources for sequencing quality metrics such as percentage of reads aligned and mean read depth (p<0.05). Differences were negligible in the accuracy of detecting short insertions and deletions; however, the false positive rate for single nucleotide variation detection was slightly higher in some saliva and buccal samples. The sensitivity of copy number variant (CNV) detection was up to 25% higher in blood samples, depending on CNV size and type, and appeared to be worse in saliva and buccal samples with high bacterial concentration. We also show that methylation-based enrichment for eukaryotic DNA in saliva and buccal samples increased alignment rates but also reduced read-depth uniformity, hampering CNV detection. CONCLUSION: For WGS, we recommend using DNA extracted from blood rather than saliva or buccal swabs; if saliva or buccal samples are used, we recommend against using methylation-based eukaryotic DNA enrichment. All data used in this study are available for further open-science investigation.
Assuntos
Variações do Número de Cópias de DNA/genética , DNA/genética , Sequenciamento Completo do Genoma/normas , Adulto , DNA/sangue , DNA/química , DNA/normas , Metilação de DNA/genética , Feminino , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Mucosa Bucal/química , Polimorfismo de Nucleotídeo Único/genética , Saliva/química , Análise de Sequência de DNA/normasRESUMO
PurposeGenetic testing is an integral diagnostic component of pediatric medicine. Standard of care is often a time-consuming stepwise approach involving chromosomal microarray analysis and targeted gene sequencing panels, which can be costly and inconclusive. Whole-genome sequencing (WGS) provides a comprehensive testing platform that has the potential to streamline genetic assessments, but there are limited comparative data to guide its clinical use.MethodsWe prospectively recruited 103 patients from pediatric non-genetic subspecialty clinics, each with a clinical phenotype suggestive of an underlying genetic disorder, and compared the diagnostic yield and coverage of WGS with those of conventional genetic testing.ResultsWGS identified diagnostic variants in 41% of individuals, representing a significant increase over conventional testing results (24%; P = 0.01). Genes clinically sequenced in the cohort (n = 1,226) were well covered by WGS, with a median exonic coverage of 40 × ±8 × (mean ±SD). All the molecular diagnoses made by conventional methods were captured by WGS. The 18 new diagnoses made with WGS included structural and non-exonic sequence variants not detectable with whole-exome sequencing, and confirmed recent disease associations with the genes PIGG, RNU4ATAC, TRIO, and UNC13A.ConclusionWGS as a primary clinical test provided a higher diagnostic yield than conventional genetic testing in a clinically heterogeneous cohort.
Assuntos
Estudos de Associação Genética , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Predisposição Genética para Doença , Testes Genéticos , Análise de Sequência de DNA , Sequenciamento Completo do Genoma , Biologia Computacional/métodos , Variações do Número de Cópias de DNA , Exoma , Feminino , Estudos de Associação Genética/métodos , Estudos de Associação Genética/normas , Testes Genéticos/métodos , Testes Genéticos/normas , Variação Genética , Humanos , Masculino , Anotação de Sequência Molecular , Fenótipo , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Sequenciamento do Exoma/métodos , Sequenciamento do Exoma/normas , Sequenciamento Completo do Genoma/métodos , Sequenciamento Completo do Genoma/normasRESUMO
BACKGROUND: The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. METHODS: Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. RESULTS: Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set (n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants - associated with cancer, cardiac or neurodegenerative phenotypes - remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. INTERPRETATION: Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care.
Assuntos
Variação Genética/genética , Genoma Humano/genética , Análise de Sequência de DNA/métodos , Sequenciamento Completo do Genoma/métodos , Canadá , Feminino , Genes Recessivos/genética , Predisposição Genética para Doença/genética , Humanos , MasculinoRESUMO
PurposeWhole-exome (WES) and whole-genome sequencing (WGS) increase the diagnostic yield in autism spectrum disorder (ASD) compared to chromosomal microarray (CMA), but there have been no comprehensive cost analyses. The objective was to perform such an assessment of CMA, WES, and WGS and compare the incremental cost per additional positive finding in hypothetical testing scenarios.MethodsFive-year patient and program costs were estimated from an institutional perspective. WES and WGS estimates were based on HiSeq 2500 with an additional WGS estimate for HiSeq X platforms. Parameter uncertainty was assessed with probabilistic and deterministic sensitivity analysis.ResultsThe cost per ASD sample was CAD$1,655 (95% CI: 1,611; 1,699) for WES, CAD$2,851 (95% CI: 2,750; 2,956) for WGS on HiSeq X, and CAD$5,519 (95% CI: 5,244; 5,785) on HiSeq 2500, compared to CAD$744 (95% CI 714, 773) for CMA. The incremental cost was over CAD$25,000 per additional positive finding if CMA was replaced by newer technology.ConclusionWhile costs for WES and WGS remain high, future reductions in material and equipment costs, and increased understanding of newly discovered variants and variants of unknown significance will lead to improved value.
Assuntos
Transtorno do Espectro Autista/genética , Sequenciamento do Exoma , Análise em Microsséries/economia , Sequenciamento Completo do Genoma/economia , Transtorno do Espectro Autista/diagnóstico , Transtorno do Espectro Autista/economia , Cromossomos Humanos , Custos e Análise de Custo , Genoma Humano , HumanosRESUMO
The discovery of antibiotics more than 70 years ago initiated a period of drug innovation and implementation in human and animal health and agriculture. These discoveries were tempered in all cases by the emergence of resistant microbes. This history has been interpreted to mean that antibiotic resistance in pathogenic bacteria is a modern phenomenon; this view is reinforced by the fact that collections of microbes that predate the antibiotic era are highly susceptible to antibiotics. Here we report targeted metagenomic analyses of rigorously authenticated ancient DNA from 30,000-year-old Beringian permafrost sediments and the identification of a highly diverse collection of genes encoding resistance to ß-lactam, tetracycline and glycopeptide antibiotics. Structure and function studies on the complete vancomycin resistance element VanA confirmed its similarity to modern variants. These results show conclusively that antibiotic resistance is a natural phenomenon that predates the modern selective pressure of clinical antibiotic use.
Assuntos
Genes Bacterianos/genética , Metagenômica , Resistência a Vancomicina/genética , Animais , Antibacterianos/farmacologia , Bactérias/classificação , Bactérias/enzimologia , Bactérias/genética , Teorema de Bayes , Cristalografia por Raios X , DNA de Cloroplastos/genética , Congelamento , Genes Mitocondriais/genética , Genes de Plantas/genética , Sedimentos Geológicos/microbiologia , História Antiga , Ligação de Hidrogênio , Modelos Moleculares , Dados de Sequência Molecular , Filogenia , Conformação Proteica , RNA Ribossômico/genética , RNA Ribossômico 16S/genética , Sibéria , Resistência a Vancomicina/efeitos dos fármacos , Vertebrados/genética , beta-Lactamases/genéticaRESUMO
Eutrema salsugineum, a halophytic relative of Arabidopsis thaliana, was subjected to varying phosphate (Pi) treatments. Arabidopsis seedlings grown on 0.05 mm Pi displayed shortened primary roots, higher lateral root density and reduced shoot biomass allocation relative to those on 0.5 mm Pi, whereas Eutrema seedlings showed no difference in lateral root density and shoot biomass allocation. While a low Fe concentration mitigated the Pi deficiency response for Arabidopsis, Eutrema root architecture was unaltered, but adding NaCl increased Eutrema lateral root density almost 2-fold. Eutrema and Arabidopsis plants grown on soil without added Pi for 4 weeks had low shoot and root Pi content. Pi-deprived, soil-grown Arabidopsis plants were stunted with senescing older leaves, whereas Eutrema plants were visually indistinguishable from 2.5 mm Pi-supplemented plants. Genes associated with Pi starvation were analysed by RT-qPCR. EsIPS2, EsPHT1;4 and EsPAP17 showed up-regulated expression in Pi-deprived Eutrema plants, while EsPHR1, EsWRKY75 and EsRNS1 showed no induction. Absolute quantification of transcripts indicated that PHR1, WRKY75 and RNS1 were expressed at higher levels in Eutrema plants relative to those in Arabidopsis regardless of external Pi. The low phenotypic plasticity Eutrema displays to Pi supply is consistent with adaptation to chronic Pi deprivation in its extreme natural habitat.
Assuntos
Aclimatação , Brassicaceae/metabolismo , Extremófilos/metabolismo , Regulação da Expressão Gênica de Plantas , Fosfatos/deficiência , Brassicaceae/genética , Brassicaceae/crescimento & desenvolvimento , Genes de Plantas , Ferro/metabolismo , Fenótipo , Salinidade , Plântula/crescimento & desenvolvimentoRESUMO
Eutrema salsugineum is an extremophile related to Arabidopsis. Accessions from Yukon, Canada and Shandong, China, were evaluated for their tolerance to water deficits. Plants were exposed to two periods of water deficit separated by an interval of re-watering and recovery. All plants took the same time to wilt during the first drought exposure but Yukon plants took 1 day longer than Shandong plants following the second drought treatment. Following re-watering and turgor recovery, solute potentials of Shandong leaves returned to predrought values while those of Yukon leaves were lower than predrought levels consistent with having undergone osmotic adjustment. Polar metabolites profiled in re-watered plants showed that different metabolites are accumulated by Yukon and Shandong plants recovering from a water deficit with glucose more abundant in Yukon and fructose in Shandong leaves. The drought-responsive expression of dehydrin genes RAB18, ERD1, RD29A and RD22 showed greater changes in transcript abundance in Yukon relative to Shandong leaves during both water deficits and recovery with the greatest difference in expression appearing during the second drought. We propose that the initial exposure of Yukon plants to drought renders them more resilient to water loss during a subsequent water deficit leading to delayed wilting. Yukon plants also established a high leaf water content and increased specific leaf area during the second deficit. Shandong plants undergoing the same treatment regime do not show the same beneficial drought tolerance responses and likely use drought avoidance to cope with water deficits.
Assuntos
Brassicaceae/fisiologia , Secas , Adaptação Fisiológica , Brassicaceae/metabolismo , China , Regulação da Expressão Gênica de Plantas , Proteínas de Plantas/genética , Estresse Fisiológico , Água , YukonRESUMO
IMPORTANCE: The use of genome-wide tests to provide molecular diagnosis for individuals with autism spectrum disorder (ASD) requires more study. OBJECTIVE: To perform chromosomal microarray analysis (CMA) and whole-exome sequencing (WES) in a heterogeneous group of children with ASD to determine the molecular diagnostic yield of these tests in a sample typical of a developmental pediatric clinic. DESIGN, SETTING, AND PARTICIPANTS: The sample consisted of 258 consecutively ascertained unrelated children with ASD who underwent detailed assessments to define morphology scores based on the presence of major congenital abnormalities and minor physical anomalies. The children were recruited between 2008 and 2013 in Newfoundland and Labrador, Canada. The probands were stratified into 3 groups of increasing morphological severity: essential, equivocal, and complex (scores of 0-3, 4-5, and ≥6). EXPOSURES: All probands underwent CMA, with WES performed for 95 proband-parent trios. MAIN OUTCOMES AND MEASURES: The overall molecular diagnostic yield for CMA and WES in a population-based ASD sample stratified in 3 phenotypic groups. RESULTS: Of 258 probands, 24 (9.3%, 95%CI, 6.1%-13.5%) received a molecular diagnosis from CMA and 8 of 95 (8.4%, 95%CI, 3.7%-15.9%) from WES. The yields were statistically different between the morphological groups. Among the children who underwent both CMA and WES testing, the estimated proportion with an identifiable genetic etiology was 15.8% (95%CI, 9.1%-24.7%; 15/95 children). This included 2 children who received molecular diagnoses from both tests. The combined yield was significantly higher in the complex group when compared with the essential group (pairwise comparison, P = .002). [table: see text]. CONCLUSIONS AND RELEVANCE: Among a heterogeneous sample of children with ASD, the molecular diagnostic yields of CMA and WES were comparable, and the combined molecular diagnostic yield was higher in children with more complex morphological phenotypes in comparison with the children in the essential category. If replicated in additional populations, these findings may inform appropriate selection of molecular diagnostic testing for children affected by ASD.
Assuntos
Transtornos Globais do Desenvolvimento Infantil/genética , Exoma , Análise em Microsséries/métodos , Técnicas de Diagnóstico Molecular/métodos , Síndrome de Asperger/diagnóstico , Síndrome de Asperger/genética , Transtorno Autístico/diagnóstico , Transtorno Autístico/genética , Criança , Transtornos Globais do Desenvolvimento Infantil/diagnóstico , Transtornos Globais do Desenvolvimento Infantil/patologia , Pré-Escolar , Feminino , Humanos , Masculino , Análise em Microsséries/estatística & dados numéricos , Técnicas de Diagnóstico Molecular/estatística & dados numéricos , Mutação , Análise de Sequência com Séries de Oligonucleotídeos/estatística & dados numéricos , Fenótipo , Análise de Sequência de DNA/métodos , Análise de Sequência de Proteína/métodosRESUMO
We performed whole-genome sequencing (WGS) in 327 children with cerebral palsy (CP) and their biological parents. We classified 37 of 327 (11.3%) children as having pathogenic/likely pathogenic (P/LP) variants and 58 of 327 (17.7%) as having variants of uncertain significance. Multiple classes of P/LP variants included single-nucleotide variants (SNVs)/indels (6.7%), copy number variations (3.4%) and mitochondrial mutations (1.5%). The COL4A1 gene had the most P/LP SNVs. We also analyzed two pediatric control cohorts (n = 203 trios and n = 89 sib-pair families) to provide a baseline for de novo mutation rates and genetic burden analyses, the latter of which demonstrated associations between de novo deleterious variants and genes related to the nervous system. An enrichment analysis revealed previously undescribed plausible candidate CP genes (SMOC1, KDM5B, BCL11A and CYP51A1). A multifactorial CP risk profile and substantial presence of P/LP variants combine to support WGS in the diagnostic work-up across all CP and related phenotypes.
Assuntos
Paralisia Cerebral , Variações do Número de Cópias de DNA , Humanos , Criança , Variações do Número de Cópias de DNA/genética , Paralisia Cerebral/genética , Mutação , Sequenciamento Completo do Genoma , GenômicaRESUMO
Importance: Nirmatrelvir-ritonavir is an oral antiviral medication that improves outcomes in SARS-CoV-2 infections. However, there is concern that antiviral resistance will develop and that these viruses could be selected for after treatment. Objective: To determine the prevalence of low-frequency SARS-CoV-2 variants in patient samples that could be selected for by nirmatrelvir-ritonavir. Design, Setting, and Participants: This retrospective cohort study was conducted at 4 laboratories that serve community hospitals, academic tertiary care centers, and COVID-19 assessment centers in Ontario, Canada. Participants included symptomatic or asymptomatic patients who tested positive for SARS-CoV-2 virus and submitted virus samples for diagnostic testing between March 2020 and January 2023. Exposure: SARS-CoV-2 infection. Main Outcomes and Measures: Samples with sufficient viral load underwent next-generation genome sequencing to identify low-frequency antiviral resistance variants that could not be identified through conventional sequencing. Results: This study included 78â¯866 clinical samples with next-generation whole-genome sequencing data for SARS-CoV-2. Low-frequency variants in the viral nsp5 gene were identified in 128 isolates (0.16%), and no single variant associated with antiviral resistance was predominate. Conclusions and Relevance: This cohort study of low-frequency variants resistant to nirmatrelvir-ritonavir found that these variants were very rare in samples from patients with SARS-CoV-2, suggesting that selection of these variants by nirmatrelvir-ritonavir following the initiation of treatment may also be rare. Surveillance efforts that involve sequencing of viral isolates should continue to monitor for novel resistance variants as nirmatrelvir-ritonavir is used more broadly.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Ontário/epidemiologia , SARS-CoV-2/genética , Ritonavir/uso terapêutico , Prevalência , Estudos de Coortes , Estudos Retrospectivos , COVID-19/epidemiologia , Antivirais/farmacologia , Antivirais/uso terapêutico , Tratamento Farmacológico da COVID-19RESUMO
Phasing of heterozygous alleles is critical for interpretation of cis-effects of disease-relevant variation. We sequenced 477 individuals with cystic fibrosis (CF) using linked-read sequencing, which display an average phase block N50 of 4.39 Mb. We use these samples to construct a graph representation of CFTR haplotypes, demonstrating its utility for understanding complex CF alleles. These are visualized in a Web app, CFTbaRcodes, that enables interactive exploration of CFTR haplotypes present in this cohort. We perform fine-mapping and phasing of the chr7q35 trypsinogen locus associated with CF meconium ileus, an intestinal obstruction at birth associated with more severe CF outcomes and pancreatic disease. A 20-kb deletion polymorphism and a PRSS2 missense variant p.Thr8Ile (rs62473563) are shown to independently contribute to meconium ileus risk (p = 0.0028, p = 0.011, respectively) and are PRSS2 pancreas eQTLs (p = 9.5 × 10-7 and p = 1.4 × 10-4, respectively), suggesting the mechanism by which these polymorphisms contribute to CF. The phase information from linked reads provides a putative causal explanation for variation at a CF-relevant locus, which also has implications for the genetic basis of non-CF pancreatitis, to which this locus has been reported to contribute.
Assuntos
Fibrose Cística , Obstrução Intestinal , Íleo Meconial , Recém-Nascido , Humanos , Fibrose Cística/genética , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Íleo Meconial/complicações , Mecônio , Obstrução Intestinal/complicações , Tripsina , Tripsinogênio/genéticaRESUMO
Defining different genetic subtypes of autism spectrum disorder (ASD) can enable the prediction of developmental outcomes. Based on minor physical and major congenital anomalies, we categorize 325 Canadian children with ASD into dysmorphic and nondysmorphic subgroups. We develop a method for calculating a patient-level, genome-wide rare variant score (GRVS) from whole-genome sequencing (WGS) data. GRVS is a sum of the number of variants in morphology-associated coding and non-coding regions, weighted by their effect sizes. Probands with dysmorphic ASD have a significantly higher GRVS compared to those with nondysmorphic ASD (P = 0.03). Using the polygenic transmission disequilibrium test, we observe an over-transmission of ASD-associated common variants in nondysmorphic ASD probands (P = 2.9 × 10-3). These findings replicate using WGS data from 442 ASD probands with accompanying morphology data from the Simons Simplex Collection. Our results provide support for an alternative genomic classification of ASD subgroups using morphology data, which may inform intervention protocols.
Assuntos
Transtorno do Espectro Autista , Criança , Humanos , Transtorno do Espectro Autista/genética , Canadá/epidemiologia , Genoma , Herança Multifatorial/genética , Sequenciamento Completo do Genoma , Predisposição Genética para DoençaRESUMO
Cardiomyopathy (CMP) is a heritable disorder. Over 50% of cases are gene-elusive on clinical gene panel testing. The contribution of variants in non-coding DNA elements that result in cryptic splicing and regulate gene expression has not been explored. We analyzed whole-genome sequencing (WGS) data in a discovery cohort of 209 pediatric CMP patients and 1953 independent replication genomes and exomes. We searched for protein-coding variants, and non-coding variants predicted to affect the function or expression of genes. Thirty-nine percent of cases harbored pathogenic coding variants in known CMP genes, and 5% harbored high-risk loss-of-function (LoF) variants in additional candidate CMP genes. Fifteen percent harbored high-risk regulatory variants in promoters and enhancers of CMP genes (odds ratio 2.25, p = 6.70 × 10-7 versus controls). Genes involved in α-dystroglycan glycosylation (FKTN, DTNA) and desmosomal signaling (DSC2, DSG2) were most highly enriched for regulatory variants (odds ratio 6.7-58.1). Functional effects were confirmed in patient myocardium and reporter assays in human cardiomyocytes, and in zebrafish CRISPR knockouts. We provide strong evidence for the genomic contribution of functionally active variants in new genes and in regulatory elements of known CMP genes to early onset CMP.
RESUMO
BACKGROUND: Tetralogy of Fallot (TOF)-the most common cyanotic heart defect in newborns-has evidence of multiple genetic contributing factors. Identifying variants that are clinically relevant is essential to understand patient-specific disease susceptibility and outcomes and could contribute to delineating pathomechanisms. METHODS: Using a clinically driven strategy, we reanalyzed exome sequencing data from 811 probands with TOF, to identify rare loss-of-function and other likely pathogenic variants in genes associated with congenital heart disease. RESULTS: We confirmed a major contribution of likely pathogenic variants in FLT4 (VEGFR3 [vascular endothelial growth factor receptor 3]; n=14) and NOTCH1 (n=10) and identified 1 to 3 variants in each of 21 other genes, including ATRX, DLL4, EP300, GATA6, JAG1, NF1, PIK3CA, RAF1, RASA1, SMAD2, and TBX1. In addition, multiple loss-of-function variants provided support for 3 emerging congenital heart disease/TOF candidate genes: KDR (n=4), IQGAP1 (n=3), and GDF1 (n=8). In total, these variants were identified in 63 probands (7.8%). Using the 26 composite genes in a STRING protein interaction enrichment analysis revealed a biologically relevant network (P=3.3×10-16), with VEGFR2 (vascular endothelial growth factor receptor 2; KDR) and NOTCH1 (neurogenic locus notch homolog protein 1) representing central nodes. Variants associated with arrhythmias/sudden death and heart failure indicated factors that could influence long-term outcomes. CONCLUSIONS: The results are relevant to precision medicine for TOF. They suggest considerable clinical yield from genome-wide sequencing, with further evidence for KDR (VEGFR2) as a congenital heart disease/TOF gene and for VEGF (vascular endothelial growth factor) and Notch signaling as mechanisms in human disease. Harnessing the genetic heterogeneity of single gene defects could inform etiopathogenesis and help prioritize novel candidate genes for TOF.
Assuntos
Predisposição Genética para Doença , Mapas de Interação de Proteínas , Tetralogia de Fallot/genética , Feminino , Estudo de Associação Genômica Ampla , Humanos , Recém-Nascido , Masculino , Sequenciamento do ExomaRESUMO
Importance: Children with medical complexity (CMC) represent a growing population in the pediatric health care system, with high resource use and associated health care costs. A genetic diagnosis can inform prognosis, anticipatory care, management, and reproductive planning. Conventional genetic testing strategies for CMC are often costly, time consuming, and ultimately unsuccessful. Objective: To evaluate the analytical and clinical validity of genome sequencing as a comprehensive diagnostic genetic test for CMC. Design, Setting, and Participants: In this cohort study of the prospective use of genome sequencing and comparison with standard-of-care genetic testing, CMC were recruited from May 1, 2017, to November 30, 2018, from a structured complex care program based at a tertiary care pediatric hospital in Toronto, Canada. Recruited CMC had at least 1 chronic condition, technology dependence (child is dependent at least part of each day on mechanical ventilators, and/or child requires prolonged intravenous administration of nutritional substances or drugs, and/or child is expected to have prolonged dependence on other device-based support), multiple subspecialist involvement, and substantial health care use. Review of the care plans for 545 CMC identified 143 suspected of having an undiagnosed genetic condition. Fifty-four families met inclusion criteria and were interested in participating, and 49 completed the study. Probands, similarly affected siblings, and biological parents were eligible for genome sequencing. Exposures: Genome sequencing was performed using blood-derived DNA from probands and family members using established methods and a bioinformatics pipeline for clinical genome annotation. Main Outcomes and Measures: The primary study outcome was the diagnostic yield of genome sequencing (proportion of CMC for whom the test result yielded a new diagnosis). Results: Genome sequencing was performed for 138 individuals from 49 families of CMC (29 male and 20 female probands; mean [SD] age, 7.0 [4.5] years). Genome sequencing detected all genomic variation previously identified by conventional genetic testing. A total of 15 probands (30.6%; 95% CI 19.5%-44.6%) received a new primary molecular genetic diagnosis after genome sequencing. Three individuals had novel diseases and an additional 9 had either ultrarare genetic conditions or rare genetic conditions with atypical features. At least 11 families received diagnostic information that had clinical management implications beyond genetic and reproductive counseling. Conclusions and Relevance: This study suggests that genome sequencing has high analytical and clinical validity and can result in new diagnoses in CMC even in the setting of extensive prior investigations. This clinical population may be enriched for ultrarare and novel genetic disorders. Genome sequencing is a potentially first-tier genetic test for CMC.
Assuntos
Testes Genéticos/estatística & dados numéricos , Transtornos Somatoformes/diagnóstico , Sequenciamento Completo do Genoma/estatística & dados numéricos , Canadá , Criança , Pré-Escolar , Feminino , Humanos , Masculino , Valor Preditivo dos Testes , Estudos Prospectivos , Reprodutibilidade dos TestesRESUMO
BACKGROUND: Plant growth-promoting bacteria can alleviate the inhibitory effects of various heavy metals on plant growth, via decreasing levels of stress-induced ethylene. However, little has been done to detect any mechanisms specific for heavy metal resistance of this kind of bacteria. Here, we investigate the response of the wild-type plant growth-promoting bacterium Pseudomonas putida UW4 to nickel stress using proteomic approaches. The mutant strain P. putida UW4/AcdS-, lacking a functional 1-aminocyclopropane-1-carboxylic acid deaminase gene, was also assessed for its response to nickel stress. RESULTS: Two dimensional difference in-gel electrophoresis (DIGE) was used to detect significantly up- or down- regulated proteins (p < 0.05, | ratio | > 1.5) in P. putida in response to the presence of 2 mM Ni. Out of a total number of 1,702 proteins detected on the analytical gels for P. putida UW4, the expression levels of 82 (4.82%) proteins increased significantly while the expression of 81 (4.76%) proteins decreased significantly. Of 1,575 proteins detected on the analytical gels for P. putida UW4/AcdS-, the expression levels of 74 (4.70%) proteins increased and 51 (3.24%) proteins decreased significantly. Thirty-five proteins whose expression was altered were successfully identified by mass spectrometry and sequence comparisons with related species. Nineteen of the identified proteins were detected as differentially expressed in both wild-type and mutant expression profiles. CONCLUSION: Functional assessment of proteins with significantly altered expression levels revealed several mechanisms thought to be involved in bacterial heavy metal detoxification, including general stress adaptation, anti-oxidative stress and heavy metal efflux proteins. This information may contribute to the development of plant growth-promoting bacteria mediated phytoremediation processes.