RESUMO
MOTIVATION: We introduce a novel framework BEATRICE to identify putative causal variants from GWAS statistics. Identifying causal variants is challenging due to their sparsity and high correlation in the nearby regions. To account for these challenges, we rely on a hierarchical Bayesian model that imposes a binary concrete prior on the set of causal variants. We derive a variational algorithm for this fine-mapping problem by minimizing the KL divergence between an approximate density and the posterior probability distribution of the causal configurations. Correspondingly, we use a deep neural network as an inference machine to estimate the parameters of our proposal distribution. Our stochastic optimization procedure allows us to sample from the space of causal configurations, which we use to compute the posterior inclusion probabilities and determine credible sets for each causal variant. We conduct a detailed simulation study to quantify the performance of our framework against two state-of-the-art baseline methods across different numbers of causal variants and noise paradigms, as defined by the relative genetic contributions of causal and noncausal variants. RESULTS: We demonstrate that BEATRICE achieves uniformly better coverage with comparable power and set sizes, and that the performance gain increases with the number of causal variants. We also show the efficacy BEATRICE in finding causal variants from the GWAS study of Alzheimer's disease. In comparison to the baselines, only BEATRICE can successfully find the APOE ϵ2 allele, a commonly associated variant of Alzheimer's. AVAILABILITY AND IMPLEMENTATION: BEATRICE is available for download at https://github.com/sayangsep/Beatrice-Finemapping.
Assuntos
Algoritmos , Teorema de Bayes , Estudo de Associação Genômica Ampla , Estudo de Associação Genômica Ampla/métodos , Humanos , Doença de Alzheimer/genéticaRESUMO
Fewer than half of individuals with a suspected Mendelian or monogenic condition receive a precise molecular diagnosis after comprehensive clinical genetic testing. Improvements in data quality and costs have heightened interest in using long-read sequencing (LRS) to streamline clinical genomic testing, but the absence of control data sets for variant filtering and prioritization has made tertiary analysis of LRS data challenging. To address this, the 1000 Genomes Project (1KGP) Oxford Nanopore Technologies Sequencing Consortium aims to generate LRS data from at least 800 of the 1KGP samples. Our goal is to use LRS to identify a broader spectrum of variation so we may improve our understanding of normal patterns of human variation. Here, we present data from analysis of the first 100 samples, representing all 5 superpopulations and 19 subpopulations. These samples, sequenced to an average depth of coverage of 37× and sequence read N50 of 54 kbp, have high concordance with previous studies for identifying single nucleotide and indel variants outside of homopolymer regions. Using multiple structural variant (SV) callers, we identify an average of 24,543 high-confidence SVs per genome, including shared and private SVs likely to disrupt gene function as well as pathogenic expansions within disease-associated repeats that were not detected using short reads. Evaluation of methylation signatures revealed expected patterns at known imprinted loci, samples with skewed X-inactivation patterns, and novel differentially methylated regions. All raw sequencing data, processed data, and summary statistics are publicly available, providing a valuable resource for the clinical genetics community to discover pathogenic SVs.
RESUMO
Much of the profound interspecific variation in genome content has been attributed to transposable elements (TEs). To explore the extent of TE variation within species, we developed an optimized open-source algorithm, panEDTA, to de novo annotate TEs in a pangenome context. We then generated a unified TE annotation for a maize pangenome derived from 26 reference-quality genomes, which reveals an excess of 35.1 Mb of TE sequences per genome in tropical maize relative to temperate maize. A small number (n = 216) of TE families, mainly LTR retrotransposons, drive these differences. Evidence from the methylome, transcriptome, LTR age distribution, and LTR insertional polymorphisms reveals that 64.7% of the variability is contributed by LTR families that are young, less methylated, and more expressed in tropical maize, whereas 18.5% is driven by LTR families with removal or loss in temperate maize. Additionally, we find enrichment for Young LTR families adjacent to nucleotide-binding and leucine-rich repeat (NLR) clusters of varying copy number across lines, suggesting TE activity may be associated with disease resistance in maize.
Assuntos
Elementos de DNA Transponíveis , Genoma de Planta , Retroelementos , Sequências Repetidas Terminais , Zea mays , Zea mays/genética , Retroelementos/genética , Variação Genética , Anotação de Sequência Molecular , Clima Tropical , Metilação de DNARESUMO
A comprehensive definition of health includes the assessment of patient experiences of a disease and its treatment. These patient experiences are best captured by standardized patient-reported outcome (PRO) instruments. A PRO is reported directly by the patient (or caregiver) and provides the patient's perspective into how a disease and its treatment impact their lives. PRO instruments are typically standardized, validated questionnaires with items that are scaled and can be combined to represent an underlying health-related construct such as physical, social, and role functioning, psychological well-being, symptoms, pain, and quality of life. Over the past few decades, PROs have become increasingly used in clinical trials as endpoints to better understand treatment benefits from the patient's perspective and in clinical practice to identify unmet needs of patients, health risk surveillance, and monitor outcomes of care. In this paper, we describe the process for developing standardized PRO instruments, from conceptual model development through instrument validation.
Assuntos
Medidas de Resultados Relatados pelo Paciente , Qualidade de Vida , Humanos , Inquéritos e Questionários , Reprodutibilidade dos TestesRESUMO
An enduring question in evolutionary biology concerns the degree to which episodes of convergent trait evolution depend on the same genetic programs, particularly over long timescales. In this work, we genetically dissected repeated origins and losses of prickles-sharp epidermal projections-that convergently evolved in numerous plant lineages. Mutations in a cytokinin hormone biosynthetic gene caused at least 16 independent losses of prickles in eggplants and wild relatives in the genus Solanum. Homologs underlie prickle formation across angiosperms that collectively diverged more than 150 million years ago, including rice and roses. By developing new Solanum genetic systems, we leveraged this discovery to eliminate prickles in a wild species and an indigenously foraged berry. Our findings implicate a shared hormone activation genetic program underlying evolutionarily widespread and recurrent instances of plant morphological innovation.
Assuntos
Evolução Biológica , Citocininas , Genes de Plantas , Epiderme Vegetal , Solanum , Citocininas/biossíntese , Citocininas/genética , Evolução Molecular , Mutação , Oryza/genética , Filogenia , Epiderme Vegetal/anatomia & histologia , Epiderme Vegetal/genética , Solanum/anatomia & histologia , Solanum/genéticaRESUMO
MOTIVATION: A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy. RESULTS: In this work, we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mb genome of Arabidopsis thaliana in under 5 min on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes. AVAILABILITY AND IMPLEMENTATION: ModDotPlot is available at https://github.com/marbl/ModDotPlot.
Assuntos
Arabidopsis , Software , Sequências de Repetição em Tandem , Arabidopsis/genética , Sequências de Repetição em Tandem/genética , Genoma de Planta , Interface Usuário-Computador , Genômica/métodosRESUMO
BACKGROUND: Social determinants of health have a significant impact on asthma outcomes, and factors such as income level and neighborhood environment have crucial roles. OBJECTIVE: This study aimed to assess the impact of the Neighborhood Deprivation Index (NDI) and Total Crime Index (TCI) on acute asthma exacerbation (AAE) and asthma-related emergency department and urgent care (ED/UC) visits in adults with mild asthma. METHODS: This retrospective cohort study used administrative data from Kaiser Permanente Southern California among 198,873 adult patients with mild asthma between January 1, 2013 and December 31, 2018. We employed robust Poisson regression models, adjusted for age and sex, to investigate the associations of NDI and TCI with AAE and asthma-related ED/UC visits. Data analysis included subgroup assessments by race and ethnicity and body mass index categories to explore potential disparities in asthma outcomes. RESULTS: Among the cohort, 12,906 patients (6.5%) experienced AAE in 1 year, and Black patients had the highest AAE percentage (7.1%). Higher NDI quintiles were associated with increased AAE risk (adjusted risk ratio = 1.11-1.27), with similar trends across body mass index categories and race or ethnicity, except for Black patients. The TCI showed weaker associations with AAE. Regarding ED/UC visits, 5.0% had such visits within 1 year. Higher NDI quintiles were associated with higher ED/UC visit risk (adjusted risk ratio = 1.23-1.75) whereas TCI associations were weaker. CONCLUSION: Addressing socioeconomic disparities, as indicated by NDI, may be crucial in mitigating asthma exacerbations and reducing health care use, highlighting the importance of incorporating social determinants into asthma management strategies even in patients with mild asthma.
RESUMO
[This corrects the article DOI: 10.1021/acs.jpcc.3c03563.].
RESUMO
Knowledge of locations and activities of cis-regulatory elements (CREs) is needed to decipher basic mechanisms of gene regulation and to understand the impact of genetic variants on complex traits. Previous studies identified candidate CREs (cCREs) using epigenetic features in one species, making comparisons difficult between species. In contrast, we conducted an interspecies study defining epigenetic states and identifying cCREs in blood cell types to generate regulatory maps that are comparable between species, using integrative modeling of eight epigenetic features jointly in human and mouse in our Validated Systematic Integration (VISION) Project. The resulting catalogs of cCREs are useful resources for further studies of gene regulation in blood cells, indicated by high overlap with known functional elements and strong enrichment for human genetic variants associated with blood cell phenotypes. The contribution of each epigenetic state in cCREs to gene regulation, inferred from a multivariate regression, was used to estimate epigenetic state regulatory potential (esRP) scores for each cCRE in each cell type, which were used to categorize dynamic changes in cCREs. Groups of cCREs displaying similar patterns of regulatory activity in human and mouse cell types, obtained by joint clustering on esRP scores, harbor distinctive transcription factor binding motifs that are similar between species. An interspecies comparison of cCREs revealed both conserved and species-specific patterns of epigenetic evolution. Finally, we show that comparisons of the epigenetic landscape between species can reveal elements with similar roles in regulation, even in the absence of genomic sequence alignment.
Assuntos
Epigênese Genética , Epigenoma , Especificidade da Espécie , Animais , Camundongos , Humanos , Células Sanguíneas/metabolismo , Sequências Reguladoras de Ácido Nucleico , Regulação da Expressão Gênica , Epigenômica/métodosRESUMO
Pangenomes are growing in number and size, thanks to the prevalence of high-quality long-read assemblies. However, current methods for studying sequence composition and conservation within pangenomes have limitations. Methods based on graph pangenomes require a computationally expensive multiple-alignment step, which can leave out some variation. Indexes based on k-mers and de Bruijn graphs are limited to answering questions at a specific substring length k. We present Maximal Exact Match Ordered (MEMO), a pangenome indexing method based on maximal exact matches (MEMs) between sequences. A single MEMO index can handle arbitrary-length queries over pangenomic windows. MEMO enables both queries that test k-mer presence/absence (membership queries) and that count the number of genomes containing k-mers in a window (conservation queries). MEMO's index for a pangenome of 89 human autosomal haplotypes fits in 2.04 GB, 8.8× smaller than a comparable KMC3 index and 11.4× smaller than a PanKmer index. MEMO indexes can be made smaller by sacrificing some counting resolution, with our decile-resolution HPRC index reaching 0.67 GB. MEMO can conduct a conservation query for 31-mers over the human leukocyte antigen locus in 13.89 seconds, 2.5x faster than other approaches. MEMO's small index size, lack of k-mer length dependence, and efficient queries make it a flexible tool for studying and visualizing substring conservation in pangenomes.
RESUMO
Background: Sublingual immunotherapy (SLIT) with 12 SQ house dust mile SLIT-tablet (HDM SLIT-tablet) for dust mite-induced perennial allergic rhinitis is reported as effective and safe. Although serious allergic reactions (SARs) and eosinophilic esophagitis (EoE) have infrequently occurred under trial conditions, the safety of HDM SLIT-tablet challenge under real-world conditions is unknown. Objective: Our aim was to estimate the incidence of SARs and EoE due to HDM SLIT-tablet challenge. Methods: Through use of administrative data from Kaiser Permanente Southern California, this prospective observational study identified patients newly administered HDM SLIT-tablet with follow-up until SLIT discontinuation or end of study. Suspected cases of SARs and EoE were detected by using International Classification of Diseases, 10th Revision, diagnosis and Current Procedural Terminology procedure codes and medication dispensing records. A 3-member clinical review committee of allergists adjudicated suspected reactions. The incidence rate of confirmed SARs and EoE per 1000 person years of exposure were determined. Results: A total of 521 patients (93.9% adult and 6.1% pediatric) were exposed to HDM SLIT-tablet challenge from January 2018 through May 2023, for 440.4 person years of exposure. The patients' average age (SD) was 39.3 (14.1) years, 58.7% were female, 44.3% were non-Hispanic White, 40.3% had asthma, and 15.0% had gastroesophageal reflux disease. A SAR occurred in 1 adult patient, and during initial HDM SLIT-tablet challenge, SARs occurred in 2 pediatric adolescents, for an overall incidence of 6.8 SARs per 1000 patient years (95% CI = 2.2-21.1). EoE occurred in 1 adult patient, for an overall incidence of 2.3 cases of EoE per 1000 patient years (95% CI = 0.3-16.1). Conclusions: This real-world study demonstrated that SARs and EoE were infrequent events with HDM SLIT-tablet use, supporting the safety of HDM SLIT-tablets and need for physician supervision with initial challenge.
RESUMO
BACKGROUND: Although individuals with mild asthma account for 30% to 40% of acute asthma exacerbations (AAEs), relatively little attention has been paid to risk factors for AAEs in this population. OBJECTIVE: To identify risk factors associated with AAEs in patients with mild asthma. METHODS: This was a retrospective cohort study. We used administrative data from a large managed care organization to identify 199,010 adults aged 18 to 85 years who met study criteria for mild asthma between 2013 and 2018. An asthma-coded qualifying visit (index visit) was identified for each patient. We then used information at the index visit or from the year before the index visit to measure potential risk factors for AAEs in the subsequent year. An AAE was defined as either an asthma-coded hospitalization or emergency department visit, or an asthma-related systemic corticosteroid administration (intramuscular or intravenous) or oral corticosteroid dispensing. Poisson regression models with robust SEs were used to estimate the adjusted risk ratios for future AAEs. RESULTS: In the study cohort, mean age was 44 years and 64% were female; 6.5% had AAEs within 1 year after the index visit. In multivariate models, age, sex, race, ethnicity, smoking status, body mass index, prior acute asthma care, and a variety of comorbidities and other clinical characteristics were significant predictors for future AAE risk. CONCLUSION: Population-based disease management strategies for asthma should be expanded to include people with mild asthma in addition to those with moderate to severe disease.
Assuntos
Asma , Humanos , Asma/epidemiologia , Asma/tratamento farmacológico , Feminino , Adulto , Masculino , Pessoa de Meia-Idade , Fatores de Risco , Estudos Retrospectivos , Idoso , Adulto Jovem , Adolescente , Idoso de 80 Anos ou mais , Progressão da Doença , Hospitalização/estatística & dados numéricos , Corticosteroides/uso terapêutico , Doença Aguda , Índice de Gravidade de Doença , Serviço Hospitalar de Emergência/estatística & dados numéricosRESUMO
Motivation: A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy. Results: In this work we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mbp genome of Arabidopsis thaliana in under 5 minutes on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes. Availability and Implementation: ModDotPlot is available at https://github.com/marbl/ModDotPlot.
RESUMO
Rare structural variants (SVs) - insertions, deletions, and complex rearrangements - can cause Mendelian disease, yet they remain difficult to accurately detect and interpret. We sequenced and analyzed Oxford Nanopore long-read genomes of 68 individuals from the Undiagnosed Disease Network (UDN) with no previously identified diagnostic mutations from short-read sequencing. Using our optimized SV detection pipelines and 571 control long-read genomes, we detected 716 long-read rare (MAF < 0.01) SV alleles per genome on average, achieving a 2.4x increase from short-reads. To characterize the functional effects of rare SVs, we assessed their relationship with gene expression from blood or fibroblasts from the same individuals, and found that rare SVs overlapping enhancers were enriched (LOR = 0.46) near expression outliers. We also evaluated tandem repeat expansions (TREs) and found 14 rare TREs per genome; notably these TREs were also enriched near overexpression outliers. To prioritize candidate functional SVs, we developed Watershed-SV, a probabilistic model that integrates expression data with SV-specific genomic annotations, which significantly outperforms baseline models that don't incorporate expression data. Watershed-SV identified a median of eight high-confidence functional SVs per UDN genome. Notably, this included compound heterozygous deletions in FAM177A1 shared by two siblings, which were likely causal for a rare neurodevelopmental disorder. Our observations demonstrate the promise of integrating long-read sequencing with gene expression towards improving the prioritization of functional SVs and TREs in rare disease patients.
RESUMO
The Human Genome Project was an enormous accomplishment, providing a foundation for countless explorations into the genetics and genomics of the human species. Yet for many years, the human genome reference sequence remained incomplete and lacked representation of human genetic diversity. Recently, two major advances have emerged to address these shortcomings: complete gap-free human genome sequences, such as the one developed by the Telomere-to-Telomere Consortium, and high-quality pangenomes, such as the one developed by the Human Pangenome Reference Consortium. Facilitated by advances in long-read DNA sequencing and genome assembly algorithms, complete human genome sequences resolve regions that have been historically difficult to sequence, including centromeres, telomeres, and segmental duplications. In parallel, pangenomes capture the extensive genetic diversity across populations worldwide. Together, these advances usher in a new era of genomics research, enhancing the accuracy of genomic analysis, paving the path for precision medicine, and contributing to deeper insights into human biology.
Assuntos
Genoma Humano , Projeto Genoma Humano , Humanos , Variação Genética , Genômica/métodos , Análise de Sequência de DNA/métodos , Telômero/genéticaRESUMO
The combination of ultra-long Oxford Nanopore (ONT) sequencing reads with long, accurate PacBio HiFi reads has enabled the completion of a human genome and spurred similar efforts to complete the genomes of many other species. However, this approach for complete, "telomere-to-telomere" genome assembly relies on multiple sequencing platforms, limiting its accessibility. ONT "Duplex" sequencing reads, where both strands of the DNA are read to improve quality, promise high per-base accuracy. To evaluate this new data type, we generated ONT Duplex data for three widely-studied genomes: human HG002, Solanum lycopersicum Heinz 1706 (tomato), and Zea mays B73 (maize). For the diploid, heterozygous HG002 genome, we also used "Pore-C" chromatin contact mapping to completely phase the haplotypes. We found the accuracy of Duplex data to be similar to HiFi sequencing, but with read lengths tens of kilobases longer, and the Pore-C data to be compatible with existing diploid assembly algorithms. This combination of read length and accuracy enables the construction of a high-quality initial assembly, which can then be further resolved using the ultra-long reads, and finally phased into chromosome-scale haplotypes with Pore-C. The resulting assemblies have a base accuracy exceeding 99.999% (Q50) and near-perfect continuity, with most chromosomes assembled as single contigs. We conclude that ONT sequencing is a viable alternative to HiFi sequencing for de novo genome assembly, and has the potential to provide a single-instrument solution for the reconstruction of complete genomes.
RESUMO
Nanopore signal analysis enables detection of nucleotide modifications from native DNA and RNA sequencing, providing both accurate genetic/transcriptomic and epigenetic information without additional library preparation. Presently, only a limited set of modifications can be directly basecalled (e.g. 5-methylcytosine), while most others require exploratory methods that often begin with alignment of nanopore signal to a nucleotide reference. We present Uncalled4, a toolkit for nanopore signal alignment, analysis, and visualization. Uncalled4 features an efficient banded signal alignment algorithm, BAM signal alignment file format, statistics for comparing signal alignment methods, and a reproducible de novo training method for k-mer-based pore models, revealing potential errors in ONT's state-of-the-art DNA model. We apply Uncalled4 to RNA 6-methyladenine (m6A) detection in seven human cell lines, identifying 26% more modifications than Nanopolish using m6Anet, including in several genes where m6A has known implications in cancer. Uncalled4 is available open-source at github.com/skovaka/uncalled4.
RESUMO
Less than half of individuals with a suspected Mendelian condition receive a precise molecular diagnosis after comprehensive clinical genetic testing. Improvements in data quality and costs have heightened interest in using long-read sequencing (LRS) to streamline clinical genomic testing, but the absence of control datasets for variant filtering and prioritization has made tertiary analysis of LRS data challenging. To address this, the 1000 Genomes Project ONT Sequencing Consortium aims to generate LRS data from at least 800 of the 1000 Genomes Project samples. Our goal is to use LRS to identify a broader spectrum of variation so we may improve our understanding of normal patterns of human variation. Here, we present data from analysis of the first 100 samples, representing all 5 superpopulations and 19 subpopulations. These samples, sequenced to an average depth of coverage of 37x and sequence read N50 of 54 kbp, have high concordance with previous studies for identifying single nucleotide and indel variants outside of homopolymer regions. Using multiple structural variant (SV) callers, we identify an average of 24,543 high-confidence SVs per genome, including shared and private SVs likely to disrupt gene function as well as pathogenic expansions within disease-associated repeats that were not detected using short reads. Evaluation of methylation signatures revealed expected patterns at known imprinted loci, samples with skewed X-inactivation patterns, and novel differentially methylated regions. All raw sequencing data, processed data, and summary statistics are publicly available, providing a valuable resource for the clinical genetics community to discover pathogenic SVs.
RESUMO
BACKGROUND: A systematic review and meta-analysis from 2013 reported increased risks of congenital malformations, neonatal death and neonatal hospitalization amongst infants born to women with asthma compared to infants born to mothers without asthma. OBJECTIVE: Our objective was to update the evidence on the associations between maternal asthma and adverse neonatal outcomes. SEARCH STRATEGY: We performed an English-language MEDLINE, Embase, CINAHL, and COCHRANE search with the terms (asthma or wheeze) and (pregnan* or perinat* or obstet*). SELECTION CRITERIA: Studies published from March 2012 until September 2023 reporting at least one outcome of interest (congenital malformations, stillbirth, neonatal death, perinatal mortality, neonatal hospitalization, transient tachypnea of the newborn, respiratory distress syndrome and neonatal sepsis) in a population of women with and without asthma. DATA COLLECTION AND ANALYSIS: The study was reported following the 2020 Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) and the Meta-Analysis of Observational Studies in Epidemiology (MOOSE) guidelines. Quality of individual studies was assessed by two reviewers independently using the Newcastle-Ottawa Scale. Random effects models (≥3 studies) or fixed effect models (≤2 studies) were used with restricted maximum likelihood to calculate relative risk (RR) from prevalence data and the inverse generic variance method where adjusted odds ratios (aORs) from individual studies were combined. MAIN RESULTS: A total of 18 new studies were included, along with the 22 studies from the 2013 review. Previously observed increased risks remained for perinatal mortality (relative risk [RR] 1.14, 95% confidence interval [CI]: 1.05, 1.23 n = 16 studies; aOR 1.07, 95% CI: 0.98-1.17 n = 6), congenital malformations (RR 1.36, 95% CI: 1.32-1.40 n = 17; aOR 1.42, 95% CI: 1.38-1.47 n = 6), and neonatal hospitalization (RR 1.27, 95% CI: 1.25-1.30 n = 12; aOR 1.1, 95% CI: 1.07-1.16 n = 3) amongst infants born to mothers with asthma, while the risk for neonatal death was no longer significant (RR 1.33, 95% CI: 0.95-1.84 n = 8). Previously reported non-significant risks for major congenital malformations (RR1.18, 95% CI: 1.15-1.21; aOR 1.20, 95% CI: 1.15-1.26 n = 3) and respiratory distress syndrome (RR 1.25, 95% CI: 1.17-1.34 n = 4; aOR 1.09, 95% CI: 1.01-1.18 n = 2) reached statistical significance. CONCLUSIONS: Healthcare professionals should remain aware of the increased risks to neonates being born to mothers with asthma.