RESUMEN
With progressive digitalization of healthcare systems worldwide, large-scale collection of electronic health records (EHRs) has become commonplace. However, an extensible framework for comprehensive exploratory analysis that accounts for data heterogeneity is missing. Here we introduce ehrapy, a modular open-source Python framework designed for exploratory analysis of heterogeneous epidemiology and EHR data. ehrapy incorporates a series of analytical steps, from data extraction and quality control to the generation of low-dimensional representations. Complemented by rich statistical modules, ehrapy facilitates associating patients with disease states, differential comparison between patient clusters, survival analysis, trajectory inference, causal inference and more. Leveraging ontologies, ehrapy further enables data sharing and training EHR deep learning models, paving the way for foundational models in biomedical research. We demonstrate ehrapy's features in six distinct examples. We applied ehrapy to stratify patients affected by unspecified pneumonia into finer-grained phenotypes. Furthermore, we reveal biomarkers for significant differences in survival among these groups. Additionally, we quantify medication-class effects of pneumonia medications on length of stay. We further leveraged ehrapy to analyze cardiovascular risks across different data modalities. We reconstructed disease state trajectories in patients with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) based on imaging data. Finally, we conducted a case study to demonstrate how ehrapy can detect and mitigate biases in EHR data. ehrapy, thus, provides a framework that we envision will standardize analysis pipelines on EHR data and serve as a cornerstone for the community.
RESUMEN
Idiopathic pulmonary fibrosis (IPF) is a lethal chronic lung disease characterized by aberrant intercellular communication, extracellular matrix deposition, and destruction of functional lung tissue. While extracellular vesicles (EVs) accumulate in the IPF lung, their cargo and biological effects remain unclear. We interrogated the proteome of EV and non-EV fractions during pulmonary fibrosis and characterized their contribution to fibrosis. EVs accumulated 14 days after bleomycin challenge, correlating with decreased lung function and initiated fibrogenesis in healthy precision-cut lung slices. Label-free proteomics of bronchoalveolar lavage fluid EVs (BALF-EVs) collected from mice challenged with bleomycin or control identified 107 proteins enriched in fibrotic vesicles. Multiomic analysis revealed fibroblasts as a major cellular source of BALF-EV cargo, which was enriched in secreted frizzled related protein 1 (SFRP1). Sfrp1 deficiency inhibited the activity of fibroblast-derived EVs to potentiate lung fibrosis in vivo. SFRP1 led to increased transitional cell markers, such as keratin 8, and WNT/ß-catenin signaling in primary alveolar type 2 cells. SFRP1 was expressed within the IPF lung and localized at the surface of EVs from patient-derived fibroblasts and BALF. Our work reveals altered EV protein cargo in fibrotic EVs promoting fibrogenesis and identifies fibroblast-derived vesicular SFRP1 as a fibrotic mediator and potential therapeutic target for IPF.
Asunto(s)
Bleomicina , Líquido del Lavado Bronquioalveolar , Vesículas Extracelulares , Fibroblastos , Fibrosis Pulmonar Idiopática , Animales , Vesículas Extracelulares/metabolismo , Fibroblastos/metabolismo , Fibroblastos/patología , Ratones , Fibrosis Pulmonar Idiopática/metabolismo , Fibrosis Pulmonar Idiopática/patología , Humanos , Masculino , Pulmón/patología , Pulmón/metabolismo , Proteínas de la Membrana/metabolismo , Proteínas de la Membrana/genética , Péptidos y Proteínas de Señalización Intercelular/metabolismo , Péptidos y Proteínas de Señalización Intercelular/genética , Proteómica/métodos , Modelos Animales de Enfermedad , Ratones Endogámicos C57BL , Vía de Señalización Wnt , FemeninoRESUMEN
Emphysema, the progressive destruction of gas exchange surfaces in the lungs, is a hallmark of chronic obstructive pulmonary disease (COPD) that is presently incurable. This therapeutic gap is largely due to a poor understanding of potential drivers of impaired tissue regeneration, such as abnormal lung epithelial progenitor cells, including alveolar type II (ATII) and airway club cells. We discovered an emphysema-specific sub-population of ATII cells located in enlarged distal alveolar sacs, termed asATII cells. Single cell RNA-seq and in situ localisation revealed that asATII cells co-express the alveolar marker surfactant protein C (SPC) and the club cell marker secretaglobin-3A2 (SCGB3A2). A similar ATII sub-population derived from club cells was also identified in mouse COPD models using lineage labeling. Human and mouse ATII sub-populations formed 80-90% fewer alveolar organoids than healthy controls, indicating reduced progenitor function. Targeting asATII cells or their progenitor club cells could reveal novel COPD treatment strategies.
RESUMEN
BACKGROUND: Tetraspanin CD151 is highly expressed in endothelia and reinforces cell adhesion, but its role in vascular inflammation remains largely unknown. METHODS: In vitro molecular and cellular biological analyses on genetically modified endothelial cells, in vivo vascular biological analyses on genetically engineered mouse models, and in silico systems biology and bioinformatics analyses on CD151-related events. RESULTS: Endothelial ablation of Cd151 leads to pulmonary and cardiac inflammation, severe sepsis, and perilous COVID-19, and endothelial CD151 becomes downregulated in inflammation. Mechanistically, CD151 restrains endothelial release of proinflammatory molecules for less leukocyte infiltration. At the subcellular level, CD151 determines the integrity of multivesicular bodies/lysosomes and confines the production of exosomes that carry cytokines such as ANGPT2 (angiopoietin-2) and proteases such as cathepsin-D. At the molecular level, CD151 docks VCP (valosin-containing protein)/p97, which controls protein quality via mediating deubiquitination for proteolytic degradation, onto endolysosomes to facilitate VCP/p97 function. At the endolysosome membrane, CD151 links VCP/p97 to (1) IFITM3 (interferon-induced transmembrane protein 3), which regulates multivesicular body functions, to restrain IFITM3-mediated exosomal sorting, and (2) V-ATPase, which dictates endolysosome pH, to support functional assembly of V-ATPase. CONCLUSIONS: Distinct from its canonical function in strengthening cell adhesion at cell surface, CD151 maintains endolysosome function by sustaining VCP/p97-mediated protein unfolding and turnover. By supporting protein quality control and protein degradation, CD151 prevents proteins from (1) buildup in endolysosomes and (2) discharge through exosomes, to limit vascular inflammation. Also, our study conceptualizes that balance between degradation and discharge of proteins in endothelial cells determines vascular information. Thus, the IFITM3/V-ATPase-tetraspanin-VCP/p97 complexes on endolysosome, as a protein quality control and inflammation-inhibitory machinery, could be beneficial for therapeutic intervention against vascular inflammation.
Asunto(s)
COVID-19 , Endosomas , Lisosomas , Tetraspanina 24 , Animales , Lisosomas/metabolismo , Tetraspanina 24/metabolismo , Tetraspanina 24/genética , Humanos , Ratones , COVID-19/metabolismo , COVID-19/inmunología , COVID-19/patología , Endosomas/metabolismo , Ratones Noqueados , Vasculitis/metabolismo , Ratones Endogámicos C57BL , SARS-CoV-2 , Inflamación/metabolismo , Inflamación/patología , Sepsis/metabolismoRESUMEN
Single-cell multiplexing techniques (cell hashing and genetic multiplexing) combine multiple samples, optimizing sample processing and reducing costs. Cell hashing conjugates antibody-tags or chemical-oligonucleotides to cell membranes, while genetic multiplexing allows to mix genetically diverse samples and relies on aggregation of RNA reads at known genomic coordinates. We develop hadge (hashing deconvolution combined with genotype information), a Nextflow pipeline that combines 12 methods to perform both hashing- and genotype-based deconvolution. We propose a joint deconvolution strategy combining best-performing methods and demonstrate how this approach leads to the recovery of previously discarded cells in a nuclei hashing of fresh-frozen brain tissue.
Asunto(s)
Análisis de la Célula Individual , Análisis de la Célula Individual/métodos , Humanos , Encéfalo/metabolismo , Encéfalo/citología , Programas Informáticos , GenotipoRESUMEN
INTRODUCTION: Environmental pollutants injure the mucociliary elevator, thereby provoking disease progression in chronic obstructive pulmonary disease (COPD). Epithelial resilience mechanisms to environmental nanoparticles in health and disease are poorly characterised. METHODS: We delineated the impact of prevalent pollutants such as carbon and zinc oxide nanoparticles, on cellular function and progeny in primary human bronchial epithelial cells (pHBECs) from end-stage COPD (COPD-IV, n=4), early disease (COPD-II, n=3) and pulmonary healthy individuals (n=4). After nanoparticle exposure of pHBECs at air-liquid interface, cell cultures were characterised by functional assays, transcriptome and protein analysis, complemented by single-cell analysis in serial samples of pHBEC cultures focusing on basal cell differentiation. RESULTS: COPD-IV was characterised by a prosecretory phenotype (twofold increase in MUC5AC+) at the expense of the multiciliated epithelium (threefold reduction in Ac-Tub+), resulting in an increased resilience towards particle-induced cell damage (fivefold reduction in transepithelial electrical resistance), as exemplified by environmentally abundant doses of zinc oxide nanoparticles. Exposure of COPD-II cultures to cigarette smoke extract provoked the COPD-IV characteristic, prosecretory phenotype. Time-resolved single-cell transcriptomics revealed an underlying COPD-IV unique basal cell state characterised by a twofold increase in KRT5+ (P=0.018) and LAMB3+ (P=0.050) expression, as well as a significant activation of Wnt-specific (P=0.014) and Notch-specific (P=0.021) genes, especially in precursors of suprabasal and secretory cells. CONCLUSION: We identified COPD stage-specific gene alterations in basal cells that affect the cellular composition of the bronchial elevator and may control disease-specific epithelial resilience mechanisms in response to environmental nanoparticles. The identified phenomena likely inform treatment and prevention strategies.
Asunto(s)
Células Epiteliales , Enfermedad Pulmonar Obstructiva Crónica , Humanos , Enfermedad Pulmonar Obstructiva Crónica/etiología , Células Epiteliales/metabolismo , Masculino , Persona de Mediana Edad , Células Cultivadas , Bronquios/patología , Femenino , Anciano , Óxido de Zinc , Mucosa Respiratoria/metabolismo , Mucosa Respiratoria/patología , Cilios , Nanopartículas , Diferenciación CelularRESUMEN
BACKGROUND: Fibroblast-to-myofibroblast conversion is a major driver of tissue remodelling in organ fibrosis. Distinct lineages of fibroblasts support homeostatic tissue niche functions, yet their specific activation states and phenotypic trajectories during injury and repair have remained unclear. METHODS: We combined spatial transcriptomics, multiplexed immunostainings, longitudinal single-cell RNA-sequencing and genetic lineage tracing to study fibroblast fates during mouse lung regeneration. Our findings were validated in idiopathic pulmonary fibrosis patient tissues in situ as well as in cell differentiation and invasion assays using patient lung fibroblasts. Cell differentiation and invasion assays established a function of SFRP1 in regulating human lung fibroblast invasion in response to transforming growth factor (TGF)ß1. MEASUREMENTS AND MAIN RESULTS: We discovered a transitional fibroblast state characterised by high Sfrp1 expression, derived from both Tcf21-Cre lineage positive and negative cells. Sfrp1 + cells appeared early after injury in peribronchiolar, adventitial and alveolar locations and preceded the emergence of myofibroblasts. We identified lineage-specific paracrine signals and inferred converging transcriptional trajectories towards Sfrp1 + transitional fibroblasts and Cthrc1 + myofibroblasts. TGFß1 downregulated SFRP1 in noninvasive transitional cells and induced their switch to an invasive CTHRC1+ myofibroblast identity. Finally, using loss-of-function studies we showed that SFRP1 modulates TGFß1-induced fibroblast invasion and RHOA pathway activity. CONCLUSIONS: Our study reveals the convergence of spatially and transcriptionally distinct fibroblast lineages into transcriptionally uniform myofibroblasts and identifies SFRP1 as a modulator of TGFß1-driven fibroblast phenotypes in fibrogenesis. These findings are relevant in the context of therapeutic interventions that aim at limiting or reversing fibroblast foci formation.
Asunto(s)
Fibrosis Pulmonar Idiopática , Miofibroblastos , Ratones , Animales , Humanos , Miofibroblastos/metabolismo , Fibroblastos/metabolismo , Pulmón/metabolismo , Fibrosis Pulmonar Idiopática/metabolismo , Diferenciación Celular , Factor de Crecimiento Transformador beta1/metabolismo , Proteínas de la Matriz Extracelular/metabolismo , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismoRESUMEN
Pulmonary fibrosis develops as a consequence of failed regeneration after injury. Analyzing mechanisms of regeneration and fibrogenesis directly in human tissue has been hampered by the lack of organotypic models and analytical techniques. In this work, we coupled ex vivo cytokine and drug perturbations of human precision-cut lung slices (hPCLS) with single-cell RNA sequencing and induced a multilineage circuit of fibrogenic cell states in hPCLS. We showed that these cell states were highly similar to the in vivo cell circuit in a multicohort lung cell atlas from patients with pulmonary fibrosis. Using micro-CT-staged patient tissues, we characterized the appearance and interaction of myofibroblasts, an ectopic endothelial cell state, and basaloid epithelial cells in the thickened alveolar septum of early-stage lung fibrosis. Induction of these states in the hPCLS model provided evidence that the basaloid cell state was derived from alveolar type 2 cells, whereas the ectopic endothelial cell state emerged from capillary cell plasticity. Cell-cell communication routes in patients were largely conserved in hPCLS, and antifibrotic drug treatments showed highly cell type-specific effects. Our work provides an experimental framework for perturbational single-cell genomics directly in human lung tissue that enables analysis of tissue homeostasis, regeneration, and pathology. We further demonstrate that hPCLS offer an avenue for scalable, high-resolution drug testing to accelerate antifibrotic drug development and translation.
Asunto(s)
Fibrosis Pulmonar , Humanos , Fibrosis Pulmonar/genética , Fibrosis Pulmonar/patología , Análisis de Expresión Génica de una Sola Célula , Pulmón/patología , Células Epiteliales Alveolares , Células Epiteliales/metabolismoRESUMEN
In this study, we interrogate molecular mechanisms underlying the specification of lung progenitors from human pluripotent stem cells (hPSCs). We employ single-cell RNA-sequencing with high temporal precision, alongside an optimized differentiation protocol, to elucidate the transcriptional hierarchy of lung specification to chart the associated single-cell trajectories. Our findings indicate that Sonic hedgehog, TGF-ß, and Notch activation are essential within an ISL1/NKX2-1 trajectory, leading to the emergence of lung progenitors during the foregut endoderm phase. Additionally, the induction of HHEX delineates an alternate trajectory at the early definitive endoderm stage, preceding the lung pathway and giving rise to a significant hepatoblast population. Intriguingly, neither KDR+ nor mesendoderm progenitors manifest as intermediate stages in the lung and hepatic lineage development. Our multistep model offers insights into lung organogenesis and provides a foundation for in-depth study of early human lung development and modeling using hPSCs.
RESUMEN
Autoimmunity plays a role in certain types of lung fibrosis, notably connective tissue disease-associated interstitial lung disease (CTD-ILD). In idiopathic pulmonary fibrosis (IPF), an incurable and fatal lung disease, diagnosis typically requires clinical exclusion of autoimmunity. However, autoantibodies of unknown significance have been detected in IPF patients. We conducted computational analysis of B cell transcriptomes in published transcriptomics datasets and developed a proteomic Differential Antigen Capture (DAC) assay that captures plasma antibodies followed by affinity purification of lung proteins coupled to mass spectrometry. We analyzed antibody capture in two independent cohorts of IPF and CTL-ILD patients over two disease progression time points. Our findings revealed significant upregulation of specific immunoglobulins with V-segment bias in IPF across multiple cohorts. We identified a predictive autoimmune signature linked to reduced transplant-free survival in IPF, persisting over time. Notably, autoantibodies against thrombospondin-1 were associated with decreased survival, suggesting their potential as predictive biomarkers.
RESUMEN
Regenerating the lungs' architecture after injury requires rebuilding its fibroelastic extracellular matrix scaffold. Konkimalla et al. establish that regenerative cell states (RCSs) of both epithelial and mesenchymal origin are functionally linked and indispensable for this process. Experimental ablation of RCSs causes organ degeneration, whereas their induction causes organ fibrosis.
Asunto(s)
Matriz Extracelular , Humanos , FibrosisRESUMEN
Optimal tissue recovery and organismal survival are achieved by spatiotemporal tuning of tissue inflammation, contraction and scar formation1. Here we identify a multipotent fibroblast progenitor marked by CD201 expression in the fascia, the deepest connective tissue layer of the skin. Using skin injury models in mice, single-cell transcriptomics and genetic lineage tracing, ablation and gene deletion models, we demonstrate that CD201+ progenitors control the pace of wound healing by generating multiple specialized cell types, from proinflammatory fibroblasts to myofibroblasts, in a spatiotemporally tuned sequence. We identified retinoic acid and hypoxia signalling as the entry checkpoints into proinflammatory and myofibroblast states. Modulating CD201+ progenitor differentiation impaired the spatiotemporal appearances of fibroblasts and chronically delayed wound healing. The discovery of proinflammatory and myofibroblast progenitors and their differentiation pathways provide a new roadmap to understand and clinically treat impaired wound healing.
Asunto(s)
Receptor de Proteína C Endotelial , Fascia , Cicatrización de Heridas , Animales , Ratones , Diferenciación Celular , Hipoxia de la Célula , Linaje de la Célula , Modelos Animales de Enfermedad , Receptor de Proteína C Endotelial/metabolismo , Fascia/citología , Fascia/lesiones , Fascia/metabolismo , Fibroblastos/citología , Fibroblastos/metabolismo , Perfilación de la Expresión Génica , Inflamación/metabolismo , Inflamación/patología , Miofibroblastos/citología , Miofibroblastos/metabolismo , Transducción de Señal , Análisis de Expresión Génica de una Sola Célula , Piel/citología , Piel/lesiones , Piel/metabolismo , Tretinoina/metabolismoRESUMEN
Single-cell proteomics by mass spectrometry is emerging as a powerful and unbiased method for the characterization of biological heterogeneity. So far, it has been limited to cultured cells, whereas an expansion of the method to complex tissues would greatly enhance biological insights. Here we describe single-cell Deep Visual Proteomics (scDVP), a technology that integrates high-content imaging, laser microdissection and multiplexed mass spectrometry. scDVP resolves the context-dependent, spatial proteome of murine hepatocytes at a current depth of 1,700 proteins from a cell slice. Half of the proteome was differentially regulated in a spatial manner, with protein levels changing dramatically in proximity to the central vein. We applied machine learning to proteome classes and images, which subsequently inferred the spatial proteome from imaging data alone. scDVP is applicable to healthy and diseased tissues and complements other spatial proteomics and spatial omics technologies.
Asunto(s)
Proteoma , Proteómica , Animales , Ratones , Proteoma/análisis , Espectrometría de Masas/métodos , Proteómica/métodos , Captura por Microdisección con Láser/métodosRESUMEN
The origins of wound myofibroblasts and scar tissue remains unclear, but it is assumed to involve conversion of adipocytes into myofibroblasts. Here, we directly explore the potential plasticity of adipocytes and fibroblasts after skin injury. Using genetic lineage tracing and live imaging in explants and in wounded animals, we observe that injury induces a transient migratory state in adipocytes with vastly distinct cell migration patterns and behaviours from fibroblasts. Furthermore, migratory adipocytes, do not contribute to scar formation and remain non-fibrogenic in vitro, in vivo and upon transplantation into wounds in animals. Using single-cell and bulk transcriptomics we confirm that wound adipocytes do not convert into fibrogenic myofibroblasts. In summary, the injury-induced migratory adipocytes remain lineage-restricted and do not converge or reprogram into a fibrosing phenotype. These findings broadly impact basic and translational strategies in the regenerative medicine field, including clinical interventions for wound repair, diabetes, and fibrotic pathologies.
Asunto(s)
Cicatriz , Piel , Animales , Cicatriz/patología , Piel/patología , Miofibroblastos/patología , Adipocitos/patología , Cicatrización de Heridas , Fibroblastos/patología , FibrosisRESUMEN
Systemic inflammation is established as part of late-stage severe lung disease, but molecular, functional, and phenotypic changes in peripheral immune cells in early disease stages remain ill defined. Chronic obstructive pulmonary disease (COPD) is a major respiratory disease characterized by small-airway inflammation, emphysema, and severe breathing difficulties. Using single-cell analyses we demonstrate that blood neutrophils are already increased in early-stage COPD, and changes in molecular and functional neutrophil states correlate with lung function decline. Assessing neutrophils and their bone marrow precursors in a murine cigarette smoke exposure model identified similar molecular changes in blood neutrophils and precursor populations that also occur in the blood and lung. Our study shows that systemic molecular alterations in neutrophils and their precursors are part of early-stage COPD, a finding to be further explored for potential therapeutic targets and biomarkers for early diagnosis and patient stratification.
Asunto(s)
Enfermedad Pulmonar Obstructiva Crónica , Enfisema Pulmonar , Humanos , Animales , Ratones , Neutrófilos , Enfermedad Pulmonar Obstructiva Crónica/tratamiento farmacológico , Pulmón , InflamaciónRESUMEN
Recent advances in single-cell technologies have enabled high-throughput molecular profiling of cells across modalities and locations. Single-cell transcriptomics data can now be complemented by chromatin accessibility, surface protein expression, adaptive immune receptor repertoire profiling and spatial information. The increasing availability of single-cell data across modalities has motivated the development of novel computational methods to help analysts derive biological insights. As the field grows, it becomes increasingly difficult to navigate the vast landscape of tools and analysis steps. Here, we summarize independent benchmarking studies of unimodal and multimodal single-cell analysis across modalities to suggest comprehensive best-practice workflows for the most common analysis steps. Where independent benchmarks are not available, we review and contrast popular methods. Our article serves as an entry point for novices in the field of single-cell (multi-)omic analysis and guides advanced users to the most recent best practices.
Asunto(s)
Perfilación de la Expresión Génica , Proteómica , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual/métodosRESUMEN
BACKGROUND: Receptor-interacting protein kinase 1 (RIPK1) is a key mediator of regulated cell death (including apoptosis and necroptosis) and inflammation, both drivers of COPD pathogenesis. We aimed to define the contribution of RIPK1 kinase-dependent cell death and inflammation in the pathogenesis of COPD. METHODS: We assessed RIPK1 expression in single-cell RNA sequencing (RNA-seq) data from human and mouse lungs, and validated RIPK1 levels in lung tissue of COPD patients via immunohistochemistry. Next, we assessed the consequences of genetic and pharmacological inhibition of RIPK1 kinase activity in experimental COPD, using Ripk1 S25D/S25D kinase-deficient mice and the RIPK1 kinase inhibitor GSK'547. RESULTS: RIPK1 expression increased in alveolar type 1 (AT1), AT2, ciliated and neuroendocrine cells in human COPD. RIPK1 protein levels were significantly increased in airway epithelium of COPD patients compared with never-smokers and smokers without airflow limitation. In mice, exposure to cigarette smoke (CS) increased Ripk1 expression similarly in AT2 cells, and further in alveolar macrophages and T-cells. Genetic and/or pharmacological inhibition of RIPK1 kinase activity significantly attenuated airway inflammation upon acute and subacute CS exposure, as well as airway remodelling, emphysema, and apoptotic and necroptotic cell death upon chronic CS exposure. Similarly, pharmacological RIPK1 kinase inhibition significantly attenuated elastase-induced emphysema and lung function decline. Finally, RNA-seq on lung tissue of CS-exposed mice revealed downregulation of cell death and inflammatory pathways upon pharmacological RIPK1 kinase inhibition. CONCLUSIONS: RIPK1 kinase inhibition is protective in experimental models of COPD and may represent a novel promising therapeutic approach.
Asunto(s)
Enfisema , Enfermedad Pulmonar Obstructiva Crónica , Enfisema Pulmonar , Humanos , Ratones , Animales , Pulmón , Muerte Celular , Inflamación/metabolismo , Ratones Endogámicos C57BL , Proteína Serina-Treonina Quinasas de Interacción con Receptores/genética , Proteína Serina-Treonina Quinasas de Interacción con Receptores/metabolismoRESUMEN
The specification, characterization, and fate of alveolar type 1 and type 2 (AT1 and AT2) progenitors during embryonic lung development are poorly defined. Current models of distal epithelial lineage formation fail to capture the heterogeneity and dynamic contribution of progenitor pools present during early development. Furthermore, few studies explore the pathways involved in alveolar progenitor specification and fate. In this paper, we build upon our previously published work on the regulation of airway epithelial progenitors by fibroblast growth factor receptor 2b (FGFR2b) signalling during early (E12.5) and mid (E14.5) pseudoglandular stage lung development. Our results suggest that a significant proportion of AT2 and AT1 progenitors are lineage-flexible during late pseudoglandular stage development, and that lineage commitment is regulated in part by FGFR2b signalling. We have characterized a set of direct FGFR2b targets at E16.5 which are likely involved in alveolar lineage formation. These signature genes converge on a subpopulation of AT2 cells later in development and are downregulated in AT2 cells transitioning to the AT1 lineage during repair after injury in adults. Our findings highlight the extensive heterogeneity of pneumocytes by elucidating the role of FGFR2b signalling in these cells during early airway epithelial lineage formation, as well as during repair after injury.