RESUMO
Unsupervised learning, particularly clustering, plays a pivotal role in disease subtyping and patient stratification, especially with the abundance of large-scale multi-omics data. Deep learning models, such as variational autoencoders (VAEs), can enhance clustering algorithms by leveraging inter-individual heterogeneity. However, the impact of confounders-external factors unrelated to the condition, e.g. batch effect or age-on clustering is often overlooked, introducing bias and spurious biological conclusions. In this work, we introduce four novel VAE-based deconfounding frameworks tailored for clustering multi-omics data. These frameworks effectively mitigate confounding effects while preserving genuine biological patterns. The deconfounding strategies employed include (i) removal of latent features correlated with confounders, (ii) a conditional VAE, (iii) adversarial training, and (iv) adding a regularization term to the loss function. Using real-life multi-omics data from The Cancer Genome Atlas, we simulated various confounding effects (linear, nonlinear, categorical, mixed) and assessed model performance across 50 repetitions based on reconstruction error, clustering stability, and deconfounding efficacy. Our results demonstrate that our novel models, particularly the conditional multi-omics VAE (cXVAE), successfully handle simulated confounding effects and recover biologically driven clustering structures. cXVAE accurately identifies patient labels and unveils meaningful pathological associations among cancer types, validating deconfounded representations. Furthermore, our study suggests that some of the proposed strategies, such as adversarial training, prove insufficient in confounder removal. In summary, our study contributes by proposing innovative frameworks for simultaneous multi-omics data integration, dimensionality reduction, and deconfounding in clustering. Benchmarking on open-access data offers guidance to end-users, facilitating meaningful patient stratification for optimized precision medicine.
Assuntos
Algoritmos , Humanos , Análise por Conglomerados , Neoplasias/genética , Neoplasias/classificação , Aprendizado Profundo , Genômica/métodos , Biologia Computacional/métodos , Aprendizado de Máquina não Supervisionado , MultiômicaRESUMO
The COVID-19 pandemic demonstrated the need for respiratory protection against airborne pathogens. Respirator options for children are limited, and existing designs do not consider differences in facial shape or size. We created a dataset of children's facial images from three cohorts, then used geometric morphometric analyses of dense and sparse facial landmark representations to quantify age, sex and ancestry-related variation in shape. We found facial shape and size in children vary significantly with age from ages 2 to 18, particularly in dimensions relevant to respirator design. Sex differences are small throughout most of the age range of our sample. Ancestry is associated with significant facial shape variation in dimensions that may affect respirator fit. We offer guidance on how to our results can be used for the appropriate design of devices such as respirators for pediatric populations. We also highlight the need to consider ancestry-related variation in facial morphology to promote equitable, inclusive products.
RESUMO
Evidence that breastfeeding impacts the facial features of children is conflicting. Most studies to date have focused on dental and skeletal malocclusion. It currently remains unclear whether such effects are of sufficient magnitude to be detectable on outward facial appearance. Here, we evaluate the extent to which maternally reported breastfeeding is associated with 3D facial shape in a large adolescent cohort. After extracting 3D facial surfaces from MR scans in 2275 9- and 10-year-old children and aligning the surfaces in dense correspondence, we analyzed the effect of breastfeeding on shape as a dichotomous (no/yes) and semi-quantitative (to assess duration in months) variable using partial least squares regression. Our results showed no effect (p = 0.532) when breastfeeding was dichotomized. However, when treated as a semi-quantitative variable, breastfeeding duration was associated with statistically significant changes in shape (p = 3.61x 10-4). The most prominent facial changes included relative retrusion of the central midface, zygomatic arches, and orbital regions along with relative protrusion of forehead, cheek, and mandible. The net effect was that as breastfeeding duration increased, the facial profile in children became flatter (less convex). The observed effects on the face, however, were subtle and likely not conspicuous enough to be noticed by most observers. This was true even when comparing the faces of children breastfed for 19-24 months to children with no reported breastfeeding. Thus, breastfeeding does appear to have detectable effect on outward facial appearance in adolescent children, but its practical impact appears to be minimal.
Assuntos
Aleitamento Materno , Face , Humanos , Criança , Feminino , Face/anatomia & histologia , Masculino , Adolescente , Imageamento TridimensionalRESUMO
Age estimation in forensic odontology is mainly based on the development of permanent teeth. To register the developmental status of an examined tooth, staging techniques were developed. However, due to inappropriate calibration, uncertainties during stage allocation, and lack of experience, non-uniformity in stage allocation exists between expert observers. As a consequence, related age estimation results are inconsistent. An automated staging technique applicable to all tooth types can overcome this drawback.This study aimed to establish an integrated automated technique to stage the development of all mandibular tooth types and to compare their staging performances.Calibrated observers staged FDI teeth 31, 33, 34, 37 and 38 according to a ten-stage modified Demirjian staging technique. According to a standardised bounding box around each examined tooth, the retrospectively collected panoramic radiographs were cropped using Photoshop CC 2021® software (Adobe®, version 23.0). A gold standard set of 1639 radiographs were selected (n31 = 259, n33 = 282, n34 = 308, n37 = 390, n38 = 400) and input into a convolutional neural network (CNN) trained for optimal staging accuracy. The performance evaluation of the network was conducted in a five-fold cross-validation scheme. In each fold, the entire dataset was split into a training and a test set in a non-overlapping fashion between the folds (i.e., 80% and 20% of the dataset, respectively). Staging performances were calculated per tooth type and overall (accuracy, mean absolute difference, linearly weighted Cohen's Kappa and intra-class correlation coefficient). Overall, these metrics equalled 0.53, 0.71, 0.71, and 0.89, respectively. All staging performance indices were best for 37 and worst for 31. The highest number of misclassified stages were associated to adjacent stages. Most misclassifications were observed in all available stages of 31.Our findings suggest that the developmental status of mandibular molars can be taken into account in an automated approach for age estimation, while taking incisors into account may hinder age estimation.
Assuntos
Determinação da Idade pelos Dentes , Mandíbula , Radiografia Panorâmica , Humanos , Determinação da Idade pelos Dentes/métodos , Mandíbula/diagnóstico por imagem , Adolescente , Masculino , Feminino , Redes Neurais de Computação , Estudos Retrospectivos , Adulto , Odontologia Legal/métodos , Inteligência Artificial , Adulto Jovem , Pessoa de Meia-IdadeRESUMO
Recognizing Mendelian causes is crucial in molecular diagnostics and counseling for patients with autism spectrum disorder (ASD). We explored facial dysmorphism and facial asymmetry in relation to genetic causes in ASD patients and studied the potential of objective facial phenotyping in discriminating between Mendelian and multifactorial ASD. In a cohort of 152 ASD patients, 3D facial images were used to calculate three metrics: a computational dysmorphism score, a computational asymmetry score, and an expert dysmorphism score. High scores for each of the three metrics were associated with Mendelian causes of ASD. The computational dysmorphism score showed a significant correlation with the average expert dysmorphism score. However, in some patients, different dysmorphism aspects were captured making the metrics potentially complementary. The computational dysmorphism and asymmetry scores both enhanced the individual expert dysmorphism scores in differentiating Mendelian from non-Mendelian cases. Furthermore, the computational asymmetry score enhanced the average expert opinion in predicting a Mendelian cause. By design, our study does not allow to draw conclusions on the actual point-of-care use of 3D facial analysis. Nevertheless, 3D morphometric analysis is promising for developing clinical dysmorphology applications in diagnostics and training.
Assuntos
Transtorno do Espectro Autista , Face , Imageamento Tridimensional , Humanos , Transtorno do Espectro Autista/genética , Transtorno do Espectro Autista/diagnóstico , Masculino , Feminino , Criança , Face/anormalidades , Face/patologia , Fenótipo , Pré-Escolar , Adolescente , Assimetria Facial/genética , Assimetria Facial/diagnósticoRESUMO
Bloodstain pattern analysis plays a crucial role in forensic investigations. Projected patterns can offer valuable insights into the dynamics of crime scenes. In this paper, we propose and validate a novel approach that extends existing software, HemoVision, to analyze impact patterns that are distributed across multiple arbitrarily oriented surfaces. The proposed method integrates HemoVision's marker-based system with structure from motion (SfM) techniques to reconstruct the three-dimensional geometry of impact patterns using only two-dimensional photographs. Controlled experiments were used to validate the proposed approach, demonstrating robustness in reconstruction accuracy with median translation errors below 3â¯mm and median angular errors below 0.2°, irrespective of imaging device or image resolution. Comparing the estimated areas origin to their known ground truth, the proposed method achieved an average total error of 8.12â¯cm, with the primary source of error being the vertical dimension. Despite this, the overall error remains well within the ranges of error reported in prior work. This study demonstrates that HemoVision can be used to analyze complex impact patterns using only two-dimensional photographs, providing forensic experts with an efficient and accessible tool for investigating intricate crime scenes involving multi-surface impact patterns.
RESUMO
Human facial shape, while strongly heritable, involves both genetic and structural complexity, necessitating precise phenotyping for accurate assessment. Common phenotyping strategies include simplifying 3D facial features into univariate traits such as anthropometric measurements (e.g., inter-landmark distances), unsupervised dimensionality reductions (e.g., principal component analysis (PCA) and auto-encoder (AE) approaches), and assessing resemblance to particular facial gestalts (e.g., syndromic facial archetypes). This study provides a comparative assessment of these strategies in genome-wide association studies (GWASs) of 3D facial shape. Specifically, we investigated inter-landmark distances, PCA and AE-derived latent dimensions, and facial resemblance to random, extreme, and syndromic gestalts within a GWAS of 8,426 individuals of recent European ancestry. Inter-landmark distances exhibit the highest SNP-based heritability as estimated via LD score regression, followed by AE dimensions. Conversely, resemblance scores to extreme and syndromic facial gestalts display the lowest heritability, in line with expectations. Notably, the aggregation of multiple GWASs on facial resemblance to random gestalts reveals the highest number of independent genetic loci. This novel, easy-to-implement phenotyping approach holds significant promise for capturing genetically relevant morphological traits derived from complex biomedical imaging datasets, and its applications extend beyond faces. Nevertheless, these different phenotyping strategies capture different genetic influences on craniofacial shape. Thus, it remains valuable to explore these strategies individually and in combination to gain a more comprehensive understanding of the genetic factors underlying craniofacial shape and related traits. Author Summary: Advancements linking variation in the human genome to phenotypes have rapidly evolved in recent decades and have revealed that most human traits are influenced by genetic variants to at least some degree. While many traits, such as stature, are straightforward to acquire and investigate, the multivariate and multipartite nature of facial shape makes quantification more challenging. In this study, we compared the impact of different facial phenotyping approaches on gene mapping outcomes. Our findings suggest that the choice of facial phenotyping method has an impact on apparent trait heritability and the ability to detect genetic association signals. These results offer valuable insights into the importance of phenotyping in genetic investigations, especially when dealing with highly complex morphological traits.
RESUMO
PURPOSE: The conventional method to reconstruct the bone level for orbital defects, which is based on mirroring and manual adaptation, is time-consuming and the accuracy highly depends on the expertise of the clinical engineer. The aim of this study is to propose and evaluate an automated reconstruction method utilizing a Gaussian process morphable model (GPMM). METHODS: Sixty-five Computed Tomography (CT) scans of healthy midfaces were used to create a GPMM that can model shape variations of the orbital region. Parameter optimization was performed by evaluating several quantitative metrics inspired on the shape modeling literature, e.g. generalization and specificity. The reconstruction error was estimated by reconstructing artificial defects created in orbits from fifteen CT scans that were not included in the GPMM. The developed algorithms utilize the existing framework of Gaussian process morphable models, as implemented in the Scalismo software. RESULTS: By evaluating the proposed quality metrics, adequate parameters are chosen for non-rigid registration and reconstruction. The resulting median reconstruction error using the GPMM was lower (0.35 ± 0.16 mm) compared to the mirroring method (0.52 ± 0.18 mm). In addition, the GPMM-based reconstruction is automated and can be applied to large bilateral defects with a median reconstruction error of 0.39 ± 0.11 mm. CONCLUSION: The GPMM-based reconstruction proves to be less time-consuming and more accurate than reconstruction by mirroring. Further validation through clinical studies on patients with orbital defects is warranted. Nevertheless, the results underscore the potential of GPMM-based reconstruction as a promising alternative for designing patient-specific implants.
Assuntos
Algoritmos , Órbita , Tomografia Computadorizada por Raios X , Humanos , Tomografia Computadorizada por Raios X/métodos , Órbita/diagnóstico por imagem , Órbita/cirurgia , Distribuição Normal , Imageamento Tridimensional/métodosRESUMO
Automatic dense 3D surface registration is a powerful technique for comprehensive 3D shape analysis that has found a successful application in human craniofacial morphology research, particularly within the mandibular and cranial vault regions. However, a notable gap exists when exploring the frontal aspect of the human skull, largely due to the intricate and unique nature of its cranial anatomy. To better examine this region, this study introduces a simplified single-surface craniofacial bone mask comprising of 6707 quasi-landmarks, which can aid in the classification and quantification of variation over human facial bone surfaces. Automatic craniofacial bone phenotyping was conducted on a dataset of 31 skull scans obtained through cone-beam computed tomography (CBCT) imaging. The MeshMonk framework facilitated the non-rigid alignment of the constructed craniofacial bone mask with each individual target mesh. To gauge the accuracy and reliability of this automated process, 20 anatomical facial landmarks were manually placed three times by three independent observers on the same set of images. Intra- and inter-observer error assessments were performed using root mean square (RMS) distances, revealing consistently low scores. Subsequently, the corresponding automatic landmarks were computed and juxtaposed with the manually placed landmarks. The average Euclidean distance between these two landmark sets was 1.5 mm, while centroid sizes exhibited noteworthy similarity. Intraclass coefficients (ICC) demonstrated a high level of concordance (> 0.988), with automatic landmarking showing significantly lower errors and variation. These results underscore the utility of this newly developed single-surface craniofacial bone mask, in conjunction with the MeshMonk framework, as a highly accurate and reliable method for automated phenotyping of the facial region of human skulls from CBCT and CT imagery. This craniofacial template bone mask expansion of the MeshMonk toolbox not only enhances our capacity to study craniofacial bone variation but also holds significant potential for shedding light on the genetic, developmental, and evolutionary underpinnings of the overall human craniofacial structure.
Assuntos
Tomografia Computadorizada de Feixe Cônico , Imageamento Tridimensional , Crânio , Humanos , Crânio/anatomia & histologia , Crânio/diagnóstico por imagem , Imageamento Tridimensional/métodos , Tomografia Computadorizada de Feixe Cônico/métodos , Ossos Faciais/diagnóstico por imagem , Ossos Faciais/anatomia & histologia , Pontos de Referência Anatômicos/diagnóstico por imagem , Masculino , Feminino , Reprodutibilidade dos TestesRESUMO
Craniosynostosis (CS) is a major birth defect resulting from premature fusion of cranial sutures. Nonsyndromic CS occurs more frequently than syndromic CS, with sagittal nonsyndromic craniosynostosis (sNCS) presenting as the most common CS phenotype. Previous genome-wide association and targeted sequencing analyses of sNCS have identified multiple associated loci, with the strongest association on chromosome 20. Herein, we report the first whole-genome sequencing study of sNCS using 63 proband-parent trios. Sequencing data for these trios were analyzed using the transmission disequilibrium test (TDT) and rare variant TDT (rvTDT) to identify high-risk rare gene variants. Sequencing data were also examined for copy number variants (CNVs) and de novo variants. TDT analysis identified a highly significant locus at 20p12.3, localized to the intergenic region between BMP2 and the noncoding RNA gene LINC01428. Three variants (rs6054763, rs6054764, rs932517) were identified as potential causal variants due to their probability of being transcription factor binding sites, deleterious combined annotation dependent depletion scores, and high minor allele enrichment in probands. Morphometric analysis of cranial vault shape in an unaffected cohort validated the effect of these three single nucleotide variants (SNVs) on dolichocephaly. No genome-wide significant rare variants, de novo loci, or CNVs were identified. Future efforts to identify risk variants for sNCS should include sequencing of larger and more diverse population samples and increased omics analyses, such as RNA-seq and ATAC-seq.
Assuntos
Craniossinostoses , Estudo de Associação Genômica Ampla , Humanos , Alelos , Proteína Morfogenética Óssea 2/genética , Craniossinostoses/genética , DNA Intergênico/genética , Sequenciamento Completo do Genoma , RNA Longo não CodificanteRESUMO
Genome-wide association studies (GWAS) identified thousands of genetic variants linked to phenotypic traits and disease risk. However, mechanistic understanding of how GWAS variants influence complex morphological traits and can, in certain cases, simultaneously confer normal-range phenotypic variation and disease predisposition, is still largely lacking. Here, we focus on rs6740960, a single nucleotide polymorphism (SNP) at the 2p21 locus, which in GWAS studies has been associated both with normal-range variation in jaw shape and with an increased risk of non-syndromic orofacial clefting. Using in vitro derived embryonic cell types relevant for human facial morphogenesis, we show that this SNP resides in an enhancer that regulates chondrocytic expression of PKDCC - a gene encoding a tyrosine kinase involved in chondrogenesis and skeletal development. In agreement, we demonstrate that the rs6740960 SNP is sufficient to confer chondrocyte-specific differences in PKDCC expression. By deploying dense landmark morphometric analysis of skull elements in mice, we show that changes in Pkdcc dosage are associated with quantitative changes in the maxilla, mandible, and palatine bone shape that are concordant with the facial phenotypes and disease predisposition seen in humans. We further demonstrate that the frequency of the rs6740960 variant strongly deviated among different human populations, and that the activity of its cognate enhancer diverged in hominids. Our study provides a mechanistic explanation of how a common SNP can mediate normal-range and disease-associated morphological variation, with implications for the evolution of human facial features.
Assuntos
Condrogênese , Estudo de Associação Genômica Ampla , Animais , Humanos , Camundongos , Condrogênese/genética , Face , Cabeça , CrânioRESUMO
Automatic dense 3D surface registration is a powerful technique for comprehensive 3D shape analysis that has found a successful application in human craniofacial morphology research, particularly within the mandibular and cranial vault regions. However, a notable gap exists when exploring the frontal aspect of the human skull, largely due to the intricate and unique nature of its cranial anatomy. To better examine this region, this study introduces a simplified single-surface craniofacial bone mask comprising 9,999 quasi-landmarks, which can aid in the classification and quantification of variation over human facial bone surfaces. Automatic craniofacial bone phenotyping was conducted on a dataset of 31 skull scans obtained through cone-beam computed tomography (CBCT) imaging. The MeshMonk framework facilitated the non-rigid alignment of the constructed craniofacial bone mask with each individual target mesh. To gauge the accuracy and reliability of this automated process, 20 anatomical facial landmarks were manually placed three times by three independent observers on the same set of images. Intra- and inter-observer error assessments were performed using root mean square (RMS) distances, revealing consistently low scores. Subsequently, the corresponding automatic landmarks were computed and juxtaposed with the manually placed landmarks. The average Euclidean distance between these two landmark sets was 1.5mm, while centroid sizes exhibited noteworthy similarity. Intraclass coefficients (ICC) demonstrated a high level of concordance (>0.988), and automatic landmarking showing significantly lower errors and variation. These results underscore the utility of this newly developed single-surface craniofacial bone mask, in conjunction with the MeshMonk framework, as a highly accurate and reliable method for automated phenotyping of the facial region of human skulls from CBCT and CT imagery. This craniofacial template bone mask expansion of the MeshMonk toolbox not only enhances our capacity to study craniofacial bone variation but also holds significant potential for shedding light on the genetic, developmental, and evolutionary underpinnings of the overall human craniofacial structure.
RESUMO
Craniofacial phenotyping is critical for both syndrome delineation and diagnosis because craniofacial abnormalities occur in 30% of characterized genetic syndromes. Clinical reports, textbooks, and available software tools typically provide two-dimensional, static images and illustrations of the characteristic phenotypes of genetic syndromes. In this work, we provide an interactive web application that provides three-dimensional, dynamic visualizations for the characteristic craniofacial effects of 95 syndromes. Users can visualize syndrome facial appearance estimates quantified from data and easily compare craniofacial phenotypes of different syndromes. Our application also provides a map of morphological similarity between a target syndrome and other syndromes. Finally, users can upload 3D facial scans of individuals and compare them to our syndrome atlas estimates. In summary, we provide an interactive reference for the craniofacial phenotypes of syndromes that allows for precise, individual-specific comparisons of dysmorphology.
Assuntos
Face , Software , Humanos , Fácies , Fenótipo , SíndromeRESUMO
Dental age estimation, a cornerstone in forensic age assessment, has been extensively tried and tested, yet manual methods are impeded by tedium and interobserver variability. Automated approaches using deep transfer learning encounter challenges like data scarcity, suboptimal training, and fine-tuning complexities, necessitating robust training methods. This study explores the impact of convolutional neural network hyperparameters, model complexity, training batch size, and sample quantity on age estimation. EfficientNet-B4, DenseNet-201, and MobileNet V3 models underwent cross-validation on a dataset of 3896 orthopantomograms (OPGs) with batch sizes escalating from 10 to 160 in a doubling progression, as well as random subsets of this training dataset. Results demonstrate the EfficientNet-B4 model, trained on the complete dataset with a batch size of 160, as the top performer with a mean absolute error of 0.562 years on the test set, notably surpassing the MAE of 1.01 at a batch size of 10. Increasing batch size consistently improved performance for EfficientNet-B4 and DenseNet-201, whereas MobileNet V3 performance peaked at batch size 40. Similar trends emerged in training with reduced sample sizes, though they were outperformed by the complete models. This underscores the critical role of hyperparameter optimization in adopting deep learning for age estimation from complete OPGs. The findings not only highlight the nuanced interplay of hyperparameters and performance but also underscore the potential for accurate age estimation models through optimization. This study contributes to advancing the application of deep learning in forensic age estimation, emphasizing the significance of tailored training methodologies for optimal outcomes.
Assuntos
Determinação da Idade pelos Dentes , Aprendizado Profundo , Redes Neurais de Computação , Radiografia Panorâmica , Humanos , Determinação da Idade pelos Dentes/métodos , Adolescente , Adulto , Feminino , Masculino , Adulto Jovem , Pessoa de Meia-Idade , Odontologia Legal/métodos , Conjuntos de Dados como Assunto , IdosoRESUMO
Transcription factors (TFs) can define distinct cellular identities despite nearly identical DNA-binding specificities. One mechanism for achieving regulatory specificity is DNA-guided TF cooperativity. Although in vitro studies suggest that it may be common, examples of such cooperativity remain scarce in cellular contexts. Here, we demonstrate how "Coordinator," a long DNA motif composed of common motifs bound by many basic helix-loop-helix (bHLH) and homeodomain (HD) TFs, uniquely defines the regulatory regions of embryonic face and limb mesenchyme. Coordinator guides cooperative and selective binding between the bHLH family mesenchymal regulator TWIST1 and a collective of HD factors associated with regional identities in the face and limb. TWIST1 is required for HD binding and open chromatin at Coordinator sites, whereas HD factors stabilize TWIST1 occupancy at Coordinator and titrate it away from HD-independent sites. This cooperativity results in the shared regulation of genes involved in cell-type and positional identities and ultimately shapes facial morphology and evolution.
Assuntos
Proteínas de Ligação a DNA , Desenvolvimento Embrionário , Fatores de Transcrição , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Sítios de Ligação , DNA/metabolismo , Proteínas de Ligação a DNA/metabolismo , Regulação da Expressão Gênica , Mesoderma/metabolismo , Fatores de Transcrição/metabolismo , Humanos , Animais , Camundongos , Extremidades/crescimento & desenvolvimentoRESUMO
Human craniofacial shape is highly variable yet highly heritable with genetic variants interacting through multiple layers of development. Here, we hypothesize that Mendelian phenotypes represent the extremes of a phenotypic spectrum and, using achondroplasia as an example, we introduce a syndrome-informed phenotyping approach to identify genomic loci associated with achondroplasia-like facial variation in the normal population. We compared three-dimensional facial scans from 43 individuals with achondroplasia and 8246 controls to calculate achondroplasia-like facial scores. Multivariate GWAS of the control scores revealed a polygenic basis for normal facial variation along an achondroplasia-specific shape axis, identifying genes primarily involved in skeletal development. Jointly modeling these genes in two independent control samples showed craniofacial effects approximating the characteristic achondroplasia phenotype. These findings suggest that both complex and Mendelian genetic variation act on the same developmentally determined axes of facial variation, providing new insights into the genetic intersection of complex traits and Mendelian disorders.
RESUMO
Introduction: Multi-view data offer advantages over single-view data for characterizing individuals, which is crucial in precision medicine toward personalized prevention, diagnosis, or treatment follow-up. Methods: Here, we develop a network-guided multi-view clustering framework named netMUG to identify actionable subgroups of individuals. This pipeline first adopts sparse multiple canonical correlation analysis to select multi-view features possibly informed by extraneous data, which are then used to construct individual-specific networks (ISNs). Finally, the individual subtypes are automatically derived by hierarchical clustering on these network representations. Results: We applied netMUG to a dataset containing genomic data and facial images to obtain BMI-informed multi-view strata and showed how it could be used for a refined obesity characterization. Benchmark analysis of netMUG on synthetic data with known strata of individuals indicated its superior performance compared with both baseline and benchmark methods for multi-view clustering. The clustering derived from netMUG achieved an adjusted Rand index of 1 with respect to the synthesized true labels. In addition, the real-data analysis revealed subgroups strongly linked to BMI and genetic and facial determinants of these subgroups. Discussion: netMUG provides a powerful strategy, exploiting individual-specific networks to identify meaningful and actionable strata. Moreover, the implementation is easy to generalize to accommodate heterogeneous data sources or highlight data structures.
RESUMO
The cranial vault in humans is highly variable, clinically relevant, and heritable, yet its genetic architecture remains poorly understood. Here, we conduct a joint multi-ancestry and admixed multivariate genome-wide association study on 3D cranial vault shape extracted from magnetic resonance images of 6772 children from the ABCD study cohort yielding 30 genome-wide significant loci. Follow-up analyses indicate that these loci overlap with genomic risk loci for sagittal craniosynostosis, show elevated activity cranial neural crest cells, are enriched for processes related to skeletal development, and are shared with the face and brain. We present supporting evidence of regional localization for several of the identified genes based on expression patterns in the cranial vault bones of E15.5 mice. Overall, our study provides a comprehensive overview of the genetics underlying normal-range cranial vault shape and its relevance for understanding modern human craniofacial diversity and the etiology of congenital malformations.
Assuntos
Craniossinostoses , Estudo de Associação Genômica Ampla , Criança , Humanos , Animais , Camundongos , Crânio/diagnóstico por imagem , Craniossinostoses/genética , Ossos Faciais , Encéfalo/diagnóstico por imagemRESUMO
PURPOSE: Facial disfigurement may affect the quality of life of many patients. Facial prostheses are often used as an adjuvant to surgical intervention and may sometimes be the only viable treatment option. Traditional methods for designing soft-tissue facial prostheses are time-consuming and subjective, while existing digital techniques are based on mirroring of contralateral features of the patient, or the use of existing feature templates/models that may not be readily available. We aim to support the objective and semi-automated design of facial prostheses with primary application to midline or bilateral defect restoration where no contralateral features are present. Specifically, we developed and validated a statistical shape model (SSM) for estimating the shape of missing facial soft tissue segments, from any intact parts of the face. MATERIALS AND METHODS: An SSM of 3D facial variations was built from meshes extracted from computed tomography and cone beam computed tomography images of a black South African sample (n = 235) without facial disfigurement. Various types of facial defects were simulated, and the missing parts were estimated automatically by a weighted fit of each mesh to the SSM. The estimated regions were compared to the original regions using color maps and root-mean-square (RMS) distances. RESULTS: Root mean square errors (RMSE) for defect estimations of one orbit, partial nose, cheek, and lip were all below 1.71 mm. Errors for the full nose, bi-orbital defects, as well as small and large composite defects were between 2.10 and 2.58 mm. Statistically significant associations of age and type of defect with RMSE were observed, but not with sex or imaging modality. CONCLUSION: This method can support the objective and semi-automated design of facial prostheses, specifically for defects in the midline, crossing the midline or bilateral defects, by facilitating time-consuming and skill-dependent aspects of prosthesis design.
RESUMO
A genome-wide association study (GWAS) of a complex, multi-dimensional morphological trait, such as the human face, typically relies on predefined and simplified phenotypic measurements, such as inter-landmark distances and angles. These measures are predominantly designed by human experts based on perceived biological or clinical knowledge. To avoid use handcrafted phenotypes (i.e., a priori expert-identified phenotypes), alternative automatically extracted phenotypic descriptors, such as features derived from dimension reduction techniques (e.g., principal component analysis), are employed. While the features generated by such computational algorithms capture the geometric variations of the biological shape, they are not necessarily genetically relevant. Therefore, genetically informed data-driven phenotyping is desirable. Here, we propose an approach where phenotyping is done through a data-driven optimization of trait heritability, defined as the degree of variation in a phenotypic trait in a population that is due to genetic variation. The resulting phenotyping process consists of two steps: 1) constructing a feature space that models shape variations using dimension reduction techniques, and 2) searching for directions in the feature space exhibiting high trait heritability using a genetic search algorithm (i.e., heuristic inspired by natural selection). We show that the phenotypes resulting from the proposed trait heritability-optimized training differ from those of principal components in the following aspects: 1) higher trait heritability, 2) higher SNP heritability, and 3) identification of the same number of independent genetic loci with a smaller number of effective traits. Our results demonstrate that data-driven trait heritability-based optimization enables the automatic extraction of genetically relevant phenotypes, as shown by their increased power in genome-wide association scans.