RESUMO
OBJECTIVE: Torus Palatinus (TP) is a common trait with an unclear aetiology. Although prior studies suggest a hereditary component, the genetic factors that influence TP risk remain unknown. The purpose of this study is to identify genetic variants associated with TP. MATERIALS AND METHODS: We assessed the TP status of 829 individuals from various ancestral backgrounds using 3D palate scans. We then carried out a genome-wide association study (GWAS) to identify common variants associated with TP. We also performed gene-based tests across the exome to investigate the role of low-frequency coding variants. RESULTS: Our GWAS did not identify any genome-wide significant signals but identified suggestive associations including hits on chromosomes 2, 5 and 17 with p-values less than 5 × 10-6. Candidate genes at these suggestive loci have been implicated in normal-range craniofacial features, syndromes with facial and oral malformations, and bone density. We did not find evidence that low-frequency coding variants influence TP risk. In addition, we failed to replicate associations identified in prior genetic studies of TP. CONCLUSION: These findings suggest that multiple genes likely influence the development of TP. Independent replication will be required to confirm our suggestive associations.
RESUMO
BACKGROUND: Processing raw genomic data for downstream applications such as imputation, association studies, and modeling requires numerous third-party bioinformatics software tools. It is highly time-consuming and resource-intensive with computational demands and storage limitations that pose significant challenges that increase cost. The use of software tools independent of one another, in a disjointed stepwise fashion, increases the difficulty and sets forth higher error rates because of fragmented job executions in alignment, variant calling, and/or build conversion complications. As sequencing data availability grows, the ability for biologists to process it using stable, automated, and reproducible workflows is paramount as it significantly reduces the time to generate clean and reliable data. RESULTS: The Iliad suite of genomic data workflows was developed to provide users with seamless file transitions from raw genomic data to a quality-controlled variant call format (VCF) file for downstream applications. Iliad benefits from the efficiency of the Snakemake best practices framework coupled with Singularity and Docker containers for repeatability, portability, and ease of installation. This feat is accomplished from the onset with download acquisitions of any raw data type (FASTQ, CRAM, IDAT) straight through to the generation of a clean merged data file that can combine any user-preferred datasets using robust programs such as BWA, Samtools, and BCFtools. Users can customize and direct their workflow with one straightforward configuration file. Iliad is compatible with Linux, MacOS, and Windows platforms and scalable from a local machine to a high-performance computing cluster. CONCLUSION: Iliad offers automated workflows with optimized time and resource management that are comparable to other workflows available but generates analysis-ready VCF files from the most common datatypes using a single command. The storage footprint challenge of genomic data is overcome by utilizing temporary intermediate files before the final VCF is generated. This file is ready for use in imputation, genome-wide association study (GWAS) pipelines, high-throughput population genetics studies, select gene candidate studies, and more. Iliad was developed to be portable, compatible, scalable, robust, and repeatable with a simplistic setup, so biologists that are less familiar with programming can manage their own big data with this open-source suite of workflows.
Assuntos
Estudo de Associação Genômica Ampla , Genômica , Fluxo de Trabalho , Biologia Computacional , SoftwareRESUMO
Automatic dense 3D surface registration is a powerful technique for comprehensive 3D shape analysis that has found a successful application in human craniofacial morphology research, particularly within the mandibular and cranial vault regions. However, a notable gap exists when exploring the frontal aspect of the human skull, largely due to the intricate and unique nature of its cranial anatomy. To better examine this region, this study introduces a simplified single-surface craniofacial bone mask comprising 9,999 quasi-landmarks, which can aid in the classification and quantification of variation over human facial bone surfaces. Automatic craniofacial bone phenotyping was conducted on a dataset of 31 skull scans obtained through cone-beam computed tomography (CBCT) imaging. The MeshMonk framework facilitated the non-rigid alignment of the constructed craniofacial bone mask with each individual target mesh. To gauge the accuracy and reliability of this automated process, 20 anatomical facial landmarks were manually placed three times by three independent observers on the same set of images. Intra- and inter-observer error assessments were performed using root mean square (RMS) distances, revealing consistently low scores. Subsequently, the corresponding automatic landmarks were computed and juxtaposed with the manually placed landmarks. The average Euclidean distance between these two landmark sets was 1.5mm, while centroid sizes exhibited noteworthy similarity. Intraclass coefficients (ICC) demonstrated a high level of concordance (>0.988), and automatic landmarking showing significantly lower errors and variation. These results underscore the utility of this newly developed single-surface craniofacial bone mask, in conjunction with the MeshMonk framework, as a highly accurate and reliable method for automated phenotyping of the facial region of human skulls from CBCT and CT imagery. This craniofacial template bone mask expansion of the MeshMonk toolbox not only enhances our capacity to study craniofacial bone variation but also holds significant potential for shedding light on the genetic, developmental, and evolutionary underpinnings of the overall human craniofacial structure.
RESUMO
Automatic dense 3D surface registration is a powerful technique for comprehensive 3D shape analysis that has found a successful application in human craniofacial morphology research, particularly within the mandibular and cranial vault regions. However, a notable gap exists when exploring the frontal aspect of the human skull, largely due to the intricate and unique nature of its cranial anatomy. To better examine this region, this study introduces a simplified single-surface craniofacial bone mask comprising of 6707 quasi-landmarks, which can aid in the classification and quantification of variation over human facial bone surfaces. Automatic craniofacial bone phenotyping was conducted on a dataset of 31 skull scans obtained through cone-beam computed tomography (CBCT) imaging. The MeshMonk framework facilitated the non-rigid alignment of the constructed craniofacial bone mask with each individual target mesh. To gauge the accuracy and reliability of this automated process, 20 anatomical facial landmarks were manually placed three times by three independent observers on the same set of images. Intra- and inter-observer error assessments were performed using root mean square (RMS) distances, revealing consistently low scores. Subsequently, the corresponding automatic landmarks were computed and juxtaposed with the manually placed landmarks. The average Euclidean distance between these two landmark sets was 1.5 mm, while centroid sizes exhibited noteworthy similarity. Intraclass coefficients (ICC) demonstrated a high level of concordance (> 0.988), with automatic landmarking showing significantly lower errors and variation. These results underscore the utility of this newly developed single-surface craniofacial bone mask, in conjunction with the MeshMonk framework, as a highly accurate and reliable method for automated phenotyping of the facial region of human skulls from CBCT and CT imagery. This craniofacial template bone mask expansion of the MeshMonk toolbox not only enhances our capacity to study craniofacial bone variation but also holds significant potential for shedding light on the genetic, developmental, and evolutionary underpinnings of the overall human craniofacial structure.
Assuntos
Tomografia Computadorizada de Feixe Cônico , Imageamento Tridimensional , Crânio , Humanos , Crânio/anatomia & histologia , Crânio/diagnóstico por imagem , Imageamento Tridimensional/métodos , Tomografia Computadorizada de Feixe Cônico/métodos , Ossos Faciais/diagnóstico por imagem , Ossos Faciais/anatomia & histologia , Pontos de Referência Anatômicos/diagnóstico por imagem , Masculino , Feminino , Reprodutibilidade dos TestesRESUMO
Analysis of population structure and genomic ancestry remains an important topic in human genetics and bioinformatics. Commonly used methods require high-quality genotype data to ensure accurate inference. However, in practice, laboratory artifacts and outliers are often present in the data. Moreover, existing methods are typically affected by the presence of related individuals in the dataset. In this work, we propose a novel hybrid method, called SAE-IBS, which combines the strengths of traditional matrix decomposition-based (e.g., principal component analysis) and more recent neural network-based (e.g., autoencoders) solutions. Namely, it yields an orthogonal latent space enhancing dimensionality selection while learning non-linear transformations. The proposed approach achieves higher accuracy than existing methods for projecting poor quality target samples (genotyping errors and missing data) onto a reference ancestry space and generates a robust ancestry space in the presence of relatedness. We introduce a new approach and an accompanying open-source program for robust ancestry inference in the presence of missing data, genotyping errors, and relatedness. The obtained ancestry space allows for non-linear projections and exhibits orthogonality with clearly separable population groups.
Assuntos
Genética Populacional , Redes Neurais de Computação , Humanos , Genótipo , Análise de Componente PrincipalRESUMO
Facial ancestry can be described as variation that exists in facial features that are shared amongst members of a population due to environmental and genetic effects. Even within Europe, faces vary among subregions and may lead to confounding in genetic association studies if unaccounted for. Genetic studies use genetic principal components (PCs) to describe facial ancestry to circumvent this issue. Yet the phenotypic effect of these genetic PCs on the face has yet to be described, and phenotype-based alternatives compared. In anthropological studies, consensus faces are utilized as they depict a phenotypic, not genetic, ancestry effect. In this study, we explored the effects of regional differences on facial ancestry in 744 Europeans using genetic and anthropological approaches. Both showed similar ancestry effects between subgroups, localized mainly to the forehead, nose, and chin. Consensus faces explained the variation seen in only the first three genetic PCs, differing more in magnitude than shape change. Here we show only minor differences between the two methods and discuss a combined approach as a possible alternative for facial scan correction that is less cohort dependent, more replicable, non-linear, and can be made open access for use across research groups, enhancing future studies in this field.
Assuntos
Antropologia , Testa , Queixo , Consenso , Europa (Continente)RESUMO
The cranial vault in humans is highly variable, clinically relevant, and heritable, yet its genetic architecture remains poorly understood. Here, we conduct a joint multi-ancestry and admixed multivariate genome-wide association study on 3D cranial vault shape extracted from magnetic resonance images of 6772 children from the ABCD study cohort yielding 30 genome-wide significant loci. Follow-up analyses indicate that these loci overlap with genomic risk loci for sagittal craniosynostosis, show elevated activity cranial neural crest cells, are enriched for processes related to skeletal development, and are shared with the face and brain. We present supporting evidence of regional localization for several of the identified genes based on expression patterns in the cranial vault bones of E15.5 mice. Overall, our study provides a comprehensive overview of the genetics underlying normal-range cranial vault shape and its relevance for understanding modern human craniofacial diversity and the etiology of congenital malformations.
Assuntos
Craniossinostoses , Estudo de Associação Genômica Ampla , Criança , Humanos , Animais , Camundongos , Crânio/diagnóstico por imagem , Craniossinostoses/genética , Ossos Faciais , Encéfalo/diagnóstico por imagemRESUMO
Estimates of individual-level genomic ancestry are routinely used in human genetics, and related fields. The analysis of population structure and genomic ancestry can yield insights in terms of modern and ancient populations, allowing us to address questions regarding admixture, and the numbers and identities of the parental source populations. Unrecognized population structure is also an important confounder to correct for in genome-wide association studies. However, it remains challenging to work with heterogeneous datasets from multiple studies collected by different laboratories with diverse genotyping and imputation protocols. This work presents a new approach and an accompanying open-source toolbox that facilitates a robust integrative analysis for population structure and genomic ancestry estimates for heterogeneous datasets. We show robustness against individual outliers and different protocols for the projection of new samples into a reference ancestry space, and the ability to reveal and adjust for population structure in a simulated case-control admixed population. Given that visually evident and easily recognizable patterns of human facial characteristics co-vary with genomic ancestry, and based on the integration of three different sources of genome data, we generate average 3D faces to illustrate genomic ancestry variations within the 1,000 Genome project and for eight ancient-DNA profiles, respectively.
Assuntos
Identificação Biométrica/métodos , Face/anatomia & histologia , Genoma Humano , Genética Humana/métodos , Padrões de Herança , Modelos Estatísticos , Conjuntos de Dados como Assunto , Reconhecimento Facial/fisiologia , Feminino , Genética Populacional/métodos , Estudo de Associação Genômica Ampla , História do Século XXI , História Antiga , Humanos , Processamento de Imagem Assistida por Computador , Masculino , Grupos Raciais/históriaRESUMO
Forensic DNA Phenotyping (FDP) provides the ability to predict externally visible characteristics from minute amounts of crime scene DNA, which can help find unknown perpetrators who are typically unidentifiable via conventional forensic DNA profiling. Fundamental human genetics research has led to a better understanding of the specific DNA variants responsible for physical appearance characteristics, particularly eye, hair, and skin color. Recently, we introduced the HIrisPlex-S system for the simultaneous prediction of eye, hair, and skin color based on 41 DNA variants generated from two forensically validated SNaPshot multiplex assays using capillary electrophoresis (CE). Here we introduce massively parallel sequencing (MPS) solutions for the HIrisPlex-S (HPS) system on two MPS platforms commonly used in forensics, Ion Torrent and MiSeq, that cover all 41 DNA variants in a single assay, respectively. Additionally, we present the forensic developmental validation of the two HPS-MPS assays. The Ion Torrent MPS assay, based on Ion AmpliSeq technology, illustrated the successful generation of full HIrisPlex-S genotypic profiles from 100â¯pg of input control DNA, while the MiSeq MPS assay based on an in-house design yielded complete profiles from 250â¯pg of input DNA. Assessing simulated forensic casework samples such as saliva, hair (bulb), blood, semen, and low quantity touch DNA, as well as artificially damaged DNA samples, concordance testing, and samples from numerous species, all illustrated the ability of both versions of the HIrisPlex-S MPS assay to produce results that motivate forensic applications. By also providing an integrated bioinformatics analysis pipeline, MPS data can now be analyzed and a file generated for upload to the publically accessible HIrisPlex online webtool (https://hirisplex.erasmusmc.nl). In addition, we updated the website to accept VCF input data for those with genome sequence data. We thus provide a user-friendly and semi-automated MPS workflow from DNA sample to individual eye, hair, and skin color prediction probabilities. Furthermore, we present a 2-person mixture separation tool that not only assesses genotype reliability with regards genotyping confidence but also provides the most fitting mixture scenario for both minor and major contributors, including profile separation. We envision this MPS implementation of the HIrisPlex-S system for eye, hair, and skin color prediction from DNA as a starting point for further expanding MPS-based forensic DNA phenotyping. This may include the future addition of SNPs predictive for more externally visible characteristics, as well as SNPs for bio-geographic ancestry inference, provided the statistical framework for DNA prediction of these traits is in place.