RESUMEN
Rare genetic diseases affect 5-8% of the population but are often undiagnosed or misdiagnosed. Electronic health records (EHR) contain large amounts of data, which provide opportunities for analysing and mining. Data mining, in the form of cluster analysis and visualisation, was performed on a database containing deidentified health records of 1.28 million patients across 3 major hospitals in Singapore, in a bid to improve the diagnostic process for patients who are living with an undiagnosed rare disease, specifically focusing on Fabry Disease and Familial Hypercholesterolaemia (FH). On a baseline of 4 patients, we identified 2 additional patients with potential diagnosis of Fabry disease, suggesting a potential 50% increase in diagnosis. Similarly, we identified > 12,000 individuals who fulfil the clinical and laboratory criteria for FH but had not been diagnosed previously. This proof-of-concept study showed that it is possible to perform mining on EHR data albeit with some challenges and limitations.
Asunto(s)
Enfermedad de Fabry , Hiperlipoproteinemia Tipo II , Enfermedades no Diagnosticadas , Humanos , Enfermedades Raras/diagnóstico , Enfermedades Raras/epidemiología , Enfermedades Raras/genética , Registros Electrónicos de Salud , Hiperlipoproteinemia Tipo II/genética , Análisis por ConglomeradosRESUMEN
Neovascular age-related macular degeneration (nAMD), along with its clinical subtype known as polypoidal choroidal vasculopathy (PCV), are among the leading causes of vision loss in elderly Asians. In a genome-wide association study (GWAS) comprising 3,128 nAMD (1,555 PCV and 1,573 typical nAMD), and 5,493 controls of East Asian ancestry, we identify twelve loci, of which four are novel ([Formula: see text]). Substantial genetic sharing between PCV and typical nAMD is noted (rg = 0.666), whereas collagen extracellular matrix and fibrosis-related pathways are more pronounced for PCV. Whole-exome sequencing in 259 PCV patients revealed functional rare variants burden in collagen type I alpha 1 chain gene (COL1A1; [Formula: see text]) and potential enrichment of functional rare mutations at AMD-associated loci. At the GATA binding protein 5 (GATA5) locus, the most significant GWAS novel loci, the expressions of genes including laminin subunit alpha 5 (Lama5), mitochondrial ribosome associated GTPase 2 (Mtg2), and collagen type IX alpha 3 chain (Col9A3), are significantly induced during retinal angiogenesis and subretinal fibrosis in murine models. Furthermore, retinoic acid increased the expression of LAMA5 and MTG2 in vitro. Taken together, our data provide insights into the genetic basis of AMD pathogenesis in the Asian population.
Asunto(s)
Degeneración Macular , Vasculopatía Coroidea Polipoidea , Anciano , Animales , Humanos , Ratones , Asiático , Pueblos del Este de Asia , Matriz Extracelular/genética , Estudio de Asociación del Genoma Completo , Degeneración Macular/genética , Vasculopatía Coroidea Polipoidea/genética , Modelos Animales de EnfermedadRESUMEN
Genomic researchers increasingly utilize commercial cloud service providers (CSPs) to manage data and analytics needs. CSPs allow researchers to grow Information Technology (IT) infrastructure on demand to overcome bottlenecks when combining large datasets. However, without adequate security controls, the risk of unauthorized access may be higher for data stored on the cloud. Additionally, regulators are mandating data access patterns and specific security protocols for the storage and use of genomic data. While CSP provides tools for security and regulatory compliance, building the necessary controls required for cloud solutions is not trivial. Research Assets Provisioning and Tracking Online Repository (RAPTOR) by the Genome Institute of Singapore is a cloud-native genomics data repository and analytics platform that implements a "five-safes" framework to provide security and governance controls to data contributors and users, leveraging CSP for sharing and analysis of genomic datasets without the risk of security breaches or running afoul of regulations.
RESUMEN
Precision medicine promises to transform healthcare for groups and individuals through early disease detection, refining diagnoses and tailoring treatments. Analysis of large-scale genomic-phenotypic databases is a critical enabler of precision medicine. Although Asia is home to 60% of the world's population, many Asian ancestries are under-represented in existing databases, leading to missed opportunities for new discoveries, particularly for diseases most relevant for these populations. The Singapore National Precision Medicine initiative is a whole-of-government 10-year initiative aiming to generate precision medicine data of up to one million individuals, integrating genomic, lifestyle, health, social and environmental data. Beyond technologies, routine adoption of precision medicine in clinical practice requires social, ethical, legal and regulatory barriers to be addressed. Identifying driver use cases in which precision medicine results in standardized changes to clinical workflows or improvements in population health, coupled with health economic analysis to demonstrate value-based healthcare, is a vital prerequisite for responsible health system adoption.
Asunto(s)
Atención a la Salud , Medicina de Precisión , Humanos , Singapur , Medicina de Precisión/métodos , AsiaRESUMEN
Asian populations are under-represented in human genomics research. Here, we characterize clinically significant genetic variation in 9051 genomes representing East Asian, South Asian, and severely under-represented Austronesian-speaking Southeast Asian ancestries. We observe disparate genetic risk burden attributable to ancestry-specific recurrent variants and identify individuals with variants specific to ancestries discordant to their self-reported ethnicity, mostly due to cryptic admixture. About 27% of severe recessive disorder genes with appreciable carrier frequencies in Asians are missed by carrier screening panels, and we estimate 0.5% Asian couples at-risk of having an affected child. Prevalence of medically-actionable variant carriers is 3.4% and a further 1.6% harbour variants with potential for pathogenic classification upon additional clinical/experimental evidence. We profile 23 pharmacogenes with high-confidence gene-drug associations and find 22.4% of Asians at-risk of Centers for Disease Control and Prevention Tier 1 genetic conditions concurrently harbour pharmacogenetic variants with actionable phenotypes, highlighting the benefits of pre-emptive pharmacogenomics. Our findings illuminate the diversity in genetic disease epidemiology and opportunities for precision medicine for a large, diverse Asian population.
Asunto(s)
Pueblo Asiatico , Genoma Humano , Niño , Humanos , Pueblo Asiatico/genética , Genoma Humano/genética , Etnicidad , Farmacogenética , FenotipoRESUMEN
Neisseria meningitidis protects itself from complement-mediated killing by binding complement factor H (FH). Previous studies associated susceptibility to meningococcal disease (MD) with variation in CFH, but the causal variants and underlying mechanism remained unknown. Here we attempted to define the association more accurately by sequencing the CFH-CFHR locus and imputing missing genotypes in previously obtained GWAS datasets of MD-affected individuals of European ancestry and matched controls. We identified a CFHR3 SNP that provides protection from MD (rs75703017, p value = 1.1 × 10-16) by decreasing the concentration of FH in the blood (p value = 1.4 × 10-11). We subsequently used dual-luciferase studies and CRISPR gene editing to establish that deletion of rs75703017 increased FH expression in hepatocyte by preventing promotor inhibition. Our data suggest that reduced concentrations of FH in the blood confer protection from MD; with reduced access to FH, N. meningitidis is less able to shield itself from complement-mediated killing.
Asunto(s)
Factor H de Complemento , Infecciones Meningocócicas , Proteínas Sanguíneas/genética , Factor H de Complemento/genética , Proteínas del Sistema Complemento/genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Infecciones Meningocócicas/genéticaRESUMEN
BACKGROUND: Consumer-grade wearable devices enable detailed recordings of heart rate and step counts in free-living conditions. Recent studies have shown that summary statistics from these wearable recordings have potential uses for longitudinal monitoring of health and disease states. However, the relationship between higher resolution physiological dynamics from wearables and known markers of health and disease remains largely uncharacterized. OBJECTIVE: We aimed to derive high-resolution digital phenotypes from observational wearable recordings and to examine their associations with modifiable and inherent markers of cardiometabolic disease risk. METHODS: We introduced a principled framework to extract interpretable high-resolution phenotypes from wearable data recorded in free-living conditions. The proposed framework standardizes the handling of data irregularities; encodes contextual information regarding the underlying physiological state at any given time; and generates a set of 66 minimally redundant features across active, sedentary, and sleep states. We applied our approach to a multimodal data set, from the SingHEART study (NCT02791152), which comprises heart rate and step count time series from wearables, clinical screening profiles, and whole genome sequences from 692 healthy volunteers. We used machine learning to model nonlinear relationships between the high-resolution phenotypes on the one hand and clinical or genomic risk markers for blood pressure, lipid, weight and sugar abnormalities on the other. For each risk type, we performed model comparisons based on Brier scores to assess the predictive value of high-resolution features over and beyond typical baselines. We also qualitatively characterized the wearable phenotypes for participants who had actualized clinical events. RESULTS: We found that the high-resolution features have higher predictive value than typical baselines for clinical markers of cardiometabolic disease risk: the best models based on high-resolution features had 17.9% and 7.36% improvement in Brier score over baselines based on age and gender and resting heart rate, respectively (P<.001 in each case). Furthermore, heart rate dynamics from different activity states contain distinct information (maximum absolute correlation coefficient of 0.15). Heart rate dynamics in sedentary states are most predictive of lipid abnormalities and obesity, whereas patterns in active states are most predictive of blood pressure abnormalities (P<.001). Moreover, in comparison with standard measures, higher resolution patterns in wearable heart rate recordings are better able to represent subtle physiological dynamics related to genomic risk for cardiometabolic disease (improvement of 11.9%-22.0% in Brier scores; P<.001). Finally, illustrative case studies reveal connections between these high-resolution phenotypes and actualized clinical events, even for borderline profiles lacking apparent cardiometabolic risk markers. CONCLUSIONS: High-resolution digital phenotypes recorded by consumer wearables in free-living states have the potential to enhance the prediction of cardiometabolic disease risk and could enable more proactive and personalized health management.
Asunto(s)
Enfermedades Cardiovasculares , Dispositivos Electrónicos Vestibles , Enfermedades Cardiovasculares/diagnóstico , Estudios Clínicos como Asunto , Estudios de Cohortes , Humanos , Lípidos , Aprendizaje Automático , FenotipoRESUMEN
Charcot-Marie-Tooth type 1A (CMT1A) is typically characterised as a childhood-onset, symmetrical, length-dependent polyneuropathy with a gradual progressive clinical course. Acute to subacute neurological deterioration in CMT1A is rare, and has been reported secondary to overlap pathologies including inflammatory neuropathy. We identified two patients with CMT1A who presented with acute to subacute, atraumatic, entrapment neuropathies as an initial symptom. A superimposed inflammatory neuropathy was excluded. Both patients had a diffuse demyelinating polyneuropathy, with markedly low motor nerve conduction velocities (<20 m/s). In both patients, we demonstrated symptomatic and asymptomatic partial conduction blocks at multiple entrapment sites. Nerve ultrasound findings in our patients demonstrated marked diffuse nerve enlargement, more pronounced at non-entrapment sites compared to entrapment sites. We discuss ways to distinguish this condition from its other differentials. We propose pathophysiological mechanisms underlying this condition. We propose that CMT1A with acute to subacute, atraumatic, entrapment neuropathies to be a distinct phenotypic variant of CMT1A.
RESUMEN
Multinational studies have reported monogenic etiologies in 25%-30% of children with steroid-resistant nephrotic syndrome. Such large studies are lacking in Asia. We established Deciphering Diversities: Renal Asian Genetics Network (DRAGoN) and aimed to describe the genetic and clinical spectrums in Asians. We prospectively studied a cohort of 183 probands with suspected genetic glomerulopathies from South and Southeast Asia, of whom 17% had positive family history. Using multi-gene panel sequencing, we detected pathogenic variants in 26 (14%) probands, of whom one-third had COL4A4 or COL4A5 variants (n = 9, 5%). Of those with COL4A5 defects, only 25% had features suggestive of Alport syndrome. Besides traditional predictors for genetic disease (positive family history and extrarenal malformations), we identified novel predictors, namely older age (6.2 vs. 2.4 years; p = 0.001), hematuria (OR 5.6; 95% CI 2.1-14.8; p < 0.001), and proteinuria in the absence of nephrotic syndrome (OR 4.6; 95% CI 1.8-11.8; p = 0.001) at first manifestation. Among patients who first presented with proteinuria without nephrotic syndrome, the genetic diagnostic rates were >60% when a second risk factor (positive family history or extrarenal manifestation) co-existed. The genetic spectrum of glomerulopathies appears different in Asia. Collagen IV genes may be included in sequencing panels even when suggestive clinical features are absent.
Asunto(s)
Nefritis Hereditaria , Síndrome Nefrótico , Pueblo Asiatico/genética , Niño , Colágeno Tipo IV/genética , Femenino , Humanos , Masculino , Mutación , Nefritis Hereditaria/diagnóstico , Síndrome Nefrótico/genética , ProteinuriaRESUMEN
The role of low frequency variants associated with telomere length homeostasis in chronic diseases and mortalities is relatively understudied in the East-Asian population. Here we evaluated low frequency variants, including 1,915,154 Asian specific variants, for leukocyte telomere length (LTL) associations among 25,533 Singapore Chinese samples. Three East Asian specific variants in/near POT1, TERF1 and STN1 genes are associated with LTL (Meta-analysis P 2.49×10-14-6.94×10-10). Rs79314063, a missense variant (p.Asp410His) at POT1, shows effect 5.3 fold higher and independent of a previous common index SNP. TERF1 (rs79617270) and STN1 (rs139620151) are linked to LTL-associated common index SNPs at these loci. Rs79617270 is associated with cancer mortality [HR95%CI = 1.544 (1.173, 2.032), PAdj = 0.018] and 4.76% of the association between the rs79617270 and colon cancer is mediated through LTL. Overall, genetically determined LTL is particularly associated with lung adenocarcinoma [HR95%CI = 1.123 (1.051, 1.201), Padj = 0.007]. Ethnicity-specific low frequency variants may affect LTL homeostasis and associate with certain cancers.
Asunto(s)
Leucocitos/patología , Neoplasias/epidemiología , Polimorfismo de Nucleótido Simple , Homeostasis del Telómero , Proteínas de Unión a Telómeros/genética , Adulto , Anciano , Anciano de 80 o más Años , Pueblo Asiatico/genética , Enfermedad Crónica , Estudios Transversales , Femenino , Humanos , Leucocitos/metabolismo , Masculino , Persona de Mediana Edad , Neoplasias/genética , Neoplasias/patología , Estudios Prospectivos , Complejo Shelterina , Singapur/epidemiología , Adulto JovenRESUMEN
BACKGROUND: Family history has traditionally been an essential part of clinical care to assess health risks. However, declining sequencing costs have precipitated a shift towards genomics-first approaches in population screening programs rendering the value of family history unknown. We evaluated the utility of incorporating family history information for genomic sequencing selection. METHODS: To ascertain the relationship between family histories on such population-level initiatives, we analysed whole genome sequences of 1750 research participants with no known pre-existing conditions, of which half received comprehensive family history assessment of up to four generations, focusing on 95 cancer genes. RESULTS: Amongst the 1750 participants, 866 (49.5%) had high-quality standardised family history available. Within this group, 73 (8.4%) participants had an increased family history risk of cancer (increased FH risk cohort) and 1 in 7 participants (n = 10/73) carried a clinically actionable variant inferring a sixfold increase compared with 1 in 47 participants (n = 17/793) assessed at average family history cancer risk (average FH risk cohort) (p = 0.00001) and a sevenfold increase compared to 1 in 52 participants (n = 17/884) where family history was not available (FH not available cohort) (p = 0.00001). The enrichment was further pronounced (up to 18-fold) when assessing only the 25 cancer genes in the American College of Medical Genetics (ACMG) Secondary Findings (SF) genes. Furthermore, 63 (7.3%) participants had an increased family history cancer risk in the absence of an apparent clinically actionable variant. CONCLUSIONS: These findings demonstrate that the collection and analysis of comprehensive family history and genomic data are complementary and in combination can prioritise individuals for genomic analysis. Thus, family history remains a critical component of health risk assessment, providing important actionable data when implementing genomics screening programs. TRIAL REGISTRATION: ClinicalTrials.gov NCT02791152 . Retrospectively registered on May 31, 2016.
Asunto(s)
Atención a la Salud , Genómica , Anamnesis , Medicina de Precisión , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Femenino , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Persona de Mediana Edad , Factores de Riesgo , Adulto JovenRESUMEN
Information technology applications for patient-collection of family health history (FHH) increase identification of elevated-risk individuals compared to usual care. It is unknown if the method of collection impacts data collected or if simply going directly to the patient is what makes the difference. The objective of this study was to examine differences in data detail and risk identification rates between FHH collection directly from individuals using paper-based forms and an interactive web-based platform. This is a non-randomized epidemiologic study in Singaporean population from 2016 to 2018. Intervention was paper-based versus web-based interactive platform for FHH collection. Participant demographics, FHH detail, and risk assessment results were analyzed. 882 participants enrolled in the study, 481 in the paper-based group and 401 in the web-based group with mean (SD) age of 45.4 (12.98) years and 47.5% male. Web-based FHH collection participants had an increased number of conditions per relative (p-value <0.001), greater frequency of reporting age of onset (p-value <0.001), and greater odds of receiving ≥1 risk recommendation both overall (OR: 3.99 (2.41, 6.59)) and within subcategories of genetic counselling for hereditary cancer syndromes (p-value = 0.041) and screening and prevention for breast (p-value = 0.002) and colon cancer (p-value = 0.005). This has significant implications for clinical care and research efforts where FHH is being assessed. Using interactive information technology platforms to collect FHH can improve the completeness of the data collected and result in increased rates of risk identification. Methods of data collection to maximize benefit should be taken into account in future studies and clinical care.
RESUMEN
BACKGROUND: Cardiovascular disease (CVD) imposes much mortality and morbidity worldwide. The use of "deep learning", advancements in genomics, metabolomics, proteomics and devices like wearables have the potential to unearth new insights in the field of cardiology. Currently, in Asia, there are no studies that combine the use of conventional clinical information with these advanced technologies. We aim to harness these new technologies to understand the development of cardiovascular disease in Asia. METHODS: Singapore is a multi-ethnic country in Asia with well-represented diverse ethnicities including Chinese, Malays and Indians. The SingHEART study is the first technology driven multi-ethnic prospective population-based study of healthy Asians. Healthy male and female subjects aged 21-69 years old without any prior cardiovascular disease or diabetes mellitus will be recruited from the general population. All subjects are consented to undergo a detailed on-line questionnaire, basic blood investigations, resting and continuous electrocardiogram and blood pressure monitoring, activity and sleep tracking, calcium score, cardiac magnetic resonance imaging, whole genome sequencing and lipidomic analysis. Outcomes studied will include mortality and cause of mortality, myocardial infarction, stroke, malignancy, heart failure, and the development of co-morbidities. DISCUSSION: An initial target of 2500 patients has been set. From October 2015 to May 2017, an initial 683 subjects have been recruited and have completed the initial work-up the SingHEART project is the first contemporary population-based study in Asia that will include whole genome sequencing and deep phenotyping: including advanced imaging and wearable data, to better understand the development of cardiovascular disease across different ethnic groups in Asia.
Asunto(s)
Pueblo Asiatico/genética , Enfermedades Cardiovasculares/genética , Adulto , Anciano , Enfermedades Cardiovasculares/diagnóstico , Enfermedades Cardiovasculares/etnología , Enfermedades Cardiovasculares/prevención & control , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Estado de Salud , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , Valor Predictivo de las Pruebas , Estudios Prospectivos , Medición de Riesgo , Factores de Riesgo , Singapur/epidemiología , Secuenciación Completa del Genoma , Adulto JovenRESUMEN
Sleep is associated with various health outcomes. Despite their growing adoption, the potential for consumer wearables to contribute sleep metrics to sleep-related biomedical research remains largely uncharacterized. Here we analyzed sleep tracking data, along with questionnaire responses and multi-modal phenotypic data generated from 482 normal volunteers. First, we compared wearable-derived and self-reported sleep metrics, particularly total sleep time (TST) and sleep efficiency (SE). We then identified demographic, socioeconomic and lifestyle factors associated with wearable-derived TST; they included age, gender, occupation and alcohol consumption. Multi-modal phenotypic data analysis showed that wearable-derived TST and SE were associated with cardiovascular disease risk markers such as body mass index and waist circumference, whereas self-reported measures were not. Using wearable-derived TST, we showed that insufficient sleep was associated with premature telomere attrition. Our study highlights the potential for sleep metrics from consumer wearables to provide novel insights into data generated from population cohort studies.
Asunto(s)
Envejecimiento , Enfermedades Cardiovasculares/epidemiología , Sueño , Adulto , Anciano , Índice de Masa Corporal , Estudios de Cohortes , Femenino , Humanos , Masculino , Persona de Mediana Edad , Factores de Riesgo , Autoinforme , Telómero , Circunferencia de la Cintura , Dispositivos Electrónicos Vestibles , Adulto JovenRESUMEN
Underrepresentation of Asian genomes has hindered population and medical genetics research on Asians, leading to population disparities in precision medicine. By whole-genome sequencing of 4,810 Singapore Chinese, Malays, and Indians, we found 98.3 million SNPs and small insertions or deletions, over half of which are novel. Population structure analysis demonstrated great representation of Asian genetic diversity by three ethnicities in Singapore and revealed a Malay-related novel ancestry component. Furthermore, demographic inference suggested that Malays split from Chinese â¼24,800 years ago and experienced significant admixture with East Asians â¼1,700 years ago, coinciding with the Austronesian expansion. Additionally, we identified 20 candidate loci for natural selection, 14 of which harbored robust associations with complex traits and diseases. Finally, we show that our data can substantially improve genotype imputation in diverse Asian and Oceanian populations. These results highlight the value of our data as a resource to empower human genetics discovery across broad geographic regions.
Asunto(s)
Genética de Población , Genoma Humano/genética , Selección Genética , Secuenciación Completa del Genoma , Pueblo Asiatico/genética , Femenino , Genotipo , Humanos , Malasia/epidemiología , Masculino , Polimorfismo de Nucleótido Simple/genética , Singapur/epidemiologíaRESUMEN
Whilst the underlying principles of precision medicine are comparable across the globe, genomic references, health practices, costs and discrimination policies differ in Asian settings compared to the reported initiatives involving European-derived populations. We have addressed these variables by developing an evolving reference base of genomic and phenotypic data and a framework to return medically significant variants to consenting research participants applicable for the Asian context. Targeting 10,000 participants, over 2000 Singaporeans, with no known pre-existing health conditions, have consented to an extensive clinical health screen, family health history collection, genome sequencing and ongoing follow-up. Genomic variants in a subset of genes associated with Mendelian disorders and drug responses are analysed using an in-house bioinformatics pipeline. A multidisciplinary team reviews the classification of variants and a research report is generated. Medically significant variants are returned to consenting participants through a bespoke return-of-result genomics clinic. Variant validation and subsequent clinical referral are advised as appropriate. The design and implementation of this flexible learning framework enables a cohort of detailed phenotyping and genotyping of healthy Singaporeans to be established and the frequency of disease-causing variants in this population to be determined. Our findings will contribute to international precision medicine initiatives, bridging gaps with ethnic-specific data and insights from this understudied population.
RESUMEN
Non-coding genetic variants play an important role in driving susceptibility to complex diseases but their characterization remains challenging. Here, we employed a novel approach to interrogate the genetic risk of such polymorphisms in a more systematic way by targeting specific regulatory regions relevant for the phenotype studied. We applied this method to meningococcal disease susceptibility, using the DNA binding pattern of RELA - a NF-kB subunit, master regulator of the response to infection - under bacterial stimuli in nasopharyngeal epithelial cells. We designed a custom panel to cover these RELA binding sites and used it for targeted sequencing in cases and controls. Variant calling and association analysis were performed followed by validation of candidate polymorphisms by genotyping in three independent cohorts. We identified two new polymorphisms, rs4823231 and rs11913168, showing signs of association with meningococcal disease susceptibility. In addition, using our genomic data as well as publicly available resources, we found evidences for these SNPs to have potential regulatory effects on ATXN10 and LIF genes respectively. The variants and related candidate genes are relevant for infectious diseases and may have important contribution for meningococcal disease pathology. Finally, we described a novel genetic association approach that could be applied to other phenotypes.