Your browser doesn't support javascript.
loading
Functional filter for whole-genome sequencing data identifies HHT and stress-associated non-coding SMAD4 polyadenylation site variants >5 kb from coding DNA.
Xiao, Sihao; Kai, Zhentian; Murphy, Daniel; Li, Dongyang; Patel, Dilip; Bielowka, Adrianna M; Bernabeu-Herrero, Maria E; Abdulmogith, Awatif; Mumford, Andrew D; Westbury, Sarah K; Aldred, Micheala A; Vargesson, Neil; Caulfield, Mark J; Shovlin, Claire L.
Afiliação
  • Xiao S; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK. Electronic address: sihao.xiao@bnc.ox.ac.uk.
  • Kai Z; Topgen Biopharm Technology Co. Ltd., Shanghai 201203, China.
  • Murphy D; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK; Women's, Children's & Clinical Support (Pharmacy), Imperial College Healthcare NHS Trust, W2 1NY London, UK.
  • Li D; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK.
  • Patel D; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK.
  • Bielowka AM; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK.
  • Bernabeu-Herrero ME; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK.
  • Abdulmogith A; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK.
  • Mumford AD; School of Cellular and Molecular Medicine, University of Bristol, BS8 1QU Bristol, UK.
  • Westbury SK; School of Cellular and Molecular Medicine, University of Bristol, BS8 1QU Bristol, UK.
  • Aldred MA; Division of Pulmonary, Critical Care, Sleep & Occupational Medicine, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
  • Vargesson N; School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, AB25 2ZD Aberdeen, UK.
  • Caulfield MJ; William Harvey Research Institute, Queen Mary University of London, E1 4NS London, UK.
  • Shovlin CL; National Heart and Lung Institute, Imperial College London, W12 ONN London, UK; National Institute for Health Research (NIHR) Imperial Biomedical Research Centre, W2 1NY London, UK; Specialist Medicine, Imperial College Healthcare NHS Trust, W12 OHS London, UK. Electronic address: c.shovlin@imperial
Am J Hum Genet ; 110(11): 1903-1918, 2023 11 02.
Article em En | MEDLINE | ID: mdl-37816352
ABSTRACT
Despite whole-genome sequencing (WGS), many cases of single-gene disorders remain unsolved, impeding diagnosis and preventative care for people whose disease-causing variants escape detection. Since early WGS data analytic steps prioritize protein-coding sequences, to simultaneously prioritize variants in non-coding regions rich in transcribed and critical regulatory sequences, we developed GROFFFY, an analytic tool that integrates coordinates for regions with experimental evidence of functionality. Applied to WGS data from solved and unsolved hereditary hemorrhagic telangiectasia (HHT) recruits to the 100,000 Genomes Project, GROFFFY-based filtration reduced the mean number of variants/DNA from 4,867,167 to 21,486, without deleting disease-causal variants. In three unsolved cases (two related), GROFFFY identified ultra-rare deletions within the 3' untranslated region (UTR) of the tumor suppressor SMAD4, where germline loss-of-function alleles cause combined HHT and colonic polyposis (MIM 175050). Sited >5.4 kb distal to coding DNA, the deletions did not modify or generate microRNA binding sites, but instead disrupted the sequence context of the final cleavage and polyadenylation site necessary for protein production By iFoldRNA, an AAUAAA-adjacent 16-nucleotide deletion brought the cleavage site into inaccessible neighboring secondary structures, while a 4-nucleotide deletion unfolded the downstream RNA polymerase II roadblock. SMAD4 RNA expression differed to control-derived RNA from resting and cycloheximide-stressed peripheral blood mononuclear cells. Patterns predicted the mutational site for an unrelated HHT/polyposis-affected individual, where a complex insertion was subsequently identified. In conclusion, we describe a functional rare variant type that impacts regulatory systems based on RNA polyadenylation. Extension of coding sequence-focused gene panels is required to capture these variants.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Telangiectasia Hemorrágica Hereditária / Proteína Smad4 Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Am J Hum Genet Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Telangiectasia Hemorrágica Hereditária / Proteína Smad4 Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Am J Hum Genet Ano de publicação: 2023 Tipo de documento: Article