Your browser doesn't support javascript.
loading
Towards a reference genome that captures global genetic diversity.
Wong, Karen H Y; Ma, Walfred; Wei, Chun-Yu; Yeh, Erh-Chan; Lin, Wan-Jia; Wang, Elin H F; Su, Jen-Ping; Hsieh, Feng-Jen; Kao, Hsiao-Jung; Chen, Hsiao-Huei; Chow, Stephen K; Young, Eleanor; Chu, Catherine; Poon, Annie; Yang, Chi-Fan; Lin, Dar-Shong; Hu, Yu-Feng; Wu, Jer-Yuarn; Lee, Ni-Chung; Hwu, Wuh-Liang; Boffelli, Dario; Martin, David; Xiao, Ming; Kwok, Pui-Yan.
  • Wong KHY; Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA, 94158, USA.
  • Ma W; Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA, 94158, USA.
  • Wei CY; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Yeh EC; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Lin WJ; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Wang EHF; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Su JP; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Hsieh FJ; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Kao HJ; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Chen HH; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Chow SK; Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA, 94158, USA.
  • Young E; School of Biomedical Engineering, Drexel University, Philadelphia, PA, 19104, USA.
  • Chu C; Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94143, USA.
  • Poon A; Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94143, USA.
  • Yang CF; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Lin DS; Department of Pediatrics, Mackay Memorial Hospital, Taipei, Taiwan.
  • Hu YF; Department of Medicine, Mackay Medical College, New Taipei, Taiwan.
  • Wu JY; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Lee NC; Department of Internal Medicine, Taipei Veterans General Hospital, Taipei, Taiwan.
  • Hwu WL; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
  • Boffelli D; Departments of Pediatrics and Medical Genetics, National Taiwan University Hospital, Taipei, Taiwan.
  • Martin D; Departments of Pediatrics and Medical Genetics, National Taiwan University Hospital, Taipei, Taiwan.
  • Xiao M; Children's Hospital Oakland Research Institute, Oakland, CA, 94609, USA.
  • Kwok PY; Children's Hospital Oakland Research Institute, Oakland, CA, 94609, USA.
Nat Commun ; 11(1): 5482, 2020 10 30.
Article en En | MEDLINE | ID: mdl-33127893
The current human reference genome is predominantly derived from a single individual and it does not adequately reflect human genetic diversity. Here, we analyze 338 high-quality human assemblies of genetically divergent human populations to identify missing sequences in the human reference genome with breakpoint resolution. We identify 127,727 recurrent non-reference unique insertions spanning 18,048,877 bp, some of which disrupt exons and known regulatory elements. To improve genome annotations, we linearly integrate these sequences into the chromosomal assemblies and construct a Human Diversity Reference. Leveraging this reference, an average of 402,573 previously unmapped reads can be recovered for a given genome sequenced to ~40X coverage. Transcriptomic diversity among these non-reference sequences can also be directly assessed. We successfully map tens of thousands of previously discarded RNA-Seq reads to this reference and identify transcription evidence in 4781 gene loci, underlining the importance of these non-reference sequences in functional genomics. Our extensive datasets are important advances toward a comprehensive reference representation of global human genetic diversity.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Población / Variación Genética / Genoma Humano Límite: Humans Idioma: En Año: 2020 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Población / Variación Genética / Genoma Humano Límite: Humans Idioma: En Año: 2020 Tipo del documento: Article