An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.

Wang, Li-Ju; Zhang, Catherine W; Su, Sophia C; Chen, Hung-I H; Chiu, Yu-Chiao; Lai, Zhao; Bouamar, Hakim; Ramirez, Amelie G; Cigarroa, Francisco G; Sun, Lu-Zhe; Chen, Yidong

Wang, Li-Ju; Zhang, Catherine W; Su, Sophia C; Chen, Hung-I H; Chiu, Yu-Chiao; Lai, Zhao; Bouamar, Hakim; Ramirez, Amelie G; Cigarroa, Francisco G; Sun, Lu-Zhe; Chen, Yidong.

Afiliação

Wang LJ; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Zhang CW; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Su SC; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Chen HH; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Chiu YC; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Lai Z; Greehey Children's Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Bouamar H; Department of Molecular Medicine, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Ramirez AG; Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Cigarroa FG; Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Sun LZ; Institute for Health Promotion Research, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.
Chen Y; Department of Surgery, University of Texas Health San Antonio, San Antonio, TX, 78229, USA.

BMC Genomics ; 20(Suppl 12): 1007, 2019 Dec 30.

Article em En | MEDLINE | ID: mdl-31888480

RESUMO

BACKGROUND: Europeans and American Indians were major genetic ancestry of Hispanics in the U.S. These ancestral groups have markedly different incidence rates and outcomes in many types of cancers. Therefore, the genetic admixture may cause biased genetic association study with cancer susceptibility variants specifically in Hispanics. For example, the incidence rate of liver cancer has been shown with substantial disparity between Hispanic, Asian and non-Hispanic white populations. Currently, ancestry informative marker (AIM) panels have been widely utilized with up to a few hundred ancestry-informative single nucleotide polymorphisms (SNPs) to infer ancestry admixture. Notably, current available AIMs are predominantly located in intron and intergenic regions, while the whole exome sequencing (WES) protocols commonly used in translational research and clinical practice do not cover these markers. Thus, it remains challenging to accurately determine a patient's admixture proportion without additional DNA testing. RESULTS: In this study we designed an unique AIM panel that infers 3-way genetic admixture from three distinct and selective continental populations (African (AFR), European (EUR), and East Asian (EAS)) within evolutionarily conserved exonic regions. Initially, about 1 million exonic SNPs from selective three populations in the 1000 Genomes Project were trimmed by their linkage disequilibrium (LD), restricted to biallelic variants, and finally we optimized to an AIM panel with 250 SNP markers, or the UT-AIM250 panel, using their ancestral informativeness statistics. Comparing to published AIM panels, UT-AIM250 performed better accuracy when we tested with three ancestral populations (accuracy: 0.995 ± 0.012 for AFR, 0.997 ± 0.007 for EUR, and 0.994 ± 0.012 for EAS). We further demonstrated the performance of the UT-AIM250 panel to admixed American (AMR) samples of the 1000 Genomes Project and obtained similar results (AFR, 0.085 ± 0.098; EUR, 0.665 ± 0.182; and EAS, 0.250 ± 0.205) to previously published AIM panels (Phillips-AIM34: AFR, 0.096 ± 0.127, EUR, 0.575 ± 0.290, and EAS, 0.330 ± 0.315; Wei-AIM278: AFR, 0.070 ± 0.096, EUR, 0.537 ± 0.267, and EAS, 0.393 ± 0.300). Subsequently, we applied the UT-AIM250 panel to a clinical dataset of 26 self-reported Hispanic patients in South Texas with hepatocellular carcinoma (HCC). We estimated the admixture proportions using WES data of adjacent non-cancer liver tissues (AFR, 0.065 ± 0.043; EUR, 0.594 ± 0.150; and EAS, 0.341 ± 0.160). Similar admixture proportions were identified from corresponding tumor tissues. In addition, we estimated admixture proportions of The Cancer Genome Atlas (TCGA) collection of hepatocellular carcinoma (TCGA-LIHC) samples (376 patients) using the UT-AIM250 panel. The panel obtained consistent admixture proportions from tumor and matched normal tissues, identified 3 possible incorrectly reported race/ethnicity, and/or provided race/ethnicity determination if necessary. CONCLUSIONS: Here we demonstrated the feasibility of using evolutionarily conserved exonic regions to infer admixture proportions and provided a robust and reliable control for sample collection or patient stratification for genetic analysis. R implementation of UT-AIM250 is available at https://github.com/chenlabgccri/UT-AIM250.

Assuntos

Genoma Humano/genética; Estudo de Associação Genômica Ampla/métodos; Hispânico ou Latino/genética; Carcinoma Hepatocelular/etnologia; Carcinoma Hepatocelular/genética; Etnicidade/genética; Éxons/genética; Frequência do Gene; Testes Genéticos; Genética Populacional; Genótipo; Humanos; Neoplasias Hepáticas/etnologia; Neoplasias Hepáticas/genética; Polimorfismo de Nucleotídeo Único; Software

Palavras-chave

Admixture; Ancestry Informative Markers (AIMs); Hepatocellular carcinoma; Hispanics population; STRUCTURE; Whole exome sequencing

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Hispânico ou Latino / Genoma Humano / Estudo de Associação Genômica Ampla Tipo de estudo: Guideline / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google