RESUMO
The article presents the analysis of whole genome sequence of a Gujarati Indian individual (IHGP01) that was sequenced at 23.05× coverage with a total of 74.93â¯Gb of sequence data generated using Illumina HiSeq 2000 platform. Variant analysis revealed over 3.9 million single nucleotide variants (SNVs) and about 393,000 small insertions and deletions (InDels) including novel variants. The known variants were analyzed for their health and disease relevance and pharmacogenomic profile. Mitochondrial and Y-chromosome haplogroup analysis clearly indicated arrival on the continent not more than 20,000-25,000â¯years ago, following the route out of Africa to central Europe, then into Asian continent and subsequent migration to West part of the Indian subcontinent. The current research has added 141,000 novel genetic variations to the human DNA database. Functional analysis and validation of these novel variations and revelation of their role in health and disease will add a newer dimension to understand people of this subcontinent.
Assuntos
Genoma Humano , Polimorfismo Genético , Sequenciamento Completo do Genoma , Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Haplótipos , Migração Humana , Humanos , Índia , MasculinoRESUMO
BACKGROUND: Pakistan covers a key geographic area in human history, being both part of the Indus River region that acted as one of the cradles of civilization and as a link between Western Eurasia and Eastern Asia. This region is inhabited by a number of distinct ethnic groups, the largest being the Punjabi, Pathan (Pakhtuns), Sindhi, and Baloch. RESULTS: We analyzed the first ethnic male Pathan genome by sequencing it to 29.7-fold coverage using the Illumina HiSeq2000 platform. A total of 3.8 million single nucleotide variations (SNVs) and 0.5 million small indels were identified by comparing with the human reference genome. Among the SNVs, 129,441 were novel, and 10,315 nonsynonymous SNVs were found in 5,344 genes. SNVs were annotated for health consequences and high risk diseases, as well as possible influences on drug efficacy. We confirmed that the Pathan genome presented here is representative of this ethnic group by comparing it to a panel of Central Asians from the HGDP-CEPH panels typed for ~650 k SNPs. The mtDNA (H2) and Y haplogroup (L1) of this individual were also typical of his geographic region of origin. Finally, we reconstruct the demographic history by PSMC, which highlights a recent increase in effective population size compatible with admixture between European and Asian lineages expected in this geographic region. CONCLUSIONS: We present a whole-genome sequence and analyses of an ethnic Pathan from the north-west province of Pakistan. It is a useful resource to understand genetic variation and human migration across the whole Asian continent.
Assuntos
Variação Genética , Genoma Humano , Cromossomos Humanos Y , DNA Mitocondrial/química , Demografia , Humanos , Masculino , Paquistão/etnologia , Análise de Sequência de DNARESUMO
BACKGROUND: The horse (Equus ferus caballus) is one of the earliest domesticated species and has played an important role in the development of human societies over the past 5,000 years. In this study, we characterized the genome of the Marwari horse, a rare breed with unique phenotypic characteristics, including inwardly turned ear tips. It is thought to have originated from the crossbreeding of local Indian ponies with Arabian horses beginning in the 12th century. RESULTS: We generated 101 Gb (~30 × coverage) of whole genome sequences from a Marwari horse using the Illumina HiSeq2000 sequencer. The sequences were mapped to the horse reference genome at a mapping rate of ~98% and with ~95% of the genome having at least 10 × coverage. A total of 5.9 million single nucleotide variations, 0.6 million small insertions or deletions, and 2,569 copy number variation blocks were identified. We confirmed a strong Arabian and Mongolian component in the Marwari genome. Novel variants from the Marwari sequences were annotated, and were found to be enriched in olfactory functions. Additionally, we suggest a potential functional genetic variant in the TSHZ1 gene (p.Ala344>Val) associated with the inward-turning ear tip shape of the Marwari horses. CONCLUSIONS: Here, we present an analysis of the Marwari horse genome. This is the first genomic data for an Asian breed, and is an invaluable resource for future studies of genetic variation associated with phenotypes and diseases in horses.
Assuntos
Genoma/genética , Genômica , Cavalos/genética , Análise de Sequência de DNA , Sequência de Aminoácidos , Animais , Evolução Molecular , Variação Genética , Genótipo , Humanos , Hibridização Genética , Masculino , Dados de Sequência Molecular , Fenótipo , Seleção Genética , Especificidade da EspécieRESUMO
The red-crowned crane (Grus japonensis) is an endangered, large-bodied crane native to East Asia. It is a traditional symbol of longevity and its long lifespan has been confirmed both in captivity and in the wild. Lifespan in birds is known to be positively correlated with body size and negatively correlated with metabolic rate, though the genetic mechanisms for the red-crowned crane's long lifespan have not previously been investigated. Using whole genome sequencing and comparative evolutionary analyses against the grey-crowned crane and other avian genomes, including the long-lived common ostrich, we identified redcrowned crane candidate genes with known associations with longevity. Among these are positively selected genes in metabolism and immunity pathways (NDUFA5, NDUFA8, NUDT12, SOD3, CTH , RPA1, PHAX, HNMT , HS2ST1 , PPCDC , PSTK CD8B, GP9, IL-9R, and PTPRC). Our analyses provide genetic evidence for low metabolic rate and longevity, accompanied by possible convergent adaptation signatures among distantly related large and long-lived birds. Finally, we identified low genetic diversity in the red-crowned crane, consistent with its listing as an endangered species, and this genome should provide a useful genetic resource for future conservation studies of this rare and iconic species.
Assuntos
Proteínas Aviárias/genética , Aves/fisiologia , Animais , Espécies em Perigo de Extinção , Imunidade/genética , Longevidade/genética , Polimorfismo Genético , Especificidade da Espécie , Transcriptoma , Sequenciamento Completo do GenomaRESUMO
The greater wax moth, Galleria mellonella, degrades wax and plastic molecules. Despite much interest, the genetic basis of these hallmark traits remains poorly understood. Herein, we assembled high-quality genome and transcriptome data from G. mellonella to investigate long-chain hydrocarbon wax metabolism strategies. Specific carboxylesterase and lipase and fatty-acid-metabolism-related enzymes in the G. mellonella genome are transcriptionally regulated during feeding on beeswax. Strikingly, G. mellonella lacking intestinal microbiota successfully decomposes long-chain fatty acids following wax metabolism, although the intestinal microbiome performs a supplementary role in short-chain fatty acid degradation. Notably, final wax derivatives were detected by gas chromatography even in the absence of gut microbiota. Our findings provide insight into wax moth adaptation and may assist in the development of unique wax-degradation strategies with a similar metabolic approach for a plastic molecule polyethylene biodegradation using organisms without intestinal microbiota.
Assuntos
Microbioma Gastrointestinal , Mariposas/metabolismo , Ceras/metabolismo , Animais , Evolução Molecular , Ácidos Graxos/metabolismo , Ácidos Graxos Voláteis/metabolismo , Genoma de Inseto , Larva/metabolismo , Larva/microbiologia , Mariposas/crescimento & desenvolvimento , Mariposas/microbiologia , Família Multigênica , TranscriptomaRESUMO
We report two Egyptian male genomes (EGP1 and EGP2) sequenced at ~ 30× sequencing depths. EGP1 had 4.7 million variants, where 198,877 were novel variants while EGP2 had 209,109 novel variants out of 4.8 million variants. The mitochondrial haplogroup of the two individuals were identified to be H7b1 and L2a1c, respectively. We also identified the Y haplogroup of EGP1 (R1b) and EGP2 (J1a2a1a2â¯>â¯P58â¯>â¯FGC11). EGP1 had a mutation in the NADH gene of the mitochondrial genome ND4 (m.11778â¯Gâ¯>â¯A) that causes Leber's hereditary optic neuropathy. Some SNPs shared by the two genomes were associated with an increased level of cholesterol and triglycerides, probably related with Egyptians obesity. Comparison of these genomes with African and Western-Asian genomes can provide insights on Egyptian ancestry and genetic history. This resource can be used to further understand genomic diversity and functional classification of variants as well as human migration and evolution across Africa and Western-Asia.
Assuntos
Variação Genética , Genoma Humano , Cromossomos Humanos Y , DNA Mitocondrial/química , Egito , Genômica , Humanos , Masculino , Mutação , Filogenia , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
High-coverage whole-genome sequencing data of a single ethnicity can provide a useful catalogue of population-specific genetic variations, and provides a critical resource that can be used to more accurately identify pathogenic genetic variants. We report a comprehensive analysis of the Korean population, and present the Korean National Standard Reference Variome (KoVariome). As a part of the Korean Personal Genome Project (KPGP), we constructed the KoVariome database using 5.5 terabases of whole genome sequence data from 50 healthy Korean individuals in order to characterize the benign ethnicity-relevant genetic variation present in the Korean population. In total, KoVariome includes 12.7M single-nucleotide variants (SNVs), 1.7M short insertions and deletions (indels), 4K structural variations (SVs), and 3.6K copy number variations (CNVs). Among them, 2.4M (19%) SNVs and 0.4M (24%) indels were identified as novel. We also discovered selective enrichment of 3.8M SNVs and 0.5M indels in Korean individuals, which were used to filter out 1,271 coding-SNVs not originally removed from the 1,000 Genomes Project when prioritizing disease-causing variants. KoVariome health records were used to identify novel disease-causing variants in the Korean population, demonstrating the value of high-quality ethnic variation databases for the accurate interpretation of individual genomes and the precise characterization of genetic variations.
Assuntos
Variações do Número de Cópias de DNA , Doença/genética , Genética Populacional , Genoma Humano , Mutação INDEL , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma/métodos , Bases de Dados Genéticas , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Mutação , Padrões de Referência , República da Coreia , Análise de Sequência de DNARESUMO
Myotis rufoniger is a vesper bat in the genus Myotis. Here we report the whole genome sequence and analyses of the M. rufoniger. We generated 124 Gb of short-read DNA sequences with an estimated genome size of 1.88 Gb at a sequencing depth of 66× fold. The sequences were aligned to M. brandtii bat reference genome at a mapping rate of 96.50% covering 95.71% coding sequence region at 10× coverage. The divergence time of Myotis bat family is estimated to be 11.5 million years, and the divergence time between M. rufoniger and its closest species M. davidii is estimated to be 10.4 million years. We found 1,239 function-altering M. rufoniger specific amino acid sequences from 929 genes compared to other Myotis bat and mammalian genomes. The functional enrichment test of the 929 genes detected amino acid changes in melanin associated DCT, SLC45A2, TYRP1, and OCA2 genes possibly responsible for the M. rufoniger's red fur color and a general coloration in Myotis. N6AMT1 gene, associated with arsenic resistance, showed a high degree of function alteration in M. rufoniger. We further confirmed that the M. rufoniger also has bat-specific sequences within FSHB, GHR, IGF1R, TP53, MDM2, SLC45A2, RGS7BP, RHO, OPN1SW, and CNGB3 genes that have already been published to be related to bat's reproduction, lifespan, flight, low vision, and echolocation. Additionally, our demographic history analysis found that the effective population size of Myotis clade has been consistently decreasing since ~30k years ago. M. rufoniger's effective population size was the lowest in Myotis bats, confirming its relatively low genetic diversity.
Assuntos
Quirópteros/genética , Genoma , Substituição de Aminoácidos , Animais , Quirópteros/classificação , Variação Genética , Mutação , FilogeniaRESUMO
Cetacean body structure and physiology exhibit dramatic adaptations to their aquatic environment. Fibroblast growth factors (FGFs) are a family of essential factors that regulate animal development and physiology; however, their role in cetacean evolution is not clearly understood. Here, we sequenced the fin whale genome and analysed FGFs from 8 cetaceans. FGF22, a hair follicle-enriched gene, exhibited pseudogenization, indicating that the function of this gene is no longer necessary in cetaceans that have lost most of their body hair. An evolutionary analysis revealed signatures of positive selection for FGF3 and FGF11, genes related to ear and tooth development and hypoxia, respectively. We found a D203G substitution in cetacean FGF9, which was predicted to affect FGF9 homodimerization, suggesting that this gene plays a role in the acquisition of rigid flippers for efficient manoeuvring. Cetaceans utilize low bone density as a buoyancy control mechanism, but the underlying genes are not known. We found that the expression of FGF23, a gene associated with reduced bone density, is greatly increased in the cetacean liver under hypoxic conditions, thus implicating FGF23 in low bone density in cetaceans. Altogether, our results provide novel insights into the roles of FGFs in cetacean adaptation to the aquatic environment.
Assuntos
Adaptação Fisiológica , Evolução Molecular , Fatores de Crescimento de Fibroblastos/genética , Baleia Comum/genética , Baleia Comum/fisiologia , Animais , Genoma , Filogenia , Seleção GenéticaRESUMO
This corrects the article DOI: 10.1038/ncomms13637.
RESUMO
Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity.
Assuntos
Povo Asiático/genética , Genoma Humano/genética , Mapeamento Cromossômico , Consenso , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , República da Coreia , Análise de Sequência de DNARESUMO
BACKGROUND: There are three main dietary groups in mammals: carnivores, omnivores, and herbivores. Currently, there is limited comparative genomics insight into the evolution of dietary specializations in mammals. Due to recent advances in sequencing technologies, we were able to perform in-depth whole genome analyses of representatives of these three dietary groups. RESULTS: We investigated the evolution of carnivory by comparing 18 representative genomes from across Mammalia with carnivorous, omnivorous, and herbivorous dietary specializations, focusing on Felidae (domestic cat, tiger, lion, cheetah, and leopard), Hominidae, and Bovidae genomes. We generated a new high-quality leopard genome assembly, as well as two wild Amur leopard whole genomes. In addition to a clear contraction in gene families for starch and sucrose metabolism, the carnivore genomes showed evidence of shared evolutionary adaptations in genes associated with diet, muscle strength, agility, and other traits responsible for successful hunting and meat consumption. Additionally, an analysis of highly conserved regions at the family level revealed molecular signatures of dietary adaptation in each of Felidae, Hominidae, and Bovidae. However, unlike carnivores, omnivores and herbivores showed fewer shared adaptive signatures, indicating that carnivores are under strong selective pressure related to diet. Finally, felids showed recent reductions in genetic diversity associated with decreased population sizes, which may be due to the inflexible nature of their strict diet, highlighting their vulnerability and critical conservation status. CONCLUSIONS: Our study provides a large-scale family level comparative genomic analysis to address genomic changes associated with dietary specialization. Our genomic analyses also provide useful resources for diet-related genetic and health research.
Assuntos
Variação Genética , Genoma , Panthera/genética , Análise de Sequência de DNA , Adaptação Fisiológica/genética , Animais , Evolução Biológica , Gatos , Herbivoria/genética , Mamíferos/genética , Anotação de Sequência Molecular , FilogeniaRESUMO
BACKGROUND: The cinereous vulture, Aegypius monachus, is the largest bird of prey and plays a key role in the ecosystem by removing carcasses, thus preventing the spread of diseases. Its feeding habits force it to cope with constant exposure to pathogens, making this species an interesting target for discovering functionally selected genetic variants. Furthermore, the presence of two independently evolved vulture groups, Old World and New World vultures, provides a natural experiment in which to investigate convergent evolution due to obligate scavenging. RESULTS: We sequenced the genome of a cinereous vulture, and mapped it to the bald eagle reference genome, a close relative with a divergence time of 18 million years. By comparing the cinereous vulture to other avian genomes, we find positively selected genetic variations in this species associated with respiration, likely linked to their ability of immune defense responses and gastric acid secretion, consistent with their ability to digest carcasses. Comparisons between the Old World and New World vulture groups suggest convergent gene evolution. We assemble the cinereous vulture blood transcriptome from a second individual, and annotate genes. Finally, we infer the demographic history of the cinereous vulture which shows marked fluctuations in effective population size during the late Pleistocene. CONCLUSIONS: We present the first genome and transcriptome analyses of the cinereous vulture compared to other avian genomes and transcriptomes, revealing genetic signatures of dietary and environmental adaptations accompanied by possible convergent evolution between the Old World and New World vultures.