RESUMEN
BACKGROUND: The honey bee reference genome, HAv3.1, was produced from a commercial line sample that was thought to have a largely dominant Apis mellifera ligustica genetic background. Apis mellifera mellifera, often referred to as the black bee, has a separate evolutionary history and is the original type in western and northern Europe. Growing interest in this subspecies for conservation and non-professional apicultural practices, together with the necessity of deciphering genome backgrounds in hybrids, triggered the necessity for a specific genome assembly. Moreover, having several high-quality genomes is becoming key for taking structural variations into account in pangenome analyses. RESULTS: Pacific Bioscience technology long reads were produced from a single haploid black bee drone. Scaffolding contigs into chromosomes was done using a high-density genetic map. This allowed for re-estimation of the recombination rate, which was over-estimated in some previous studies due to mis-assemblies, which resulted in spurious inversions in the older reference genomes. The sequence continuity obtained was very high and the only limit towards continuous chromosome-wide sequences seemed to be due to tandem repeat arrays that were usually longer than 10 kb and that belonged to two main families, the 371 and 91 bp repeats, causing problems in the assembly process due to high internal sequence similarity. Our assembly was used together with the reference genome to genotype two structural variants by a pangenome graph approach with Graphtyper2. Genotypes obtained were either correct or missing, when compared to an approach based on sequencing depth analysis, and genotyping rates were 89 and 76% for the two variants. CONCLUSIONS: Our new assembly for the Apis mellifera mellifera honey bee subspecies demonstrates the utility of multiple high-quality genomes for the genotyping of structural variants, with a test case on two insertions and deletions. It will therefore be an invaluable resource for future studies, for instance by including structural variants in GWAS. Having used a single haploid drone for sequencing allowed a refined analysis of very large tandem repeat arrays, raising the question of their function in the genome. High quality genome assemblies for multiple subspecies such as presented here, are crucial for emerging projects using pangenomes.
Asunto(s)
Genoma de los Insectos , Abejas/genética , AnimalesRESUMEN
BACKGROUND: The Algerian honey bee population is composed of two described subspecies A. m. intermissa and A. m. sahariensis, of which little is known regarding population genomics, both in terms of genetic differentiation and of possible contamination by exogenous stock. Moreover, the phenotypic differences between the two subspecies are expected to translate into genetic differences and possible adaptation to heat and drought in A. m. sahariensis. To shed light on the structure of this population and to integrate these two subspecies in the growing dataset of available haploid drone sequences, we performed whole-genome sequencing of 151 haploid drones. RESULTS: Integrated analysis of our drone sequences with a similar dataset of European reference populations did not detect any significant admixture in the Algerian honey bees. Interestingly, most of the genetic variation was not found between the A. m. intermissa and A. m. sahariensis subspecies; instead, two main genetic clusters were found along an East-West axis. We found that the correlation between genetic and geographic distances was higher in the Western cluster and that close-family relationships were mostly detected in the Eastern cluster, sometimes at long distances. In addition, we selected a panel of 96 ancestry-informative markers to decide whether a sampled bee is Algerian or not, and tested this panel in simulated cases of admixture. CONCLUSIONS: The differences between the two main genetic clusters suggest differential breeding management between eastern and western Algeria, with greater exchange of genetic material over long distances in the east. The lack of detected admixture events suggests that, unlike what is seen in many places worldwide, imports of queens from foreign countries do not seem to have occurred on a large scale in Algeria, a finding that is relevant for conservation purposes. In addition, the proposed panel of 96 markers was found effective to distinguish Algerian from European honey bees. Therefore, we conclude that applying this approach to other taxa is promising, in particular when genetic differentiation is difficult to capture.
Asunto(s)
Cruzamiento , Flujo Genético , Humanos , Abejas/genética , Animales , Secuenciación Completa del Genoma/veterinaria , Polimorfismo de Nucleótido Simple , Estructuras GenéticasRESUMEN
The practice of beekeeping in Algeria is of great cultural, social, and economic importance. However, the importation of non-local subspecies reported by beekeepers has disrupted the natural geographical distribution area and the genetic diversity of the native honey bees. To assess the genetic diversity of A. m. intermissa and A. m. sahariensis, and their relationships with African and European subspecies, the COI-COII intergenic region was analyzed in 335 individuals, 68 sampled in Algeria, 71 in Europe, Madagascar, and the South West Indian Ocean archipelagos, and 196 sequences recovered from GenBank. The results show the presence of the A lineage exclusively in Algerian samples with the identification of 24 haplotypes of which 16 are described for the first time. These haplotypes were found to be shared by both subspecies, with A74 being the most common haplotype in the population studied. The sequence comparison indicates the existence of three polymorphisms of the COI-COII marker: P0Q, P0QQ, and P0QQQ. One new haplotype was identified in the M lineage in samples from France. No evidence of genetic introgression within the Algerian honey bee population was detected. These data enhance our knowledge of the genetic diversity and emphasize the importance of protecting these local subspecies.
RESUMEN
Eusocial insects are crucial to many ecosystems, and particularly the honeybee (Apis mellifera). One approach to facilitate their study in molecular genetics, is to consider whole-colony genotyping by combining DNA of multiple individuals in a single pool sequencing experiment. Cheap and fast, this technique comes with the drawback of producing data requiring dedicated methods to be fully exploited. Despite this limitation, pool sequencing data have been shown to be informative and cost-effective when working on random mating populations. Here, we present new statistical methods for exploiting pool sequencing of eusocial colonies in order to reconstruct the genotypes of the queen of such colony. This leverages the possibility to monitor genetic diversity, perform genomic-based studies or implement selective breeding. Using simulations and honeybee real data, we show that the new methods allow for a fast and accurate estimation of the queen's genetic ancestry, with correlations of about 0.9 to that obtained from individual genotyping. Also, it allows for an accurate reconstruction of the queen genotypes, with about 2% genotyping error. We further validate these inferences using experimental data on colonies with both pool sequencing and individual genotyping of drones. In brief, in this study we present statistical models to accurately estimate the genetic ancestry and reconstruct the genotypes of the queen from pool sequencing data from workers of an eusocial colony. Such information allows to exploit pool sequencing for traditional population genetics analyses, association studies and for selective breeding. While validated in Apis mellifera, these methods are applicable to other eusocial hymenopterans.
Asunto(s)
Ecosistema , Reproducción , Animales , Abejas/genética , ADN/genética , Genotipo , Humanos , Insectos/genéticaRESUMEN
Honey bee subspecies originate from specific geographical areas in Africa, Europe and the Middle East, and beekeepers interested in specific phenotypes have imported genetic material to regions outside of the bees' original range for use either in pure lines or controlled crosses. Moreover, imported drones are present in the environment and mate naturally with queens from the local subspecies. The resulting admixture complicates population genetics analyses, and population stratification can be a major problem for association studies. To better understand Western European honey bee populations, we produced a whole genome sequence and single nucleotide polymorphism (SNP) genotype data set from 870 haploid drones and demonstrate its utility for the identification of nine genetic backgrounds and various degrees of admixture in a subset of 629 samples. Five backgrounds identified correspond to subspecies, two to isolated populations on islands and two to managed populations. We also highlight several large haplotype blocks, some of which coincide with the position of centromeres. The largest is 3.6 Mb long and represents 21% of chromosome 11, with two major haplotypes corresponding to the two dominant genetic backgrounds identified. This large naturally phased data set is available as a single vcf file that can now serve as a reference for subsequent populations genomics studies in the honey bee, such as (i) selecting individuals of verified homogeneous genetic backgrounds as references, (ii) imputing genotypes from a lower-density data set generated by an SNP-chip or by low-pass sequencing, or (iii) selecting SNPs compatible with the requirements of genotyping chips.
Asunto(s)
Endogamia , Dispositivos Aéreos No Tripulados , Animales , Abejas/genética , Genotipo , Haploidia , HaplotiposRESUMEN
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.
RESUMEN
The honeybee population of the tropical Reunion Island is a genetic admixture of the Apis mellifera unicolor subspecies, originally described in Madagascar, and of European subspecies, mainly A. m. carnica and A. m. ligustica, regularly imported to the island since the late 19th century. We took advantage of this population to study genetic admixing of the tropical-adapted indigenous and temperate-adapted European genetic backgrounds. Whole genome sequencing of 30 workers and 6 males from Reunion, compared with samples from Europe, Madagascar, Mauritius, Rodrigues, and the Seychelles, revealed the Reunion honeybee population to be composed on an average of 53.2 ± 5.9% A. m. unicolor nuclear genomic background, the rest being mainly composed of A. m. carnica and to a lesser extent A. m. ligustica. In striking contrast to this, only 1 out of the 36 honeybees from Reunion had a mitochondrial genome of European origin, suggesting selection has favored the A. m. unicolor mitotype, which is possibly better adapted to the island's bioclimate. Local ancestry was determined along the chromosomes for all Reunion samples, and a test for preferential selection for the A. m. unicolor or European background revealed 15 regions significantly associated with the A. m. unicolor lineage and 9 regions with the European lineage. Our results provide insights into the long-term consequences of introducing exotic specimen on the nuclear and mitochondrial genomes of locally adapted populations.
Asunto(s)
Abejas/genética , Mitocondrias/genética , Aclimatación , Adaptación Fisiológica , Animales , Abejas/fisiología , ADN Mitocondrial/genética , Femenino , Genoma de los Insectos , Genoma Mitocondrial , Masculino , Mitocondrias/metabolismo , ReuniónRESUMEN
This corrects the article DOI: 10.1038/srep27059.
RESUMEN
The quantity of milk and milk fat and proteins are particularly important traits in dairy livestock. However, little is known about the regions of the genome that influence these traits in goats. We conducted a genome wide association study in French goats and identified 109 regions associated with dairy traits. For a major region on chromosome 14 closely associated with fat content, the Diacylglycerol O-Acyltransferase 1 (DGAT1) gene turned out to be a functional and positional candidate gene. The caprine reference sequence of this gene was completed and 29 polymorphisms were found in the gene sequence, including two novel exonic mutations: R251L and R396W, leading to substitutions in the protein sequence. The R251L mutation was found in the Saanen breed at a frequency of 3.5% and the R396W mutation both in the Saanen and Alpine breeds at a frequencies of 13% and 7% respectively. The R396W mutation explained 46% of the genetic variance of the trait, and the R251L mutation 6%. Both mutations were associated with a notable decrease in milk fat content. Their causality was then demonstrated by a functional test. These results provide new knowledge on the genetic basis of milk synthesis and will help improve the management of the French dairy goat breeding program.
RESUMEN
The pig is an emerging animal model, complementary to rodents for basic research and for biomedical and agronomical purposes. However despite the progress made on mouse and rat models to produce genuine pluripotent cells, it remains impossible to produce porcine pluripotent cell lines with germline transmission. Reprogramming of pig somatic cells using conventional integrative strategies remains also unsatisfactory. In the present study, we compared the outcome of both integrative and non-integrative reprogramming strategies on pluripotency and chromosome stability during pig somatic cell reprogramming. The porcine cell lines produced with integrative strategies express several pluripotency genes but they do not silence the integrated exogenes and present a high genomic instability upon passaging. In contrast, pig induced pluripotent-like stem cells produced with non-integrative reprogramming system (NI-iPSLCs) exhibit a normal karyotype after more than 12 months in culture and reactivate endogenous pluripotency markers. Despite the persistent expression of exogenous OCT4 and MYC, these cells can differentiate into derivatives expressing markers of the three embryonic germ layers and we propose that these NI-iPSLCs can be used as a model to bring new insights into the molecular factors controlling and maintaining pluripotency in the pig and other non-rodent mammalians.