Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Mol Biol Evol ; 41(3)2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38376487

RESUMO

The blue whale, Balaenoptera musculus, is the largest animal known to have ever existed, making it an important case study in longevity and resistance to cancer. To further this and other blue whale-related research, we report a reference-quality, long-read-based genome assembly of this fascinating species. We assembled the genome from PacBio long reads and utilized Illumina/10×, optical maps, and Hi-C data for scaffolding, polishing, and manual curation. We also provided long read RNA-seq data to facilitate the annotation of the assembly by NCBI and Ensembl. Additionally, we annotated both haplotypes using TOGA and measured the genome size by flow cytometry. We then compared the blue whale genome with other cetaceans and artiodactyls, including vaquita (Phocoena sinus), the world's smallest cetacean, to investigate blue whale's unique biological traits. We found a dramatic amplification of several genes in the blue whale genome resulting from a recent burst in segmental duplications, though the possible connection between this amplification and giant body size requires further study. We also discovered sites in the insulin-like growth factor-1 gene correlated with body size in cetaceans. Finally, using our assembly to examine the heterozygosity and historical demography of Pacific and Atlantic blue whale populations, we found that the genomes of both populations are highly heterozygous and that their genetic isolation dates to the last interglacial period. Taken together, these results indicate how a high-quality, annotated blue whale genome will serve as an important resource for biology, evolution, and conservation research.


Assuntos
Balaenoptera , Neoplasias , Animais , Balaenoptera/genética , Duplicações Segmentares Genômicas , Genoma , Demografia , Neoplasias/genética
2.
Proc Natl Acad Sci U S A ; 118(52)2021 12 28.
Artigo em Inglês | MEDLINE | ID: mdl-34862323

RESUMO

In less than 25 y, the field of animal genome science has transformed from a discipline seeking its first glimpses into genome sequences across the Tree of Life to a global enterprise with ambitions to sequence genomes for all of Earth's eukaryotic diversity [H. A. Lewin et al., Proc. Natl. Acad. Sci. U.S.A. 115, 4325-4333 (2018)]. As the field rapidly moves forward, it is important to take stock of the progress that has been made to best inform the discipline's future. In this Perspective, we provide a contemporary, quantitative overview of animal genome sequencing. We identified the best available genome assemblies in GenBank, the world's most extensive genetic database, for 3,278 unique animal species across 24 phyla. We assessed taxonomic representation, assembly quality, and annotation status for major clades. We show that while tremendous taxonomic progress has occurred, stark disparities in genomic representation exist, highlighted by a systemic overrepresentation of vertebrates and underrepresentation of arthropods. In terms of assembly quality, long-read sequencing has dramatically improved contiguity, whereas gene annotations are available for just 34.3% of taxa. Furthermore, we show that animal genome science has diversified in recent years with an ever-expanding pool of researchers participating. However, the field still appears to be dominated by institutions in the Global North, which have been listed as the submitting institution for 77% of all assemblies. We conclude by offering recommendations for improving genomic resource availability and research value while also broadening global representation.


Assuntos
Artrópodes/genética , Bases de Dados Genéticas , Genoma/genética , Genômica , Vertebrados/genética , Animais , Cordados/genética , Sequenciamento de Nucleotídeos em Larga Escala , Invertebrados/genética , Análise de Sequência de DNA
3.
Genomics ; 114(1): 202-214, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34923089

RESUMO

Long non-coding RNAs (lncRNAs) hold gene regulatory potential, but require substantial further functional annotation in livestock. Applying two metabogenomic approaches by combining transcriptomic and metabolomic analyses, we aimed to identify lncRNAs with potential regulatory function for divergent nutrient partitioning of lactating crossbred cows and to establish metabogenomic interaction networks comprising metabolites, genes and lncRNAs. Through correlation analysis of lncRNA expression with transcriptomic and metabolomic data, we unraveled lncRNAs that have a putative regulatory role in energy and lipid metabolism, the urea and tricarboxylic acid cycles, and gluconeogenesis. Especially FGF21, which correlated with a plentitude of differentially expressed genes, differentially abundant metabolites, as well as lncRNAs, suggested itself as a key metabolic regulator. Notably, lncRNAs in close physical proximity to coding-genes as well as lncRNAs with natural antisense transcripts appear to perform a fine-tuning function in gene expression involved in metabolic pathways associated with different nutrient partitioning phenotypes.


Assuntos
RNA Longo não Codificante , Animais , Bovinos , Feminino , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Lactação , Fígado/metabolismo , Nutrientes , Fenótipo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo
4.
BMC Genomics ; 22(1): 164, 2021 Mar 09.
Artigo em Inglês | MEDLINE | ID: mdl-33750298

RESUMO

BACKGROUND: Only 1.5% of the human genome encodes proteins, while large part of the remaining encodes noncoding RNAs (ncRNA). Many ncRNAs form structures and perform many important functions. Accurately identifying structured ncRNAs in the human genome and discovering their biological functions remain a major challenge. RESULTS: Here, we have established a pipeline (CM-line) with the following features for analyzing the large genomes of humans and other animals. First, we selected species with larger genetic distances to facilitate the discovery of covariations and compatible mutations. Second, we used CMfinder, which can generate useful alignments even with low sequence conservation. Third, we removed repetitive sequences and known structured ncRNAs to reduce the workload of CMfinder. Fourth, we used Infernal to find more representatives and refine the structure. We reported 11 classes of structured ncRNA candidates with significant covariations in humans. Functional analysis showed that these ncRNAs may have variable functions. Some may regulate circadian clock genes through poly (A) signals (PAS); some may regulate the elongation factor (EEF1A) and the T-cell receptor signaling pathway by cooperating with RNA binding proteins. CONCLUSIONS: By searching for important features of RNA structure from large genomes, the CM-line has revealed the existence of a variety of novel structured ncRNAs. Functional analysis suggests that some newly discovered ncRNA motifs may have biological functions. The pipeline we have established for the discovery of structured ncRNAs and the identification of their functions can also be applied to analyze other large genomes.


Assuntos
Genômica , RNA não Traduzido , Animais , Genoma , Humanos , Motivos de Nucleotídeos , RNA , RNA não Traduzido/genética
5.
BMC Genomics ; 21(1): 848, 2020 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-33256610

RESUMO

BACKGROUND: Distichiasis, an ocular disorder in which aberrant cilia (eyelashes) grow from the opening of the Meibomian glands of the eyelid, has been reported in Friesian horses. These misplaced cilia can cause discomfort, chronic keratitis, and corneal ulceration, potentially impacting vision due to corneal fibrosis, or, if secondary infection occurs, may lead to loss of the eye. Friesian horses represent the vast majority of reported cases of equine distichiasis, and as the breed is known to be affected with inherited monogenic disorders, this condition was hypothesized to be a simply inherited Mendelian trait. RESULTS: A genome wide association study (GWAS) was performed using the Axiom 670 k Equine Genotyping array (MNEc670k) utilizing 14 cases and 38 controls phenotyped for distichiasis. An additive single locus mixed linear model (EMMAX) approach identified a 1.83 Mb locus on ECA5 and a 1.34 Mb locus on ECA13 that reached genome-wide significance (pcorrected = 0.016 and 0.032, respectively). Only the locus on ECA13 withstood replication testing (p = 1.6 × 10- 5, cases: n = 5 and controls: n = 37). A 371 kb run of homozygosity (ROH) on ECA13 was found in 13 of the 14 cases, providing evidence for a recessive mode of inheritance. Haplotype analysis (hapQTL) narrowed the region of association on ECA13 to 163 kb. Whole-genome sequencing data from 3 cases and 2 controls identified a 16 kb deletion within the ECA13 associated haplotype (ECA13:g.178714_195130del). Functional annotation data supports a tissue-specific regulatory role of this locus. This deletion was associated with distichiasis, as 18 of the 19 cases were homozygous (p = 4.8 × 10- 13). Genotyping the deletion in 955 horses from 54 different breeds identified the deletion in only 11 non-Friesians, all of which were carriers, suggesting that this could be causal for this Friesian disorder. CONCLUSIONS: This study identified a 16 kb deletion on ECA13 in an intergenic region that was associated with distichiasis in Friesian horses. Further functional analysis in relevant tissues from cases and controls will help to clarify the precise role of this deletion in normal and abnormal eyelash development and investigate the hypothesis of incomplete penetrance.


Assuntos
Doenças Palpebrais/veterinária , Pálpebras/patologia , Estudo de Associação Genômica Ampla , Doenças dos Cavalos/genética , Animais , Doenças Palpebrais/genética , Haplótipos , Cavalos , Fenótipo , Sequenciamento Completo do Genoma
6.
Int J Mol Sci ; 21(9)2020 May 06.
Artigo em Inglês | MEDLINE | ID: mdl-32384694

RESUMO

Long non-coding RNAs (lncRNAs) can influence transcriptional and translational processes in mammalian cells and are associated with various developmental, physiological and phenotypic conditions. However, they remain poorly understood and annotated in livestock species. We combined phenotypic, metabolomics and liver transcriptomic data of bulls divergent for residual feed intake (RFI) and fat accretion. Based on a project-specific transcriptome annotation for the bovine reference genome ARS-UCD.1.2 and multiple-tissue total RNA sequencing data, we predicted 3590 loci to be lncRNAs. To identify lncRNAs with potential regulatory influence on phenotype and gene expression, we applied the regulatory impact factor algorithm on a functionally prioritized set of loci (n = 4666). Applying the algorithm of partial correlation and information theory, significant and independent pairwise correlations were calculated and co-expression networks were established, including plasma metabolites correlated with lncRNAs. The network hub lncRNAs were assessed for potential cis-actions and subjected to biological pathway enrichment analyses. Our results reveal a prevalence of antisense lncRNAs positively correlated with adjacent protein-coding genes and suggest their participation in mitochondrial function, acute phase response signalling, TCA-cycle, fatty acid ß-oxidation and presumably gluconeogenesis. These antisense lncRNAs indicate a stabilizing function for their cis-correlated genes and a putative regulatory role in gene expression.


Assuntos
Fenômenos Fisiológicos da Nutrição Animal/genética , Bovinos/genética , RNA Antissenso/genética , RNA Longo não Codificante/genética , Animais , Bovinos/fisiologia , Redes Reguladoras de Genes , Gluconeogênese , Fígado/metabolismo , Masculino , Mitocôndrias Hepáticas/metabolismo , Característica Quantitativa Herdável , RNA Antissenso/metabolismo , RNA Longo não Codificante/metabolismo
7.
BMC Genomics ; 17(1): 761, 2016 Sep 27.
Artigo em Inglês | MEDLINE | ID: mdl-27678198

RESUMO

BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in the NGS field, also confronts great challenges in data analysis. RESULTS: To address the challenges in RNA-seq data analysis, we developed a web portal that offers three integrated workflows that can perform end-to-end compute and analysis, including sequence quality control, read-mapping, transcriptome assembly, reconstruction and quantification, and differential analysis. The first workflow utilizes Tuxedo (Tophat, Cufflink, Cuffmerge and Cuffdiff suite of tools). The second workflow deploys Trinity for de novo assembly and uses RSEM for transcript quantification and EdgeR for differential analysis. The third combines STAR, RSEM, and EdgeR for data analysis. All these workflows support multiple samples and multiple groups of samples and perform differential analysis between groups in a single workflow job submission. The calculated results are available for download and post-analysis. The supported animal species include chicken, cow, duck, goat, pig, horse, rabbit, sheep, turkey, as well as several other model organisms including yeast, C. elegans, Drosophila, and human, with genomic sequences and annotations obtained from ENSEMBL. The RNA-seq portal is freely available from http://weizhongli-lab.org/RNA-seq . CONCLUSIONS: The web portal offers not only bioinformatics software, workflows, computation and reference data, but also an integrated environment for complex RNA-seq data analysis for agricultural animal species. In this project, our aim is not to develop new RNA-seq tools, but to build web workflows for using popular existing RNA-seq methods and make these tools more accessible to the communities.

8.
Pflugers Arch ; 467(10): 2043-53, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25823501

RESUMO

Transient receptor potential (TRP) channels are unusual among cation channels because of their diverse cation selectivities and activation mechanisms. TRP channels thus play major roles in various sensory perceptions by functioning as multimodal signal integrators. Some TRP subfamily members are also implicated in acute and chronic pain and inflammation. So far, most TRP channel studies have been targeted to human and model organisms within a limited evolutionary context. Classification of TRP channels in various animal genomes has revealed extensive gene gain and loss events across animal species. Furthermore, the chemical activation profiles of some orthologous TRP channels were different between species such as human and mouse. Amino acid substitutions must underlie such differences, and the crucial amino acid residues have been identified in some cases. These changes represent the evolution of TRP channels at the amino acid sequence level. There is also evidence that TRP channels have obtained species-diversity through alternative splicing and possibly cis-regulatory element mutations. All of the above demonstrate the dynamic and plastic evolutionary history of metazoan TRP channels at multiple levels, possibly in conjunction with the specific habitats and life histories of individual species.


Assuntos
Evolução Molecular , Canais de Potencial de Receptor Transitório/genética , Animais , Humanos , Splicing de RNA , Canais de Potencial de Receptor Transitório/química , Canais de Potencial de Receptor Transitório/metabolismo
9.
Front Genet ; 10: 1130, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31824560

RESUMO

Background: Genomic regions associated with divergent livestock feed efficiency have been found predominantly outside protein coding sequences. Long non-coding RNAs (lncRNA) can modulate chromatin accessibility, gene expression and act as important metabolic regulators in mammals. By integrating phenotypic, transcriptomic, and metabolomic data with quantitative trait locus data in prioritizing co-expression network analyses, we aimed to identify and functionally characterize lncRNAs with a potential key regulatory role in metabolic efficiency in cattle. Materials and Methods: Crossbred animals (n = 48) of a Charolais x Holstein F2-population were allocated to groups of high or low metabolic efficiency based on residual feed intake in bulls, energy corrected milk in cows and intramuscular fat content in both genders. Tissue samples from jejunum, liver, skeletal muscle and rumen were subjected to global transcriptomic analysis via stranded total RNA sequencing (RNAseq) and blood plasma samples were used for profiling of 640 metabolites. To identify lncRNAs within the indicated tissues, a project-specific transcriptome annotation was established. Subsequently, novel transcripts were categorized for potential lncRNA status, yielding a total of 7,646 predicted lncRNA transcripts belonging to 3,287 loci. A regulatory impact factor approach highlighted 92, 55, 35, and 73 lncRNAs in jejunum, liver, muscle, and rumen, respectively. Their ensuing high regulatory impact factor scores indicated a potential regulatory key function in a gene set comprising loci displaying differential expression, tissue specificity and loci overlapping with quantitative trait locus regions for residual feed intake or milk production. These were subjected to a partial correlation and information theory analysis with the prioritized gene set. Results and Conclusions: Independent, significant and group-specific correlations (|r| > 0.8) were used to build a network for the high and the low metabolic efficiency group resulting in 1,522 and 1,732 nodes, respectively. Eight lncRNAs displayed a particularly high connectivity (>100 nodes). Metabolites and genes from the partial correlation and information theory networks, which each correlated significantly with the respective lncRNA, were included in an enrichment analysis indicating distinct affected pathways for the eight lncRNAs. LncRNAs associated with metabolic efficiency were classified to be functionally involved in hepatic amino acid metabolism and protein synthesis and in calcium signaling and neuronal nitric oxide synthase signaling in skeletal muscle cells.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa