Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 3.413
Filter
1.
Article in English | MEDLINE | ID: mdl-39209796

ABSTRACT

Increasing the accuracy of the nucleotide sequence alignment is an essential issue in genomics research. Although classic dynamic programming (DP) algorithms (e.g., Smith-Waterman and Needleman-Wunsch) guarantee to produce the optimal result, their time complexity hinders the application of large-scale sequence alignment. Many optimization efforts that aim to accelerate the alignment process generally come from three perspectives: redesigning data structures [e.g., diagonal or striped Single Instruction Multiple Data (SIMD) implementations], increasing the number of parallelisms in SIMD operations (e.g., difference recurrence relation), or reducing search space (e.g., banded DP). However, no methods combine all these three aspects to build an ultra-fast algorithm. In this study, we developed a Banded Striped Aligner (BSAlign) library that delivers accurate alignment results at an ultra-fast speed by knitting a series of novel methods together to take advantage of all of the aforementioned three perspectives with highlights such as active F-loop in striped vectorization and striped move in banded DP. We applied our new acceleration design on both regular and edit distance pairwise alignment. BSAlign achieved 2-fold speed-up than other SIMD-based implementations for regular pairwise alignment, and 1.5-fold to 4-fold speed-up in edit distance-based implementations for long reads. BSAlign is implemented in C programing language and is available at https://github.com/ruanjue/bsalign.


Subject(s)
Algorithms , Sequence Alignment , Software , Sequence Alignment/methods , Sequence Alignment/statistics & numerical data , Sequence Analysis, DNA/methods , Gene Library , Computational Biology/methods , Base Sequence/genetics
2.
J Comput Biol ; 31(5): 416-428, 2024 05.
Article in English | MEDLINE | ID: mdl-38687334

ABSTRACT

A Coding DNA Sequence (CDS) is a fraction of DNA whose nucleotides are grouped into consecutive triplets called codons, each one encoding an amino acid. Because most amino acids can be encoded by more than one codon, the same amino acid chain can be obtained by a very large number of different CDSs. These synonymous CDSs show different features that, also depending on the organism the transcript is expressed in, could affect translational efficiency and yield. The identification of optimal CDSs with respect to given transcript indicators is in general a challenging task, but it has been observed in recent literature that integer linear programming (ILP) can be a very flexible and efficient way to achieve it. In this article, we add evidence to this observation by proposing a new ILP model that simultaneously optimizes different well-grounded indicators. With this model, we efficiently find solutions that dominate those returned by six existing codon optimization heuristics.


Subject(s)
Algorithms , Codon , Models, Genetic , Programming, Linear , Codon/genetics , Base Sequence/genetics , DNA/genetics , Computational Biology/methods
3.
Biotechniques ; 76(5): 203-215, 2024 May.
Article in English | MEDLINE | ID: mdl-38573592

ABSTRACT

In the absence of a DNA template, the ab initio production of long double-stranded DNA molecules of predefined sequences is particularly challenging. The DNA synthesis step remains a bottleneck for many applications such as functional assessment of ancestral genes, analysis of alternative splicing or DNA-based data storage. In this report we propose a fully in vitro protocol to generate very long double-stranded DNA molecules starting from commercially available short DNA blocks in less than 3 days using Golden Gate assembly. This innovative application allowed us to streamline the process to produce a 24 kb-long DNA molecule storing part of the Declaration of the Rights of Man and of the Citizen of 1789 . The DNA molecule produced can be readily cloned into a suitable host/vector system for amplification and selection.


Subject(s)
DNA , DNA/genetics , DNA/chemistry , Information Storage and Retrieval/methods , Humans , Base Sequence/genetics , Cloning, Molecular/methods
4.
HGG Adv ; 5(3): 100284, 2024 Jul 18.
Article in English | MEDLINE | ID: mdl-38509709

ABSTRACT

Systematic determination of novel variant pathogenicity remains a major challenge, even when there is an established association between a gene and phenotype. Here we present Power Window (PW), a sliding window technique that identifies the impactful regions of a gene using population-scale clinico-genomic datasets. By sizing analysis windows on the number of variant carriers, rather than the number of variants or nucleotides, statistical power is held constant, enabling the localization of clinical phenotypes and removal of unassociated gene regions. The windows can be built by sliding across either the nucleotide sequence of the gene (through 1D space) or the positions of the amino acids in the folded protein (through 3D space). Using a training set of 350k exomes from the UK Biobank (UKB), we developed PW models for well-established gene-disease associations and tested their accuracy in two independent cohorts (117k UKB exomes and 65k exomes sequenced at Helix in the Healthy Nevada Project, myGenetics, or In Our DNA SC studies). The significant models retained a median of 49% of the qualifying variant carriers in each gene (range 2%-98%), with quantitative traits showing a median effect size improvement of 66% compared with aggregating variants across the entire gene, and binary traits' odds ratios improving by a median of 2.2-fold. PW showcases that electronic health record-based statistical analyses can accurately distinguish between novel coding variants in established genes that will have high phenotypic penetrance and those that will not, unlocking new potential for human genomics research, drug development, variant interpretation, and precision medicine.


Subject(s)
Genetic Variation , Humans , Genetic Variation/genetics , Protein Folding , Phenotype , Base Sequence/genetics , Genetic Predisposition to Disease/genetics , Exome/genetics
5.
Int J Biol Macromol ; 204: 356-363, 2022 Apr 15.
Article in English | MEDLINE | ID: mdl-35149094

ABSTRACT

Infections caused by SARS-CoV-2 have brought great harm to human health. After transmission for over two years, SARS-CoV-2 has diverged greatly and formed dozens of different lineages. Understanding the trend of its genome evolution could help foresee difficulties in controlling transmission of the virus. In this study, we conducted an extensive monthly survey and in-depth analysis on variations of nucleotide, amino acid and codon numbers in 311,260 virus samples collected till January 2022. The results demonstrate that the evolution of SARS-CoV-2 is toward increasing U-content and reducing genome-size. C, G and A to U mutations have all contributed to this U-content increase. Mutations of C, G and A at codon position 1, 2 or 3 have no significant difference in most SARS-CoV-2 lineages. Current viruses are more cryptic and more efficient in replication, and are thus less virulent yet more infectious. Delta and Omicron variants have high mutability over other lineages, bringing new threat to human health. This trend of genome evolution may provide a clue for tracing the origin of SARS-CoV-2, because ancestral viruses should have lower U-content and probably bigger genome-size.


Subject(s)
Base Composition/genetics , COVID-19/genetics , SARS-CoV-2/genetics , Base Sequence/genetics , COVID-19/transmission , China , Codon/genetics , Evolution, Molecular , Genome/genetics , Genome Size/genetics , Genome, Viral/genetics , Humans , Mutation/genetics , Phylogeny , SARS-CoV-2/pathogenicity , Uracil/metabolism
6.
Sci Rep ; 12(1): 2840, 2022 02 18.
Article in English | MEDLINE | ID: mdl-35181703

ABSTRACT

Streptomyces coelicolor A3(2) is a model microorganism for the study of Streptomycetes, antibiotic production, and secondary metabolism in general. Even though S. coelicolor has an outstanding variety of regulators among bacteria, little effort to globally study its transcription has been made. We manually curated 29 years of literature and databases to assemble a meta-curated experimentally-validated gene regulatory network (GRN) with 5386 genes and 9707 regulatory interactions (~ 41% of the total expected interactions). This provides the most extensive and up-to-date reconstruction available for the regulatory circuitry of this organism. Only ~ 6% (534/9707) are supported by experiments confirming the binding of the transcription factor to the upstream region of the target gene, the so-called "strong" evidence. While for the remaining interactions there is no confirmation of direct binding. To tackle network incompleteness, we performed network inference using several methods (including two proposed here) for motif identification in DNA sequences and GRN inference from transcriptomics. Further, we contrasted the structural properties and functional architecture of the networks to assess the reliability of the predictions, finding the inference from DNA sequence data to be the most trustworthy approach. Finally, we show two applications of the inferred and the curated networks. The inference allowed us to propose novel transcription factors for the key Streptomyces antibiotic regulatory proteins (SARPs). The curated network allowed us to study the conservation of the system-level components between S. coelicolor and Corynebacterium glutamicum. There we identified the basal machinery as the common signature between the two organisms. The curated networks were deposited in Abasy Atlas ( https://abasy.ccg.unam.mx/ ) while the inferences are available as Supplementary Material.


Subject(s)
Bacterial Infections/genetics , Gene Regulatory Networks/genetics , Streptomyces coelicolor/genetics , Transcription Factors/genetics , Anti-Bacterial Agents/biosynthesis , Anti-Bacterial Agents/therapeutic use , Bacterial Infections/drug therapy , Bacterial Infections/microbiology , Bacterial Proteins/antagonists & inhibitors , Bacterial Proteins/genetics , Base Sequence/genetics , Gene Expression Regulation, Bacterial/genetics , Humans , Secondary Metabolism/genetics , Streptomyces coelicolor/metabolism
7.
Nat Commun ; 13(1): 474, 2022 01 25.
Article in English | MEDLINE | ID: mdl-35078987

ABSTRACT

The specificity of CRISPR/Cas9 genome editing is largely determined by the sequences of guide RNA (gRNA) and the targeted DNA, yet the sequence-dependent rules underlying off-target effects are not fully understood. To systematically explore the sequence determinants governing CRISPR/Cas9 specificity, here we describe a dual-target system to measure the relative cleavage rate between off- and on-target sequences (off-on ratios) of 1902 gRNAs on 13,314 synthetic target sequences, and reveal a set of sequence rules involving 2 factors in off-targeting: 1) a guide-intrinsic mismatch tolerance (GMT) independent of the mismatch context; 2) an "epistasis-like" combinatorial effect of multiple mismatches, which are associated with the free-energy landscape in R-loop formation and are explainable by a multi-state kinetic model. These sequence rules lead to the development of MOFF, a model-based predictor of Cas9-mediated off-target effects. Moreover, the "epistasis-like" combinatorial effect suggests a strategy of allele-specific genome editing using mismatched guides. With the aid of MOFF prediction, this strategy significantly improves the selectivity and expands the application domain of Cas9-based allele-specific editing, as tested in a high-throughput allele-editing screen on 18 cancer hotspot mutations.


Subject(s)
Base Sequence/genetics , CRISPR-Cas Systems , Gene Editing/methods , Mutation , Neoplasms/therapy , RNA, Guide, Kinetoplastida/chemistry , Cell Line , Humans , Neoplasms/genetics , Neoplasms/pathology , RNA, Guide, Kinetoplastida/genetics
8.
PLoS One ; 17(1): e0260222, 2022.
Article in English | MEDLINE | ID: mdl-35085240

ABSTRACT

BACKGROUND: Sepsis is associated with high platelet turnover and elevated levels of immature platelets. Changes in the platelet transcriptome and the specific impact of immature platelets on the platelet transcriptome remain unclear. Thus, this study sought to address whether and how elevated levels of immature platelets affect the platelet transcriptome in patients with sepsis. METHODS: Blood samples were obtained from patients with sepsis requiring vasopressor therapy (n = 8) and from a control group of patients with stable coronary artery disease and otherwise similar demographic characteristics (n = 8). Immature platelet fraction (IPF) was determined on a Sysmex XE 2100 analyser and platelet function was tested by impedance aggregometry. RNA from leukocyte-depleted platelets was used for transcriptome analysis by Next Generation Sequencing integrating the use of unique molecular identifiers. RESULTS: IPF (median [interquartile range]) was significantly elevated in sepsis patients (6.4 [5.3-8.7] % vs. 3.6 [2.6-4.6] %, p = 0.005). Platelet function testing revealed no differences in adenosine diphosphate- or thrombin receptor activating peptide-induced platelet aggregation between control and sepsis patients. Putative circular RNA transcripts were decreased in platelets from septic patients. Leukocyte contamination defined by CD45 abundance levels in RNA-sequencing was absent in both groups. Principal component analysis of transcripts showed only partial overlap of clustering with IPF levels. RNA sequencing showed up-regulation of 524 and down-regulation of 118 genes in platelets from sepsis patients compared to controls. Upregulated genes were mostly related to catabolic processes and protein translation. Comparison to published platelet transcriptomes showed a large overlap of changes observed in sepsis and COVID-19 but not with reticulated platelets from healthy donors. CONCLUSIONS: Patients with sepsis appear to have a less degraded platelet transcriptome as indicated by increased levels of immature platelets and decreased levels of putative circular RNA transcripts. The present data suggests that increased protein translation is a characteristic mechanism of systemic inflammation.


Subject(s)
Blood Platelets/metabolism , Sepsis/genetics , Transcriptome/genetics , Aged , Base Sequence/genetics , Blood Platelets/pathology , Cell Fractionation/methods , Gene Expression/genetics , Gene Expression Profiling/methods , Humans , Male , Platelet Activation/genetics , Platelet Aggregation/drug effects , Platelet Aggregation Inhibitors/pharmacology , Platelet Count , Platelet Function Tests , RNA, Circular/analysis , RNA, Circular/genetics , Sepsis/blood , Sequence Analysis, RNA/methods
10.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042801

ABSTRACT

Life on Earth has evolved from initial simplicity to the astounding complexity we experience today. Bacteria and archaea have largely excelled in metabolic diversification, but eukaryotes additionally display abundant morphological innovation. How have these innovations come about and what constraints are there on the origins of novelty and the continuing maintenance of biodiversity on Earth? The history of life and the code for the working parts of cells and systems are written in the genome. The Earth BioGenome Project has proposed that the genomes of all extant, named eukaryotes-about 2 million species-should be sequenced to high quality to produce a digital library of life on Earth, beginning with strategic phylogenetic, ecological, and high-impact priorities. Here we discuss why we should sequence all eukaryotic species, not just a representative few scattered across the many branches of the tree of life. We suggest that many questions of evolutionary and ecological significance will only be addressable when whole-genome data representing divergences at all of the branchings in the tree of life or all species in natural ecosystems are available. We envisage that a genomic tree of life will foster understanding of the ongoing processes of speciation, adaptation, and organismal dependencies within entire ecosystems. These explorations will resolve long-standing problems in phylogenetics, evolution, ecology, conservation, agriculture, bioindustry, and medicine.


Subject(s)
Base Sequence/genetics , Eukaryota/genetics , Genomics/ethics , Animals , Biodiversity , Biological Evolution , Ecology , Ecosystem , Genome , Genomics/methods , Humans , Phylogeny
11.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042802

ABSTRACT

A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.


Subject(s)
Base Sequence/genetics , Eukaryota/genetics , Genomics/standards , Animals , Biodiversity , Genomics/methods , Humans , Reference Standards , Reference Values , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/standards
12.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042803

ABSTRACT

Green plants play a fundamental role in ecosystems, human health, and agriculture. As de novo genomes are being generated for all known eukaryotic species as advocated by the Earth BioGenome Project, increasing genomic information on green land plants is essential. However, setting standards for the generation and storage of the complex set of genomes that characterize the green lineage of life is a major challenge for plant scientists. Such standards will need to accommodate the immense variation in green plant genome size, transposable element content, and structural complexity while enabling research into the molecular and evolutionary processes that have resulted in this enormous genomic variation. Here we provide an overview and assessment of the current state of knowledge of green plant genomes. To date fewer than 300 complete chromosome-scale genome assemblies representing fewer than 900 species have been generated across the estimated 450,000 to 500,000 species in the green plant clade. These genomes range in size from 12 Mb to 27.6 Gb and are biased toward agricultural crops with large branches of the green tree of life untouched by genomic-scale sequencing. Locating suitable tissue samples of most species of plants, especially those taxa from extreme environments, remains one of the biggest hurdles to increasing our genomic inventory. Furthermore, the annotation of plant genomes is at present undergoing intensive improvement. It is our hope that this fresh overview will help in the development of genomic quality standards for a cohesive and meaningful synthesis of green plant genomes as we scale up for the future.


Subject(s)
Base Sequence/genetics , Genomics/trends , Viridiplantae/genetics , Biodiversity , Biological Evolution , DNA Transposable Elements/genetics , Ecology , Ecosystem , Embryophyta/genetics , Evolution, Molecular , Genome , Genome, Plant/genetics , Genomics/methods , Information Dissemination/methods , Information Storage and Retrieval/methods , Phylogeny , Plants/genetics
13.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042816

ABSTRACT

HIV-1 infection produces a long-lived reservoir of latently infected CD4+ T cells that represents the major barrier to HIV-1 cure. The reservoir contains both intact and defective proviruses, but only the proviruses that are intact can reinitiate infection upon cessation of antiretroviral therapy (ART). Here we combine four-color quantitative PCR and next-generation sequencing (Q4PCR) to distinguish intact and defective proviruses and measure reservoir content longitudinally in 12 infected individuals. Q4PCR differs from other PCR-based methods in that the amplified proviruses are sequence verified as intact or defective. Samples were collected systematically over the course of up to 10 y beginning shortly after the initiation of ART. The size of the defective reservoir was relatively stable with minimal decay during the 10-y observation period. In contrast, the intact proviral reservoir decayed with an estimated half-life of 4.9 y. Nevertheless, both intact and defective proviral reservoirs are dynamic. As a result, the fraction of intact proviruses found in expanded clones of CD4+ T cells increases over time with a concomitant decrease in overall reservoir complexity. Thus, reservoir decay measurements by Q4PCR are quantitatively similar to viral outgrowth assay (VOA) and intact proviral DNA PCR assay (IPDA) with the addition of sequence information that distinguishes intact and defective proviruses and informs reservoir dynamics. The data are consistent with the notion that intact and defective proviruses are under distinct selective pressure, and that the intact proviral reservoir is progressively enriched in expanded clones of CD4+ T cells resulting in diminishing complexity over time.


Subject(s)
HIV-1/pathogenicity , Virus Latency/genetics , Adult , Aged , Anti-HIV Agents/therapeutic use , Anti-Retroviral Agents/therapeutic use , Base Sequence/genetics , CD4-Positive T-Lymphocytes/virology , DNA, Viral/genetics , HIV Infections/virology , HIV Seropositivity , HIV-1/genetics , HIV-1/metabolism , Humans , Male , Middle Aged , Polymerase Chain Reaction/methods , Proviruses/genetics , Sequence Analysis, DNA/methods , Viral Load , Virus Latency/physiology
14.
Proc Natl Acad Sci U S A ; 119(4)2022 01 25.
Article in English | MEDLINE | ID: mdl-35042820

ABSTRACT

RNA is a ubiquitous biomolecule that can serve as both catalyst and information carrier. Understanding how RNA bioactivity is controlled is crucial for elucidating its physiological roles and potential applications in synthetic biology. Here, we show that lipid membranes can act as RNA organization platforms, introducing a mechanism for riboregulation. The activity of R3C ribozyme can be modified by the presence of lipid membranes, with direct RNA-lipid interactions dependent on RNA nucleotide content, base pairing, and length. In particular, the presence of guanine in short RNAs is crucial for RNA-lipid interactions, and G-quadruplex formation further promotes lipid binding. Lastly, by artificially modifying the R3C substrate sequence to enhance membrane binding, we generated a lipid-sensitive ribozyme reaction with riboswitch-like behavior. These findings introduce RNA-lipid interactions as a tool for developing synthetic riboswitches and RNA-based lipid biosensors and bear significant implications for RNA world scenarios for the origin of life.


Subject(s)
Lipids/physiology , Membrane Lipids/metabolism , RNA/metabolism , Base Pairing/genetics , Base Sequence/genetics , Binding Sites/genetics , Cell Membrane/metabolism , Cell Membrane/physiology , Genetic Engineering/methods , Lipid Metabolism/physiology , Lipids/genetics , Membrane Lipids/physiology , Nucleic Acid Conformation , RNA/chemistry , RNA, Catalytic/chemistry , RNA, Catalytic/metabolism , Riboswitch/genetics
15.
Life Sci Alliance ; 5(4)2022 04.
Article in English | MEDLINE | ID: mdl-35022248

ABSTRACT

Nucleotide sequence reagents underpin molecular techniques that have been applied across hundreds of thousands of publications. We have previously reported wrongly identified nucleotide sequence reagents in human research publications and described a semi-automated screening tool Seek & Blastn to fact-check their claimed status. We applied Seek & Blastn to screen >11,700 publications across five literature corpora, including all original publications in Gene from 2007 to 2018 and all original open-access publications in Oncology Reports from 2014 to 2018. After manually checking Seek & Blastn outputs for >3,400 human research articles, we identified 712 articles across 78 journals that described at least one wrongly identified nucleotide sequence. Verifying the claimed identities of >13,700 sequences highlighted 1,535 wrongly identified sequences, most of which were claimed targeting reagents for the analysis of 365 human protein-coding genes and 120 non-coding RNAs. The 712 problematic articles have received >17,000 citations, including citations by human clinical trials. Given our estimate that approximately one-quarter of problematic articles may misinform the future development of human therapies, urgent measures are required to address unreliable gene research articles.


Subject(s)
Base Sequence/genetics , Genetic Research , Genome, Human/genetics , Publications/statistics & numerical data , Scientific Experimental Error/statistics & numerical data , Human Genetics/standards , Humans , Proteins/genetics
16.
Dev Biol ; 482: 34-43, 2022 02.
Article in English | MEDLINE | ID: mdl-34902310

ABSTRACT

The DAN gene family (DAN, Differential screening-selected gene Aberrant in Neuroblastoma) is a group of genes that is expressed during development and plays fundamental roles in limb bud formation and digitation, kidney formation and morphogenesis and left-right axis specification. During adulthood the expression of these genes are associated with diseases, including cancer. Although most of the attention to this group of genes has been dedicated to understanding its role in physiology and development, its evolutionary history remains poorly understood. Thus, the goal of this study is to investigate the evolutionary history of the DAN gene family in vertebrates, with the objective of complementing the already abundant physiological information with an evolutionary context. Our results recovered the monophyly of all DAN gene family members and divide them into five main groups. In addition to the well-known DAN genes, our phylogenetic results revealed the presence of two new DAN gene lineages; one is only retained in cephalochordates, whereas the other one (GREM3) was only identified in cartilaginous fish, holostean fish, and coelacanth. According to the phyletic distribution of the genes, the ancestor of gnathostomes possessed a repertoire of eight DAN genes, and during the radiation of the group GREM1, GREM2, SOST, SOSTDC1, and NBL1 were retained in all major groups, whereas, GREM3, CER1, and DAND5 were differentially lost.


Subject(s)
Base Sequence/genetics , Cell Cycle Proteins/genetics , Conserved Sequence/genetics , Embryonic Development/genetics , Adaptor Proteins, Signal Transducing/genetics , Amphibians , Animals , Birds , Body Patterning/genetics , Cytokines/genetics , Evolution, Molecular , Fishes , Intercellular Signaling Peptides and Proteins/genetics , Limb Buds/growth & development , Mammals , Morphogenesis/genetics , Reptiles
17.
Gene ; 807: 145960, 2022 Jan 10.
Article in English | MEDLINE | ID: mdl-34509581

ABSTRACT

Opsin is a fellow of the G protein-coupled receptors (GPCRs) superfamily. It can be divided into visual and non-visual opsin according to whether it is directly involved in visual imaging. Opsin plays an important role in visual image formation and the regulation of non-image forming functions such as circadian entrainment in the growth, development and evolution of fish. Crimson snapper belongs to Perciforme mainly found in the Indo-West Pacific and the South China Sea. It is one of the most influential economic fishes in the South China Sea. In order to study the existence and expression of opsin gene in Crimson snapper, we sequenced the genome and tissue sample transcriptome of Crimson snapper. In this study, 32 opsin genes were identified from the genome of Crimson snapper. The length of these genes ranged from 1061 bp to 86203 bp and were distributed on 15 different chromosomes. The analysis of opsin gene family of Crimson snapper showed that the sws2 had two extra copies as compared with that of Zebrafish. Domain and motif analysis revealed that all the 32 opsin genes have seven-(pass)-transmembrane domain receptors (7TM receptors) each, and the opsin family contained 10 common motifs. The expression level of opsin gene, confirmed by RT-qPCR, was analyzed by using nine tissues transcriptome databases of Crimson snapper. The results showed that almost all opsin genes were highly expressed in the retina and brain, except opn7a and opn7b which were expressed in intestine and red skin, and almost no expression in other tissues. Our results provide a comprehensive basic knowledge for the opsin gene family of Crimson snapper, which has significance for the study of the function of opsin in Lutjanidaes.


Subject(s)
Opsins/genetics , Perciformes/genetics , Animals , Base Sequence/genetics , China , Cloning, Molecular/methods , Fish Diseases/genetics , Gene Expression/genetics , Opsins/metabolism , Perciformes/metabolism , Receptors, G-Protein-Coupled/genetics , Transcriptome/genetics
18.
Mol Biol Rep ; 49(2): 1513-1517, 2022 Feb.
Article in English | MEDLINE | ID: mdl-34797491

ABSTRACT

Due to the drawback of traditional and current diagnostic methods including serological and molecular assays, the development of the rapid and free-PCR techniques can be an alternative technique for the human T-cell lymphotropic virus (HTLV-1) DNA detection sequences. On the other hand, early detection of HTLV-1 prevents two dangerous diseases including Adult T-cell leukemia/lymphoma and HTLV-1 Associated Myelopathy/Tropical Spastic Paraparesis. The biosensor-based methods are sensitive techniques that can provide new opportunities to detect infectious diseases, particularly in the early stage. This study provides a comparative view among recently designed biosensors for the detection of HTLV-1.


Subject(s)
Biosensing Techniques/methods , HTLV-I Infections/diagnosis , Human T-lymphotropic virus 1/genetics , Base Sequence/genetics , HTLV-I Infections/genetics , Human T-lymphotropic virus 1/pathogenicity , Humans , Leukemia-Lymphoma, Adult T-Cell/genetics , Leukemia-Lymphoma, Adult T-Cell/pathology , Leukemia-Lymphoma, Adult T-Cell/virology , Paraparesis, Tropical Spastic/virology , Polymerase Chain Reaction/methods
19.
Gene ; 808: 145974, 2022 Jan 15.
Article in English | MEDLINE | ID: mdl-34592348

ABSTRACT

The mitochondrial genome (mitogenome) has been regarded as significant source of data to better understand the phylogenetic relationships within the Euphasmatodea, but no mitogenome in Aschiphasmatoidea has been sequenced to date. In this study, two mitogenomes of Orthomeria smaragdinum and Nanhuaphasma hamicercum of Aschiphasmatidae were sequenced and annotated for the first time. The same mitochondrial gene rearrangement structure was present in the two mitogenomes sequenced, showing as the translocation of tRNA-Arg and tRNA-Asn, which conformed to the tandem duplication-random loss and could be used as a possible synapomorphy for Aschiphasmatidae. The phylogenetic results based on the maximum likelihood (ML) and bayesian inference (BI) methods both showed that Aschiphasmatidae and Neophasmatodea in Euphasmatodea are sister taxa. Although the monophyly of Oriophasmata, Occidophasmata, Diapheromeridae, Phasmatidae, Lonchodidae and Bacilloidea has not been solved, the monophyly of Neophasmatodea and Phyllioidea was well supported.


Subject(s)
Genome, Mitochondrial/genetics , Mitochondria/genetics , Nematoda/genetics , Animals , Base Sequence/genetics , Gene Order/genetics , Gene Rearrangement/genetics , Genomics/methods , Neoptera/genetics , Phylogeny
20.
Mol Biol Rep ; 49(2): 1587-1591, 2022 Feb.
Article in English | MEDLINE | ID: mdl-34773549

ABSTRACT

BACKGROUND: The highfin blenny, Lupinoblennius nicholsi, is a marine fish species reported in reef and rocky inshore habitats with a disjunct distribution in the southern Gulf of Mexico. Overall, there are very few studies on this species and there is a scarcity of molecular resources for genetic comparisons. We set out to report the first mitochondrial genome for L. nicholsi and report a range expansion for the species. METHODS AND RESULTS: An individual of L. nicholsi was collected from the coast of Dauphin Island, Alabama. The mitochondrial genome was sequenced, assembled, and annotated. The fragment corresponding to cytochrome oxidase I (COI) was used to compare this sample to other cryptobenthic species of the Atlantic. Finding a mature individual in the coast of Alabama implies this species has a continuous distribution throughout the northern Gulf of Mexico. The mitochondrial genome of L. nicholsi is 16,416 bp in length and comprised of 13 protein coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding D-loop. Comparisons using COI support the species is L. nicholsi and separate it from other cryptobenthic fishes found in the area. CONCLUSIONS: This study represents the first mitochondrial genome for this L. nicholsi, serving as a reference for future comparative studies with marine fishes. By reporting the range expansion of this species, this study provides insights on the fish diversity of the Gulf of Mexico.


Subject(s)
Genome, Mitochondrial/genetics , Perciformes/genetics , Animals , Base Sequence/genetics , DNA, Mitochondrial/genetics , Ecosystem , Fishes/genetics , Phylogeny , RNA, Transfer/genetics
SELECTION OF CITATIONS
SEARCH DETAIL