Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
PLoS Comput Biol ; 20(2): e1011870, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38335225

RESUMEN

Chloroplasts are photosynthetic organelles in algal and plant cells that contain their own genome. Chloroplast genomes are commonly used in evolutionary studies and taxonomic identification and are increasingly becoming a target for crop improvement studies. As DNA sequencing becomes more affordable, researchers are collecting vast swathes of high-quality whole-genome sequence data from laboratory and field settings alike. Whole tissue read libraries sequenced with the primary goal of understanding the nuclear genome will inadvertently contain many reads derived from the chloroplast genome. These whole-genome, whole-tissue read libraries can additionally be used to assemble chloroplast genomes with little to no extra cost. While several tools exist that make use of short-read second generation and third-generation long-read sequencing data for chloroplast genome assembly, these tools may have complex installation steps, inadequate error reporting, poor expandability, and/or lack scalability. Here, we present CLAW (Chloroplast Long-read Assembly Workflow), an easy to install, customise, and use Snakemake tool to assemble chloroplast genomes from chloroplast long-reads found in whole-genome read libraries (https://github.com/aaronphillips7493/CLAW). Using 19 publicly available reference chloroplast genome assemblies and long-read libraries from algal, monocot and eudicot species, we show that CLAW can rapidly produce chloroplast genome assemblies with high similarity to the reference assemblies. CLAW was designed such that users have complete control over parameterisation, allowing individuals to optimise CLAW to their specific use cases. We expect that CLAW will provide researchers (with varying levels of bioinformatics expertise) with an additional resource useful for contributing to the growing number of publicly available chloroplast genome assemblies.


Asunto(s)
Genoma del Cloroplasto , Humanos , Genoma del Cloroplasto/genética , Flujo de Trabajo , Análisis de Secuencia de ADN , Biología Computacional , Cloroplastos/genética , Secuenciación de Nucleótidos de Alto Rendimiento
2.
Artículo en Inglés | MEDLINE | ID: mdl-36795096

RESUMEN

Six strains, KI11_D11T, KI4_B1, KI11_C11T, KI16_H9T, KI4_A6T and KI3_B9T, were isolated from insects and flowers on Kangaroo Island, South Australia. On the basis of 16S rRNA gene phylogeny, strains KI11_D11T, KI4_B1, KI11_C11T, KI16_H9T, KI4_A6T were found to be closely related to Fructilactobacillus ixorae Ru20-1T. Due to the lack of a whole genome sequence for this species, whole genome sequencing of Fructilactobacillus ixorae Ru20-1T was undertaken. KI3_B9T was found to be closely related to Fructobacillus tropaeoli F214-1T. Utilizing core gene phylogenetics and whole genome analyses, such as determination of AAI, ANI and dDDH, we propose that these six isolates represent five novel species with the names Fructilactobacillus cliffordii (KI11_D11T= LMG 32130T = NBRC 114988T), Fructilactobacillus hinvesii (KI11_C11T = LMG 32129T = NBRC 114987T), Fructilactobacillus myrtifloralis (KI16_H9T= LMG 32131T = NBRC 114989T) Fructilactobacillus carniphilus (KI4_A6T = LMG 32127T = NBRC 114985T) and Fructobacillus americanaquae (KI3_B9T = LMG 32124T = NBRC 114983T). Chemotaxonomic analyses detected no fructophilic characters for these strains of member of the genus Fructilactobacillus. KI3_B9T was found to be obligately fructophilic, similarly to its phylogenetic neighbours in the genus Fructobacillus. This study represents the first isolation, to our knowledge, of novel species in the family Lactobacillaceae from the Australian wild.


Asunto(s)
Lactobacillales , Animales , Lactobacillales/genética , Filogenia , ARN Ribosómico 16S/genética , Australia del Sur , Análisis de Secuencia de ADN , ADN Bacteriano/genética , Composición de Base , Ácidos Grasos/química , Australia , Técnicas de Tipificación Bacteriana , Lactobacillus , Insectos , Flores/microbiología
3.
Artículo en Inglés | MEDLINE | ID: mdl-36094463

RESUMEN

Four strains, SG5_A10T, SGEP1_A5T, SG4_D2T, and SG4_A1T, were isolated from the honey or homogenate of Australian stingless bee species Tetragonula carbonaria and Austroplebeia australis. Based on 16S rRNA gene phylogeny, core gene phylogenetics, whole genome analyses such as determination of amino acid identity (AAI), cAAI of conserved genes, average nucleotide identity (ANI), and digital DNA-DNA hybridization (dDDH), chemotaxonomic analyses, and the novel isolation sources and unique geography, we propose three new species and one genus with the names Apilactobacillus apisilvae sp. nov. (SG5_A10T = LMG 32133T = NBRC 114991T), Bombilactobacillus thymidiniphilus sp. nov. (SG4_A1T = LMG 32125T = NBRC 114984T), Bombilactobacillus folatiphilus sp. nov. (SG4_D2T = LMG 32126T = NBRC 115004T) and Nicolia spurrieriana sp. nov. (SGEP1_A5T = LMG 32134T = NBRC 114992T). Three out of the four strains were found to be fructophilic, where SG5_A10T and SGEP1_A5T belong to obligately fructophilic lactic acid bacteria, and SG4_D2T representing a new type denoted here as kinetically fructophilic. This study represents the first published lactic acid bacterial species associated with the unique niche of Australian stingless bees.


Asunto(s)
Lactobacillales , Animales , Australia , Técnicas de Tipificación Bacteriana , Composición de Base , Abejas , ADN Bacteriano/genética , Ácidos Grasos/química , Ácido Láctico , Lactobacillales/genética , Filogenia , ARN Ribosómico 16S/genética , Análisis de Secuencia de ADN
4.
Mol Biol Evol ; 36(12): 2922-2924, 2019 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31411700

RESUMEN

Comparing newly obtained and previously known nucleotide and amino-acid sequences underpins modern biological research. BLAST is a well-established tool for such comparisons but is challenging to use on new data sets. We combined a user-centric design philosophy with sustainable software development approaches to create Sequenceserver, a tool for running BLAST and visually inspecting BLAST results for biological interpretation. Sequenceserver uses simple algorithms to prevent potential analysis errors and provides flexible text-based and visual outputs to support researcher productivity. Our software can be rapidly installed for use by individuals or on shared servers.


Asunto(s)
Biología Computacional/métodos , Técnicas Genéticas , Programas Informáticos
5.
BMC Biotechnol ; 19(1): 71, 2019 10 30.
Artículo en Inglés | MEDLINE | ID: mdl-31684940

RESUMEN

BACKGROUND: The CRISPR-Cas9 system is a powerful and versatile tool for crop genome editing. However, achieving highly efficient and specific editing in polyploid species can be a challenge. The efficiency and specificity of the CRISPR-Cas9 system depends critically on the gRNA used. Here, we assessed the activities and specificities of seven gRNAs targeting 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) in hexaploid wheat protoplasts. EPSPS is the biological target of the widely used herbicide glyphosate. RESULTS: The seven gRNAs differed substantially in their on-target activities, with mean indel frequencies ranging from 0% to approximately 20%. There was no obvious correlation between experimentally determined and in silico predicted on-target gRNA activity. The presence of a single mismatch within the seed region of the guide sequence greatly reduced but did not abolish gRNA activity, whereas the presence of an additional mismatch, or the absence of a PAM, all but abolished gRNA activity. Large insertions (≥20 bp) of DNA vector-derived sequence were detected at frequencies up to 8.5% of total indels. One of the gRNAs exhibited several properties that make it potentially suitable for the development of non-transgenic glyphosate resistant wheat. CONCLUSIONS: We have established a rapid and reliable method for gRNA validation in hexaploid wheat protoplasts. The method can be used to identify gRNAs that have favourable properties. Our approach is particularly suited to polyploid species, but should be applicable to any plant species amenable to protoplast transformation.


Asunto(s)
Sistemas CRISPR-Cas/genética , Edición Génica/métodos , Genoma de Planta/genética , ARN Guía de Kinetoplastida/genética , Triticum/genética , Protoplastos/metabolismo
6.
Plant Biotechnol J ; 17(10): 1905-1913, 2019 10.
Artículo en Inglés | MEDLINE | ID: mdl-30839150

RESUMEN

The development and adoption of hybrid seed technology have led to dramatic increases in agricultural productivity. However, it has been a challenge to develop a commercially viable platform for the production of hybrid wheat (Triticum aestivum) seed due to wheat's strong inbreeding habit. Recently, a novel platform for commercial hybrid seed production was described. This hybridization platform utilizes nuclear male sterility to force outcrossing and has been applied to maize and rice. With the recent molecular identification of the wheat male fertility gene Ms1, it is now possible to extend the use of this novel hybridization platform to wheat. In this report, we used the CRISPR/Cas9 system to generate heritable, targeted mutations in Ms1. The introduction of biallelic frameshift mutations into Ms1 resulted in complete male sterility in wheat cultivars Fielder and Gladius, and several of the selected male-sterile lines were potentially non-transgenic. Our study demonstrates the utility of the CRISPR/Cas9 system for the rapid generation of male sterility in commercial wheat cultivars. This represents an important step towards capturing heterosis to improve wheat yields, through the production and use of hybrid seed on an industrial scale.


Asunto(s)
Sistemas CRISPR-Cas , Infertilidad Vegetal , Semillas , Triticum/genética , Mutación del Sistema de Lectura , Técnicas de Inactivación de Genes , Genes de Plantas , Poliploidía
7.
Brief Bioinform ; 18(3): 537-544, 2017 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-27084333

RESUMEN

The Bioinformatics Training Platform (BTP) has been developed to provide access to the computational infrastructure required to deliver sophisticated hands-on bioinformatics training courses. The BTP is a cloud-based solution that is in active use for delivering next-generation sequencing training to Australian researchers at geographically dispersed locations. The BTP was built to provide an easy, accessible, consistent and cost-effective approach to delivering workshops at host universities and organizations with a high demand for bioinformatics training but lacking the dedicated bioinformatics training suites required. To support broad uptake of the BTP, the platform has been made compatible with multiple cloud infrastructures. The BTP is an open-source and open-access resource. To date, 20 training workshops have been delivered to over 700 trainees at over 10 venues across Australia using the BTP.


Asunto(s)
Biología Computacional , Australia , Secuenciación de Nucleótidos de Alto Rendimiento , Universidades
8.
Brief Bioinform ; 18(2): 348-355, 2017 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-26984618

RESUMEN

There is a clear demand for hands-on bioinformatics training. The development of bioinformatics workshop content is both time-consuming and expensive. Therefore, enabling trainers to develop bioinformatics workshops in a way that facilitates reuse is becoming increasingly important. The most widespread practice for sharing workshop content is through making PDF, PowerPoint and Word documents available online. While this effort is to be commended, such content is usually not so easy to reuse or repurpose and does not capture all the information required for a third party to rerun a workshop. We present an open, collaborative framework for developing and maintaining, reusable and shareable hands-on training workshop content.


Asunto(s)
Biología Computacional , Conducta Cooperativa , Humanos
9.
Theor Appl Genet ; 132(7): 1965-1979, 2019 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-30899967

RESUMEN

KEY MESSAGE: Elite wheat pollinators are critical for successful hybrid breeding. We identified Rht-B1 and Ppd-D1 loci affecting multiple pollinator traits and therefore represent major targets for improving hybrid seed production. Hybrid breeding has a great potential to significantly boost wheat yields. Ideal male pollinators would be taller in stature, contain many spikelets well-spaced along the spike and exhibit high extrusion of large anthers. Most importantly, flowering time would match with that of the female parent. Available genetic resources for developing an elite wheat pollinator are limited, and the genetic basis for many of these traits is largely unknown. Here, we report on the genetic analysis of pollinator traits using biparental mapping populations. We identified two anther extrusion QTLs of medium effect, one on chromosome 1BL and the other on 4BS coinciding with the semi-dwarfing Rht-B1 locus. The effect of Rht-B1 alleles on anther extrusion is genotype dependent, while tall plant Rht-B1a allele is consistently associated with large anthers. Multiple QTLs were identified at the Ppd-D1 locus for anther length, spikelet number and spike length, with the photoperiod-sensitive Ppd-D1b allele associated with favourable pollinator traits in the populations studied. We also demonstrated that homeoloci, Rht-D1 and Ppd-B1, influence anther length among other traits. These results suggest that combinations of Rht-B1 and Ppd-D1 alleles control multiple pollinator traits and should be major targets of hybrid wheat breeding programs.


Asunto(s)
Flores/genética , Polinización/genética , Sitios de Carácter Cuantitativo , Triticum/genética , Alelos , Mapeo Cromosómico , Genes de Plantas , Genotipo , Fenotipo , Fotoperiodo
10.
BMC Genomics ; 19(1): 941, 2018 Dec 17.
Artículo en Inglés | MEDLINE | ID: mdl-30558550

RESUMEN

BACKGROUND: Democratising the growing body of whole genome sequencing data available for Triticum aestivum (bread wheat) has been impeded by the lack of a genome reference and the large computational requirements for analysing these data sets. RESULTS: DAWN (Diversity Among Wheat geNomes) integrates data from the T. aestivum Chinese Spring (CS) IWGSC RefSeq v1.0 genome with public WGS and exome data from 17 and 62 accessions respectively, enabling researchers and breeders alike to investigate genotypic differences between wheat accessions at the level of whole chromosomes down to individual genes. CONCLUSIONS: Using DAWN we show that it is possible to visualise small and large chromosomal deletions, identify haplotypes at a glance and spot the consequences of selective breeding. DAWN allows us to detect the break points of alien introgression segments brought into an accession when transferring desired genes. Furthermore, we can find possible explanations for reduced recombination in parts of a chromosome, we can predict regions with linkage drag, and also look at diversity in centromeric regions.


Asunto(s)
Bases de Datos Genéticas , Genoma de Planta , Triticum/genética , Centrómero/genética , Genotipo , Haplotipos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuenciación del Exoma
12.
Mol Phylogenet Evol ; 100: 135-147, 2016 07.
Artículo en Inglés | MEDLINE | ID: mdl-27044608

RESUMEN

Calcineurin B-like protein interacting protein kinases (CIPKs) are key regulators of pre-transcriptional and post-translational responses to abiotic stress. Arabidopsis thaliana CIPK16 (AtCIPK16) was identified from a forward genetic screen as a gene that mediates lower shoot salt accumulation and improved salinity tolerance in Arabidopsis and transgenic barley. Here, we aimed to gain an understanding of the evolution of AtCIPK16, and orthologues of CIPK16 in other plant species including barley, by conducting a phylogenetic analysis of terrestrial plant species. The resulting protein sequence based phylogenetic trees revealed a single clade that included AtCIPK16 along with two segmentally duplicated CIPKs, AtCIPK5 and AtCIPK25. No monocots had proteins that fell into this clade; instead the most closely related monocot proteins formed a group basal to the entire CIPK16, 5 and 25 clade. We also found that AtCIPK16 contains a core Brassicales specific indel and a putative nuclear localisation signal, which are synapomorphic characters of CIPK16 genes. In addition, we present a model that proposes the evolution of CIPK16, 5 and 25 clade.


Asunto(s)
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Evolución Molecular , Proteínas Serina-Treonina Quinasas/genética , Secuencia de Aminoácidos , Arabidopsis/metabolismo , Proteínas de Arabidopsis/clasificación , Proteínas de Arabidopsis/metabolismo , Exones , Hordeum/genética , Intrones , Filogenia , Proteínas Serina-Treonina Quinasas/clasificación , Proteínas Serina-Treonina Quinasas/metabolismo , Tolerancia a la Sal/genética , Alineación de Secuencia
13.
Plant J ; 80(3): 516-26, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25158883

RESUMEN

Bread wheat (Triticum aestivum L.) has a major salt tolerance locus, Kna1, responsible for the maintenance of a high cytosolic K(+) /Na(+) ratio in the leaves of salt stressed plants. The Kna1 locus encompasses a large DNA fragment, the distal 14% of chromosome 4DL. Limited recombination has been observed at this locus making it difficult to map genetically and identify the causal gene. Here, we decipher the function of TaHKT1;5-D, a candidate gene underlying the Kna1 locus. Transport studies using the heterologous expression systems Saccharomyces cerevisiae and Xenopus laevis oocytes indicated that TaHKT1;5-D is a Na(+) -selective transporter. Transient expression in Arabidopsis thaliana mesophyll protoplasts and in situ polymerase chain reaction indicated that TaHKT1;5-D is localised on the plasma membrane in the wheat root stele. RNA interference-induced silencing decreased the expression of TaHKT1;5-D in transgenic bread wheat lines which led to an increase in the Na(+) concentration in the leaves. This indicates that TaHKT1;5-D retrieves Na(+) from the xylem vessels in the root and has an important role in restricting the transport of Na(+) from the root to the leaves in bread wheat. Thus, TaHKT1;5-D confers the essential salinity tolerance mechanism in bread wheat associated with the Kna1 locus via shoot Na(+) exclusion and is critical in maintaining a high K(+) /Na(+) ratio in the leaves. These findings show there is potential to increase the salinity tolerance of bread wheat by manipulation of HKT1;5 genes.


Asunto(s)
Proteínas de Transporte de Catión/genética , Regulación de la Expresión Génica de las Plantas , Proteínas de Plantas/genética , Sodio/metabolismo , Simportadores/genética , Triticum/genética , Animales , Arabidopsis/genética , Arabidopsis/metabolismo , Secuencia de Bases , Proteínas de Transporte de Catión/metabolismo , Expresión Génica , Datos de Secuencia Molecular , Oocitos , Hojas de la Planta/genética , Hojas de la Planta/metabolismo , Proteínas de Plantas/metabolismo , Raíces de Plantas/genética , Raíces de Plantas/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Tolerancia a la Sal , Análisis de Secuencia de ADN , Simportadores/metabolismo , Transgenes , Triticum/citología , Triticum/metabolismo , Xenopus laevis , Xilema/metabolismo
14.
Brief Bioinform ; 14(5): 563-74, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23543352

RESUMEN

The widespread adoption of high-throughput next-generation sequencing (NGS) technology among the Australian life science research community is highlighting an urgent need to up-skill biologists in tools required for handling and analysing their NGS data. There is currently a shortage of cutting-edge bioinformatics training courses in Australia as a consequence of a scarcity of skilled trainers with time and funding to develop and deliver training courses. To address this, a consortium of Australian research organizations, including Bioplatforms Australia, the Commonwealth Scientific and Industrial Research Organisation and the Australian Bioinformatics Network, have been collaborating with EMBL-EBI training team. A group of Australian bioinformaticians attended the train-the-trainer workshop to improve training skills in developing and delivering bioinformatics workshop curriculum. A 2-day NGS workshop was jointly developed to provide hands-on knowledge and understanding of typical NGS data analysis workflows. The road show-style workshop was successfully delivered at five geographically distant venues in Australia using the newly established Australian NeCTAR Research Cloud. We highlight the challenges we had to overcome at different stages from design to delivery, including the establishment of an Australian bioinformatics training network and the computing infrastructure and resource development. A virtual machine image, workshop materials and scripts for configuring a machine with workshop contents have all been made available under a Creative Commons Attribution 3.0 Unported License. This means participants continue to have convenient access to an environment they had become familiar and bioinformatics trainers are able to access and reuse these resources.


Asunto(s)
Biología Computacional/educación , Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Australia , Instrucción por Computador/métodos , Conducta Cooperativa , Curriculum , Enseñanza
15.
Nucleic Acids Res ; 40(Web Server issue): W340-7, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22693213

RESUMEN

Analyses of multiple sequence alignments generally focus on well-defined conserved sequence blocks, while the rest of the alignment is largely ignored or discarded. This is especially true in phylogenomics, where large multigene datasets are produced through automated pipelines. However, some of the most powerful phylogenetic markers have been found in the variable length regions of multiple alignments, particularly insertions/deletions (indels) in protein sequences. We have developed Sequence Feature and Indel Region Extractor (SeqFIRE) to enable the automated identification and extraction of indels from protein sequence alignments. The program can also extract conserved blocks and identify fast evolving sites using a combination of conservation and entropy. All major variables can be adjusted by the user, allowing them to identify the sets of variables most suited to a particular analysis or dataset. Thus, all major tasks in preparing an alignment for further analysis are combined in a single flexible and user-friendly program. The output includes a numbered list of indels, alignments in NEXUS format with indels annotated or removed and indel-only matrices. SeqFIRE is a user-friendly web application, freely available online at www.seqfire.org/.


Asunto(s)
Mutación INDEL , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína , Programas Informáticos , Algoritmos , Internet , Proteínas/genética
16.
BMC Genomics ; 14: 13, 2013 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-23324119

RESUMEN

BACKGROUND: The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. RESULTS: Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. CONCLUSIONS: We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.


Asunto(s)
Pollos/genética , Perfilación de la Expresión Génica , Genómica , Animales , Bases de Datos Genéticas , Regulación de la Expresión Génica , Redes Reguladoras de Genes/genética , Familia de Multigenes/genética , Motivos de Nucleótidos/genética , Programas Informáticos
17.
Sci Rep ; 13(1): 1528, 2023 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-36707685

RESUMEN

Plantago ovata is cultivated for production of its seed husk (psyllium). When wet, the husk transforms into a mucilage with properties suitable for pharmaceutical industries, utilised in supplements for controlling blood cholesterol levels, and food industries for making gluten-free products. There has been limited success in improving husk quantity and quality through breeding approaches, partly due to the lack of a reference genome. Here we constructed the first chromosome-scale reference assembly of P. ovata using a combination of 5.98 million PacBio and 636.5 million Hi-C reads. We also used corrected PacBio reads to estimate genome size and transcripts to generate gene models. The final assembly covers ~ 500 Mb with 99.3% gene set completeness. A total of 97% of the sequences are anchored to four chromosomes with an N50 of ~ 128.87 Mb. The P. ovata genome contains 61.90% repeats, where 40.04% are long terminal repeats. We identified 41,820 protein-coding genes, 411 non-coding RNAs, 108 ribosomal RNAs, and 1295 transfer RNAs. This genome will provide a resource for plant breeding programs to, for example, reduce agronomic constraints such as seed shattering, increase psyllium yield and quality, and overcome crop disease susceptibility.


Asunto(s)
Plantago , Psyllium , Plantago/genética , Fitomejoramiento , Cromosomas , Genoma
18.
Sci Rep ; 12(1): 10823, 2022 06 25.
Artículo en Inglés | MEDLINE | ID: mdl-35752642

RESUMEN

Oryza australiensis is a wild rice native to monsoonal northern Australia. The International Oryza Map Alignment Project emphasises its significance as the sole representative of the EE genome clade. Assembly of the O. australiensis genome has previously been challenging due to its high Long Terminal Repeat (LTR) retrotransposon (RT) content. Oxford Nanopore long reads were combined with Illumina short reads to generate a high-quality ~ 858 Mbp genome assembly within 850 contigs with 46× long read coverage. Reference-guided scaffolding increased genome contiguity, placing 88.2% of contigs into 12 pseudomolecules. After alignment to the Oryza sativa cv. Nipponbare genome, we observed several structural variations. PacBio Iso-Seq data were generated for five distinct tissues to improve the functional annotation of 34,587 protein-coding genes and 42,329 transcripts. We also report SNV numbers for three additional O. australiensis genotypes based on Illumina re-sequencing. Although genetic similarity reflected geographical separation, the density of SNVs also correlated with our previous report on variations in salinity tolerance. This genome re-confirms the genetic remoteness of the O. australiensis lineage within the O. officinalis genome complex. Assembly of a high-quality genome for O. australiensis provides an important resource for the discovery of critical genes involved in development and stress tolerance.


Asunto(s)
Oryza , Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Oryza/genética , Retroelementos/genética , Análisis de Secuencia de ADN
19.
BMC Genomics ; 12: 607, 2011 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-22171619

RESUMEN

BACKGROUND: In livestock populations the genetic contribution to muscling is intensively monitored in the progeny of industry sires and used as a tool in selective breeding programs. The genes and pathways conferring this genetic merit are largely undefined. Genetic variation within a population has potential, amongst other mechanisms, to alter gene expression via cis- or trans-acting mechanisms in a manner that impacts the functional activities of specific pathways that contribute to muscling traits. By integrating sire-based genetic merit information for a muscling trait with progeny-based gene expression data we directly tested the hypothesis that there is genetic structure in the gene expression program in ovine skeletal muscle. RESULTS: The genetic performance of six sires for a well defined muscling trait, longissimus lumborum muscle depth, was measured using extensive progeny testing and expressed as an Estimated Breeding Value by comparison with contemporary sires. Microarray gene expression data were obtained for longissimus lumborum samples taken from forty progeny of the six sires (4-8 progeny/sire). Initial unsupervised hierarchical clustering analysis revealed strong genetic architecture to the gene expression data, which also discriminated the sire-based Estimated Breeding Value for the trait. An integrated systems biology approach was then used to identify the major functional pathways contributing to the genetics of enhanced muscling by using both Estimated Breeding Value weighted gene co-expression network analysis and a differential gene co-expression network analysis. The modules of genes revealed by these analyses were enriched for a number of functional terms summarised as muscle sarcomere organisation and development, protein catabolism (proteosome), RNA processing, mitochondrial function and transcriptional regulation. CONCLUSIONS: This study has revealed strong genetic structure in the gene expression program within ovine longissimus lumborum muscle. The balance between muscle protein synthesis, at the levels of both transcription and translation control, and protein catabolism mediated by regulated proteolysis is likely to be the primary determinant of the genetic merit for the muscling trait in this sheep population. There is also evidence that high genetic merit for muscling is associated with a fibre type shift toward fast glycolytic fibres. This study provides insight into mechanisms, presumably subject to strong artificial selection, that underpin enhanced muscling in sheep populations.


Asunto(s)
Perfilación de la Expresión Génica , Músculo Esquelético/metabolismo , Animales , Análisis de Secuencia por Matrices de Oligonucleótidos , Ovinos
20.
Bioinformatics ; 26(3): 411-3, 2010 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-20007253

RESUMEN

SUMMARY: We make the PCIT algorithm, used for detecting meaningful gene-gene associations in co-expression networks, available as an R package. Automatic detection of a suitable parallel environment is used such that scripts are portable between parallel and non-parallel environments with no modification of the script. AVAILABILITY AND IMPLEMENTATION: Source code and binaries freely available (under GPL-3) for download via CRAN at http://cran.r-project.org/package=PCIT, implemented in R and supported on Linux and MS Windows.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Redes Reguladoras de Genes , Programas Informáticos , Bases de Datos Genéticas , Perfilación de la Expresión Génica/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA