Pesquisa | Biblioteca Virtual em Saúde

CyVerse: Cyberinfrastructure for open science.

Swetnam, Tyson L; Antin, Parker B; Bartelme, Ryan; Bucksch, Alexander; Camhy, David; Chism, Greg; Choi, Illyoung; Cooksey, Amanda M; Cosi, Michele; Cowen, Cindy; Culshaw-Maurer, Michael; Davey, Robert; Davey, Sean; Devisetty, Upendra; Edgin, Tony; Edmonds, Andy; Fedorov, Dmitry; Frady, Jeremy; Fonner, John; Gillan, Jeffrey K; Hossain, Iqbal; Joyce, Blake; Lang, Konrad; Lee, Tina; Littin, Shelley; McEwen, Ian; Merchant, Nirav; Micklos, David; Nelson, Andrew; Ramsey, Ashley; Roberts, Sarah; Sarando, Paul; Skidmore, Edwin; Song, Jawon; Sprinkle, Mary Margaret; Srinivasan, Sriram; Stanzione, Dan; Strootman, Jonathan D; Stryeck, Sarah; Tuteja, Reetu; Vaughn, Matthew; Wali, Mojib; Wall, Mariah; Walls, Ramona; Wang, Liya; Wickizer, Todd; Williams, Jason; Wregglesworth, John; Lyons, Eric.

PLoS Comput Biol ; 20(2): e1011270, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38324613

RESUMO

CyVerse, the largest publicly-funded open-source research cyberinfrastructure for life sciences, has played a crucial role in advancing data-driven research since the 2010s. As the technology landscape evolved with the emergence of cloud computing platforms, machine learning and artificial intelligence (AI) applications, CyVerse has enabled access by providing interfaces, Software as a Service (SaaS), and cloud-native Infrastructure as Code (IaC) to leverage new technologies. CyVerse services enable researchers to integrate institutional and private computational resources, custom software, perform analyses, and publish data in accordance with open science principles. Over the past 13 years, CyVerse has registered more than 124,000 verified accounts from 160 countries and was used for over 1,600 peer-reviewed publications. Since 2011, 45,000 students and researchers have been trained to use CyVerse. The platform has been replicated and deployed in three countries outside the US, with additional private deployments on commercial clouds for US government agencies and multinational corporations. In this manuscript, we present a strategic blueprint for creating and managing SaaS cyberinfrastructure and IaC as free and open-source software.

Assuntos

Inteligência Artificial , Software , Humanos , Computação em Nuvem , Editoração

PhytoOracle: Scalable, modular phenomics data processing pipelines.

Gonzalez, Emmanuel M; Zarei, Ariyan; Hendler, Nathanial; Simmons, Travis; Zarei, Arman; Demieville, Jeffrey; Strand, Robert; Rozzi, Bruno; Calleja, Sebastian; Ellingson, Holly; Cosi, Michele; Davey, Sean; Lavelle, Dean O; Truco, Maria José; Swetnam, Tyson L; Merchant, Nirav; Michelmore, Richard W; Lyons, Eric; Pauli, Duke.

Front Plant Sci ; 14: 1112973, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36950362

RESUMO

As phenomics data volume and dimensionality increase due to advancements in sensor technology, there is an urgent need to develop and implement scalable data processing pipelines. Current phenomics data processing pipelines lack modularity, extensibility, and processing distribution across sensor modalities and phenotyping platforms. To address these challenges, we developed PhytoOracle (PO), a suite of modular, scalable pipelines for processing large volumes of field phenomics RGB, thermal, PSII chlorophyll fluorescence 2D images, and 3D point clouds. PhytoOracle aims to (i) improve data processing efficiency; (ii) provide an extensible, reproducible computing framework; and (iii) enable data fusion of multi-modal phenomics data. PhytoOracle integrates open-source distributed computing frameworks for parallel processing on high-performance computing, cloud, and local computing environments. Each pipeline component is available as a standalone container, providing transferability, extensibility, and reproducibility. The PO pipeline extracts and associates individual plant traits across sensor modalities and collection time points, representing a unique multi-system approach to addressing the genotype-phenotype gap. To date, PO supports lettuce and sorghum phenotypic trait extraction, with a goal of widening the range of supported species in the future. At the maximum number of cores tested in this study (1,024 cores), PO processing times were: 235 minutes for 9,270 RGB images (140.7 GB), 235 minutes for 9,270 thermal images (5.4 GB), and 13 minutes for 39,678 PSII images (86.2 GB). These processing times represent end-to-end processing, from raw data to fully processed numerical phenotypic trait data. Repeatability values of 0.39-0.95 (bounding area), 0.81-0.95 (axis-aligned bounding volume), 0.79-0.94 (oriented bounding volume), 0.83-0.95 (plant height), and 0.81-0.95 (number of points) were observed in Field Scanalyzer data. We also show the ability of PO to process drone data with a repeatability of 0.55-0.95 (bounding area).

Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species.

Reuscher, Stefan; Furuta, Tomoyuki; Bessho-Uehara, Kanako; Cosi, Michele; Jena, Kshirod K; Toyoda, Atsushi; Fujiyama, Asao; Kurata, Nori; Ashikari, Motoyuki.

Commun Biol ; 1: 162, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-30320230

RESUMO

The African wild rice species Oryza longistaminata has several beneficial traits compared to cultivated rice species, such as resistance to biotic stresses, clonal propagation via rhizomes, and increased biomass production. To facilitate breeding efforts and functional genomics studies, we de-novo assembled a high-quality, haploid-phased genome. Here, we present our assembly, with a total length of 351 Mb, of which 92.2% was anchored onto 12 chromosomes. We detected 34,389 genes and 38.1% of the genome consisted of repetitive content. We validated our assembly by a comparative linkage analysis and by examining well-characterized gene families. This genome assembly will be a useful resource to exploit beneficial alleles found in O. longistaminata. Our results also show that it is possible to generate a high-quality, functionally complete rice genome assembly from moderate SMRT read coverage by exploiting synteny in a closely related Oryza species.

RESUMO

Assuntos

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA