RESUMO
MOTIVATION: Computational biologists face many challenges related to data size, and they need to manage complicated analyses often including multiple stages and multiple tools, all of which must be deployed to modern infrastructures. To address these challenges and maintain reproducibility of results, researchers need (i) a reliable way to run processing stages in any computational environment, (ii) a well-defined way to orchestrate those processing stages and (iii) a data management layer that tracks data as it moves through the processing pipeline. RESULTS: Pachyderm is an open-source workflow system and data management framework that fulfils these needs by creating a data pipelining and data versioning layer on top of projects from the container ecosystem, having Kubernetes as the backbone for container orchestration. We adapted Pachyderm and demonstrated its attractive properties in bioinformatics. A Helm Chart was created so that researchers can use Pachyderm in multiple scenarios. The Pachyderm File System was extended to support block storage. A wrapper for initiating Pachyderm on cloud-agnostic virtual infrastructures was created. The benefits of Pachyderm are illustrated via a large metabolomics workflow, demonstrating that Pachyderm enables efficient and sustainable data science workflows while maintaining reproducibility and scalability. AVAILABILITY AND IMPLEMENTATION: Pachyderm is available from https://github.com/pachyderm/pachyderm. The Pachyderm Helm Chart is available from https://github.com/kubernetes/charts/tree/master/stable/pachyderm. Pachyderm is available out-of-the-box from the PhenoMeNal VRE (https://github.com/phnmnl/KubeNow-plugin) and general Kubernetes environments instantiated via KubeNow. The code of the workflow used for the analysis is available on GitHub (https://github.com/pharmbio/LC-MS-Pachyderm). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Biologia Computacional , Software , Ecossistema , Reprodutibilidade dos Testes , Fluxo de TrabalhoRESUMO
Aspects of density functional resonance theory (DFRT) [D. L. Whitenack and A. Wasserman, Phys. Rev. Lett. 107, 163002 (2011)], a recently developed complex-scaled version of ground-state density functional theory (DFT), are studied in detail. The asymptotic behavior of the complex density function is related to the complex resonance energy and system's threshold energy, and the function's local oscillatory behavior is connected with preferential directions of electron decay. Practical considerations for implementation of the theory are addressed including sensitivity to the complex-scaling parameter, θ. In Kohn-Sham DFRT, it is shown that almost all θ-dependence in the calculated energies and lifetimes can be extinguished via use of a proper basis set or fine grid. The highest occupied Kohn-Sham orbital energy and lifetime are related to physical affinity and width, and the threshold energy of the Kohn-Sham system is shown to be equal to the threshold energy of the interacting system shifted by a well-defined functional. Finally, various complex-scaling conditions are derived which relate the functionals of ground-state DFT to those of DFRT via proper scaling factors and a non-Hermitian coupling-constant system.
RESUMO
Density functional resonance theory (DFRT) is a complex-scaled version of ground-state density functional theory (DFT) that allows one to calculate the in-principle exact resonance energies and lifetimes of metastable anions. In this formalism, the energy and lifetime of the lowest-energy resonance of unbound systems is encoded into a complex "density" that can be obtained via complex-coordinate scaling. This complex density is used as the primary variable in a DFRT calculation, just as the ground-state density would be used as the primary variable in DFT. As in DFT, there exists a mapping of the N-electron interacting system to a Kohn-Sham system of N noninteracting particles. This mapping facilitates self-consistent calculations with an initial guess for the complex density, as illustrated with an exactly solvable model system.