Your browser doesn't support javascript.
loading
The Essential Toolbox of Data Science: Python, R, Git, and Docker.
Pittard, W Stephen; Li, Shuzhao.
Afiliação
  • Pittard WS; Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA.
  • Li S; Department of Medicine, Emory University School of Medicine, Atlanta, GA, USA. shuzhao.li@gmail.com.
Methods Mol Biol ; 2104: 265-311, 2020.
Article em En | MEDLINE | ID: mdl-31953823
ABSTRACT
The daily work in data science involves a set of essential tools the programming languages Python and R, the version control tool Git and the virtualization tool Docker. Proficiency in at least one programming language is required for data science. R is tied to a computing environment that focuses on statistics, in which many new algorithms in genomics and biomedicine are first published. Python has a root in system administration, and is a superb language for general programming. Version control is critical to managing complex projects, even if software development is not involved. Docker container is becoming a key tool for deployment, portability, and reproducibility. This chapter provides a self-contained practical guide of these topics so that readers can use it as a reference and to plan their training.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Biologia Computacional / Ciência de Dados Idioma: En Revista: Methods Mol Biol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Biologia Computacional / Ciência de Dados Idioma: En Revista: Methods Mol Biol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos