Your browser doesn't support javascript.
loading
ukbtools: An R package to manage and query UK Biobank data.
Hanscombe, Ken B; Coleman, Jonathan R I; Traylor, Matthew; Lewis, Cathryn M.
Affiliation
  • Hanscombe KB; Department of Medical & Molecular Genetics, King's College London, London, United Kingdom.
  • Coleman JRI; Social, Genetic and Developmental Psychiatry Centre, King's College London, London, United Kingdom.
  • Traylor M; Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom.
  • Lewis CM; Department of Medical & Molecular Genetics, King's College London, London, United Kingdom.
PLoS One ; 14(5): e0214311, 2019.
Article in En | MEDLINE | ID: mdl-31150407
ABSTRACT

INTRODUCTION:

The UK Biobank (UKB) is a resource that includes detailed health-related data on about 500,000 individuals and is available to the research community. However, several obstacles limit immediate analysis of the data data files vary in format, may be very large, and have numerical codes for column names.

RESULTS:

ukbtools removes all the upfront data wrangling required to get a single dataset for statistical analysis. All associated data files are merged into a single dataset with descriptive column names. The package also provides tools to assist in quality control by exploring the primary demographics of subsets of participants; query of disease diagnoses for one or more individuals, and estimating disease frequency relative to a reference variable; and to retrieve genetic metadata.

CONCLUSION:

Having a dataset with meaningful variable names, a set of UKB-specific exploratory data analysis tools, disease query functions, and a set of helper functions to explore and write genetic metadata to file, will rapidly enable UKB users to undertake their research.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Database Management Systems / Information Storage and Retrieval / Electronic Health Records / Datasets as Topic Limits: Humans Country/Region as subject: Europa Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2019 Type: Article Affiliation country: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Database Management Systems / Information Storage and Retrieval / Electronic Health Records / Datasets as Topic Limits: Humans Country/Region as subject: Europa Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2019 Type: Article Affiliation country: United kingdom