Your browser doesn't support javascript.
loading
Principles of Experimental Design for Big Data Analysis.
Drovandi, Christopher C; Holmes, Christopher; McGree, James M; Mengersen, Kerrie; Richardson, Sylvia; Ryan, Elizabeth G.
Affiliation
  • Drovandi CC; School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia, 4000.
  • Holmes C; Department of Statistics, University of Oxford, Oxford, UK, OX1 3TG.
  • McGree JM; School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia, 4000.
  • Mengersen K; School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia, 4000.
  • Richardson S; MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge, UK, CB2 0SR.
  • Ryan EG; Biostatistics Department, King's College London, UK, SE5 8AF.
Stat Sci ; 32(3): 385-404, 2017 Aug.
Article in En | MEDLINE | ID: mdl-28883686
ABSTRACT
Big Datasets are endemic, but are often notoriously difficult to analyse because of their size, heterogeneity and quality. The purpose of this paper is to open a discourse on the potential for modern decision theoretic optimal experimental design methods, which by their very nature have traditionally been applied prospectively, to improve the analysis of Big Data through retrospective designed sampling in order to answer particular questions of interest. By appealing to a range of examples, it is suggested that this perspective on Big Data modelling and analysis has the potential for wide generality and advantageous inferential and computational properties. We highlight current hurdles and open research questions surrounding efficient computational optimisation in using retrospective designs, and in part this paper is a call to the optimisation and experimental design communities to work together in the field of Big Data analysis.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies Language: En Journal: Stat Sci Year: 2017 Document type: Article

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies Language: En Journal: Stat Sci Year: 2017 Document type: Article