Your browser doesn't support javascript.
loading
Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.
Lauterbur, M Elise; Cavassim, Maria Izabel A; Gladstein, Ariella L; Gower, Graham; Pope, Nathaniel S; Tsambos, Georgia; Adrion, Jeffrey; Belsare, Saurabh; Biddanda, Arjun; Caudill, Victoria; Cury, Jean; Echevarria, Ignacio; Haller, Benjamin C; Hasan, Ahmed R; Huang, Xin; Iasi, Leonardo Nicola Martin; Noskova, Ekaterina; Obsteter, Jana; Pavinato, Vitor Antonio Correa; Pearson, Alice; Peede, David; Perez, Manolo F; Rodrigues, Murillo F; Smith, Chris C R; Spence, Jeffrey P; Teterina, Anastasia; Tittes, Silas; Unneberg, Per; Vazquez, Juan Manuel; Waples, Ryan K; Wohns, Anthony Wilder; Wong, Yan; Baumdicker, Franz; Cartwright, Reed A; Gorjanc, Gregor; Gutenkunst, Ryan N; Kelleher, Jerome; Kern, Andrew D; Ragsdale, Aaron P; Ralph, Peter L; Schrider, Daniel R; Gronau, Ilan.
Afiliação
  • Lauterbur ME; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States.
  • Cavassim MIA; Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, United States.
  • Gladstein AL; Embark Veterinary, Inc, Boston, United States.
  • Gower G; Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
  • Pope NS; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Tsambos G; School of Mathematics and Statistics, University of Melbourne, Melbourne, Australia.
  • Adrion J; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Belsare S; Ancestry DNA, San Francisco, United States.
  • Biddanda A; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Caudill V; 54Gene, Inc, Washington, United States.
  • Cury J; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Echevarria I; Universite Paris-Saclay, CNRS, INRIA, Laboratoire Interdisciplinaire des Sciences du Numerique, Orsay, France.
  • Haller BC; School of Life Sciences, University of Glasgow, Glasgow, United Kingdom.
  • Hasan AR; Department of Computational Biology, Cornell University, Ithaca, United States.
  • Huang X; Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.
  • Iasi LNM; Department of Biology, University of Toronto Mississauga, Mississauga, Canada.
  • Noskova E; Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria.
  • Obsteter J; Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria.
  • Pavinato VAC; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
  • Pearson A; Computer Technologies Laboratory, ITMO University, St Petersburg, Russian Federation.
  • Peede D; Agricultural Institute of Slovenia, Department of Animal Science, Ljubljana, Slovenia.
  • Perez MF; Entomology Department, The Ohio State University, Wooster, United States.
  • Rodrigues MF; Department of Genetics, University of Cambridge, Cambridge, United Kingdom.
  • Smith CCR; Department of Zoology, University of Cambridge, Cambridge, United Kingdom.
  • Spence JP; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, United States.
  • Teterina A; Center for Computational Molecular Biology, Brown University, Providence, United States.
  • Tittes S; Department of Genetics and Evolution, Federal University of Sao Carlos, Sao Carlos, Brazil.
  • Unneberg P; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Vazquez JM; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Waples RK; Department of Genetics, Stanford University School of Medicine, Stanford, United States.
  • Wohns AW; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Wong Y; Institute of Ecology and Evolution, University of Oregon, Eugene, United States.
  • Baumdicker F; Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
  • Cartwright RA; Department of Integrative Biology, University of California, Berkeley, Berkeley, United States.
  • Gorjanc G; Department of Biostatistics, University of Washington, Seattle, United States.
  • Gutenkunst RN; Broad Institute of MIT and Harvard, Cambridge, United States.
  • Kelleher J; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom.
  • Kern AD; Cluster of Excellence - Controlling Microbes to Fight Infections, Eberhard Karls Universit¨at Tubingen, Tubingen, Germany.
  • Ragsdale AP; School of Life Sciences and The Biodesign Institute, Arizona State University, Tempe, United States.
  • Ralph PL; The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom.
  • Schrider DR; Department of Molecular and Cellular Biology, University of Arizona, Tucson, United States.
  • Gronau I; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom.
Elife ; 122023 06 21.
Article em En | MEDLINE | ID: mdl-37342968
ABSTRACT
Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma Tipo de estudo: Guideline Idioma: En Revista: Elife Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma Tipo de estudo: Guideline Idioma: En Revista: Elife Ano de publicação: 2023 Tipo de documento: Article