RESUMO
Our goal is to analyze improvement of scientific performance in a multidimensional outcome space, with a focus on US-based biomedical research. With the growing diversity of research databases, limiting assessment of scientific productivity to bibliometric measures such as number of publications, impact factor of journals and number of citations, is increasingly challenged. Using a wider range of outcomes, from publications through practice improvements to entrepreneurial outcomes, overcomes many current limitations in the study of research growth. However, combining such heterogeneous datasets raise three challenges: 1. gathering in one common place a variety of data shared as csv, xml or xls files, 2. merging and linking this data, that sometimes overlap, 3. assessing the impact of research production and inclusive practices in a multidimensional space, that are often missing from the datasets. We would like to present our solution for the first of those challenges, and discuss our leads for the second and third challenges.