RESUMEN
Brazil is one of the countries with the worst response against the pandemic scenario of coronavírus. At the beginning we were on average with 4000 deaths in a 24 hours period. In the course of this situation, large amounts of health and medicine datasets were being generated in real time, requiring effective ways to extract information and discover patterns that can help in the fight against this disease. And even more important is to monitor the progress of prophylactic measures and whether they are being effective in reducing the spread of the virus. Thus, the aim of this study is to analyze how the coronavirus has different ways to evolve in each Brazilian state with the influences of the vaccination process. To achieve this goal, the time series Clustering Technique based on a K-Means variation was applied, with the similarity metric Dynamic Time Warping (DTW). We produced this study using the data reported by the Ministry of Health in Brazil, referring to deaths per 100k inhabitants and all vaccination data available. Our results indicate an unevenly occurring vaccination and the need to identify other associated patterns with human development indices and other socio-economic indicators, being this the first analysis developed in the country, under the goals above.
RESUMEN
Extracting information and discovering patterns from a massive dataset is a hard task. In an epidemic scenario, this data has to be integrated providing organization, agility, transparency and, above all, it has to be free of any type of censorship or bias. The aim of this paper is to analyze how coronavirus contamination has evolved in Brazil applying unsupervised analysis algorithms to extract information and find characteristics between them. To achieve this goal we describe an implementation that uses data about Covid-19 spread in Brazilian states (26 states and the federal district), applying a Time Series Clustering technique based on a K-Means variation, using Dynamic Time Warping as a similarity metric. We used data reported by the Ministry of Health in Brazil, referring to deaths per 100k inhabitants, during 452 days from the first reported death in each state. Two analyzes were performed, one considering 3 clusters and the other with 6 clusters. Through these analysis, 3 patterns of responses to the pandemic can be observed, ranging from one of greater to lesser control of the pandemic, although in recent months all clusters showed a highly increase in the number of deaths. The identification of these patterns is important to highlight possible actions and events, as well as other characteristics that determine the correct or incorrect public decision-making in combating the Covid-19 pandemic.