An information theoretic clustering approach for unveiling authorship affinities in Shakespearean era plays and poems.
PLoS One
; 9(10): e111445, 2014.
Article
em En
| MEDLINE
| ID: mdl-25347727
ABSTRACT
In this paper we analyse the word frequency profiles of a set of works from the Shakespearean era to uncover patterns of relationship between them, highlighting the connections within authorial canons. We used a text corpus comprising 256 plays and poems from the 16th and 17th centuries, with 17 works of uncertain authorship. Our clustering approach is based on the Jensen-Shannon divergence and a graph partitioning algorithm, and our results show that authors' characteristic styles are very powerful factors in explaining the variation of word use, frequently transcending cross-cutting factors like the differences between tragedy and comedy, early and late works, and plays and poems. Our method also provides an empirical guide to the authorship of plays and poems where this is unknown or disputed.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Poesia como Assunto
/
Autoria
/
Drama
/
Modelos Teóricos
Tipo de estudo:
Prognostic_studies
País/Região como assunto:
Europa
Idioma:
En
Revista:
PLoS One
Assunto da revista:
CIENCIA
/
MEDICINA
Ano de publicação:
2014
Tipo de documento:
Article
País de afiliação:
Austrália