Good-Turing frequency estimation in a finite population.
Biom J
; 57(2): 321-39, 2015 Mar.
Article
in En
| MEDLINE
| ID: mdl-25394337
Good-Turing frequency estimation (Good, ) is a simple, effective method for predicting detection probabilities of objects of both observed and unobserved classes based on observed frequencies of classes in a sample. The method has been used widely in several disciplines, such as information retrieval, computational linguistics, text recognition, and ecological diversity estimation. Nevertheless, existing studies assume sampling with replacement or sampling from an infinite population, which might be inappropriate for many practical applications. In light of this limitation, this article presents a modification of the Good-Turing estimation method to account for finite population sampling. We provide three practical extensions of the modified method, and we examine performance of the modified method and its extensions in simulation experiments.
Key words
Full text:
1
Database:
MEDLINE
Main subject:
Statistics as Topic
Language:
En
Journal:
Biom J
Year:
2015
Type:
Article
Affiliation country:
Taiwan